Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 52896 invoked from network); 25 Feb 2008 17:45:22 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Feb 2008 17:45:22 -0000 Received: (qmail 77790 invoked by uid 500); 25 Feb 2008 17:45:16 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 77769 invoked by uid 500); 25 Feb 2008 17:45:16 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 77760 invoked by uid 99); 25 Feb 2008 17:45:16 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Feb 2008 09:45:16 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of seancallan@gmail.com designates 209.85.132.241 as permitted sender) Received: from [209.85.132.241] (HELO an-out-0708.google.com) (209.85.132.241) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Feb 2008 17:44:40 +0000 Received: by an-out-0708.google.com with SMTP id c37so373610anc.85 for ; Mon, 25 Feb 2008 09:44:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type; bh=L99jzflIY0IKfSjmCf/jf0d2AX+YBC7EA6j0Cr7+/ps=; b=BqbZI3DL7EmZPDK+qSiyK+x3QOklJz0HCi9P9ORjGRn1orcfazSbFQMNOAfHY8WCjaFurCiLMl1TjPPLg2musTl/hMpggVl6O8BE937Hf4BqPRzVz1b/xsCh97uy3BERKpybt15oHkPESzTRRueINuVqa92PIWn0wtyTb8WF4tk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type; b=XvyIOzObNXx0NeaEPtEpEHiAGxKQDeTmvKm5QHY3KHkwAqPXZVBf2zNui7rnGdOshIVFGt1pDEN39bmqoYf5FRCQpOfyEuoCbGsnY27XpFhiK6Ieo6kZiKkJZQdg+S6GzNg6klBLX6hJ56W251EWgs/IIhqvzc+6F5RI64VBKEg= Received: by 10.100.215.5 with SMTP id n5mr1925457ang.41.1203961487837; Mon, 25 Feb 2008 09:44:47 -0800 (PST) Received: by 10.100.44.16 with HTTP; Mon, 25 Feb 2008 09:44:47 -0800 (PST) Message-ID: <245ef33b0802250944t611aa7cax97078514170f2f08@mail.gmail.com> Date: Mon, 25 Feb 2008 12:44:47 -0500 From: "Sean Callan" To: users@jackrabbit.apache.org Subject: Saving & Searching Binary Content MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_5526_3618080.1203961487833" X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_5526_3618080.1203961487833 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi guys, I've got a quick question about how I'm doing my saving and searching on binary content. At this time I use a my own node definition as follows: [webfmis:resource] > nt:resource, nt:unstructured - webfmis:fileName (string) - webfmis:size (string) When a file is uploaded I create a new webfmis:resource and load in their data as follows: n.setProperty("jcr:mimeType", mimeType); n.setProperty("jcr:data", file.getInputStream()); n.setProperty("jcr:lastModified", Calendar.getInstance()); n.setProperty("webfmis:fileName", filename); n.setProperty("webfmis:size", file.getSize()); My repository.xml file has all of the text extractors listed in it and everything seems to be working fine. If search for a word that happens to be in the webfmis:fileName I get it returned, however, I cannot get any values within the binary content to be indexed and returned. What am I doing here? These are the various xpaths I've tried to use: //*[jcr:contains(.,'rapdily')] //element(*, nt:resource)[(jcr:contains(.,'rapdily'))] //element(*, webfmis:resource)[(jcr:contains(.,'rapdily'))] //webfmis:resource[jcr:contains(.,'rapdily')] I'm not sure what I'm doing that is preventing the indexing and searching, I thought perhaps it was the jcr:encodingType property but I did away with that but there was no improvement. If I just look for webfmis:resource, or nt:resource I get the number of files in the system. Any help would be greatly appreciated. Thanks, Sean ------=_Part_5526_3618080.1203961487833--