Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 58626 invoked from network); 5 Mar 2008 16:59:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Mar 2008 16:59:06 -0000 Received: (qmail 21122 invoked by uid 500); 5 Mar 2008 16:59:01 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 21108 invoked by uid 500); 5 Mar 2008 16:59:01 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 21099 invoked by uid 99); 5 Mar 2008 16:59:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Mar 2008 08:59:01 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [192.18.19.6] (HELO sineb-mail-1.sun.com) (192.18.19.6) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Mar 2008 16:58:12 +0000 Received: from fe-apac-04.sun.com (fe-apac-04.sun.com [192.18.19.175] (may be forged)) by sineb-mail-1.sun.com (8.13.6+Sun/8.12.9) with ESMTP id m25GwZP6017990 for ; Wed, 5 Mar 2008 16:58:36 GMT Received: from conversion-daemon.mail-apac.sun.com by mail-apac.sun.com (Sun Java System Messaging Server 6.2-6.01 (built Apr 3 2006)) id <0JX900001LYZKO00@mail-apac.sun.com> (original mail from Prakash.R@Sun.COM) for users@jackrabbit.apache.org; Thu, 06 Mar 2008 00:58:28 +0800 (SGT) Received: from [123.236.221.201] by mail-apac.sun.com (Sun Java System Messaging Server 6.2-6.01 (built Apr 3 2006)) with ESMTPSA id <0JX900433OHDAKS1@mail-apac.sun.com> for users@jackrabbit.apache.org; Thu, 06 Mar 2008 00:58:28 +0800 (SGT) Date: Wed, 05 Mar 2008 22:40:39 +0530 From: "Prakash Reddy K. L. V." Subject: Re: Search in binary Content In-reply-to: <510143ac0803050653l69c75276ha860ef50fc65edb0@mail.gmail.com> Sender: Prakash.R@Sun.COM To: users@jackrabbit.apache.org Message-id: <47CED40F.8090504@sun.com> MIME-version: 1.0 Content-type: multipart/alternative; boundary="Boundary_(ID_iAD/I0ahiW/2UTBmihSqpA)" References: <47CEAB02.9070809@sun.com> <510143ac0803050653l69c75276ha860ef50fc65edb0@mail.gmail.com> User-Agent: Thunderbird 2.0.0.12 (Windows/20080213) X-Virus-Checked: Checked by ClamAV on apache.org --Boundary_(ID_iAD/I0ahiW/2UTBmihSqpA) Content-type: text/plain; format=flowed; charset=ISO-8859-1 Content-transfer-encoding: 7BIT Hi Jukka, I am sorry for giving out the wrong info but I somehow have this vague memory of reading somewhere that jackrabbit does not support this. Thanks for clarifying this. Sorry again. Prakash Jukka Zitting wrote: > Hi, > > On Wed, Mar 5, 2008 at 4:15 PM, Prakash Reddy K. L. V. > wrote: > >> Jackrabbit does not support searching in binary content. >> > > It does, but there are certain restrictions before that happens. > > You need to put your binary content in a jcr:data property, and have a > related jcr:mimeType string property with the exact MIME type of the > binary data. > > Then, if you've configured the appropriate Jackrabbit text extractors > in the repository configuration file and have all the required parser > libraries (e.g. pdfbox for PDFs) available, Jackrabbit will index such > binary properties. > > BR, > > Jukka Zitting > --Boundary_(ID_iAD/I0ahiW/2UTBmihSqpA)--