Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 52611 invoked from network); 28 Mar 2008 06:57:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 28 Mar 2008 06:57:58 -0000 Received: (qmail 92431 invoked by uid 500); 28 Mar 2008 06:57:57 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 92415 invoked by uid 500); 28 Mar 2008 06:57:57 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 92406 invoked by uid 99); 28 Mar 2008 06:57:57 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Mar 2008 23:57:57 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jukka.zitting@gmail.com designates 209.85.200.171 as permitted sender) Received: from [209.85.200.171] (HELO wf-out-1314.google.com) (209.85.200.171) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Mar 2008 06:57:17 +0000 Received: by wf-out-1314.google.com with SMTP id 23so128021wfg.13 for ; Thu, 27 Mar 2008 23:57:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; bh=HXd1d4BznZDIB4qN9A0KKiKJZl/dpdnpLUpjG6RQhsM=; b=pCaLpOzAcY8ucpahVkrL0lVEb1Gg6Fv9Mzgev2HHM9NgIbtRfS34Wu71wpP5cTzB206i3JI4fBudRbyQXhpsnHBrjp6UQbxoQEgMr3+XwgWuvNrgYhFu9nHx3weyj558f4qJB9x2SgB15FRl7w+UNfivnFDygiSC5t9HyXTIk8U= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=VkP6KEJY+QIPVkKoBOEftgqliahLObcD/Vr1J0qP2Q4mQCPtN+jxel7loMH3ZVdacV2s9Ckc4N7yTAt5u7xkQUHFGZ/NpYWU1anEt2BliQIffw86NSJvvT4O9KovGstNvrNBngub2sZ2dpWMhplYbJI4vUQ98BBBWmjJAwVEjlk= Received: by 10.142.203.13 with SMTP id a13mr1909173wfg.224.1206687448031; Thu, 27 Mar 2008 23:57:28 -0700 (PDT) Received: by 10.142.126.3 with HTTP; Thu, 27 Mar 2008 23:57:27 -0700 (PDT) Message-ID: <510143ac0803272357x78a6a926jd2e175ac78566857@mail.gmail.com> Date: Fri, 28 Mar 2008 08:57:27 +0200 From: "Jukka Zitting" To: users@jackrabbit.apache.org Subject: Re: how can I say to jackrabbit to index a text when I put a TIFF in the repository? In-Reply-To: <1206686591.6269.7.camel@antares> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline References: <41668.192.168.1.70.1206640770.webmail@192.168.1.70> <510143ac0803272326j36e201bblce32d388b515f169@mail.gmail.com> <1206686591.6269.7.camel@antares> X-Virus-Checked: Checked by ClamAV on apache.org Hi, On Fri, Mar 28, 2008 at 8:43 AM, Paco Avila wrote: > El vie, 28-03-2008 a las 08:26 +0200, Jukka Zitting escribi=F3: > > Or just a normal string property with the text to be indexed. > > But, in this case, the query can't be: > > /jcr:root//element(*,my:document)[jcr:contains(nt:resource,'hola > mundo')] > > and should be something like (if I store the text in my:docText > property: > > /jcr:root//element(*,my:document)[jcr:contains(my:docText,'hola > mundo')] > > because Lucene is not indexing the "document text version". You could use jcr:contains(., 'hola mundo') that looks in all properties of a node. Alternatively, you could also put the text in a TIFF comment and implement a custom TextExtractor class that pulls that comment for Jackrabbit to index as the text version of the TIFF file. > By the way, can I get the text generated by text-extractors or > it is only used by Lucene engine? No, it's only used for Lucene. But of course you can instantiate and run the text extractors manually on any binary property you like. BR, Jukka Zitting