jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Philipp Koch" <philipp.k...@gmail.com>
Subject Re: how can I say to jackrabbit to index a text when I put a TIFF in the repository?
Date Fri, 28 Mar 2008 07:47:49 GMT
> What do you mean with "document version"? the text from the TIFF image?
i was refering to my:docText in your example.

you could than call jcr:contains(jcr:content, 'hola mundo') .

regrads,
philipp


On Fri, Mar 28, 2008 at 8:33 AM, Paco Avila <pavila@git.es> wrote:
> What do you mean with "document version"? the text from the TIFF image?
>
>  El vie, 28-03-2008 a las 08:19 +0100, Philipp Koch escribió:
>
>
> > why don't you just add a mixin nodetype(that would contain the
>  > document version property) to the nt:resource node while uploading and
>  > store  doc version as additional property on the nt:resource node.
>  > this would solve your problem if i understood your use case the right
>  > way.
>  >
>  > regards,
>  > philipp
>  >
>  > On Fri, Mar 28, 2008 at 7:57 AM, Jukka Zitting <jukka.zitting@gmail.com>
wrote:
>  > > Hi,
>  > >
>  > >
>  > >  On Fri, Mar 28, 2008 at 8:43 AM, Paco Avila <pavila@git.es> wrote:
>  > >  > El vie, 28-03-2008 a las 08:26 +0200, Jukka Zitting escribió:
>  > >
>  > > >  > Or just a normal string property with the text to be indexed.
>  > >  >
>  > >
>  > > >  But, in this case, the query can't be:
>  > >  >
>  > >  >   /jcr:root//element(*,my:document)[jcr:contains(nt:resource,'hola
>  > >  >  mundo')]
>  > >  >
>  > >  >  and should be something like (if I store the text in my:docText
>  > >  >  property:
>  > >  >
>  > >  >   /jcr:root//element(*,my:document)[jcr:contains(my:docText,'hola
>  > >  >  mundo')]
>  > >  >
>  > >  >  because Lucene is not indexing the "document text version".
>  > >
>  > >  You could use jcr:contains(., 'hola mundo') that looks in all
>  > >  properties of a node.
>  > >
>  > >  Alternatively, you could also put the text in a TIFF comment and
>  > >  implement a custom TextExtractor class that pulls that comment for
>  > >  Jackrabbit to index as the text version of the TIFF file.
>  > >
>  > >
>  > >  >  By the way, can I get the text generated by text-extractors or
>  > >  >  it is only used by Lucene engine?
>  > >
>  > >  No, it's only used for Lucene. But of course you can instantiate and
>  > >  run the text extractors manually on any binary property you like.
>  > >
>  > >  BR,
>  > >
>  > >  Jukka Zitting
>  > >
>  --
>
> Paco Avila <pavila@git.es>
>  GIT Consultors
>
>

Mime
View raw message