lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wettin (JIRA)" <>
Subject [jira] Commented: (LUCENE-580) Pre-analyzed fields
Date Wed, 25 Apr 2007 08:32:15 GMT


Karl Wettin commented on LUCENE-580:

25 apr 2007 kl. 10.23 skrev Michael Busch (JIRA):
> What kind of use cases do you have in mind with the cached field?

I made the inital implementation for text mining purposes -- I needed the term vector prior
to inserting the document to the index. Back then I analyzed, cached it up, did my secondary
analysis of the vector, and finally reconstructed the token stream and passed it to the field.
I think it would be easier to just pass a token stream to an extention of CachedPreAn.. that
also features a termFreqVector(), termPosVector(), et c.


> Pre-analyzed fields
> -------------------
>                 Key: LUCENE-580
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>    Affects Versions: 1.9
>            Reporter: Karl Wettin
>         Assigned To: Michael Busch
>            Priority: Minor
>         Attachments: preanalyze.tar, trunk.diff
> Adds the possibility to set a TokenStream at Field constrution time, available as tokenStreamValue
in addition to stringValue, readerValue and binaryValue.
> There might be some problems with mixing stored fields with the same name as a field
with tokenStreamValue.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message