lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: index phrases
Date Tue, 21 Jun 2005 15:13:57 GMT

On Jun 21, 2005, at 10:01 AM, Roxana Angheluta wrote:

> Dear all,
>
> I am using Lucene for indexing documents.
>
> I would like to include phrases (of a certain maximum length given  
> as a parameter) in the index. I know this is non-standard for e.g.  
> searching, where a PhraseQuery can be built which makes use of the  
> terms positions. However, I am not interested in searching, but  
> rather in using the indexing terms for some statistics.
>
> What would be an efficient way to do this? Is it possible to build  
> phrases in a filter after tokenization?

Roxana- could you give us a concrete example of what you're wanting  
to do?

A TokenFilter could certainly be used to aggregate multiple terms  
into a single term that represents a phrase.  This would happen  
during the analysis process, which occurs along with tokenization.

     Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message