lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <>
Subject Re: index phrases
Date Tue, 21 Jun 2005 15:13:57 GMT

On Jun 21, 2005, at 10:01 AM, Roxana Angheluta wrote:

> Dear all,
> I am using Lucene for indexing documents.
> I would like to include phrases (of a certain maximum length given  
> as a parameter) in the index. I know this is non-standard for e.g.  
> searching, where a PhraseQuery can be built which makes use of the  
> terms positions. However, I am not interested in searching, but  
> rather in using the indexing terms for some statistics.
> What would be an efficient way to do this? Is it possible to build  
> phrases in a filter after tokenization?

Roxana- could you give us a concrete example of what you're wanting  
to do?

A TokenFilter could certainly be used to aggregate multiple terms  
into a single term that represents a phrase.  This would happen  
during the analysis process, which occurs along with tokenization.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message