lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Naber <>
Subject Re: Getting irrelevant results using fuzzy query
Date Wed, 18 Jun 2008 19:10:33 GMT
On Mittwoch, 18. Juni 2008, László Monda wrote:

> Additional info: Lucene seems to do the right thing when only few
> documents are present, but goes crazy when there is about 1.5 million
> documents in the index.

Lucene works well with more documents (currently using it with 9 million). 
but the fuzzy query requires iteration over all terms which makes this 
query slow. This can be avoid by setting the prefixLength parameter of the 
FuzzyQuery constructor to 1 or 2. Or maybe you should use an n-gram index, 
see the spellchecker in the contrib area.



To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message