lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Naber <lucenelist2...@danielnaber.de>
Subject Re: Getting irrelevant results using fuzzy query
Date Wed, 18 Jun 2008 19:10:33 GMT
On Mittwoch, 18. Juni 2008, László Monda wrote:

> Additional info: Lucene seems to do the right thing when only few
> documents are present, but goes crazy when there is about 1.5 million
> documents in the index.

Lucene works well with more documents (currently using it with 9 million). 
but the fuzzy query requires iteration over all terms which makes this 
query slow. This can be avoid by setting the prefixLength parameter of the 
FuzzyQuery constructor to 1 or 2. Or maybe you should use an n-gram index, 
see the spellchecker in the contrib area.

Regards
 Daniel

-- 
http://www.danielnaber.de

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message