lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From László Monda <l...@monda.hu>
Subject Re: Getting irrelevant results using fuzzy query
Date Mon, 23 Jun 2008 11:10:05 GMT
On Wed, 2008-06-18 at 21:10 +0200, Daniel Naber wrote:
> On Mittwoch, 18. Juni 2008, László Monda wrote:
> 
> > Additional info: Lucene seems to do the right thing when only few
> > documents are present, but goes crazy when there is about 1.5 million
> > documents in the index.
> 
> Lucene works well with more documents (currently using it with 9 million). 
> but the fuzzy query requires iteration over all terms which makes this 
> query slow. This can be avoid by setting the prefixLength parameter of the 
> FuzzyQuery constructor to 1 or 2. Or maybe you should use an n-gram index, 
> see the spellchecker in the contrib area.

Thanks for the suggestion, but I don't have any performance problems
yet, but I do have serious problems with the relevance of the results
with fuzzy queries.

-- 
Laci  <http://monda.hu>


Mime
View raw message