lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Miller <>
Subject Re: query with stemming, prefix and fuzzy?
Date Fri, 30 Jan 2009 19:27:34 GMT
Gert Brinkmann wrote:
> <int name="distinct">57971</int> <!-- is this much? -->
Its a lot for a small index. The fuzzy query will enumerate all of those 
terms and calculate an edit distance. Its not an insane amount of work, 
but it jives with the slowness you see. Doing that 60,000 times for a 
query is not that fast.

Unfortunately, without the prefix setting, FuzzyQueries are slow, slow 
with that many uniques. Solr should def allow the prefix to be set. 
There was talk a couple years back about changing the default prefix 
value in Lucene because its so slow - but it didn't happen. The 
developers decided that you could tweak it yourself if you needed to be 
able to scale (if you add a prefix length, up to that length won't be 
fuzzy). Unfortunately, Solr hasnt yet given this option to my knowledge.

- Mark

View raw message