lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Bennett <>
Subject Re: FuzzyQuery vs SlowFuzsyQuery docs? -- was: Re: [jira] [Commented] (LUCENE-2667) Fix FuzzyQuery's defaults, so its fast.
Date Fri, 09 Nov 2012 23:48:49 GMT
Hi Robert,

On Thu, Sep 13, 2012 at 7:39 PM, Robert Muir <> wrote:

> ...
> ... I'm strongly against having this
> unscalable garbage in lucene's core.
> There is no use case for ed > 2, thats just crazy.

I promise you there ARE use cases for edit distances > 2, especially with
longer words.  Due to NDA I can't go into details.

Also ed>2 can be useful when COMBINING that low-quality part of the search
with other sub-queries, or additional business rules.  Maybe instead of
boiling an ocean this lets you just boil the sea.  ;-)

I won't comment on the quality of the older Levenstein code, or the likely
very slow performance, nor where the code should live, etc.

But your statement about "no use case for ed > 2" is simply not true.
(whether you'd agree with any of them or not is certainly another matter)

I understand your concerns about not having it be the default.  (or maybe
having a giant warning message or something, whatever)

> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

View raw message