lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] [Commented] (LUCENE-4024) FuzzyQuery should never do edit distance > 2
Date Mon, 07 May 2012 19:52:48 GMT


Robert Muir commented on LUCENE-4024:

Did you look at mmmm say the changes to package.html?
 /** Implements the fuzzy search query. The similarity measurement
- * is based on the Levenshtein (edit distance) algorithm.
+ * is based on the Damerau-Levenshtein (optimal string alignment) algorithm.

+   * @param transpositions true if transpositions should be treated as a primitive
+   *        edit operation. If this is false, comparisons will implement the classic
+   *        Levenshtein algorithm.

-<p>Lucene supports fuzzy searches based on the Levenshtein Distance, or Edit Distance
+<p>Lucene supports fuzzy searches based on Damerau-Levenshtein Distance. 

> FuzzyQuery should never do edit distance > 2
> --------------------------------------------
>                 Key: LUCENE-4024
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>             Fix For: 4.0
>         Attachments: LUCENE-4024.patch
> Edit distance 1 and 2 are now very very fast compared to 3.x (100X-200X faster) ... but
edit distance 3 will fallback to the super-slow scan all terms in 3.x, which is not graceful
> Not sure how to fix it ... mabye we have a SlowFuzzyQuery?  And FuzzyQuery throws exc
if you try to ask it to be slow?  Or, we add boolean (off by default) that you must turn on
to allow slow one..?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message