lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas Morton (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-1550) Add N-Gram String Matching for Spell Checking
Date Wed, 22 Apr 2009 03:19:47 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12701362#action_12701362
] 

Thomas Morton commented on LUCENE-1550:
---------------------------------------

The implementations returns a normalized edit distance (normalized by string length) and specifically
1 if the strings are the same and 0 if that are maximally different.  0 in that case makes
sense as the number of edits is equal to the number of characters in the longest string, so:

1- (2 edits /2 length) = 0


> Add N-Gram String Matching for Spell Checking
> ---------------------------------------------
>
>                 Key: LUCENE-1550
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1550
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/spellchecker
>    Affects Versions: 2.9
>            Reporter: Thomas Morton
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 2.9
>
>         Attachments: LUCENE-1550.patch, LUCENE-1550.patch, LUCENE-1550.patch
>
>
> N-Gram version of edit distance based on paper by Grzegorz Kondrak, "N-gram similarity
and distance". Proceedings of the Twelfth International Conference on String Processing and
Information Retrieval (SPIRE 2005), pp. 115-126,  Buenos Aires, Argentina, November 2005.

> http://www.cs.ualberta.ca/~kondrak/papers/spire05.pdf

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message