lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Rennie (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-1417) Allowing for distance measures that incorporate frequency/popularity for SuggestWord comparison
Date Fri, 10 Oct 2008 16:21:44 GMT
Allowing for distance measures that incorporate frequency/popularity for SuggestWord comparison
-----------------------------------------------------------------------------------------------

                 Key: LUCENE-1417
                 URL: https://issues.apache.org/jira/browse/LUCENE-1417
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/spellchecker
    Affects Versions: 2.4
            Reporter: Jason Rennie


Spelling suggestions are currently ordered first by a string edit distance measure, then by
popularity/frequency.  This limits the ability of popularity/frequency to affect suggestions.
 I think it would be better for the distance measure to accept popularity/frequency as an
argument and provide a distance/score that incorporates any popularity/frequency considerations.
 I.e. change StringDistance.getDistance to accept an additional argument: frequency of the
potential suggestion.

The new SuggestWord.compareTo function would only order by score.  We could achieve the existing
behavior by adding a small inverse frequency value to the distances.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message