commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LANG-1199) Fix implementation of StringUtils.getJaroWinklerDistance()
Date Sun, 05 Jun 2016 16:12:59 GMT

    [ https://issues.apache.org/jira/browse/LANG-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15315929#comment-15315929
] 

ASF GitHub Bot commented on LANG-1199:
--------------------------------------

Github user PascalSchumacher commented on the issue:

    https://github.com/apache/commons-lang/pull/148
  
    merged: https://github.com/apache/commons-lang/commit/c35a04102274b330aa833030c15f722b00062812


> Fix implementation of StringUtils.getJaroWinklerDistance()
> ----------------------------------------------------------
>
>                 Key: LANG-1199
>                 URL: https://issues.apache.org/jira/browse/LANG-1199
>             Project: Commons Lang
>          Issue Type: Bug
>          Components: lang.*
>    Affects Versions: 3.4
>            Reporter: M. Steiger
>            Assignee: Pascal Schumacher
>             Fix For: 3.5
>
>
> The current implementation of StringUtils.getJaroWinklerDistance() does not compute the
correct result in some cases. See #LANG-944 for the initial code contribution.
> StringUtils.getJaroWinklerDistance("Haus Ingeborg", "Ingeborg Esser") == 0.0
> This is due to the incorrect computation of common characters, which causes the algorithm
to exit prematurely.
> In contrast, the implementation in Lucene gives ~0.63, which is about right.
>     JaroWinklerDistance d = new JaroWinklerDistance();
>     getDistance("Haus Ingeborg", "Ingeborg Esser");
> See https://lucene.apache.org/core/3_0_3/api/contrib-spellchecker/org/apache/lucene/search/spell/JaroWinklerDistance.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message