commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gary Gregory (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CODEC-235) Revised / Alternate NYSIIS
Date Wed, 31 May 2017 17:51:05 GMT

    [ https://issues.apache.org/jira/browse/CODEC-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16031586#comment-16031586
] 

Gary Gregory commented on CODEC-235:
------------------------------------

OK, sounds good. Feel free to submit a patch with unit tests. You can create a PR on GitHub
or attach a diff file here, up to you. You'll need to come up with a class name and Javadoc
that makes it clear what the differences are with the current Nysiis class. Perhaps the Javadoc
on the existing Nysiis class can be improved as well?

> Revised / Alternate NYSIIS
> --------------------------
>
>                 Key: CODEC-235
>                 URL: https://issues.apache.org/jira/browse/CODEC-235
>             Project: Commons Codec
>          Issue Type: New Feature
>            Reporter: Juan Garcia
>            Priority: Minor
>
> I have been dabbling in phonetic algorithms lately and it is pleasing to see that I can
find something under the commons umbrella for this area as well so thanks a ton for that.
> In regards to feature requests NYSIIS as is implemented here I believe falls under the
original release in the 1970s. Not being savvy in this area it took me too long to realize
that the results that I was seeing from Oracles implementation as referenced here https://docs.oracle.com/cd/E18150_01/javadocs/SunMasterIndex/com/sun/mdm/index/phonetic/impl/Nysiis.html
differs from what exists in commons-codec 1.10.
> A series of searches brings me to this cool page http://www.dropby.com/NYSIIS.html which
illustrates the differences as each replacement occurs between the original NYSIIS and refined
/ alternate NYSIIS.
> I would gladly put more research in regards to specifications, coming up with samples
for tests, tests themselves, and even development if this was something the team wished to
see become a part of commons-codec. Some other Google searches does yield some implementations
that I am using for the time being but something that is already packaged into something I
use daily would be gladly welcome in my books.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message