commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 29080] - soundex encoding
Date Wed, 02 Jun 2004 01:04:30 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=29080>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=29080

soundex encoding





------- Additional Comments From ggregory@seagullsw.com  2004-06-02 01:04 -------
Instead of a ArrayIndexOutOfBoundsException, you will now get an
IllegalArgumentException with a "The character is not mapped: " message. Now in
CVS HEAD.

This is perhaps not ideal but note that the default behavior of the Soundex
class is to use the Soundex.US_ENGLISH_MAPPING constant. Users that desire a
different mapping can provide their own through the Soundex(char[]) constructor.

I've posted a note on commons-dev on this topic titled "[codec] Soudex issue
with accented character." with no replies so far. See
http://www.mail-archive.com/commons-dev@jakarta.apache.org/msg41974.html

Whether or not this is "as designed" depends on whether you think "fancy"
characters should be handled in Soundex.US_ENGLISH_MAPPING.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


Mime
View raw message