commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Oliver Heger <>
Subject Re: [LANG] New class called StringAlgorithms?
Date Sat, 18 Jan 2014 17:12:37 GMT

Am 18.01.2014 17:40, schrieb Emmanuel Bourg:
> Le 18/01/2014 16:04, Benedikt Ritter a écrit :
>> About putting this into codec: I still don't think this is a good fit for
>> this contribution. Codec is about, well decoding and encoding stuff. Jaro
>> Winkler and Levenshtein Distance are more like scores or metrics that help
>> in comparing strings.
> The point is, string metrics and soundex algorithm are often used to
> find similarities between words. That's a bit odd to have them in
> separate packages. That being said, string metrics doesn't look like a
> good fit for codec since it doesn't encode anything.

>From a logic PoV I agree with Emmanuel that a separate Text component
would make sense. It could also contain other stuff like special search
algorithms or trie implementations.

>From an organizational PoV I also understand Gary: It is unlikely that
we have the energy and man power to keep such a new component alive -
except someone steps up now?

So I am on the fence. In past we have always tried to keep [lang] very
focused and lean.


> Emmanuel Bourg
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message