commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benedikt Ritter <>
Subject Re: [LANG] New class called StringAlgorithms?
Date Sat, 18 Jan 2014 15:04:54 GMT
2014/1/18 Gary Gregory <>

> A new component just for this? Uhg. I'd rather find it a home in one of
> our current components. A new component in this case seems like an orphan
> in waiting. If we really want a new component instead of Lang or Codec, I
> could see a Text component which would get land's text package as well...
> But a new component would be my last choice.

I agree. The new implementation (the Jaro Winkler Distance) was contributed
bei someone. I want to give people the feeling, that the stuff they
contribute is usable shortly afterwards. So my goal is to get this into
lang and release it in early february.

About putting this into codec: I still don't think this is a good fit for
this contribution. Codec is about, well decoding and encoding stuff. Jaro
Winkler and Levenshtein Distance are more like scores or metrics that help
in comparing strings.

Proposal: I'll implement something in [lang], so everybody can have a look
and comment. Seeing code is always better for discussion than just talking
about what could be done.


> Gary
> -------- Original message --------
> From: Emmanuel Bourg <>
> Date:01/17/2014  17:25  (GMT-05:00)
> To: Commons Developers List <>
> Subject: Re: [LANG] New class called StringAlgorithms?
> Le 17/01/2014 14:07, Gary Gregory a écrit :
> > All of this is starting to feel like drifting away from [lang] but toward
> > what I am not sure, maybe [codec].
> What about creating a commons-string-metric component (or any better
> name)? Wikipedia has a list of about 20 string metrics algorithm, the
> field looks broad enough to deserve a dedicated component.
> Emmanuel Bourg
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message