commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rob Tompkins <chtom...@gmail.com>
Subject Re: [text] Next steps.
Date Tue, 29 Nov 2016 22:11:52 GMT

> On Nov 29, 2016, at 2:02 PM, Benedikt Ritter <britter@apache.org> wrote:
> 
> +1 for an early release without the name parser. We can implement the
> algorithms from the book in 1.1.

Sounds good I’ll start heading that direction.

-Rob

> 
> Benedikt
> 
> Gary Gregory <garydgregory@gmail.com> schrieb am Di. 29. Nov. 2016 um 00:34:
> 
>> +1 to removing the human name code for 1.0. Put it in a branch IMO.
>> 
>> Gary
>> 
>> On Mon, Nov 28, 2016 at 3:12 PM, Bruno P. Kinoshita <
>> brunodepaulak@yahoo.com.br.invalid> wrote:
>> 
>>> Hi Rob,
>>> 
>>> First of all, kudos for the great work moving things from [lang] into
>>> [text].
>>> 
>>> I got a copy of the Lothaire book last weekend, but haven't had a chance
>>> to read it yet.
>>> 
>>> There was also some discussion around the name-parser, and since we
>>> couldn't reach a consensus,
>>> I think we could either try to have another discussion thread, or stash
>> it
>>> somewhere so that
>>> it doesn't block a release.
>>> 
>>> 
>>> I also would like to implement more edit distance and string
>> similarities,
>>> as well as
>>> look into the duration unit parser, probably adapting code from
>>> github.com/jchampemont/gunip
>>> 
>>> 
>>> But I'd vote for (4). First moving the human name parser elsewhere,
>>> reviewing the edit distances,
>>> and checking if there's anything else we could put into this initial
>>> release from [lang].
>>> 
>>> Once it has been released, we will be able to add things from Lothaire
>>> book,
>>> more edit distances, maybe bring back the name parser, as well as any
>>> enhancement
>>> bug fixing.
>>> 
>>> Bruno
>>> 
>>>> ________________________________
>>>> From: Rob Tompkins <chtompki@gmail.com>
>>>> To: Commons Developers List <dev@commons.apache.org>
>>>> Sent: Tuesday, 29 November 2016 11:45 AM
>>>> Subject: [text] Next steps.
>>>> 
>>>> 
>>>> Hello,
>>>> 
>>>> I'm a tad curious what folks (along with Gary, Benedikt, and Bruno)
>> think
>>>> the next steps are for text in the hopeful thought that we are
>> eventually
>>>> heading towards a 1.0 release. Some thoughts that come to mind are:
>>>> 
>>>> (1) Go over lang with fine tooth comb and see what we think should move,
>>>> (2) Go through the Lothaire "Applied Combinatorics on Words" book (
>>>> http://lipn.univ-paris13.fr/~duchamp/Books&more/Lothaire/(
>>> Encyclopedia_of_Mathematics_and_its_Applications_)M._Lothaire-Applied_
>>> Combinatorics_On_Words-Cambridge_University_Press(2005).pdf)
>>>> and minimally implement some of the standard algorithms.
>>>> (3) Implement, from the Lothaire book, some of the more complex stuff:
>>>> heavier pattern matching, and/or natural language processing,
>>>> and/or
>>>> (4) Go straight for a release.
>>>> 
>>>> I'm less for (4) because I think there's probably some smaller bits of
>>> code
>>>> in lang that probably come over. I like the idea of (2) before heading
>> out
>>>> the door. Regarding (3), I would have to do considerable reading to make
>>>> considerable headway here, which I'm not opposed to doing it would just
>>>> merely prolong getting to a 1.0 release if we predicated the release
>> upon
>>>> my getting that done.
>>>> 
>>>> So, what do you guys think?
>>>> 
>>>> Cheers,
>>>> -Rob
>>>> 
>>>> 
>>>> 
>>> 
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
>>> For additional commands, e-mail: dev-help@commons.apache.org
>>> 
>>> 
>> 
>> 
>> --
>> E-Mail: garydgregory@gmail.com | ggregory@apache.org
>> Java Persistence with Hibernate, Second Edition
>> <
>> https://www.amazon.com/gp/product/1617290459/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1617290459&linkCode=as2&tag=garygregory-20&linkId=cadb800f39946ec62ea2b1af9fe6a2b8
>>> 
>> 
>> <http:////
>> ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1617290459>
>> JUnit in Action, Second Edition
>> <
>> https://www.amazon.com/gp/product/1935182021/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1935182021&linkCode=as2&tag=garygregory-20&linkId=31ecd1f6b6d1eaf8886ac902a24de418%22
>>> 
>> 
>> <http:////
>> ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1935182021>
>> Spring Batch in Action
>> <
>> https://www.amazon.com/gp/product/1935182951/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1935182951&linkCode=%7B%7BlinkCode%7D%7D&tag=garygregory-20&linkId=%7B%7Blink_id%7D%7D%22%3ESpring+Batch+in+Action
>>> 
>> <http:////
>> ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1935182951>
>> Blog: http://garygregory.wordpress.com
>> Home: http://garygregory.com/
>> Tweet! http://twitter.com/GaryGregory
>> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
For additional commands, e-mail: dev-help@commons.apache.org


Mime
View raw message