Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 190BD200BC8 for ; Wed, 23 Nov 2016 13:07:00 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 17ADE160AFD; Wed, 23 Nov 2016 12:07:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3C497160AFA for ; Wed, 23 Nov 2016 13:06:59 +0100 (CET) Received: (qmail 88840 invoked by uid 500); 23 Nov 2016 12:06:58 -0000 Mailing-List: contact dev-help@opennlp.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@opennlp.apache.org Delivered-To: mailing list dev@opennlp.apache.org Received: (qmail 88822 invoked by uid 99); 23 Nov 2016 12:06:58 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Nov 2016 12:06:58 +0000 Received: from mail-oi0-f45.google.com (mail-oi0-f45.google.com [209.85.218.45]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id ADBC51A00A8 for ; Wed, 23 Nov 2016 12:06:57 +0000 (UTC) Received: by mail-oi0-f45.google.com with SMTP id b126so11800619oia.2 for ; Wed, 23 Nov 2016 04:06:57 -0800 (PST) X-Gm-Message-State: AKaTC02bjtwnRJkpGzRU3AnHDBYbDleWlIvuXi5cxqzWcuj26RhWj0gwm3+EvZldJ7H+35pgc/lQp7RkbhNfwQ== X-Received: by 10.202.80.149 with SMTP id e143mr1224456oib.93.1479902816987; Wed, 23 Nov 2016 04:06:56 -0800 (PST) MIME-Version: 1.0 Received: by 10.182.72.130 with HTTP; Wed, 23 Nov 2016 04:06:16 -0800 (PST) In-Reply-To: References: <1478541540.3316.7.camel@gmail.com> From: Rodrigo Agerri Date: Wed, 23 Nov 2016 13:06:16 +0100 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Next release To: "dev@opennlp.apache.org" Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable archived-at: Wed, 23 Nov 2016 12:07:00 -0000 Hello, I have added the lemmatizer info to RELEASE in opennlp-distr. Commit pushed to new branch 889. Rodrigo R On Thu, Nov 10, 2016 at 11:42 AM, William Colen w= rote: > Cool. There is a lot of PlainTextByLineStream references in deprecated > methods, specially main methods. I will ignore them and you can remove th= e > main method when you go through each tool. > I will focus on PlainTextByLineStream that are not inside deprecated > methods. > > > 2016-11-10 6:39 GMT-02:00 Joern Kottmann : > >> Ok, I created a couple of issues and will go through them rather quickly= . >> >> J=C3=B6rn >> >> On Thu, Nov 10, 2016 at 3:36 AM, William Colen >> wrote: >> >> > J=C3=B6rn, I can help removing deprecated code. I started with >> > PlainTextByLineStream. It is used everywhere so there is a lot to chan= ge. >> > >> > >> > 2016-11-08 9:08 GMT-02:00 Joern Kottmann : >> > >> > > I suggest we remove more deprecated code, there is still a lot which >> > could >> > > be removed and is really old. >> > > It is a bit of a boring task, if anyone has some spare cycles help >> would >> > be >> > > welcome. >> > > >> > > J=C3=B6rn >> > > >> > > On Tue, Nov 8, 2016 at 9:59 AM, Aliaksandr Autayeu < >> > aliaksandr@autayeu.com >> > > > >> > > wrote: >> > > >> > > > +1 for 1.7 (also due to lemmatized changes and removal of deprecat= ed >> > > code). >> > > > >> > > > On 8 November 2016 at 09:48, Rodrigo Agerri >> > wrote: >> > > > >> > > > > Hello, >> > > > > >> > > > > +1 1.7.0 in next release and +1 for a yearly release >> > > > > >> > > > > Just to provide some info, the main changes in the lemmatizer ha= ve >> > > been: >> > > > > >> > > > > 1. Added a supervised statistical lemmatizer, usable from the CL= I >> and >> > > > > API. The supervised lemmaitzer now provides a much better covera= ge >> > for >> > > > > unknown words with respect to the previously existing >> > dictionary-based >> > > > > one. >> > > > > 2. The lemmatizer component has been rewritten and the API >> therefore >> > > > > has substantially changed. Thus, the changes in the >> Dictionary-based >> > > > > lemmatizer are not backward compatible. In any case, I do not th= ink >> > > > > that so many people was using it and the change at using the API= is >> > > > > minor. >> > > > > >> > > > > The new statistical lemmatizer can support the Dictionary-based >> > > > > lemmatizers often used to provide features for components such a= s >> > Word >> > > > > Sense Disambiguation, Opinion Mining/Sentiment Analysis, etc. In >> this >> > > > > regard, it will be nice to aim at working on the development of >> those >> > > > > two components for their release. Maybe the next release is too >> > close, >> > > > > but definitely for the next one. >> > > > > >> > > > > Cheers, >> > > > > >> > > > > Rodrigo >> > > > > >> > > > > On Mon, Nov 7, 2016 at 7:01 PM, Russ, Daniel (NIH/CIT) [E] >> > > > > wrote: >> > > > > > Also the lemmatizer has significantly changed. I vote 1.7 >> > > > > > >> > > > > > On 11/7/16, 12:59 PM, "Joern Kottmann" >> wrote: >> > > > > > >> > > > > > Hello all, >> > > > > > >> > > > > > since our last release it has been a while and we received >> > quite >> > > a >> > > > > few >> > > > > > changes which would be nice to get released. >> > > > > > >> > > > > > There are still some open Jira issues, but mostly smaller >> > things >> > > > that >> > > > > > can be wrapped up rather quickly. >> > > > > > >> > > > > > Is there anything important missing which should go into t= he >> > next >> > > > > > release? Otherwise I think we should also aim for more >> frequent >> > > > > > released and just make one again early next year, with all >> the >> > > > stuff >> > > > > we >> > > > > > might miss out now. >> > > > > > >> > > > > > We took in a patch - as part of OPENNLP-830 - to replace o= ur >> > > > > self-made >> > > > > > hash table with the java.util.HashMap. This change is not >> > > backward >> > > > > > compatible for folks who extend AbstractModel. >> > > > > > >> > > > > > Should we go with 1.6.1 as a next version or should we mak= e >> > 1.7.0 >> > > > to >> > > > > > reflect that? >> > > > > > >> > > > > > Previously we only had backward incompatible changes in >> > versions >> > > > > which >> > > > > > bumped by the second number. Maybe that is better choice. = It >> > will >> > > > > > probably break some peoples code when they update. >> > > > > > >> > > > > > We also have lots of deprecated API still in OpenNLP, shou= ld >> we >> > > try >> > > > > to >> > > > > > remove as much as possible of it now? >> > > > > > >> > > > > > J=C3=B6rn >> > > > > > >> > > > > > >> > > > > >> > > > >> > > >> > >>