Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C2F40200D10 for ; Sat, 26 Aug 2017 01:07:11 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id BFA4616D7C1; Fri, 25 Aug 2017 23:07:11 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0F7D316D7C0 for ; Sat, 26 Aug 2017 01:07:10 +0200 (CEST) Received: (qmail 45816 invoked by uid 500); 25 Aug 2017 23:07:10 -0000 Mailing-List: contact dev-help@opennlp.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@opennlp.apache.org Delivered-To: mailing list dev@opennlp.apache.org Received: (qmail 45802 invoked by uid 99); 25 Aug 2017 23:07:09 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Aug 2017 23:07:09 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 539A11812FA for ; Fri, 25 Aug 2017 23:07:09 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.071 X-Spam-Level: X-Spam-Status: No, score=-0.071 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id BxKY093SYwUV for ; Fri, 25 Aug 2017 23:07:08 +0000 (UTC) Received: from mail-qt0-f182.google.com (mail-qt0-f182.google.com [209.85.216.182]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id CFC4B5FC5D for ; Fri, 25 Aug 2017 23:07:07 +0000 (UTC) Received: by mail-qt0-f182.google.com with SMTP id u11so5769049qtu.1 for ; Fri, 25 Aug 2017 16:07:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:content-transfer-encoding:mime-version:subject:date:references :to:in-reply-to:message-id; bh=RcxSfY/59de8P2OZZIx3V6Yra7tapZYV+biQBj4Vo2E=; b=kqFDfLfS0JvfAZX4YgYNTzm5W/HoYAiBs5+f5Cg8pYgUVE9CVlfWbrlQ1SDscCB4pt uGIIWdMabTaVDuqYI5iCCzgOVVl2pSOVIYI2XVWL7Vv9BIQNVK1uZVItApGsv0oQv+sf c4wPuCS+iy8PKC2Sj3/82GYb0jdKIzliagI/8UGGIH6kzRlpMDt4G03+BvswFu1JAWl8 CHuTn3Oji3wcl3q8W68SLSoJxX5s5LnxVswMhj6zNf+PtFdrGaPckIiy+Sl4SrSuDBSQ Z2yQTWMXPBdnjifrG7m3Ktlju11r0dWFd4s/q8sRSHLy/6b4PjDJoSc7whLMEfWB4MwZ DD8w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:content-transfer-encoding:mime-version :subject:date:references:to:in-reply-to:message-id; bh=RcxSfY/59de8P2OZZIx3V6Yra7tapZYV+biQBj4Vo2E=; b=umVoX7Za4WkF3yqOrRhWVOAGFJza0hsM29QU8RpWzMtCYy56Bn10rlMR4Hn8NoImcN e+m++ktihd5oreBI7o9Pl7KEmb532tPPkOQvCT5KkNzxmIxObUPKtC1GfaCgFEjO8wXG NfE2IOYC7/P7ST/vZLvrLleOI+OhuJPVq0mRLo0oUe8dm0iuGtLVTmzTd1xHzx+bsZSI yjEshgMsrMJS3BSd5caq0fcAPMEqz3x/lUVPeQHZxBpUMNEOi9vdkabQEx8asPrW1xkb wMS1Mz2KvQaE1cZq05QDBql/DllvPTIvuGQMOWWFp4jdAz2MLyJrBa/sJnrcBOyavlEB TVFQ== X-Gm-Message-State: AHYfb5juyx+t2ciDzQd0sWfJTTv01dSFyjbeR3iIguJIZvMLqdR9nYcZ xTDao450QRpT4kc3+m0= X-Received: by 10.200.0.196 with SMTP id d4mr70359qtg.3.1503702426663; Fri, 25 Aug 2017 16:07:06 -0700 (PDT) Received: from ?IPv6:2601:581:c301:5a23:c8ce:752f:e532:4a24? ([2601:581:c301:5a23:c8ce:752f:e532:4a24]) by smtp.gmail.com with ESMTPSA id o67sm4709290qte.71.2017.08.25.16.07.04 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 25 Aug 2017 16:07:06 -0700 (PDT) From: Daniel Russ Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Re: Early stopping NameFinderME Date: Fri, 25 Aug 2017 19:07:03 -0400 References: To: dev@opennlp.apache.org In-Reply-To: Message-Id: X-Mailer: Apple Mail (2.3273) archived-at: Fri, 25 Aug 2017 23:07:12 -0000 J=C3=B6rn, Currently, GISTrainer has a private static final variable = LLThreshold, which controls if the change in the log likelihood between = two iterations is too small. We could make this parameter. I am = concerned about using the accuracy to train the model. If we use = accuracy, the weight space may be flat. =20 Saurabh, you use the term =E2=80=9Cearly stopping=E2=80=9D. In deep = learning, early stopping is used to prevent overtraining and improve = generalization to unseen data. I am not sure early stopping serves the = same purpose with GIS training. Does anyone know if early stopping = improves generalization for a maxent problem? Daniel > On Aug 24, 2017, at 4:48 AM, Joern Kottmann = wrote: >=20 > You are the first one who ever asked this question. I think we have = this as > an option already on the gis trainer but it is not exposed all the way > through. >=20 > Please open a jira and I can look at it next week. >=20 > J=C3=B6rn >=20 > On Aug 21, 2017 5:11 PM, "Saurabh Jain" = wrote: >=20 >> Hi All >>=20 >> How can we use early stopping while training/crossvalidating custom = data >> with NameFinder ? What I want if change in likelihood value or = accuracy of >> model is less than 0.05 between two steps (differ by 5 i.e compare = x+5 step >> output with x step) then training should stop. I could not find = anything >> regarding this in documentation. Can some one please help ? >>=20 >> -- >> *Thanks & Regards* >>=20 >>=20 >> *Saurabh Jain * >> *AI Developer* >>=20 >> *Active Intelligence * >>=20 >> *"* >> *To do a thing yesterday was the best time . Second best time is = today .=E2=80=9D * >>=20