Return-Path: X-Original-To: apmail-opennlp-users-archive@www.apache.org Delivered-To: apmail-opennlp-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 902ED923C for ; Wed, 4 Apr 2012 12:31:01 +0000 (UTC) Received: (qmail 46057 invoked by uid 500); 4 Apr 2012 12:31:01 -0000 Delivered-To: apmail-opennlp-users-archive@opennlp.apache.org Received: (qmail 45989 invoked by uid 500); 4 Apr 2012 12:31:01 -0000 Mailing-List: contact users-help@opennlp.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@opennlp.apache.org Delivered-To: mailing list users@opennlp.apache.org Received: (qmail 45979 invoked by uid 99); 4 Apr 2012 12:31:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Apr 2012 12:31:01 +0000 X-ASF-Spam-Status: No, hits=-0.5 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jimpil1985@gmail.com designates 209.85.214.49 as permitted sender) Received: from [209.85.214.49] (HELO mail-bk0-f49.google.com) (209.85.214.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Apr 2012 12:30:52 +0000 Received: by mail-bk0-f49.google.com with SMTP id jk13so298983bkc.22 for ; Wed, 04 Apr 2012 05:30:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=8Zv0QOmlXy/BY0Ca+wQCMyPjRBEKLUHl2kJrRvogscY=; b=Oqizi+2HwB6uy6dV6gagStmIlhnmYZBU+b2psnVTanStjLawC21QfmlAWdIqCOW0bM wrdfL6tmu2Ng2FQ+mmYi6oxw4TYb+HEXmvaBhSGGHBpjgZ7ILOHO7PALaAOS3NyUofJX kDLuQ5+/twRAuCZxPBcROVKsRXCPy1vOV3ZdQ2Pb236/HAYYWNUWkvUmMLUtilNnU6Mo oDAuGm/OOc77owUDg7SUdapKmLf5ndyAj+hTRLVGjwKPKx2JMw9vblPHYoq6mpp5Yd5J n2W5GJgQkqYOthg4+32lyzF3zKPbHIoB6f5GYFfH8nppyihBgvkddScMHtle/fzljuKX obzg== Received: by 10.204.133.195 with SMTP id g3mr7051638bkt.73.1333542632116; Wed, 04 Apr 2012 05:30:32 -0700 (PDT) Received: from [192.168.0.3] (host86-175-184-43.wlms-broadband.com. [86.175.184.43]) by mx.google.com with ESMTPS id p19sm1542024bka.1.2012.04.04.05.30.30 (version=SSLv3 cipher=OTHER); Wed, 04 Apr 2012 05:30:31 -0700 (PDT) Message-ID: <4F7C3EE4.8060105@gmail.com> Date: Wed, 04 Apr 2012 13:30:28 +0100 From: "Jim - FooBar();" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:11.0) Gecko/20120310 Thunderbird/11.0 MIME-Version: 1.0 To: users@opennlp.apache.org Subject: Re: will version 1.5.3 be announced here? References: <4F7871B5.80504@gmail.com> <4F788FC5.1010400@gmail.com> <4F79963C.6060308@gmail.com> <4F7ACAEB.3040501@gmail.com> <4F7AD2F1.3050509@gmail.com> <4F7AD3C9.9050800@gmail.com> <4F7ADE17.9020205@gmail.com> <4F7C2691.8010104@gmail.com> <4F7C364C.5020905@gmail.com> <4F7C370D.2000509@gmail.com> <4F7C3CD9.70703@gmail.com> In-Reply-To: <4F7C3CD9.70703@gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit Also how come the Evaluator is an abstract class with abstract methods? Shouldn't it be an interface? Jim On 04/04/12 13:21, Jim - FooBar(); wrote: > On 04/04/12 12:57, J�rn Kottmann wrote: >> Doing this via our API should work right? You just create an >> evaluator and pass in >> the DictionaryNameFinder instance and the test data. > > Hmm...it is not that simple! what about the maxent model? Let's say you > > --create a class that extends Evaluator (as all evaluators do) > --allow the constructor of that class to take variable number of > arguments (of type TokenNameFinder) so we can use them later on > --pretty much copy the code from TokenNameFinderEvaluator and paste it > in the new class > --make sure the code asks both (or however many) name-finders before > it classifies a prediction as right or wrong. > > Up to here everything is quite straight forward...now the problems > begin!!! > The Dictionary has to be evaluated on separate data than the > model...That is because the Dictionary can only deal with the > "default tag" and nothing else! A quick workaround would be to retrain > the maxent model with "default" tags so i can evaluate it on the > dictionary's test-set but then what about multiple types??? > > Jim > > p.s. Jorn, we had this discussion before... I do believe that the > Dictionary should be able to deal with whatever tag the user supplied > just like the maxent model does...