From uima-user-return-1130-apmail-incubator-uima-user-archive=incubator.apache.org@incubator.apache.org Thu Mar 06 05:40:31 2008 Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 18422 invoked from network); 6 Mar 2008 05:40:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Mar 2008 05:40:30 -0000 Received: (qmail 95259 invoked by uid 500); 6 Mar 2008 05:40:26 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 95234 invoked by uid 500); 6 Mar 2008 05:40:26 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 95225 invoked by uid 99); 6 Mar 2008 05:40:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Mar 2008 21:40:26 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of twgoetz@gmx.de designates 213.165.64.20 as permitted sender) Received: from [213.165.64.20] (HELO mail.gmx.net) (213.165.64.20) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 06 Mar 2008 05:39:50 +0000 Received: (qmail invoked by alias); 06 Mar 2008 05:39:59 -0000 Received: from p5B2063A8.dip.t-dialin.net (EHLO [192.168.0.4]) [91.32.99.168] by mail.gmx.net (mp024) with SMTP; 06 Mar 2008 06:39:59 +0100 X-Authenticated: #25330878 X-Provags-ID: V01U2FsdGVkX1/juU9dmsbjnYaCL8RtWlBhnwSkidTIe63YV9pJ9R 5/KrNQ2xB6H5Vk Message-ID: <47CF83AE.8050804@gmx.de> Date: Thu, 06 Mar 2008 06:39:58 +0100 From: Thilo Goetz User-Agent: Thunderbird 2.0.0.12 (Windows/20080213) MIME-Version: 1.0 To: uima-user@incubator.apache.org Subject: Re: Bug with POS tagger in Sandbox References: <16d405e0803050820y2fc77ecpd7acbb59b1636173@mail.gmail.com> In-Reply-To: <16d405e0803050820y2fc77ecpd7acbb59b1636173@mail.gmail.com> X-Enigmail-Version: 0.95.6 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Y-GMX-Trusted: 0 X-Virus-Checked: Checked by ClamAV on apache.org Julien Nioche wrote: > Hi guys, > > I don't think there is a separate list for the sandbox so I am sending to > the main one. I've found a problem with the HMM Tagger which generates : > > java.lang.ArrayIndexOutOfBoundsException: -2 > at java.util.ArrayList.get(ArrayList.java:323) > at org.apache.uima.examples.tagger.Viterbi.process(Viterbi.java:328) > > when I analyze the following sentence "pH The test." (tokenized as > [pH][The][test][.]) > > Who usually looks after that part of the code? > > Thanks > > Julien > Julien, is this with the latest version of the tagger (as of last Friday, I think)? If so, please open a Jira issue. Thanks. --Thilo