Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 18179 invoked from network); 11 Dec 2006 20:10:46 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Dec 2006 20:10:46 -0000 Received: (qmail 82051 invoked by uid 500); 11 Dec 2006 20:10:51 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 82007 invoked by uid 500); 11 Dec 2006 20:10:51 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 81996 invoked by uid 99); 11 Dec 2006 20:10:50 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Dec 2006 12:10:50 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Dec 2006 12:10:42 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 4E2CD7140D1 for ; Mon, 11 Dec 2006 12:10:22 -0800 (PST) Message-ID: <1468428.1165867822317.JavaMail.jira@brutus> Date: Mon, 11 Dec 2006 12:10:22 -0800 (PST) From: "Doron Cohen (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Commented: (LUCENE-740) Bugs in contrib/snowball/.../SnowballProgram.java -> Kraaij-Pohlmann gives Index-OOB Exception In-Reply-To: <6343836.1165866021069.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ http://issues.apache.org/jira/browse/LUCENE-740?page=comments#action_12457462 ] Doron Cohen commented on LUCENE-740: ------------------------------------ In addition to SnowballProgram bug fix there are few updates in snowball.tartarus.org comparing to snowball stemmers in Lucene, and Hungarian stemmer was added. Any reason not to update all the stemmers with this fix? > Bugs in contrib/snowball/.../SnowballProgram.java -> Kraaij-Pohlmann gives Index-OOB Exception > ---------------------------------------------------------------------------------------------- > > Key: LUCENE-740 > URL: http://issues.apache.org/jira/browse/LUCENE-740 > Project: Lucene - Java > Issue Type: Bug > Components: Analysis > Affects Versions: 1.9 > Environment: linux amd64 > Reporter: Andreas Kohn > Priority: Minor > Attachments: lucene-1.9.1-SnowballProgram.java > > > (copied from mail to java-user) > while playing with the various stemmers of Lucene(-1.9.1), I got an > index out of bounds exception: > lucene-1.9.1>java -cp > build/contrib/snowball/lucene-snowball-1.9.2-dev.jar > net.sf.snowball.TestApp Kp bla.txt > Exception in thread "main" java.lang.reflect.InvocationTargetException > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:64) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:615) > at net.sf.snowball.TestApp.main(TestApp.java:56) > Caused by: java.lang.StringIndexOutOfBoundsException: String index out > of range: 11 > at java.lang.StringBuffer.charAt(StringBuffer.java:303) > at net.sf.snowball.SnowballProgram.find_among_b(SnowballProgram.java:270) > at net.sf.snowball.ext.KpStemmer.r_Step_4(KpStemmer.java:1122) > at net.sf.snowball.ext.KpStemmer.stem(KpStemmer.java:1997) > This happens when executing > lucene-1.9.1>java -cp > build/contrib/snowball/lucene-snowball-1.9.2-dev.jar > net.sf.snowball.TestApp Kp bla.txt > bla.txt contains just this word: 'spijsvertering'. > After some debugging, and some tests with the original snowball > distribution from snowball.tartarus.org, it seems that the attached > change is needed to avoid the exception. > (The change comes from tartarus' SnowballProgram.java) -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org