lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesper Badstue (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-2049) ArabicStemFilter removes numbers
Date Tue, 10 Nov 2009 12:02:27 GMT
ArabicStemFilter removes numbers
--------------------------------

                 Key: LUCENE-2049
                 URL: https://issues.apache.org/jira/browse/LUCENE-2049
             Project: Lucene - Java
          Issue Type: Bug
          Components: contrib/analyzers
    Affects Versions: 2.9.1, 2.9
            Reporter: Jesper Badstue


It seems that if a word consist or contains a number the word is removed by the arabic stemmer.

The bug can be reproduced by creating a unit test in TestArabicStemFilter.java

public void testNonArabicNumbers() throws IOException {
    check("123", "123");
 }

This test fails as the returned string is empty.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message