lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "luciano aparecido dos santos (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-4907) Tokenizer
Date Thu, 04 Apr 2013 18:05:17 GMT
luciano aparecido dos santos created LUCENE-4907:
----------------------------------------------------

             Summary: Tokenizer
                 Key: LUCENE-4907
                 URL: https://issues.apache.org/jira/browse/LUCENE-4907
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
    Affects Versions: 4.2
         Environment: WINDOWS 7, NETBEANS 7.2.1
            Reporter: luciano aparecido dos santos


Hi,

    I'm trying to remove the accents in the search, find words removing special characters
such as accents ("ANDRÉ" equals "ANDRE").
I found the class ICUTokenizer but got the Exception in thread "main" java.lang.NoSuchMethodError:
com.ibm.icu.text.UnicodeSet.freeze()Lcom/ibm/icu/text/UnicodeSet;
	at org.apache.lucene.analysis.icu.segmentation.LaoBreakIterator.<clinit>(LaoBreakIterator.java:73)
	at org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig.<clinit>(DefaultICUTokenizerConfig.java:73)


      Somebody can help me?    

Page ref.: http://lucene.apache.org/core/4_2_0/analyzers-icu/index.html



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message