lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Geir Ove Grønmo <gr...@ontopia.net>
Subject Re: [Lucene-dev] Katakana characters in queries (a bug?)
Date Mon, 22 Oct 2001 09:01:35 GMT
* Geir Ove Grønmo
| There seem to be a bug in the lucene-1.2-rc1.jar distribution. Searching
| for the following string returns an error message from the query parser.
| 
| String katakana = "\u30AB\u30BF\u30AB\u30CA";
| 
| - - - 
| org.apache.lucene.queryParser.TokenMgrError: Lexical error at line 1, column 10.  Encountered:
"\u00ab" (171), after : ""
| 	at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(Unknown Source)
| 	at org.apache.lucene.queryParser.QueryParser.jj_ntk(Unknown Source)
| 	at org.apache.lucene.queryParser.QueryParser.Clause(Unknown Source)
| 	at org.apache.lucene.queryParser.QueryParser.Query(Unknown Source)
| 	at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
| 	at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
|         ...
| - - -
| 
| This query used to work in the 1.0 release.

Can anybody confirm this bug?

Geir O.


Mime
View raw message