lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 6091] New: - QueryParser not recognizing asterisk with UTF-8 index
Date Tue, 29 Jan 2002 11:20:38 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=6091>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=6091

QueryParser not recognizing asterisk with UTF-8 index

           Summary: QueryParser not recognizing asterisk with UTF-8 index
           Product: Lucene
           Version: unspecified
          Platform: All
        OS/Version: Linux
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: QueryParser
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: tero@favorin.com


Version: 1.2-RC3

I've created an index of UTF-8 encoded documents and making sure that all
queries are converted to UTF-8. When searching the index with query containing
non-ascii UTF-8 characters and an asterisk, no results are found even though
there are documents that contain the query word. Searching does work when query
doesn't contain non-ascii UTF-8 characters or without asterisk works always.
Test results with swedish words:
"födde" - works ok, returns documents.
"född*" - doesn't return any results.
"född" - works ok, returns documents.
"kom*" - works ok, returns documents.

--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message