Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@apache.org Received: (qmail 90540 invoked from network); 29 Jan 2002 11:20:26 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 29 Jan 2002 11:20:26 -0000 Received: (qmail 15372 invoked by uid 97); 29 Jan 2002 11:20:40 -0000 Delivered-To: qmlist-jakarta-archive-lucene-dev@jakarta.apache.org Received: (qmail 15323 invoked by uid 97); 29 Jan 2002 11:20:39 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 15312 invoked by uid 50); 29 Jan 2002 11:20:38 -0000 Date: 29 Jan 2002 11:20:38 -0000 Message-ID: <20020129112038.15311.qmail@nagoya.betaversion.org> From: bugzilla@apache.org To: lucene-dev@jakarta.apache.org Cc: Subject: DO NOT REPLY [Bug 6091] New: - QueryParser not recognizing asterisk with UTF-8 index X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT . ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE. http://nagoya.apache.org/bugzilla/show_bug.cgi?id=6091 QueryParser not recognizing asterisk with UTF-8 index Summary: QueryParser not recognizing asterisk with UTF-8 index Product: Lucene Version: unspecified Platform: All OS/Version: Linux Status: NEW Severity: Normal Priority: Other Component: QueryParser AssignedTo: lucene-dev@jakarta.apache.org ReportedBy: tero@favorin.com Version: 1.2-RC3 I've created an index of UTF-8 encoded documents and making sure that all queries are converted to UTF-8. When searching the index with query containing non-ascii UTF-8 characters and an asterisk, no results are found even though there are documents that contain the query word. Searching does work when query doesn't contain non-ascii UTF-8 characters or without asterisk works always. Test results with swedish words: "födde" - works ok, returns documents. "född*" - doesn't return any results. "född" - works ok, returns documents. "kom*" - works ok, returns documents. -- To unsubscribe, e-mail: For additional commands, e-mail: