Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@apache.org Received: (qmail 97511 invoked from network); 9 Apr 2003 10:24:37 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 9 Apr 2003 10:24:37 -0000 Received: (qmail 18800 invoked by uid 97); 9 Apr 2003 10:26:41 -0000 Delivered-To: qmlist-jakarta-archive-lucene-dev@nagoya.betaversion.org Received: (qmail 18793 invoked from network); 9 Apr 2003 10:26:41 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 9 Apr 2003 10:26:41 -0000 Received: (qmail 96005 invoked by uid 500); 9 Apr 2003 10:24:15 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 95960 invoked from network); 9 Apr 2003 10:24:14 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 9 Apr 2003 10:24:14 -0000 Received: (qmail 18774 invoked by uid 50); 9 Apr 2003 10:26:18 -0000 Date: 9 Apr 2003 10:26:18 -0000 Message-ID: <20030409102618.18773.qmail@nagoya.betaversion.org> From: bugzilla@apache.org To: lucene-dev@jakarta.apache.org Cc: Subject: DO NOT REPLY [Bug 18847] New: - Parser bug when dealing with new lines. X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT . ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE. http://nagoya.apache.org/bugzilla/show_bug.cgi?id=18847 Parser bug when dealing with new lines. Summary: Parser bug when dealing with new lines. Product: Lucene Version: CVS Nightly - Specify date in submission Platform: Other OS/Version: Other Status: NEW Severity: Normal Priority: Other Component: QueryParser AssignedTo: lucene-dev@jakarta.apache.org ReportedBy: konrad@xtramind.com Hi, there seems to be a bug in the Lucene parser in the treatment of spaces and newlines. When we search for: weltbank && identifier: weltbank over a certain collection of texts, we get 82 hits. If we write this as: weltbank && identifier: weltbank with no space after the &&, there still are 82 results. But the same query with a space or tab after the && and before the newline gives back 0 results! Even worse, the same query with a space before the "identifier:", but not after the &&: weltbank && identifier: weltbank gives 106 hits, i.e., the same as weltbank || identifier: weltbank Arrrgh! There is a simple workaround - flattening the query string - but this is probably only a problem of newlines not being treated similar to whitespace in the parser code. --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-dev-help@jakarta.apache.org