Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7F750E6D3 for ; Wed, 19 Dec 2012 16:32:33 +0000 (UTC) Received: (qmail 42993 invoked by uid 500); 19 Dec 2012 16:32:31 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 42950 invoked by uid 500); 19 Dec 2012 16:32:31 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 42926 invoked by uid 99); 19 Dec 2012 16:32:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Dec 2012 16:32:31 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [193.196.8.10] (HELO linux3.ids-mannheim.de) (193.196.8.10) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Dec 2012 16:32:26 +0000 Received: from linux2.ids-mannheim.de ([10.0.1.1]) by linux3.ids-mannheim.de with smtp (Exim 4.72) (envelope-from ) id 1TlMYu-0002yx-8o for java-user@lucene.apache.org; Wed, 19 Dec 2012 17:32:04 +0100 Received: (qmail 3133 invoked from network); 19 Dec 2012 16:32:09 -0000 Received: from unknown (HELO ?10.99.1.49?) (10.99.1.49) by linux2.ids-mannheim.de with SMTP; 19 Dec 2012 16:32:09 -0000 Message-ID: <50D1EC03.4030109@ids-mannheim.de> Date: Wed, 19 Dec 2012 17:32:03 +0100 From: Carsten Schnober Organization: Institut =?ISO-8859-15?Q?f=FCr_Deutsche_Sprache?= User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/17.0 Thunderbird/17.0 MIME-Version: 1.0 To: java-user@lucene.apache.org References: <50C9F8F6.60208@ids-mannheim.de> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 8bit X-SA-Do-Not-Run: Yes X-SA-Exim-Connect-IP: 10.0.1.1 X-SA-Exim-Rcpt-To: java-user@lucene.apache.org X-SA-Exim-Mail-From: schnober@ids-mannheim.de X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on linux3.ids-mannheim.de X-Spam-Level: Subject: Re: Boolean and SpanQuery: different results X-SA-Exim-Version: 4.2.1 (built Mon, 03 Jul 2006 09:34:15 +0200) X-SA-Exim-Scanned: Yes (on linux3.ids-mannheim.de) X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No, score=-2.1 required=3.0 tests=BAYES_00,GREYLIST_ISWHITE, RDNS_NONE,TO_NO_BRKTS_NORDNS autolearn=no version=3.3.2 Am 13.12.2012 18:00, schrieb Jack Krupansky: > Can you provide some examples of terms that don't work and the index > token stream they fail on? > > Make sure that the Analyzer you are using doesn't do any magic on the > indexed terms - your query term is unanalyzed. Maybe multiple, but > distinct, index terms are analyzing to the same, but unexpected term. Apart from the answer I've already given myself, here's another note about the issue. I've been using WhitespaceAnalyzer for both indexing and query parsing, but apparently, the query parser lowercased by default while WhitespaceAnalyzer did not. Therefore, QueryParser.setLowercaseExpandedTerms(false) is necessary in order to get the same results. Best, Carsten -- Institut f�r Deutsche Sprache | http://www.ids-mannheim.de Projekt KorAP | http://korap.ids-mannheim.de Tel. +49-(0)621-43740789 | schnober@ids-mannheim.de Korpusanalyseplattform der n�chsten Generation Next Generation Corpus Analysis Platform --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org