Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 93990 invoked from network); 21 Nov 2006 03:29:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 21 Nov 2006 03:29:10 -0000 Received: (qmail 69473 invoked by uid 500); 21 Nov 2006 03:29:12 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 69436 invoked by uid 500); 21 Nov 2006 03:29:12 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 69410 invoked by uid 99); 21 Nov 2006 03:29:12 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Nov 2006 19:29:11 -0800 X-ASF-Spam-Status: No, hits=1.5 required=10.0 tests=SPF_HELO_PASS,SUBJECT_ENCODED_TWICE X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [212.226.92.15] (HELO monkey.teamware.com) (212.226.92.15) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Nov 2006 19:28:59 -0800 Received: from nimitz (nimitz.teamw.com [10.142.128.10]) by monkey.teamware.com (8.13.1/8.13.1) with ESMTP id kAL3SUDn021298 for ; Tue, 21 Nov 2006 05:28:30 +0200 Received: from [10.142.3.10] ([10.142.3.10]) by nimitz with ESMTP id mbl5skuf; 21 Nov 2006 05:28:00 +0200 Message-ID: <4562724D.5010003@teamware.com> Date: Tue, 21 Nov 2006 14:28:13 +1100 From: Antony Bowesman Organization: Teamware Group User-Agent: Thunderbird 1.5 (Windows/20051201) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: =?ISO-8859-1?Q?Re=3A_Q=3A_Wildcard_searching_with_germ?= =?ISO-8859-1?Q?an_umlauts_=28=E4=2C_=F6=2C_=DF=2C_=2E=2E=2E=29?= References: <4561A39E.9060405@joanneum.at> In-Reply-To: <4561A39E.9060405@joanneum.at> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-1.6 (monkey.teamware.com [212.226.92.15]); Tue, 21 Nov 2006 05:28:31 +0200 (EET) X-TWG-MailScanner-Information: See www.mailscanner.info for information X-TWG-MailScanner: Found to be clean X-TWG-MailScanner-SpamCheck: not spam, SpamAssassin (score=1.724, required 5, BAYES_50 0.00, SUBJECT_ENCODED_TWICE 1.72) X-TWG-MailScanner-SpamScore: 1 X-MailScanner-From: adb@teamware.com X-Virus-Checked: Checked by ClamAV on apache.org Stephan Spat wrote: > Hello again! > > It replaces german umlauts, e.g. � <=> a, � <=> u, ... . So no umlauts > are in the index. For searching I use the same Analyzer. When I do a > simple search for a word with umlauts there is no problem. But if I use > addidionally wildcards I suppose the word is not analyzed and so I word > with umlauts and wildcards is not found in the index?!! (for example: > gr�*). Is this assumption correct? I came across this class this morning: AnalyzingQueryParser - Overrides Lucene's default QueryParser so that Fuzzy-, Prefix-, Range-, and WildcardQuerys are also passed through the given analyzer, but ? and * don't get removed from the search terms. Read the warning re German though. Antony --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org