From java-user-return-54190-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Mon Nov 19 10:08:38 2012 Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7810EDC63 for ; Mon, 19 Nov 2012 10:08:38 +0000 (UTC) Received: (qmail 61315 invoked by uid 500); 19 Nov 2012 10:08:36 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 61271 invoked by uid 500); 19 Nov 2012 10:08:36 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 61248 invoked by uid 99); 19 Nov 2012 10:08:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Nov 2012 10:08:35 +0000 X-ASF-Spam-Status: No, hits=1.0 required=5.0 tests=SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (athena.apache.org: transitioning domain of LFechner@hubwoo.com does not designate 62.153.204.50 as permitted sender) Received: from [62.153.204.50] (HELO mailgate.de.cc-hubwoo.com) (62.153.204.50) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Nov 2012 10:08:28 +0000 Received: from mailgate.de.cc-hubwoo.com (localhost [127.0.0.1]) by mailgate.de.cc-hubwoo.com (Proxmox) with ESMTP id 749B15C8261 for ; Mon, 19 Nov 2012 06:08:10 +0100 (CET) Received: from deexch01.de.cc-hubwoo.com (unknown [192.168.168.13]) by mailgate.de.cc-hubwoo.com (Proxmox) with ESMTP id 567145C8085 for ; Mon, 19 Nov 2012 06:08:10 +0100 (CET) X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Subject: RE: German 'ue' -> 'u' conversion Date: Mon, 19 Nov 2012 11:08:07 +0100 Message-ID: In-Reply-To: <4F7DF008E6B16049AABE8779C772586F2C1F8A4D@R2-ExchDB1.BEUMER.com> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: German 'ue' -> 'u' conversion Thread-Index: Ac3GOSlweMcSn7rwT/CS3zxTDYR20AABCWsw References: <4F7DF008E6B16049AABE8779C772586F2C1F8A4D@R2-ExchDB1.BEUMER.com> From: "Lutz Fechner" To: X-Virus-Checked: Checked by ClamAV on apache.org Hi, we use a modified ISOLatin1AccentFilter bit to replace German accents by = ae, oe, ue and so on for that purpose. In the code you will see a switch for the characters. You need to change it from=20 case '\u00E4' : // small =E4 output[outputPos++] =3D 'a'; output[outputPos++] =3D 'e'; break; To something like this case '\u00E4' : // small =E4 output[outputPos++] =3D 'a'; =20 break; for the characters you want to replace. Best Regards Lutz Fechner -----Original Message----- From: Dyga, Adam [mailto:adam.dyga@beumergroup.com]=20 Sent: Montag, 19. November 2012 10:47 To: java-user@lucene.apache.org Subject: German 'ue' -> 'u' conversion Hello, I have two questin regarding handling German umlauts in Lucene: 1. I'm trying to find a way to convert German Umlauts written as 'ue', = 'ae', etc to folded form 'u', 'a' and so on. This is done by GermanAnalyzer (and German2StemFilter used by it), but = unfortunately it also does stemming which is very undesired in my case. Is there any other filter that can do only the 'ua' -> 'u' conversion? 2. Is there any filter that does '=FC' -> 'ue' (NOT 'u') conversion? = What I'm trying to achieve is that word "=FCber" should be found in the = index whenever the user searches for " =FCber" or "ueber" , but NOT = "uber".=20 Regards, AD --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org