lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: How to search for europian word with and without special characters
Date Tue, 20 Jun 2006 15:47:00 GMT
I think you'll want to write your own Analyzer + Tokenizer, detect tokens with umlauts, and
then emit two tokens at the same position (think of them as synonyms), one being the original
one with the umlaut, and the other one with the umlaut transformed according to the rules
(e.g. ü -> ue).  Hm, I wonder if GermanAnalyzer already does this... maybe, have a look.

Otis

----- Original Message ----
From: Supriya Kumar Shyamal <supriya.shyamal@artnology.com>
To: java-user@lucene.apache.org
Sent: Tuesday, June 20, 2006 8:09:18 AM
Subject: How to search for europian word with and without special characters

Hi All,

I have a question regarding the indexing and searching for german 
characters. For eg. when I search for the word "müller" also I want to 
search for the word "mueller". How to achieve this in lucene.

Thanks,
supriya

-- 
Mit freundlichen Grüßen / Regards
 
Supriya Kumar Shyamal

Software Developer
tel +49 (30) 443 50 99 -22
fax +49 (30) 443 50 99 -99
email supriya.shyamal@artnology.com
___________________________
artnology GmbH
Milastr. 4
10437 Berlin
___________________________

http://www.artnology.com
__________________________________________________________________________

 News / Aktuelle Projekte:
 * artnology gewinnt Ausschreibung des Bundesministeriums des Innern:
   Softwarelösung für die Verwaltung der Sammlung zeitgenössischer
   Kunstwerke zur kulturellen Repräsentation des Bundes.

 Projektreferenzen:
 * Globaler eShop und Corporate-Site für Springer: www.springeronline.com
 * E-Detailing-Portal für Novartis: www.interaktiv.novartis.de
 * Service-Center-Plattform für Biogen: www.ms-life.de
 * eCRM-System für Grünenthal: www.gruenenthal.com

___________________________________________________________________________ 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org





---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message