lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Murzaku" <li...@lissus.com>
Subject RE: Accentuated characters
Date Thu, 12 Dec 2002 17:39:27 GMT
Something flexible and elegant would also be a simple fst.
Here is one built for lucene:
 http://sourceforge.net/projects/normalizer/

-----Original Message-----
From: stephane vaucher [mailto:vaucher@LUB.UMontreal.CA] 
Sent: Thursday, December 12, 2002 12:23 PM
To: Lucene Users List
Subject: Re: Accentuated characters


Thanks for the reference. I basically work with french, english, or 
bilingual texts. I'll take a quick look at the lib, but it might be an 
overkill.

Cheers,
Stephane

Alex Murzaku wrote:

>IBM's ICU4J has a normalizer which should do what you need. It's a big 
>library, but if you deal with multilingual text often, it might make 
>your life easier.
>
>
>
>-----------------------------------------------------------------------
>-
>
>--
>To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
>For additional commands, e-mail: 
><mailto:lucene-user-help@jakarta.apache.org>
>



--
To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>



--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message