lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Patrick Turcotte" <pat...@gmail.com>
Subject Re: how to handle words with accent?
Date Tue, 31 Oct 2006 16:39:28 GMT
Should both results be returned in both cases?

If so, take a look at the IsoLatin1Filter class, it will remove those
accents for indexing and searching if needed.

Patrick

On 10/31/06, Valerio Schiavoni <valerio.schiavoni@gmail.com> wrote:
>
> hello,
> i use lucene to index documents in Italian. many terms end with accented
> letters: società, fedeltà, etc
>
> What happen now is that if a user search for : societa' (note the a and
> the
> ' character), it doesn't get the same results as he would when searching
> for
> società.
>
> What is the best practice to handle such situations ?
> i haven't tuned anyhow lucene, and i'm using the default analyzer.
>
> thanks for any suggestions,
> valerio
> --
> http://valerioschiavoni.blogspot.com
> http://jroller.com/page/vschiavoni
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message