lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject special character with lucene
Date Mon, 28 Feb 2005 15:01:05 GMT
I would like to build a search engine using several different languages - 
f.e. Spanish names, French names, ...

- Using a different analyzer for each language would be one solution.

- But how about replacing each special character (Umlaute, ...ä, ö, ...) 
with its html special character before indexing and doing the same with 
each search query before searching??

This seems to me the simplest approach to handling this issues - ?

What are the drawbacks? No Stem search? Other considerations?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message