lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Philipp_Bre...@sonydadc.com
Subject special character with lucene
Date Mon, 28 Feb 2005 15:01:05 GMT
Hello,
I would like to build a search engine using several different languages - 
f.e. Spanish names, French names, ...

- Using a different analyzer for each language would be one solution.

- But how about replacing each special character (Umlaute, ...ä, ö, ...) 
with its html special character before indexing and doing the same with 
each search query before searching??

This seems to me the simplest approach to handling this issues - ?

What are the drawbacks? No Stem search? Other considerations?

Greetings,
Philipp
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message