lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Indexing documents for several languages
Date Wed, 14 May 2003 17:58:32 GMT

> What the best practices?  
> 
> Do I have to create one index per language?

That would be one option.

> Do I have to write a custom analyzer that dynamically detects the
> Document language and apply the right stemming, stop word list, etc?

This is more complex, but sounds more attractive to me.

One person had a language-recognition code that used a different
Analyzer depending on the language.  He has not contributed the code
yet, unfortunately.

Otis


__________________________________
Do you Yahoo!?
The New Yahoo! Search - Faster. Easier. Bingo.
http://search.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message