lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Best practices for multiple languages?
Date Tue, 18 Jan 2011 19:14:44 GMT
Hi Clemens,

If you will be searching individual languages, go with language-specific 
indices.  Wunder likes to give an example of "die" in German vs. English. :)

Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



----- Original Message ----
> From: Clemens Wyss <clemensdev@mysign.ch>
> To: "java-user@lucene.apache.org" <java-user@lucene.apache.org>
> Sent: Tue, January 18, 2011 12:53:57 PM
> Subject: Best practices for multiple languages?
> 
> What is the "best practice" to support multiple languages, i.e. 
>Lucene-Documents  that have multiple language content/fields? 
>
> Should
> a) each language be  indexed in a seperate index/directory or should
> b) the Documents (in a single  directory) hold the diverse localized fields?
> 
> We most often will be  searching "language dependent" which (at least 
>performance wise) mandates  one-directory-per-language...
> 
> Any (lucene specific) white papers on this  topic?
> 
> Thx in  advance
> Clemens
> 
> ---------------------------------------------------------------------
> To  unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For  additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message