lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vinaya Kumar Thimmappa <>
Subject Re: Best practices for multiple languages?
Date Wed, 19 Jan 2011 06:53:35 GMT
I think we should be using lucene with snowball jar's which means one 
index for all languages (ofcourse size of index is always a matter of 

Hope this helps.

On Tuesday 18 January 2011 11:23 PM, Clemens Wyss wrote:
> What is the "best practice" to support multiple languages, i.e. Lucene-Documents that
have multiple language content/fields?
> Should
> a) each language be indexed in a seperate index/directory or should
> b) the Documents (in a single directory) hold the diverse localized fields?
> We most often will be searching "language dependent" which (at least performance wise)
mandates one-directory-per-language...
> Any (lucene specific) white papers on this topic?
> Thx in advance
> Clemens
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message