lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vinaya Kumar Thimmappa <vthimma...@ariba.com>
Subject Re: Best practices for multiple languages?
Date Wed, 19 Jan 2011 06:53:35 GMT
I think we should be using lucene with snowball jar's which means one 
index for all languages (ofcourse size of index is always a matter of 
concerns).

Hope this helps.
-vinaya

On Tuesday 18 January 2011 11:23 PM, Clemens Wyss wrote:
> What is the "best practice" to support multiple languages, i.e. Lucene-Documents that
have multiple language content/fields?
> Should
> a) each language be indexed in a seperate index/directory or should
> b) the Documents (in a single directory) hold the diverse localized fields?
>
> We most often will be searching "language dependent" which (at least performance wise)
mandates one-directory-per-language...
>
> Any (lucene specific) white papers on this topic?
>
> Thx in advance
> Clemens
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message