lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shai Erera <ser...@gmail.com>
Subject Re: Best practices for multiple languages?
Date Tue, 18 Jan 2011 19:27:56 GMT
Hi

There are two types of multi-language docs:
1) Docs in different languages -- every document is one language
2) Each document has fields in different languages

I've dealt with both, and there are different solutions to each. Which of
them is yours?

Shai

On Tue, Jan 18, 2011 at 7:53 PM, Clemens Wyss <clemensdev@mysign.ch> wrote:

> What is the "best practice" to support multiple languages, i.e.
> Lucene-Documents that have multiple language content/fields?
> Should
> a) each language be indexed in a seperate index/directory or should
> b) the Documents (in a single directory) hold the diverse localized fields?
>
> We most often will be searching "language dependent" which (at least
> performance wise) mandates one-directory-per-language...
>
> Any (lucene specific) white papers on this topic?
>
> Thx in advance
> Clemens
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message