lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Clemens Wyss <clemens...@mysign.ch>
Subject AW: Best practices for multiple languages?
Date Wed, 19 Jan 2011 07:09:21 GMT
> 1) Docs in different languages -- every document is one language
> 2) Each document has fields in different languages
We mainly have 1)-models

Clemens

> -----Urspr√ľngliche Nachricht-----
> Von: Shai Erera [mailto:serera@gmail.com]
> Gesendet: Dienstag, 18. Januar 2011 20:28
> An: java-user@lucene.apache.org
> Betreff: Re: Best practices for multiple languages?
> 
> Hi
> 
> There are two types of multi-language docs:
> 1) Docs in different languages -- every document is one language
> 2) Each document has fields in different languages
> 
> I've dealt with both, and there are different solutions to each. Which of them
> is yours?
> 
> Shai
> 
> On Tue, Jan 18, 2011 at 7:53 PM, Clemens Wyss <clemensdev@mysign.ch>
> wrote:
> 
> > What is the "best practice" to support multiple languages, i.e.
> > Lucene-Documents that have multiple language content/fields?
> > Should
> > a) each language be indexed in a seperate index/directory or should
> > b) the Documents (in a single directory) hold the diverse localized fields?
> >
> > We most often will be searching "language dependent" which (at least
> > performance wise) mandates one-directory-per-language...
> >
> > Any (lucene specific) white papers on this topic?
> >
> > Thx in advance
> > Clemens
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message