lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bruce Ritchie" <br...@jivesoftware.com>
Subject RE: Indexing multiple languages
Date Fri, 03 Jun 2005 20:49:06 GMT
> Tansley, Robert wrote:
> > What if we're trying to index multiple languages in the 
> same site?  Is 
> > it best to have:
> > 
> > 1/ one index for all languages
> > 2/ one index for all languages, with an extra language field so 
> > searches can be constrained to a particular language 3/ separate 
> > indices for each language?
> 
> I'd use 2/.  In particular, use the same field for the 
> content, title, etc., even if when produced by different 
> analyzers.  Have a "lang" field that names the language of 
> the document.

We use 2/ and use filters when we want to search only within a particular language. Just be
sure touse the same analyzer when indexing and 
searching within a particular language.


Regards,

Bruce Ritchie

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message