lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Adams" <James.Ad...@framfab.com>
Subject RE: Multiple Language Indexing and Searching
Date Tue, 06 Sep 2005 12:32:37 GMT

Surely it's best to have a specific analyzer for each language?

Would support for multiple Analyzers with a single index require a
different IndexWriter for each Analzser/language?  Would you then need
to manage the disk access of these regarding locking etc, so two
IndexWriter's can not do so at the same time?




-----Original Message-----
From: Olivier Jaquemet [mailto:olivier.jaquemet@jalios.com] 
Sent: 06 September 2005 13:21
To: java-user@lucene.apache.org
Subject: Re: Multiple Language Indexing and Searching

Gusenbauer Stefan wrote:

>I think nutch uses ngramj for language classification but i don't know
>what type of saving language information they use. In our application
>for example i save the language in an extra field in the document
>because lucene is supporting multiple fields with the same names we
>would be able to handle different languages. but for now we don't need
it
>  
>
But then, if you do so, you do not benefit from any specialized Analyzer

you could use for each language, do you?
Then again, maybe it's not that interesting to use specialized analyzers

for each language?.



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message