lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gusenbauer Stefan <gusenba...@eduhi.at>
Subject Re: Multiple Language Indexing and Searching
Date Tue, 06 Sep 2005 12:35:18 GMT
Olivier Jaquemet wrote:

> Gusenbauer Stefan wrote:
>
>> I think nutch uses ngramj for language classification but i don't know
>> what type of saving language information they use. In our application
>> for example i save the language in an extra field in the document
>> because lucene is supporting multiple fields with the same names we
>> would be able to handle different languages. but for now we don't
>> need it
>>  
>>
> But then, if you do so, you do not benefit from any specialized
> Analyzer you could use for each language, do you?
> Then again, maybe it's not that interesting to use specialized
> analyzers for each language?.
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
sorry this was my fault we handle different languages with different
analyzers at this time but not documents with multiple languages this
was a typing and thinking mistake by myself.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message