lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Analyze other language using English Analyzer
Date Tue, 03 Mar 2009 12:54:56 GMT

Ganesh,

You should really try to detect the language of the document and then, based on the language
identified, pick the appropriate analyzer.
See Snowball analzyers in Lucene contrib
See http://www.sematext.com/product-language-identifier.html

 Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Ganesh <emailgane@yahoo.co.in>
> To: java-user@lucene.apache.org
> Sent: Tuesday, March 3, 2009 1:15:29 AM
> Subject: Analyze other language using English Analyzer
> 
> Hello all,
> 
> I am using default English Snowball analyzer to index and search English 
> documents. There may be chances to index European, Chinese documents. What will 
> be the impact to use English Analyzer for European or Chinese language 
> documents? Whether i could do index and search as expected?
> 
> The application will be installed in English OS but the chances of getting other 
> language documents are high. I will not be able to detect the language of the 
> document.
> 
> Regards
> Ganesh 
> Send instant messages to your online friends http://in.messenger.yahoo.com 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message