lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mathieu Lecarme <math...@garambrogne.net>
Subject Re: Lucene and Eastern languages (Japanese, Korean and Chinese)
Date Wed, 25 Jul 2007 06:59:52 GMT
Le mardi 24 juillet 2007 à 13:01 -0700, Shaw, James a écrit :
> Hi, guys,
> I found Analyzers for Japanese, Korean and Chinese, but not stemmers;
> the Snowball stemmers only include European languages.  Does stemming
> not make sense for ideograph-based languages (i.e., no stemming is
> needed for Japanese, Korean and Chinese)?
No.

> Also for spell checking, does the default Lucene SpellChecker work for
> Japanese, Korean and Chinese?  Does edit distance make sense for these
> languages?
Japanese used group of ideogram, but levenstein distance don't make
sense with few letters but I'm not a CJK expert.

M.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message