lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Phil Whelan <phil...@gmail.com>
Subject Re: Searching doubt
Date Wed, 05 Aug 2009 00:19:29 GMT
(sorry, tangent. I'll be quick)

On Tue, Aug 4, 2009 at 8:42 AM, Shai Erera<serera@gmail.com> wrote:
> Interesting ... I don't have access to a Japanese dictionary, so I just
> extract bi-grams.

Shai - if you're interested in parsing Japanese, check out Kakasi. It
can split into words and convert Kanji->Katakana/Hirugana/Romaji -
after which I would index them all.
http://kakasi.namazu.org/
http://www.kawao.com/java/kakasi/api/com/kawao/kakasi/Kakasi.html
http://kakasi.namazu.org/stable/kakasi-2.3.4.tar.gz <-- contain GPL
Japanese dictionary

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message