lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Che Dong" <ched...@hotmail.com>
Subject Re: Japanese Analyzer
Date Sat, 31 Jan 2004 03:50:19 GMT
As I know: for east Asian Languages(which without space for word segment in natural), as an
non-dictionary based solution, bigram based word segment maybe the best way.

Regards

Che, Dong

----- Original Message ----- 
From: "Erik Hatcher" <erik@ehatchersolutions.com>
To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Sent: Saturday, January 31, 2004 1:14 AM
Subject: Re: Japanese Analyzer


> On Jan 29, 2004, at 1:45 PM, Otis Gospodnetic wrote:
> > --- "Weir, Michael" <Michael.Weir@cognos.com> wrote:
> >> Is the CJKAnalyzer the best to use for Japanese?  If not, which is?
> >> If so,
> >> from where can I download it?
> 
> There is also a ChineseTokenizer/Analyzer in the sandbox as well.  It 
> may have value for Japanese as well?
> 
> Erik
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 
> 
Mime
View raw message