lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Che Dong" <ched...@hotmail.com>
Subject Re: sigram?
Date Tue, 09 Dec 2003 09:39:32 GMT
means token Chinese/Japanese(without space for word segment in nature) word with Charactor
one by one.

Regards

Che, Dong
----- Original Message ----- 
From: "Erik Hatcher" <erik@ehatchersolutions.com>
To: "Lucene List" <lucene-dev@jakarta.apache.org>
Sent: Tuesday, December 09, 2003 7:11 AM
Subject: sigram?


> Could someone define "sigram" for me?  It is used as a type of token in 
> StandardTokenizer.  I know it relates to the CJK stuff, but I'm curious 
> about the term "sigram" and what it means, specifically in the context 
> of the StandardTokenizer.
> 
> Thanks, 
> Erik
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-dev-help@jakarta.apache.org
> 
> 
Mime
View raw message