lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 18933] - Add support for Chinese, Japanese, and Korean to the core build.
Date Fri, 11 Apr 2003 07:44:45 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=18933>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=18933

Add support for Chinese, Japanese, and Korean to the core build.





------- Additional Comments From chedong@yeah.net  2003-04-11 07:44 -------
for most without natural space based word segment languages(like Chinese 
Japanese Korean) I use bigram to and make query parser default boolean query 
with 'and' relationship(It's an option in QueryParser of 1.3 release).

for example: 
C1C2C3C4 will segment to  C1C2 C2C3 C3C4 while indexing
and query parsed to C1C2 +C2C3 +C3C4 while searching.

Regards

Che, Dong
http://www.chedong.com/tech/

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message