lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <>
Subject Re: Preferred Scema/Config for Chinese Language Cores?
Date Fri, 05 Dec 2014 03:14:46 GMT
I have a couple of links that may be useful, though I have not tried
Chinese indexing myself: (12 articles on CJK!)

Also, may be worth checking out the commercial offering from - one of the big issues with Chinese is that
tokenization rules are mostly dictionary-based and commercial
dictionaries could be significantly better than free ones :-)

Personal: and @arafalov
Solr resources and newsletter: and @solrstart
Solr popularizers community:

On 4 December 2014 at 22:07, Tom Zimmermann <> wrote:
> Hi ,
> We are setting up our first Chinese language index and our team has found
> multiple conflicting bits of information regarding the proper configuration
> for tokenizing, filtering etc. Does anyone out there have a good
> functioning example we could work from our some links with guidance.
> Thanks,
> Tom

View raw message