lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Trivial Update of "LanguageDetection" by KojiSekiguchi
Date Mon, 17 Oct 2011 02:06:45 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "LanguageDetection" page has been changed by KojiSekiguchi:
http://wiki.apache.org/solr/LanguageDetection?action=diff&rev1=13&rev2=14

Comment:
use ja for Japanese rather than jp

  '''Default:''' (Empty - not used)
  
  == langid.map.lcmap ==
- If this parameter is specified, it will be used as a language code map. A typical usage
is to map multiple detected languages to the same field name. I.e. to map both Japanese, Korean
and Chinese texts to the same schema field "*_cjk", do: {{{langid.map.lcmap=jp:cjk zh:cjk
ko:cjk}}}. Another use is if your language identification outputs something like en_US or
en_GB but you want only one field with *_en, you say {{{langid.map.lcmap=en_GB:en en_US:en}}}.
Note that this setting does not affect the language codes written to langField.
+ If this parameter is specified, it will be used as a language code map. A typical usage
is to map multiple detected languages to the same field name. I.e. to map both Japanese, Korean
and Chinese texts to the same schema field "*_cjk", do: {{{langid.map.lcmap=ja:cjk zh:cjk
ko:cjk}}}. Another use is if your language identification outputs something like en_US or
en_GB but you want only one field with *_en, you say {{{langid.map.lcmap=en_GB:en en_US:en}}}.
Note that this setting does not affect the language codes written to langField.
  
  '''Value:''' A space separated list of language code mappings, on the form <from>:<to>
  

Mime
View raw message