lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Solr Wiki] Update of "AnalyzersTokenizersTokenFilters" by naomidushay
Date Wed, 03 Nov 2010 20:45:40 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "AnalyzersTokenizersTokenFilters" page has been changed by naomidushay.
The comment on this change is: clarifying word delimiter filter factory explanation - no way
to change default of which non-alphanum chars it splits on .


  Splits words into subwords and performs optional transformations on subword groups. By default,
words are split into subwords with the following rules:
-  * split on intra-word delimiters (by default, all non alpha-numeric characters).
+  * split on intra-word delimiters (all non alpha-numeric characters).
    * `"Wi-Fi" -> "Wi", "Fi"`
   * split on case transitions (can be turned off - see splitOnCaseChange parameter)
    * `"PowerShot" -> "Power", "Shot"`

View raw message