lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Solr Wiki] Update of "AnalyzersTokenizersTokenFilters" by RobertMuir
Date Fri, 05 Feb 2010 15:32:35 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "AnalyzersTokenizersTokenFilters" page has been changed by RobertMuir.
The comment on this change is: sorry, forgot one more gotcha.


  <!> Gotchas:
   * Although the Lovins stemmer is described as faster than Porter/Porter2, practically it
is much slower in Solr, as it is implemented using reflection.
   * Neither the Lovins nor the Finnish stemmer produce correct output (as of Solr 1.4), due
to a [[|known bug in Snowball]]
+  * The Non-English stemmers are sensitive to diacritics. Think carefully before removing
these with something like `ASCIIFoldingFilterFactory` before stemming, as this could cause
unwanted results.
  ==== solr.WordDelimiterFilterFactory ====

View raw message