lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "AnalyzersTokenizersTokenFilters" by RobertMuir
Date Fri, 05 Feb 2010 15:32:35 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "AnalyzersTokenizersTokenFilters" page has been changed by RobertMuir.
The comment on this change is: sorry, forgot one more gotcha.
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters?action=diff&rev1=74&rev2=75

--------------------------------------------------

  <!> Gotchas:
   * Although the Lovins stemmer is described as faster than Porter/Porter2, practically it
is much slower in Solr, as it is implemented using reflection.
   * Neither the Lovins nor the Finnish stemmer produce correct output (as of Solr 1.4), due
to a [[http://article.gmane.org/gmane.comp.search.snowball/1139|known bug in Snowball]]
+  * The Non-English stemmers are sensitive to diacritics. Think carefully before removing
these with something like `ASCIIFoldingFilterFactory` before stemming, as this could cause
unwanted results.
  
  <<Anchor(WordDelimiterFilter)>>
  ==== solr.WordDelimiterFilterFactory ====

Mime
View raw message