lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "LanguageAnalysis" by ManosLaliotis
Date Wed, 15 Feb 2012 10:12:03 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "LanguageAnalysis" page has been changed by ManosLaliotis:
http://wiki.apache.org/solr/LanguageAnalysis?action=diff&rev1=17&rev2=18

Comment:
ICUFoldingFilterFactory has been available since Solr3.1

  
  For some languages in non-Latin writing systems (Arabic, Greek, Hindi, Persian), there are
filters to support the idea of "diacritics-insensitive search" already included in Solr. These
filters are described above under the relevant languages.
  
- For other languages, the ASCIIFoldingFilterFactory won't do the foldings that you need.
One solution is to use the ICUFoldingFilter <!> [[Lucene3.1]], which implements a [[http://unicode.org/reports/tr30/tr30-4.html|similar
idea]] across all of Unicode. Unfortunately, this filter is not yet integrated into Solr,
so for now you must make the factory yourself.
+ For other languages, the ASCIIFoldingFilterFactory won't do the foldings that you need.
One solution is to use {{{solr.analysis.ICUFoldingFilterFactory}}} <!> [[Solr3.1]],
which implements a [[http://unicode.org/reports/tr30/tr30-4.html|similar idea]] across all
of Unicode
  
  === Stopwords ===
  

Mime
View raw message