lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "UnicodeCollation" by RobertMuir
Date Fri, 04 Dec 2009 18:34:55 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "UnicodeCollation" page has been changed by RobertMuir.
http://wiki.apache.org/solr/UnicodeCollation?action=diff&rev1=2&rev2=3

--------------------------------------------------

  == Overview ==
  [[http://en.wikipedia.org/wiki/Unicode_collation_algorithm|Unicode Collation]] is a method
to sort text in a language-sensitive way. It is primarily intended for sorting, but can also
be used for advanced search purposes.
  
+ Unicode Collation in Solr is fast, all the work is done at index time. 
+ The way it works is that instead of just using a KeywordTokenizerFactory to create a sort
field, you use KeywordTokenizerFactory followed by CollationKeyFilterFactory.
+ At index time this indexes special "sort keys" into the sort field, so that at search you
just sort on the sort field, and it comes back in collated order.
+ 
- Unicode Collation in Solr is fast, all the work is done at index time. For more information,
see the [[http://lucene.apache.org/solr/api/org/apache/solr/analysis/CollationKeyFilterFactory.html|Javadocs]].
+ For more information, see the [[http://lucene.apache.org/solr/api/org/apache/solr/analysis/CollationKeyFilterFactory.html|Javadocs]].
  
  <<TableOfContents>>
  

Mime
View raw message