lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "UnicodeCollation" by RobertMuir
Date Fri, 04 Dec 2009 03:55:50 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "UnicodeCollation" page has been changed by RobertMuir.
http://wiki.apache.org/solr/UnicodeCollation?action=diff&rev1=2&rev2=3

--------------------------------------------------

  == Sorting text for multiple languages ==
  There are two approaches to supporting multiple languages:
  
-  * If there is a small list (FIXME: small list of Languages? Fields?), consider defining
collated fields for each language and using copyField.
+  * If there is a small list of languages you wish to support, consider defining collated
fields for each language and using copyField.
-  * If there is a very large list, an alternative is to use the "Unicode default" collator.
- 
+  * However, adding a large number of sort fields can increase disk and indexing costs. An
alternative approach is to use the Unicode "default" collator.
+    
  The Unicode default, or "ROOT" Locale, has rules that are designed to work well in general
for most languages. To use it, simply define the language as the empty string.
  
  This Unicode default sort is still significantly more advanced than the standard Solr sort.
@@ -60, +60 @@

    </analyzer>
  </fieldType>
  }}}
+ 
  == Sorting text with custom rules ==
  For advanced usage, you can define your own set of rules that determine how the sorting
takes place. Its easiest not to start from scratch, but instead to take existing rules that
are close to what you want, and "tailor" or customize them.
  

Mime
View raw message