lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: special characters "ø" indexing/searching
Date Fri, 19 Nov 2010 23:25:42 GMT

: I've managed to do this by adding 
: 
: <filter class="solr.ISOLatin1AccentFilterFactory"/>
: 
: To the fieldType that the field is using.  It seems to work well.  Can
: anyone advise if this is not a good idea?

ISOLatin1AccentFilterFactory works fine, but because it's a TokenFilter it 
changes the Tokens, which means you may see oddities with token offset 
info (used in highlighting)

Instead you might wnat to use <charFilter 
class="solr.MappingCharFilterFactory" ... /> with a mapping file where you 
define the mappings you want to add.

(CharFilter's operate on the raw Character stream before tokenization, and 
fix the character offset information as they replace characters)



-Hoss

Mime
View raw message