lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: special characters "ø" indexing/searching
Date Fri, 19 Nov 2010 23:39:46 GMT

: Shouldn't all ISO Latin accented characters translate one to one with
: unaccented characters?

a) i have no idea

b) the OP never actually said that all their characters were in the 
ISOLAttin1 range, just that they had tried using 
ISOLatin1AccentFilterFactory, which brings up the excellent point that if 
they have other "special" characters outside of ISOLatin1 that's all hte 
more reason why they might wnat to consider using MappingCharFilterFactory

: 
: On Fri, Nov 19, 2010 at 3:25 PM, Chris Hostetter
: <hossman_lucene@fucit.org>wrote:
: 
: > : I've managed to do this by adding
: > :
: > : <filter class="solr.ISOLatin1AccentFilterFactory"/>
: > :
: > : To the fieldType that the field is using.  It seems to work well.  Can
: > : anyone advise if this is not a good idea?
: >
: > ISOLatin1AccentFilterFactory works fine, but because it's a TokenFilter it
: > changes the Tokens, which means you may see oddities with token offset
: > info (used in highlighting)
: >
: 

-Hoss

Mime
View raw message