lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: special characters "ø" indexing/searching
Date Fri, 19 Nov 2010 23:32:12 GMT
Shouldn't all ISO Latin accented characters translate one to one with
unaccented characters?

On Fri, Nov 19, 2010 at 3:25 PM, Chris Hostetter
<hossman_lucene@fucit.org>wrote:

> : I've managed to do this by adding
> :
> : <filter class="solr.ISOLatin1AccentFilterFactory"/>
> :
> : To the fieldType that the field is using.  It seems to work well.  Can
> : anyone advise if this is not a good idea?
>
> ISOLatin1AccentFilterFactory works fine, but because it's a TokenFilter it
> changes the Tokens, which means you may see oddities with token offset
> info (used in highlighting)
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message