lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Koji Sekiguchi <k...@r.email.ne.jp>
Subject Re: Normalizing multiple Chars with MappingCharFilter possible?
Date Tue, 24 Nov 2009 11:30:02 GMT
Andreas Kahl wrote:
> Hello everyone,
>
> is it possible to normalize Strings like '`e' (2 chars) => 'e' (in contrast to 'é'
(1 char) => 'e') with org.apache.lucene.analysis.MappingCharFilter?
>
> I am asking this because I am considering to index some multilingual and multi-alphabetic
data with Solr which uses such Strings as a substitution for 'real' Unicode characters. 
>
> Thanks for your advice. 
>
> Andreas
>
>
>   
Yes. It should work.
MappingCharFilter supports:

* char-to-char
* string-to-char
* char-to-string
* string-to-string

without misalignment of original offsets (i.e. highlighter works
correctly with MappingCharFilters).

Koji

-- 
http://www.rondhuit.com/en/


Mime
View raw message