lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: CJKWidthFilter vs ICUFoldingFilter
Date Wed, 14 Nov 2012 19:17:20 GMT
On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith <ssmith@mainstreamdata.com> wrote:
> Reading the documentation for these two filters seems to imply that CJKWidthFilter is
a subset of ICUFoldingFilter.  Is that true?  I'm basically using the CjkAnalyzer (from Lucene
4.0) but adding ICUFoldingFilter because I need umlauts and accent characters removed from
any German, French, etc.
>
> Can I just use the ICUFoldingFilter?

Yes. its a subset of NFKC, which is a subset of ICUFolding filter :)

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message