incubator-lucy-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: [lucy-dev] Unicode integration
Date Thu, 17 Nov 2011 12:37:40 GMT
On Thu, Nov 17, 2011 at 7:30 AM, Nick Wellnhofer <wellnhofer@aevum.de> wrote:
>
> I'm not sure about the last point but NFKC, CaseFolding, and removal of
> Default_Ignorable_Code_Points are supported.
>

The point of the derived property is that there are sneaky
interactions between these.

in icu, this form is "nfkc_cf" and you get it like any other
normalizer, and accomplish this
in a single pass over the text.




-- 
lucidimagination.com

Mime
View raw message