lucy-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Wellnhofer <wellnho...@aevum.de>
Subject Re: [lucy-dev] Unicode integration
Date Thu, 17 Nov 2011 12:30:35 GMT
On 17/11/2011 01:46, Robert Muir wrote:
> Does your unicode library also support "NFKC_CaseFold" ? It might be a
> nice default:
>
> # Derived Property:   NFKC_Casefold (NFKC_CF)
> #   This property removes certain variations from characters: case,
> compatibility, and default-ignorables.
> #   It is used for loose matching and certain types of identifiers.
> #   It is constructed by applying NFKC, CaseFolding, and removal of
> Default_Ignorable_Code_Points.
> #   The process of applying these transformations is repeated until a
> stable result is produced.

I'm not sure about the last point but NFKC, CaseFolding, and removal of 
Default_Ignorable_Code_Points are supported.

Nick

Mime
View raw message