Return-Path: X-Original-To: apmail-incubator-lucy-dev-archive@www.apache.org Delivered-To: apmail-incubator-lucy-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 480B0743C for ; Thu, 17 Nov 2011 12:31:19 +0000 (UTC) Received: (qmail 4672 invoked by uid 500); 17 Nov 2011 12:31:19 -0000 Delivered-To: apmail-incubator-lucy-dev-archive@incubator.apache.org Received: (qmail 4574 invoked by uid 500); 17 Nov 2011 12:31:18 -0000 Mailing-List: contact lucy-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: lucy-dev@incubator.apache.org Delivered-To: mailing list lucy-dev@incubator.apache.org Received: (qmail 4566 invoked by uid 99); 17 Nov 2011 12:31:18 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Nov 2011 12:31:18 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_HELO_PASS,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [212.227.17.8] (HELO moutng.kundenserver.de) (212.227.17.8) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Nov 2011 12:31:09 +0000 Received: from [192.168.1.39] (dslb-092-075-140-244.pools.arcor-ip.net [92.75.140.244]) by mrelayeu.kundenserver.de (node=mrbap2) with ESMTP (Nemesis) id 0LfGuG-1R71MG1pSi-00p5eM; Thu, 17 Nov 2011 13:30:48 +0100 Message-ID: <4EC4FE6B.5010104@aevum.de> Date: Thu, 17 Nov 2011 13:30:35 +0100 From: Nick Wellnhofer User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:8.0) Gecko/20111105 Thunderbird/8.0 MIME-Version: 1.0 To: lucy-dev@incubator.apache.org References: <4EC161D0.1060103@aevum.de> <20111114212215.GA26256@rectangular.com> <4EC1C342.7080401@aevum.de> <20111115042209.GA27084@rectangular.com> <4EC2D0E5.10909@aevum.de> <20111116034932.GA10681@rectangular.com> <4EC43816.1070107@aevum.de> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:H96Hb7d3tGaV5whn3C0ESWmToY117UxmZqFKsiaY7JG Ok5EyxAOdxPP/JI0IjFM8IFXyAx4rCywh0YXQAWpOLq4xTsjdF +PVyWUnboMtD5iQKNAKJaBKT2Hi6kxLOuJ1ybDtcLi+InocgXW J6ffwtIqzvofSybB3R/HRQpuEKEazrtVeafZlEG0Qv1A0nX9hg iHD5VzEH9U0Rx9DGYQZ5LoGFmR+uN16+qNxrIdhd1oE22Or4Vi yVA1W0ZyKeIH3Jgk8YHNHqQRFhbKQnK8qZzjAHeF7mWWsQsbOv eWT8+/jHhjpftlK5WU9sxlOFvZ1FYYydWd9iNOp/O0bhV0FAg= = X-Virus-Checked: Checked by ClamAV on apache.org Subject: Re: [lucy-dev] Unicode integration On 17/11/2011 01:46, Robert Muir wrote: > Does your unicode library also support "NFKC_CaseFold" ? It might be a > nice default: > > # Derived Property: NFKC_Casefold (NFKC_CF) > # This property removes certain variations from characters: case, > compatibility, and default-ignorables. > # It is used for loose matching and certain types of identifiers. > # It is constructed by applying NFKC, CaseFolding, and removal of > Default_Ignorable_Code_Points. > # The process of applying these transformations is repeated until a > stable result is produced. I'm not sure about the last point but NFKC, CaseFolding, and removal of Default_Ignorable_Code_Points are supported. Nick