Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 553AD200B88 for ; Thu, 22 Sep 2016 18:10:10 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 53B76160AAD; Thu, 22 Sep 2016 16:10:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 16BF9160AA9 for ; Thu, 22 Sep 2016 18:10:08 +0200 (CEST) Received: (qmail 23071 invoked by uid 500); 22 Sep 2016 16:10:08 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 23051 invoked by uid 99); 22 Sep 2016 16:10:07 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Sep 2016 16:10:07 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 01233C03BC for ; Thu, 22 Sep 2016 16:10:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.951 X-Spam-Level: ** X-Spam-Status: No, score=2.951 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_PASS=-0.001, SPF_SOFTFAIL=0.972] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=quickplaytest.onmicrosoft.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id ncQhFIY5B05m for ; Thu, 22 Sep 2016 16:10:04 +0000 (UTC) Received: from NAM02-BL2-obe.outbound.protection.outlook.com (mail-bl2nam02on0099.outbound.protection.outlook.com [104.47.38.99]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id C4CA85FB37 for ; Thu, 22 Sep 2016 16:10:03 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quickplaytest.onmicrosoft.com; s=selector1-quickplay-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=tTwZbjEsQl3SKXV8snpusnko1iuEebF60+pDyBX2eEs=; b=gsyzge0QFZIrQDJRfxgVghHyJ34XmNhUvnzp11H+GrKhlYP5MYgn8dJdc2A8lnSZ7ABUwkdUwj6pTYjFGtHJJIqWyep4CWd3xL3jlk/3gB8DTzKgug7FT0N4Ym1sIDmQgYDyEpPQBgpqLaAawF3QDmYlvUzoDUNr8PgYDiiJWjQ= Received: from BN6PR04MB0213.namprd04.prod.outlook.com (10.168.224.22) by BN6PR04MB0211.namprd04.prod.outlook.com (10.168.224.20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.629.8; Thu, 22 Sep 2016 16:09:55 +0000 Received: from BN6PR04MB0213.namprd04.prod.outlook.com ([10.168.224.22]) by BN6PR04MB0213.namprd04.prod.outlook.com ([10.168.224.22]) with mapi id 15.01.0629.015; Thu, 22 Sep 2016 16:09:55 +0000 From: Satish Chennupati To: "general@lucene.apache.org" Subject: issue with case-insensitive sorting Thread-Topic: issue with case-insensitive sorting Thread-Index: AdIU6lcf+grR8PsbTRil81HFh1ntLg== Date: Thu, 22 Sep 2016 16:09:55 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: yes X-MS-TNEF-Correlator: authentication-results: spf=none (sender IP is ) smtp.mailfrom=satish.chennupati@quickplay.com; x-originating-ip: [66.207.203.114] x-ms-office365-filtering-correlation-id: fc11b799-3f61-4c10-1c4a-08d3e302e4b8 x-microsoft-exchange-diagnostics: 1;BN6PR04MB0211;6:413M5y3puMPdQuYJqNBq9xa5BsgO23khugbsMfQQ4AWijByw15GhewhrT1M+p51TmDnanld55xwsJ1vXqS6QQkWgGK9ObwDEWHMFH5ge82pqchw8WiJbXLzWvlXatZgoSe4OjbWgFZCMQRwJAU/vspps8kBivTznG2ZBeLH22KEqAxkXhbRN7iZIe5B+yWuUjQAPwqZHS0yEB4Wl+v7C/TLS1J2/B5bmWvrFnCNWpT0lCVuTb4AcGPPRQ+1QNdja+ApjZ3Duo2CoHTyloX+5zo2s2jvbRty01sZoICtDf+8=;5:8rx7Vuix4EVespGqCz4zCvP9A7BSnC0lYizZdIVamGMu71fEGPuGQiBAyD0z3PgMqKH+784ykBsK0GIEkmr2wAkBeeiTkz0jrGbqOqLtpnqvZImBl42PEZJmCH6I8/jFyCpUP1D/PuitQBpTlwIhhQ==;24:o0wi7Bdxi9rNLa9vmU2nRPIvHSIZj7MtcLPzs06d/uIdu6aBdp8QQoSee0RT8Ph0iUyD6FLSzgd2k2INMFxu7jot5/+MulLktwcldvSvVDU=;7:Wa3G/VcwkhX3j+W06qzPWCFx4RyuBntRcy2z358HU4P8oyzkSf93Jvx7pw7GaZQcOPfua78kGGo39ar8pEaYcMAU5Ora7WHuva5n4PMx1C9Bf/c2DcFLfnMQ9IaSyC8k/yKt8LxympI7pC9P6516guTqTZYgT7b+lGLYUrgxKUQ9Te6WNjFLfF3JSf5YfW4XfTkKztFnztr8vFhq7rsuyIFOqWbcnLK76fQamu7bH4wwRWOHlkpPanfaelKy4gPrz984gOeeyrz9h02XkitKS1derKpaWGFlv8nbzNoIX4kTLUHKghO5Qbl0N9V6tqES x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:;SRVR:BN6PR04MB0211; x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:(158342451672863)(788757137089)(21748063052155); x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(102415321)(6040176)(601004)(2401047)(5005006)(8121501046)(10201501046)(3002001);SRVR:BN6PR04MB0211;BCL:0;PCL:0;RULEID:;SRVR:BN6PR04MB0211; x-forefront-prvs: 0073BFEF03 x-forefront-antispam-report: SFV:NSPM;SFS:(10019020)(7916002)(199003)(36304003)(189002)(252514010)(99286002)(2906002)(19580395003)(110136003)(16236675004)(586003)(3280700002)(5890100001)(2501003)(66066001)(11100500001)(7846002)(7736002)(68736007)(7696004)(9686002)(74316002)(9326002)(87936001)(33656002)(3480700004)(5002640100001)(17760045003)(19625215002)(189998001)(10400500002)(92566002)(450100001)(2900100001)(97736004)(76576001)(86362001)(5660300001)(19580405001)(19300405004)(77096005)(18206015028)(101416001)(107886002)(15975445007)(105586002)(8936002)(19627595001)(81166006)(1730700003)(2351001)(99936001)(8676002)(790700001)(102836003)(3846002)(6116002)(54356999)(50986999)(229853001)(122556002)(81156014)(106356001)(3660700001);DIR:OUT;SFP:1102;SCL:1;SRVR:BN6PR04MB0211;H:BN6PR04MB0213.namprd04.prod.outlook.com;FPR:;SPF:None;PTR:InfoNoRecords;A:1;MX:1;LANG:en; received-spf: None (protection.outlook.com: quickplay.com does not designate permitted sender hosts) spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: multipart/related; boundary="_004_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_"; type="multipart/alternative" MIME-Version: 1.0 X-OriginatorOrg: quickplay.com X-MS-Exchange-CrossTenant-originalarrivaltime: 22 Sep 2016 16:09:55.5562 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 4b509505-af5a-41f6-8419-7ffcd3944011 X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN6PR04MB0211 archived-at: Thu, 22 Sep 2016 16:10:10 -0000 --_004_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_ Content-Type: multipart/alternative; boundary="_000_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_" --_000_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi, Right now when I perform a sort I am getting result as following: "[\"18 hotfix3\"]" "[\"Godzilla\"]" "[\"Godzilla, King of the Monsters!\"]" "[\"Harry Potter and the Sorcerers Stone\"]" "[\"How to Train Your Dragon\"]" "[\"Jurassic Park\"]" "[\"My Big Fat Greek Wedding\"]" "[\"National Treasure\"]" "[\"Palmer\"]" "[\"Patch Adams\"]" "[\"Rajan\"]" "[\"Sanity\"]" "[\"Stardust\"]" "[\"Superman\"]" "[\"The Amazing Spider-Man 2\"]" "[\"The Godfather\"]" "[\"The Lord of the Rings: The Fellowship of the Ring\"]" "[\"The Matrix\"]" "[\"V for Vendetta\"]" "[\"abcdefgh\"]" "[\"autoui1466571231695\"]" "[\"autoui1466605339320\"]" "[\"name\"]" "[\"test\"]" "[\"test2\"]" The field type has been defined as follows : And for sorting purpose we have a dynamic field in place that used the abov= e field type Issue: even though we have lowercase filter factory in place the sort doesn= 't happen case-insensitive. Satish Chennupati Senior Server Engineer Cell: (416) 918-9959 Email: satish.chennupati@quickplay.com [Quickplay] This email and any attachments are for the sole use of the intended recipie= nts and may be privileged or confidential. Any distribution, printing or ot= her use by anyone else is prohibited. If you are not an intended recipient,= please contact the sender immediately, and permanently delete this email a= nd attachments. Le pr?sent courriel et les documents qui y sont joints sont exclusivement r= ?serv?s ? l'utilisation des destinataires concern?s et peuvent ?tre de natu= re privil?gi?e ou confidentielle. Toute distribution, impression ou autre u= tilisation est interdite aux autres personnes. Si vous ne faites pas partie= des destinataires concern?s, veuillez en informer imm?diatement l'exp?dite= ur, ainsi que supprimer ce courriel et les documents joints de mani?re perm= anente. --_000_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Hi,

 

Right now when I perform a sort I  am getting r= esult as following:

 

 

"[\"18 hotfix3\"]"

"[\"Godzilla\"]"

"[\"Godzilla, King of the Monsters!\"= ]"

"[\"Harry Potter and the Sorcerers Stone\&= quot;]"

"[\"How to Train Your Dragon\"]"=

"[\"Jurassic Park\"]"=

"[\"My Big Fat Greek Wedding\"]"=

"[\"National Treasure\"]"

"[\"Palmer\"]"

"[\"Patch Adams\"]"

"[\"Rajan\"]"

"[\"Sanity\"]"

"[\"Stardust\"]"

"[\"Superman\"]"

"[\"The Amazing Spider-Man 2\"]"=

"[\"The Godfather\"]"=

"[\"The Lord of the Rings: The Fellowship = of the Ring\"]"

"[\"The Matrix\"]"

"[\"V for Vendetta\"]"

"[\"abcdefgh\"]"

"[\"autoui1466571231695\"]"=

"[\"autoui1466605339320\"]"=

"[\"name\"]"

"[\"test\"]"

"[\"test2\"]"

 

The field type has been defined as follows :

 

       <fi= eldType class=3D"org.apache.solr.schema.TextField" name=3D"T= extField" sortMissingLast=3D"true">

       &= nbsp;    <analyzer>

       &= nbsp;        <tokenizer class=3D"= ;solr.KeywordTokenizerFactory"/>

       &= nbsp;        <!-- lower case everythi= ng -->

       &= nbsp;        <filter class=3D"so= lr.LowerCaseFilterFactory"/>

       &= nbsp;        <!-- remove lead/trail w= hitespace -->

       &= nbsp;        <filter class=3D"so= lr.TrimFilterFactory"/>

       &= nbsp;        <!-- pad and trim number= s to an even 6 digits with leading 0's -->

       &= nbsp;        <filter class=3D"so= lr.PatternReplaceFilterFactory"

       &= nbsp;           &nbs= p;    pattern=3D"(\d+)" replacement=3D"00= 000$1" replace=3D"all"/>

       &= nbsp;        <filter class=3D"so= lr.PatternReplaceFilterFactory"

       &= nbsp;           &nbs= p;    pattern=3D"0*([0-9]{6,})" replacement=3D&quo= t;$1" replace=3D"all" />

        =            

       &= nbsp;    </analyzer>

        = </fieldType>

 

 

And for sorting purpose we have a dynamic field in p= lace that used the above field type

        = <!-- fields for sorting -->

        = <dynamicField indexed=3D"true" multiValued=3D"false"= name=3D"sort_str*" stored=3D"false" type=3D"SortT= extField"/>

 

Issue: even though we have lowercase filter facto= ry in place the sort doesn’t happen case-insensitive.<= /p>

 

Satish Chennupati

Senior Server Engineer

Cell:         (416) 918-9959

Email:      satis= h.chennupati@quickplay.com

3D"Quickplay"

 




This email and any attachments are for the sole use of the intended recipie= nts and may be privileged or confidential. Any distribution, printing or ot= her use by anyone else is prohibited. If you are not an intended recipient,= please contact the sender immediately, and permanently delete this email and attachments.

Le présent courriel et les documents qui y sont joints sont exclusiv= ement réservés à l'utilisation des destinataires conce= rnés et peuvent être de nature privilégiée ou co= nfidentielle. Toute distribution, impression ou autre utilisation est inter= dite aux autres personnes. Si vous ne faites pas partie des destinataires concernés= , veuillez en informer immédiatement l'expéditeur, ainsi que = supprimer ce courriel et les documents joints de manière permanente. --_000_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_-- --_004_BN6PR04MB02130C11653D3B3B4B1CDC1CF9C90BN6PR04MB0213namp_--