Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 65FC47918 for ; Sat, 15 Oct 2011 00:22:33 +0000 (UTC) Received: (qmail 97614 invoked by uid 500); 15 Oct 2011 00:22:31 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 97546 invoked by uid 500); 15 Oct 2011 00:22:30 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 97538 invoked by uid 99); 15 Oct 2011 00:22:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Oct 2011 00:22:30 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of shaikhjamir@gmail.com designates 74.125.82.42 as permitted sender) Received: from [74.125.82.42] (HELO mail-ww0-f42.google.com) (74.125.82.42) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Oct 2011 00:22:22 +0000 Received: by wwn22 with SMTP id 22so1499742wwn.5 for ; Fri, 14 Oct 2011 17:22:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; bh=1qAAiKkTUAefJEjcPj+tUZr6T3twRtmPzYJFdfA1uzQ=; b=qqpkTrHueHp62ZHR1NWSTU8nQz6XYxQ5ZEzEapNOC6MR/28FX2DIdyi0SO889h+tE0 ihYODLjBZPvkayexuFNOlaSPXo3dJuom1f76ilT1/bl4jc7R336VrXL/73t/d2SxqRVM PtFTg4WCetZXyquvC+gVYTYUsreJoQbEPaAfo= Received: by 10.216.137.81 with SMTP id x59mr3569323wei.26.1318638122171; Fri, 14 Oct 2011 17:22:02 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.49.206 with HTTP; Fri, 14 Oct 2011 17:21:42 -0700 (PDT) From: Jamir Shaikh Date: Fri, 14 Oct 2011 17:21:42 -0700 Message-ID: Subject: Case insensitive Keyword Analyser To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016e6d78530dbfe5604af4b5ce7 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d78530dbfe5604af4b5ce7 Content-Type: text/plain; charset=ISO-8859-1 Hi Guys, Use Case: Field: Name Data: Jose , Jose Sam, jose, jose jacob, jose , joseph, josef , S. Jose, B. jose etc. There is a field (Name), I want to index this field. I will be searching this field for a Wildcard query e.g. jose* This should return all names starting with jose. Search: Jose* (should return all names starting with jose) Solution: 1. Using Standard analyser. Problem with Standard Analyser: If I use Standard Analyser in addition to correct results it returns results like S. Jose, B. jose which do not start with Jose. 2. Using Keyword Analyser. Problem with Keyword Analyser: Keyword Analyser is case sensitive so it misses names like Jose, Jose Sam, This happens becuase a search Jose* will be changed to jose* (all small letters) So is there any analyser available which will take care of such use case. What I am searching is a Case insensitive Keyword Analyser. Or let me know if there is any other approach to handle this use case. Thanks, Jamir --0016e6d78530dbfe5604af4b5ce7--