Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@apache.org Received: (qmail 32255 invoked from network); 9 Jul 2003 07:05:59 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 9 Jul 2003 07:05:59 -0000 Received: (qmail 8321 invoked by uid 97); 9 Jul 2003 07:08:37 -0000 Delivered-To: qmlist-jakarta-archive-lucene-dev@nagoya.betaversion.org Received: (qmail 8314 invoked from network); 9 Jul 2003 07:08:36 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 9 Jul 2003 07:08:36 -0000 Received: (qmail 32031 invoked by uid 500); 9 Jul 2003 07:05:57 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 32018 invoked from network); 9 Jul 2003 07:05:56 -0000 Received: from unknown (HELO siderean.com) (64.70.12.228) by daedalus.apache.org with SMTP; 9 Jul 2003 07:05:56 -0000 Received: from siderean.com (c-24-126-244-84.we.client2.attbi.com [24.126.244.84]) (authenticated) by siderean.com (8.11.6/8.11.6) with ESMTP id h697G6k23748 for ; Wed, 9 Jul 2003 00:16:07 -0700 Message-ID: <3F0BBEA1.1090001@siderean.com> Date: Wed, 09 Jul 2003 00:05:05 -0700 From: Cormac Twomey User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.5a) Gecko/20030625 Thunderbird/0.1a X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Developers List Subject: Fuzzy queries are case sensitive; doesn't behave as documented References: <3F0BB347.4060406@dstc.edu.au> In-Reply-To: <3F0BB347.4060406@dstc.edu.au> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Folks, I raised this issue a while back but it went unanswered so I'm trying again. Anyhow, FuzzyTermEnum.java appears to have two problems - 1) FuzzyTermEnum searches are case sensitive. Presumably this is not as designed? See bug #18014 for a candidate patch. 2) The "Query Syntax" page on the website states in the "Fuzzy Searches" section, that: "Terms found by the fuzzy search will automatically get a boost factor of 0.2" I've found this not to be the case. In my previous email ( http://www.mail-archive.com/lucene-dev@jakarta.apache.org/msg02819.html ) I documented a test case for this. Fuzzy matches appear for me ahead of some exact matches in some cases. As this is not as clear cut a bug as the case sensitivity issue, I didn't post this as a bug. any feedback you have would be much appreciated. Thanks, --Cormac Twomey --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-dev-help@jakarta.apache.org