Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 66A65DAC6 for ; Sat, 10 Nov 2012 00:38:22 +0000 (UTC) Received: (qmail 75045 invoked by uid 500); 10 Nov 2012 00:38:21 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 74981 invoked by uid 500); 10 Nov 2012 00:38:21 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 74974 invoked by uid 99); 10 Nov 2012 00:38:21 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Nov 2012 00:38:21 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of rcmuir@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qc0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Nov 2012 00:38:14 +0000 Received: by mail-qc0-f176.google.com with SMTP id n41so3401640qco.35 for ; Fri, 09 Nov 2012 16:37:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=lZKSo6FAFuGqUyc9zMKqT1Z1GBWecs46pjfk00fs/VY=; b=pdGdmfZBBBRCBk/3njor99BeWauaElTRXLRljWLO/E3WKxcThAOVVHX64U9zVmp4z8 DRMYZDTMTGQX9FhGV4Dwl0MBTdBIOLrXIkU1RCv6p0yM+iWFvL9pFjou0AZhnU4bcLk3 JvPGex6bq5G4GIV42Vhxw/PI+XYhm8s2lFMhCHOESe6dTF2ozHKDlGk8s2kRquWYtwcK 3zBjhkngtMq1INr335+kuAUTIds01uNWbt/6XMLFK8YRPb1cNOahcH7++ClJk+ZjnEz2 AXrF1xH1knYvTYdXjQL3/DuCcAlRjPoZr5KQtb2On8WsgXBQ3m+267UthLi0gZjt+0BS dWbg== Received: by 10.49.103.162 with SMTP id fx2mr22148663qeb.1.1352507873944; Fri, 09 Nov 2012 16:37:53 -0800 (PST) MIME-Version: 1.0 Received: by 10.49.105.168 with HTTP; Fri, 9 Nov 2012 16:37:33 -0800 (PST) In-Reply-To: <7C09B40B9AF7446190E18140406F076D@JackKrupansky> References: <1574581308.74146.1347527827817.JavaMail.jiratomcat@arcas> <7C09B40B9AF7446190E18140406F076D@JackKrupansky> From: Robert Muir Date: Fri, 9 Nov 2012 19:37:33 -0500 Message-ID: Subject: Re: FuzzyQuery vs SlowFuzsyQuery docs? -- was: Re: [jira] [Commented] (LUCENE-2667) Fix FuzzyQuery's defaults, so its fast. To: dev@lucene.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org I'm -1 for having unscalable shit in lucene's core. This query should have never been added. I don't care if a few people complain because they aren't using lowercasefilter or some other insanity. Fix your analysis chain. I don't have any sympathy. On Fri, Nov 9, 2012 at 7:35 PM, Jack Krupansky wrote: > +1 for permitting a choice of fuzzy query implementation. > > I agree that we want a super-fast fuzzy query for simple variations, but I > also agree that we should have the option to trade off speed for function. > > But I am also sympathetic to assuring that any core Lucene features be as > performant as possible. > > Ultimately, if there was a single fuzzy query implementation that did > everything for everybody all of the time, that would be the way to go, but > if choices need to be made to satisfy competing goals, we should support > going that route. > > -- Jack Krupansky > > From: Mark Bennett > Sent: Friday, November 09, 2012 3:48 PM > To: dev@lucene.apache.org > Subject: Re: FuzzyQuery vs SlowFuzsyQuery docs? -- was: Re: [jira] > [Commented] (LUCENE-2667) Fix FuzzyQuery's defaults, so its fast. > > Hi Robert, > > On Thu, Sep 13, 2012 at 7:39 PM, Robert Muir wrote: >> >> ... >> ... I'm strongly against having this >> unscalable garbage in lucene's core. >> >> There is no use case for ed > 2, thats just crazy. > > > I promise you there ARE use cases for edit distances > 2, especially with > longer words. Due to NDA I can't go into details. > > Also ed>2 can be useful when COMBINING that low-quality part of the search > with other sub-queries, or additional business rules. Maybe instead of > boiling an ocean this lets you just boil the sea. ;-) > > I won't comment on the quality of the older Levenstein code, or the likely > very slow performance, nor where the code should live, etc. > > But your statement about "no use case for ed > 2" is simply not true. > (whether you'd agree with any of them or not is certainly another matter) > > I understand your concerns about not having it be the default. (or maybe > having a giant warning message or something, whatever) > >> -- >> lucidworks.com >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org >> For additional commands, e-mail: dev-help@lucene.apache.org >> > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org