Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 07BB911ABE for ; Mon, 22 Sep 2014 13:08:21 +0000 (UTC) Received: (qmail 32224 invoked by uid 500); 22 Sep 2014 13:08:15 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 32149 invoked by uid 500); 22 Sep 2014 13:08:15 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 32136 invoked by uid 99); 22 Sep 2014 13:08:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Sep 2014 13:08:14 +0000 X-ASF-Spam-Status: No, hits=3.6 required=5.0 tests=BUG6152_INVALID_DATE_TZ_ABSURD,HTML_MESSAGE,INVALID_DATE_TZ_ABSURD,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of nathaniel.rudavsky@gmail.com designates 74.125.82.50 as permitted sender) Received: from [74.125.82.50] (HELO mail-wg0-f50.google.com) (74.125.82.50) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Sep 2014 13:08:08 +0000 Received: by mail-wg0-f50.google.com with SMTP id l18so1719850wgh.21 for ; Mon, 22 Sep 2014 06:07:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:subject:to:message-id:mime-version:content-type; bh=ba1HpMZGH4cRmmK0KxUUGSkGz7paYtm1KDXrdiLiDfs=; b=GcYphSIKeDKN/sARbvNM9ORKjObHO61dwp3AdJCCd4WoWvO3I2fWsdX5TJq/pT/gY5 6q89z9hVUA1Y8hmxQ8Ae0eyGc3Ox+4gWWVH3cbgn/coxUcCba9M9k/hnUciGEZIZOgfs yGrHI2Ig8vPnbX0VEDka5b2hQWTB4j8CgDSu4EuTGtKz08uVwt7nhyf3WACrHWW2l94h m/vxstPaNXZ8oqNnpNXoiPOgWDHmWDE7A/T4J+13mDhu1y1rw5ehHwc7Q/KJGkNTYdZn GqR9xy+E46e74To8O91PMYDRyy6tzHmEonWr2wohQoY9lpf/dody4dR9SvwOo9Yyj9BK YOPQ== X-Received: by 10.194.171.228 with SMTP id ax4mr20051721wjc.3.1411391267090; Mon, 22 Sep 2014 06:07:47 -0700 (PDT) Received: from [192.168.99.10] ([212.76.254.67]) by mx.google.com with ESMTPSA id y5sm12257940wje.32.2014.09.22.06.07.45 for (version=TLSv1.2 cipher=RC4-SHA bits=128/128); Mon, 22 Sep 2014 06:07:46 -0700 (PDT) Date: Mon, 22 Sep 2014 13:09:45 +0002 From: Nathaniel Rudavsky-Brody Subject: fuzzy terms, DirectSolrSpellChecker and alternativeTermCount To: solr-user@lucene.apache.org Message-Id: <1411391265.3344.6@smtp.gmail.com> X-Mailer: geary/0.6.0 MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="=-TkcvZiU+h3hnIyosqc/0" X-Virus-Checked: Checked by ClamAV on apache.org --=-TkcvZiU+h3hnIyosqc/0 Content-Type: text/plain; charset=utf-8; format=flowed Hello, I'm trying find the best way to "fake" the terms component for fuzzy queries. That is, I need the full set of index terms for each of the two queries "quidam~1" and "quidam~2". I tried defining two suggesters with FuzzyLookupFactory, with maxEdits=1 and 2 respectively, but the results for "quidam~1" include suffixes like "quodammodo", which makes sense for a suggester but isn't what I want here. Now I'm trying with the spell-checker. As far as I can see, IndexBasedSpellChecker doesn't let me set maxEdits, so I can't use it to distinguish between my two queries. DirectSolrSpellChecker seems like it should work, ie: fuzzy1 solr.DirectSolrSpellChecker 1 ... fuzzy2 solr.DirectSolrSpellChecker 2 ... However the parameter spellcheck.alternativeTermCount has no effect, so the query "spellcheck.q=quidam" gives no results, but "spellcheck.q=quiam" (which doesn't exist in the index) gives the expected terms. Am I missing something? Or is there a better way to do this? Many thanks for any help and ideas, Nathaniel --=-TkcvZiU+h3hnIyosqc/0--