Return-Path: X-Original-To: apmail-incubator-lucy-user-archive@www.apache.org Delivered-To: apmail-incubator-lucy-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9DE5390BD for ; Mon, 28 Nov 2011 08:08:09 +0000 (UTC) Received: (qmail 23308 invoked by uid 500); 28 Nov 2011 08:08:08 -0000 Delivered-To: apmail-incubator-lucy-user-archive@incubator.apache.org Received: (qmail 23281 invoked by uid 500); 28 Nov 2011 08:08:07 -0000 Mailing-List: contact lucy-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: lucy-user@incubator.apache.org Delivered-To: mailing list lucy-user@incubator.apache.org Received: (qmail 23273 invoked by uid 99); 28 Nov 2011 08:08:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Nov 2011 08:08:05 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gorankent@gmail.com designates 209.85.212.47 as permitted sender) Received: from [209.85.212.47] (HELO mail-vw0-f47.google.com) (209.85.212.47) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Nov 2011 08:07:59 +0000 Received: by vbbfc21 with SMTP id fc21so3006779vbb.6 for ; Mon, 28 Nov 2011 00:07:38 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=+aEdVUwQe7R2hjiTtDp9fI1XFjfw/JYS1mSamU0OVvU=; b=giE+J1+RChX9SRtijGkiGaZk43r+ivSOsQSBLkoDHRmlk3ZX/2yI4gAtxv8Ns2Lr8a iyXVrgRE9U0iRh/7ASMZi5lFn8wnPBoefmorbA57F7vBv179c38Zd+ulEbVi3vrdlMu6 QWKRuqJGjbofh4Q34memBt0WkGJfwNY3SEQGU= MIME-Version: 1.0 Received: by 10.52.173.113 with SMTP id bj17mr41904430vdc.98.1322467658591; Mon, 28 Nov 2011 00:07:38 -0800 (PST) Received: by 10.52.188.10 with HTTP; Mon, 28 Nov 2011 00:07:38 -0800 (PST) In-Reply-To: <20111127173416.GA11639@rectangular.com> References: <20111126183422.GB22818@rectangular.com> <20111127173416.GA11639@rectangular.com> Date: Mon, 28 Nov 2011 10:07:38 +0200 Message-ID: From: goran kent To: lucy-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Subject: Re: [lucy-user] ClusterSearcher and excerpt/highlighting On Sun, Nov 27, 2011 at 7:34 PM, Marvin Humphrey w= rote: > This should be solved with r1206768. > > =A0 =A0http://svn.apache.org/viewvc?view=3Drevision&revision=3D1206768 > > That symptom results from a mismatch between the fetched doc and the fetc= hed > highlighting data (called the "DocVector"). =A0It was happening because t= he doc > id for the fetched hit was not being adjusted by ClusterSearcher. Excellent! Thank you very much, that looks really great - highlighting is now working as expected and the results look compelling. Minor point: I've noticed when searching for [toddler swimming lessons], the following are highlighted in their entirety: toddler to do swimming lessons toddlers to swimming lessons as well as a slew of *correct* highlighting all over the show - which is -great-. It looks like the highlighting algo is ignoring stopwords (in effect highlighting them as well) - frankly, I don't give a toss about it. The results look amazing and I'm happy. All hail the CreamyGoodness! --=20 Regards, gk