From java-user-return-25906-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Fri Feb 02 15:58:48 2007 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 77177 invoked from network); 2 Feb 2007 15:58:48 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 2 Feb 2007 15:58:48 -0000 Received: (qmail 38258 invoked by uid 500); 2 Feb 2007 15:58:47 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 38231 invoked by uid 500); 2 Feb 2007 15:58:47 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 38220 invoked by uid 99); 2 Feb 2007 15:58:47 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Feb 2007 07:58:47 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of markrmiller@gmail.com designates 66.249.82.226 as permitted sender) Received: from [66.249.82.226] (HELO wx-out-0506.google.com) (66.249.82.226) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Feb 2007 07:58:37 -0800 Received: by wx-out-0506.google.com with SMTP id i29so873433wxd for ; Fri, 02 Feb 2007 07:58:16 -0800 (PST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=rCy3P1vIxzDuG/iwvnolkxYhjt0VCL5cenxroBamei/+gHW6i98b7UrQiQxLM1KCNDsuk3cvDk4r5IxlDH6Ysys9tI/7d7S195zTxbusCRkYYLkrPXZ/brkddsWcifSp+3qpCO2OM9I38O8QQ+mPVdbpipEPygT6arh8CcfiazE= Received: by 10.90.89.5 with SMTP id m5mr5187176agb.1170431895593; Fri, 02 Feb 2007 07:58:15 -0800 (PST) Received: from ?192.168.1.103? ( [216.66.114.42]) by mx.google.com with ESMTP id 26sm8771335wrl.2007.02.02.07.58.14; Fri, 02 Feb 2007 07:58:14 -0800 (PST) Message-ID: <45C35F89.2040408@gmail.com> Date: Fri, 02 Feb 2007 10:58:01 -0500 From: Mark Miller User-Agent: Thunderbird 1.5.0.9 (Windows/20061207) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: Multiword Highlighting References: <574156.14791.qm@web50307.mail.yahoo.com> In-Reply-To: <574156.14791.qm@web50307.mail.yahoo.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org I have been away from this for a week, but my interest has started building again. The whole spans implementation seems to work great for finding the actual hits but there is a somewhat annoying limitation: because I am using Spans it seems I can only either highlight the entire found span or just the first and last token of the found span. First and last token works great for any span involving two query tokens (the only type I am concerned with at the moment), but a 3 word span would not have the middle word highlighted (unless you highlight the whole darn span). Other than that, the implementation is pretty darn simple and seems to work well. It wouldn't be too hard to set the option of complete span highlighting or first and last token. Still interested in considering this for Contrib? Perhaps you want to wait for someone to merge the idea with the current Contrib highlighter (add fragments) as Mark H. suggested in his last email on the subject. Or there just may not be much interest -- the other recent highlighters haven't really gone anywhere that I have seen (though I don't think they attempted 'actual' hit highlighting). If there is interest, suggested package name? Otis Gospodnetic wrote: > For what it's worth Mark (Miller), there *is* a need for "just highlight the query terms without trying to get excerpts" functionality - something a la Google cache (different colours...mmm, nice). I've had people ask me for this before, and I know I could use this functionality, too. Please contrib to contrib/ if you end up working on this. > > Otis > -- > Simpy -- http://www.simpy.com/ -- Tag. Search. Share. > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org