Return-Path: Delivered-To: apmail-lucene-solr-user-archive@locus.apache.org Received: (qmail 59926 invoked from network); 30 May 2008 21:34:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 30 May 2008 21:34:51 -0000 Received: (qmail 37638 invoked by uid 500); 30 May 2008 21:34:50 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 37613 invoked by uid 500); 30 May 2008 21:34:50 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 37602 invoked by uid 99); 30 May 2008 21:34:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 May 2008 14:34:50 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mike.klaas@gmail.com designates 209.85.200.168 as permitted sender) Received: from [209.85.200.168] (HELO wf-out-1314.google.com) (209.85.200.168) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 May 2008 21:34:01 +0000 Received: by wf-out-1314.google.com with SMTP id 28so42347wfc.20 for ; Fri, 30 May 2008 14:34:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:from:to:in-reply-to:content-type:content-transfer-encoding:mime-version:subject:date:references:x-mailer; bh=yRBahsC5OV0laC+ea5ZSMDoYq3bl+wrUtivl0PgFz/Y=; b=LXUYbTqo+S9rW3enVN0xWFSbr04yUoMHyRX00CAW5CnpTr7cfYbB8y4XvYG6wyqkmC099BZYObbxEJ7HAkwv+YsN58kwZ4kZ3peIH64K5i1FwSVZU91MIyvLt4DSD8Bcrxj/kpcl1XIPqfjmyMJ4X4CTTArS/PPSsF4lBv9WGy0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:from:to:in-reply-to:content-type:content-transfer-encoding:mime-version:subject:date:references:x-mailer; b=ZIGI7tijKTpITeC0uOK92TtSE1M1F2g1qhjlGvzNUINuLP2bjU/KCGyJrDrqc677sKCyrdNTqtaumkvw8b239m1SUC2rqmRzwiBdliamxZgjHaBcT5IyOtmAyHZyBoa7oSJBQ9GkWG+X+RvJFTG6OtYmDYqjse3WtmODpn1fbVs= Received: by 10.142.136.16 with SMTP id j16mr338857wfd.292.1212183257830; Fri, 30 May 2008 14:34:17 -0700 (PDT) Received: from ?192.168.1.120? ( [24.86.255.85]) by mx.google.com with ESMTPS id 22sm366069wfi.14.2008.05.30.14.34.16 (version=TLSv1/SSLv3 cipher=RC4-MD5); Fri, 30 May 2008 14:34:17 -0700 (PDT) Message-Id: <4370D9B7-E96A-4FB7-84A2-DAC80D1B4992@gmail.com> From: Mike Klaas To: solr-user@lucene.apache.org In-Reply-To: <4BCE10CFEBFE2E4F8FD2B77E2E8E5042042690EA4C@mse19be2.mse19.exchange.ms> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v919.2) Subject: Re: highlighting and hyperlink Date: Fri, 30 May 2008 14:34:13 -0700 References: <4BCE10CFEBFE2E4F8FD2B77E2E8E5042042690EA4C@mse19be2.mse19.exchange.ms> X-Mailer: Apple Mail (2.919.2) X-Virus-Checked: Checked by ClamAV on apache.org On 30-May-08, at 2:25 PM, Kevin Xiao wrote: > Hi > > I am not sure if there are any discussions about this, I could not > find the search function in mailing list archives. :) Anyway, here > is my problem: > > In my document, I have a hyperlink, say, breast cancer, but when I applied solr > highlighting on search term 'cancer', that hyperlink becomes: href="../home/home.nb?q=breast+cancer span>">breast cancer. > Obviously I don't want highlighting the first cancer (in red). > > Is there a flag to turn that off, or I have to write something > myself without using solr highlighting feature? No, Solr has no idea that you are highlighting html text. The best thing to do would be to use a tokenizer that doesn't generate terms for urls inside the href of anchor tags (this will also produce the nice result of not matching keywords inside hidden urls). -Mike