Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 6568 invoked from network); 16 Apr 2010 15:50:26 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 16 Apr 2010 15:50:26 -0000 Received: (qmail 41768 invoked by uid 500); 16 Apr 2010 15:50:24 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 41740 invoked by uid 500); 16 Apr 2010 15:50:24 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 41732 invoked by uid 99); 16 Apr 2010 15:50:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Apr 2010 15:50:24 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of SGreene@metalseconomics.com designates 142.176.9.241 as permitted sender) Received: from [142.176.9.241] (HELO mail.metalseconomics.com) (142.176.9.241) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Apr 2010 15:50:17 +0000 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01CADD7C.75D7F9F0" Subject: Term offsets for highlighting Date: Fri, 16 Apr 2010 12:49:41 -0300 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Term offsets for highlighting Thread-Index: AcrdfG2EYaUaearRQRuHy2aZNTv4jA== From: "Stephen Greene" To: ------_=_NextPart_001_01CADD7C.75D7F9F0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hello, =20 I am trying to determine begin and end offsets for terms and phrases matching a query. Is there a way using either the highlighter or fast vector highlighter in contrib?=20 I have already attempted extending the highlighter which would match terms but would not match phrases. =20 The reason I am attempting this is the document content is stored in a separate database. I need to get a list of the begin and end offsets for all matches so that I can highlight the text coming from the other database. =20 Thank you in advance for your assistance. =20 Best regards, =20 Steve=20 ------_=_NextPart_001_01CADD7C.75D7F9F0--