Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@www.apache.org Received: (qmail 95098 invoked from network); 2 Nov 2004 05:03:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 2 Nov 2004 05:03:01 -0000 Received: (qmail 31552 invoked by uid 500); 2 Nov 2004 05:02:58 -0000 Delivered-To: apmail-jakarta-lucene-dev-archive@jakarta.apache.org Received: (qmail 31522 invoked by uid 500); 2 Nov 2004 05:02:57 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 31508 invoked by uid 99); 2 Nov 2004 05:02:57 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from [195.92.193.19] (HELO cmailm3.svr.pol.co.uk) (195.92.193.19) by apache.org (qpsmtpd/0.28) with ESMTP; Mon, 01 Nov 2004 21:02:55 -0800 Received: from modem-2032.dasyure.dialup.pol.co.uk ([81.78.55.240]) by cmailm3.svr.pol.co.uk with esmtp (Exim 4.41) id 1COqok-0001vT-RU for lucene-dev@jakarta.apache.org; Tue, 02 Nov 2004 05:02:52 +0000 Message-ID: <418714F9.2080704@open.ac.uk> Date: Tue, 02 Nov 2004 05:02:49 +0000 From: Murray Altheim Organization: Knowledge Media Institute User-Agent: Mozilla Thunderbird 0.7.3 (X11/20040803) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Developers List Subject: Question on showing excerpts References: <20041029140952.86849.qmail@web12702.mail.yahoo.com> <6CDD372A-29B6-11D9-8E51-000A95BC61B6@ehatchersolutions.com> <41825612.5080302@open.ac.uk> In-Reply-To: <41825612.5080302@open.ac.uk> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N [I'm gathering that the consensus on the PorterStemmer is to deprecate the existing one in favour of using Snowball, so I've dropped the issue.] Another questio: the Lucene FAQ includes this question: 35. How can I show excerpts with the hit results? How about highlighting the matched words? http://lucene.sourceforge.net/cgi-bin/faq/faqmanager.cgi?file=chapter.search&toc=faq#q35 I'm interested in showing excerpts if the amount of effort isn't enormous, and while I understand that for each document type the results will be different, what I'm wondering is how to locate the offsets within each search result that indicate the locations of each hit within the searched document, so that I won't have to duplicate Lucene's existing efforts in creating the excerpt. Where might I find in the Lucene API or code the hooks I need? Is this information readily available, or is it buried within the engine or the index? Sorry if this is obvious -- I couldn't locate it. And thanks very much for any assistance (and no rush at all...). Murray ...................................................................... Murray Altheim http://kmi.open.ac.uk/people/murray/ Knowledge Media Institute The Open University, Milton Keynes, Bucks, MK7 6AA, UK . The Rise of Pseudo Fascism -- David Neiwert Part 1: The Morphing of the Conservative Movement http://dneiwert.blogspot.com/2004_09_19_dneiwert_archive.html#109028353137888956 Part 2: The Architecture of Fascism http://dneiwert.blogspot.com/2004_09_26_dneiwert_archive.html#109563628314780505 Part 3: The Pseudo-Fascist Campaign http://dneiwert.blogspot.com/2004_10_03_dneiwert_archive.html#109596147171278590 Part 4: The Apocalyptic One-Party State http://dneiwert.blogspot.com/2004_10_10_dneiwert_archive.html#109694976530359103 Part 5: Warfare By Other Means http://dneiwert.blogspot.com/2004_10_17_dneiwert_archive.html#109755467135245579 Part 6: Breaking Down the Barriers http://dneiwert.blogspot.com/2004_10_24_dneiwert_archive.html#109858062597237163 Part 7: It Can Happen Here http://dneiwert.blogspot.com/2004_10_31_dneiwert_archive.html#109902109250035295 --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-dev-help@jakarta.apache.org