Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 57911 invoked from network); 20 Sep 2007 22:16:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 20 Sep 2007 22:16:27 -0000 Received: (qmail 10312 invoked by uid 500); 20 Sep 2007 22:16:13 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 10274 invoked by uid 500); 20 Sep 2007 22:16:12 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 10263 invoked by uid 99); 20 Sep 2007 22:16:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Sep 2007 15:16:12 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of markrmiller@gmail.com designates 64.233.162.234 as permitted sender) Received: from [64.233.162.234] (HELO nz-out-0506.google.com) (64.233.162.234) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Sep 2007 22:18:13 +0000 Received: by nz-out-0506.google.com with SMTP id i28so506014nzi for ; Thu, 20 Sep 2007 15:15:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; bh=P2QZ833BqQ4cPMIzn5AD8+fRhs68eP4svYaKdFEFOus=; b=O9AOQNalrtrWvW/dzJbIciCbgqiGRGtAzBFntM52EvuH/4CdP9pSXbXPonYan9ynDSMx8Kehjjj5vyc7hTyGRL/wHb+At1rJVTqb2Kl4krKActbtwdNt2XfT5uoB7bDI5FiOhiWM4+PcBB4tkhOFW/zywBANN753a7YrVauE8kM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=XaLFDKyhTsZZVtqRmdCd6ea4GiAdNSY5JbDh9pUIcXUqH+PIDGAQrZNk7oLKGcFS5bCBpmFnHdYU4/UmvSUnver1nO1yTSmPRuGf4jxdja4iE6HNPhYIYNn4KwkflXrcak5+Gzv69DLz7nWvjQEMp2k3rpqcqi55vtxSJRQ/d2Y= Received: by 10.65.53.3 with SMTP id f3mr5164831qbk.1190326549654; Thu, 20 Sep 2007 15:15:49 -0700 (PDT) Received: from ?192.168.1.108? ( [69.124.234.183]) by mx.google.com with ESMTPS id e14sm25737qba.2007.09.20.15.15.48 (version=SSLv3 cipher=RC4-MD5); Thu, 20 Sep 2007 15:15:48 -0700 (PDT) Message-ID: <46F2F0FF.7070400@gmail.com> Date: Thu, 20 Sep 2007 18:15:27 -0400 From: Mark Miller User-Agent: Thunderbird 2.0.0.6 (Windows/20070728) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: highlighting and fragments References: <46F2E853.50800@mac.com> In-Reply-To: <46F2E853.50800@mac.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Lucene's storing functionality is just a simple storage mechanism. You can certainly and easily use your own storage mechanism. When you get your user created id back from Lucene due to a hit, just pass that id to your storage system to get the original text and then feed that to the Highlighter. Your storage system/code might be slower than Lucene, but I don't believe there is anything about Lucene's system that would give it an inside advantage. - Mark Michael J. Prichard wrote: > Hello Folks, > > I wanted to stay away from storing text in the indexes in order to > keep them smaller. I have a requirement now though to provide > highlighting and, more so, fragments of the content so they will be > displayed on the UI. > > Do you all prefer to store the text in the index to make this easier > or would you suggest retrieving the text from the source after doing > your search. From I can tell you need to run through the Hits anyway.... > > I am trying to keep the indexes as small as possible (they are still > HUGE...but...) so storing fields is not really what I want to do. I > will if it is the best and most efficient way to do so. > > Thanks, > Michael > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org