Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4A2B9F2AE for ; Fri, 5 Jul 2013 13:34:29 +0000 (UTC) Received: (qmail 77769 invoked by uid 500); 5 Jul 2013 13:34:25 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 77702 invoked by uid 500); 5 Jul 2013 13:34:23 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 77690 invoked by uid 99); 5 Jul 2013 13:34:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Jul 2013 13:34:22 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE X-Spam-Check-By: apache.org Received-SPF: error (athena.apache.org: local policy) Received: from [88.149.128.104] (HELO smtpi4.ngi.it) (88.149.128.104) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Jul 2013 13:34:17 +0000 Received: from [127.0.0.1] (81-174-56-138.v4.ngi.it [81.174.56.138]) by smtpi4.ngi.it (Postfix) with ESMTP id 0503642775 for ; Fri, 5 Jul 2013 15:33:32 +0200 (CEST) Message-ID: <51D6CB2C.30201@robertoragusa.it> Date: Fri, 05 Jul 2013 15:33:32 +0200 From: Roberto Ragusa User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130514 Thunderbird/17.0.6 MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: Regarding Lucene Highlighting feature. References: In-Reply-To: X-Enigmail-Version: 1.5.1 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 07/05/2013 01:27 PM, VIGNESH S wrote: > Hi, > > I think using CompressingStoredFieldsFormat Feature introduced in Lucene > 4.1 may help reduce index size. > > Any other comments and suggestions are welcome in this topic.. > Do you have access to the original documents, outside Lucene? If so, you can avoid storing anything. When you want to highlight, you read the document again, build a new index (in RAM, with stored=true), do the search again (in an index with only one document), extract highlights, destroy the index. I've done that in the past; it works beautifully. And the performance is not bad at all. Well, I actually do this for each _field_ in a document I want to highlight (for reasons I will not go to explain). Best regards. -- Roberto Ragusa mail at robertoragusa.it --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org