lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject Re: Regarding Lucene Highlighting feature.
Date Wed, 10 Jul 2013 11:29:39 GMT
Hi Robert,

Thanks for the reply.

My Actual Usecase is to Highlight the First occurence of the search word in
the sentence it occured.

In my case,I do not have access to original documents .

Iam looking for optimum way by which i need to reduce the index disk space.

I tried SimpleHighlighter and FastVectorHighlighter both of which increases
index space considerably since i need to use Store.yes while indexing.

I read PostingsHighlighter uses compression while storing and it may help
reducing index space.

Has anyone tried using PostingsHighlighter ?

Also is there some other effecient compression techniques i can apply so
that it reduces disk space and also not increase the search time.

Thanks and Regards
Vignesh srinivasan

On Fri, Jul 5, 2013 at 7:03 PM, Roberto Ragusa <>wrote:

> On 07/05/2013 01:27 PM, VIGNESH S wrote:
> > Hi,
> >
> > I think using CompressingStoredFieldsFormat Feature introduced in Lucene
> > 4.1 may help reduce index size.
> >
> > Any other comments and suggestions are welcome in this topic..
> >
> Do you have access to the original documents, outside Lucene?
> If so, you can avoid storing anything.
> When you want to highlight, you read the document again, build a
> new index (in RAM, with stored=true), do the search again (in an index
> with only one document), extract highlights, destroy the index.
> I've done that in the past; it works beautifully.
> And the performance is not bad at all.
> Well, I actually do this for each _field_ in a document I want to highlight
> (for reasons I will not go to explain).
> Best regards.
> --
>    Roberto Ragusa    mail at
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

Thanks and Regards
Vignesh Srinivasan

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message