lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael J. Prichard" <>
Subject Re: highlighting and fragments
Date Fri, 21 Sep 2007 20:10:28 GMT
One index is around 3,000,000 items.  I think around 10 fields.  I store 
some and don't others.  I index email content and attachment content.  I 
store some smaller fields and not the content fields.  That current 
index is around 10GB but that is nothing that is about to come down the 
pike.  Maybe looking at indexes 10x that.



Erick Erickson wrote:
> Out of curiosity, how big is huge? And how many documents and
> fields?
> And a silly question, are you storing your fields or not (i.e.
> Field.Store.NO????
> Erick
> On 9/20/07, Michael J. Prichard <> wrote:
>> Hello Folks,
>> I wanted to stay away from storing text in the indexes in order to keep
>> them smaller.  I have a requirement now though to provide highlighting
>> and, more so, fragments of the content so they will be displayed on the
>> UI.
>> Do you all prefer to store the text in the index to make this easier or
>> would you suggest retrieving the text from the source after doing your
>> search.  From I can tell you need to run through the Hits anyway....
>> I am trying to keep the indexes as small as possible (they are still
>> HUGE...but...) so storing fields is not really what I want to do.  I
>> will if it is the best and most efficient way to do so.
>> Thanks,
>> Michael
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail:
>> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message