lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael J. Prichard" <>
Subject Re: Seeking Advice
Date Thu, 16 Aug 2007 01:01:37 GMT
I actually know from experience.  Around 20% +/- 5% of emails will have 
attachments.  If that helps.  Again, I say index as much info as you 
can.  Store what you think it necessary.

Erick Erickson wrote:
> Rather than use efficiency arguments to drive the behavior of the
> app, I'd recommend that you define the expected behavior and
> make that behavior happen as necessary.
> What would you estimate is the ratio of meta-data to attachments?
> And what is the ratio of documents that have multiple attachments?
> I actually suspect that the number of e-mails that have multiple
> attachments is small enough that storing the meta-data with each
> document would result in a minuscule size increase,
> but you'll only find that by gathering some statistics <G>....
> Erick

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message