lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Giulio Cesare Solaroli <giulio.ces...@gmail.com>
Subject Re: one huge index or many small ones?
Date Thu, 04 Nov 2004 16:26:21 GMT
Hi Javier,

I suggest you to build a single index, with all the information you
need to find the right mail you are looking for. You than can use
Lucene alone to find you messages.

Giulio Cesare


On Thu, 4 Nov 2004 17:00:35 +0100, javier muguruza <jmuguruza@gmail.com> wrote:
> Hi,
> 
> We are going to move from a just-in-time perl based search to using
> lucene in our project. I have to index emails (bodies and also
> attachements). I keep in the filesystem all the bodies and attachments
> for a long period of time. I have to find emails that fullfil certain
> conditions, some of the conditions are take care of at a different
> level, so in the end I have a SUBSET of emails I have to run through
> lucene.
> 
> I was assuming that the best way would be to create an index for each
> email. Having an unique index for a group of emails (say a day worth
> of email) seems too coarse grained, imagine a day has 10000 emails,
> and some queries will like to look in only a handful of the
> emails...But the problem with having one index per emails is the
> massive number of emails...imagine having 100000 indexes
> 
> Anyway, any idea about that? I just wanted to check wether someones
> feels I am wrong.
> 
> Thanks
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 
>

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message