lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rob Staveley (Tom)" <rstave...@seseit.com>
Subject RE: Date ranges - getting the approach right
Date Thu, 20 Jul 2006 13:17:14 GMT
Sorry for the delayed response. It takes me a while to get my head around
Lucene.

I've got parallel indexes, which means that chorological ordering by  doc ID
would need to be a bit more sophisticated. It strikes me that there must be
some performance advantage doing it though.

I'll see if I can get good enough performance without it, though.

-----Original Message-----
From: Erick Erickson [mailto:erickerickson@gmail.com] 
Sent: 16 July 2006 15:03
To: java-user@lucene.apache.org
Subject: Re: Date ranges - getting the approach right

Thanks for the clarification. Let me re-state this and see if I got it
right.

1> if you never do any deletions (or recalculate your "special records"
after deletion/optimization), this could work as-is.

2> the safe way to do this would be to find the miniminum doc ID for the
start date, the maximum doc ID for the end date and make the filter by
flipping all the bits in the filter in between. Assuming that you indexed in
date-sorted order in the first place. There really can't be anything in the
system to do anything like this for you since it relies on the meta-data
that the mails were indexed in some specific order.

I actually like the second, it's less prone for getting out of whack.....

Thanks for the
Erick

Mime
View raw message