lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@apache.org>
Subject Re: Best Practices for Distributing Lucene Indexing and Searching
Date Wed, 09 Mar 2005 20:14:07 GMT
Yonik Seeley wrote:
> I'm trying to support an interface where documents can be added one at
> a time at a high rate (via HTTP POST).  You don't know all of the
> documents ahead of time, so you can't delete them all ahead of time.

A simple solution is to queue documents as they're posted.  When either 
(a) the queue has 1000 documents or (b) one minute has passed, process 
the queue.  First delete each document in the queue by its unique id, 
then add them.  Reset the queue and repeat.

> Big Big Question:
> Will a term enumerator enumerate in the order documents were added to
> the index (for a single term of say id:a)?

Yes.

> Another Big Big Question:
> If the former idea doesn't work, can we depend on the ordering of the
> docids?  Will docs added later always have higher internal docids than
> ones added earlier?

Yes.

Doug

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message