lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael McCandless <luc...@mikemccandless.com>
Subject Re: IndexWriter.deleteDocuments(Query query)
Date Wed, 01 Apr 2009 21:29:10 GMT
On Wed, Apr 1, 2009 at 5:22 PM, John Wang <john.wang@gmail.com> wrote:
> Hi Michael:
>
>    1) Yes, we use TermDocs, exactly what IndexWriter.deleteDocuments(Term)
> is doing under the cover.

This part I understand :)

>    2) We iterate the docid->uid mapping, for each docid, get the
> corresponding ui and check that to see if that is in the deleted set. If so,
> add the docid to the list. There is no uid->docid lookup needed.

Does this mean you iterate all docs in the index, and only when you
come across a UID that's deleted, you add to deleted set?

Do you have a separate payload field per document?  (I'm still unclear
how you use payloads to encode the full docID -> UID map).

>      However, in our sharded architecture, we partition by continuous uids,
> in which case we keep both mappings since we know the range of the the uid.
> In which case, uid->docid mapping is available.

OK

Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message