lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doron Cohen <>
Subject Re: modify existing non-indexed field
Date Fri, 07 Jul 2006 22:52:22 GMT
> dan2000 <> wrote on 07/07/2006 15:03:35:
> but if you remove it and add it again, you'll need to re-index it again.
> don't you? When you do re-index, you'll have to close the reader, which
> would pause the search. Any better way of doint it?

INHO yes and no -

There's no need to close the searcher (or the reader associated with the
searcher) in order to add that doc again, however it is required to close
the reader that is used for deleting a document in order to open a writer
for adding the updated document. Also, in order for this document to become
visible to search, a new searcher must be opened. This does not necessarily
means "pausing the search" - it is possible to open a new searcher in the
background (as done in Solr) and when it is ready - to switch to serve new
queries with the new searcher and close (and free) the old searcher once
its serviced queries are "completed". Solr ( does something similar, and it also warms
newly created searchers.

If the the update of these non-idexed field happens a lot, perhaps a side
data store would be more adequate, i.e. Lucene would store a unique docID
(non-indexed field) for each document, and at search time that ID would be
used to look-up the (updatable) values in the side data store. This means
additional code complexity and, search performance penalty if many result
documents are of interest for every search.

The approach of updating the documents <delete/add> moves the overhead to
indexing while making search faster and the application code simpler. If
this is the selected approach, especially if the updates cannot be easily
"batched" (i.e. delete 1000 docs to be updated, add back those updated 1000
docs), then the following may be of interest - a related proposal to "allow
updating documents with no need to repeatedly open/close reader/writer":

Hope this helps,

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message