lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Deepak Shakya <just...@gmail.com>
Subject Re: Lucene updateDocument deletes the document, but the counts keep increasing
Date Wed, 17 Oct 2012 13:09:03 GMT
Oh is it. So whenever in future these segments gets merged, i will have my
document count going down right?

On Wed, Oct 17, 2012 at 6:33 PM, Ian Lea <ian.lea@gmail.com> wrote:

> Yes, IndexWriter.updateDocument() deletes and then adds. See the
> javadocs.  So your index will have deleted docs.  Why do you care?
> They'll go away eventually as segments get merged.
>
> If you really do care, see IndexWriter,forceMergeDeletes().  See also
> the javadoc for that: This is often a horribly costly operation;
> rarely is it warranted.
>
>
> --
> Ian.
>
>
> On Wed, Oct 17, 2012 at 1:52 PM, Deepak Shakya <justdpk@gmail.com> wrote:
> > I am using updateDocument() method to update my document in the lucene
> > index. Here is how I am doing it.
> >
> > writer.updateDocument(new Term(Constants.DOC_ID_FIELD,
> > doc.get(Constants.DOC_ID_FIELD)), doc);
> >
> > I check my index data with Luke, and find that on second run of the
> > indexing, Luke tells that Deleted Document - not available. So basically,
> > the document is marked as deleted, but it still resides in the index.
> >
> > I don't want to keep these marked deleted documents. Am I doing it wrong?
> >
> > Also, my understanding was that, when I am updating the document, it
> > deletes the old one and then adds the new one. Is that not the case?
> >
> > --
> > With Regards,
> > Deepak Shakya
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>


-- 
With Regards,
Deepak Shakya
http://www.google.com/profiles/justdpk

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message