lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yonik Seeley <ysee...@gmail.com>
Subject Re: Document Duplication for Multiple Segment Merge
Date Fri, 14 Oct 2005 17:18:26 GMT
There is no concept in Lucene of document identity linked to any fields of a
document.
You need to handle removal of duplicates yourself.

-Yonik
Now hiring -- http://tinyurl.com/7m67g


On 10/14/05, Michael Ji <fji_00@yahoo.com> wrote:
>
> hi,
>
> When Nutch's IndexMerger.java is called, the indexes
> from multiple segment directories are merged to one
> target directory.
>
> I wonder how lucene deals with the case when identical
> documents existing in two segments. Is the older
> document ( lower time stamp ) deleted?
>
> thanks,
>
> Michael Ji,
>
>
>
> __________________________________
> Yahoo! Music Unlimited
> Access over 1 million songs. Try it free.
> http://music.yahoo.com/unlimited/
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-dev-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message