lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Noll <dan...@nuix.com>
Subject Re: Index Dedupe
Date Tue, 02 Oct 2007 04:00:07 GMT
On Tuesday 02 October 2007 12:25:47 Johnny R. Ruiz III wrote:
> Hi,
>
> I can't seem to find a way to delete duplicate in lucene index.  I hve  a
> unique key so it seems to be straight forward.  But I can't find a simple
> way  to do it except for putting  each record in the index into HashMap. 
> Are there any method in lucene package that I could use?

I would use termDocs() to iterate all the terms in that field.  Then skip the 
first doc for each term and delete all subsequent ones.

Daniel

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message