lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vjeran Marcinko <>
Subject Duplicate filtering
Date Tue, 20 Sep 2016 05:17:57 GMT

I'm pretty much Lucene newb, so wondering for some short guidelines on 
how to implement some duplicate document filtering based on some field 
which defines uniqueness, and first document stays, other duplicates are 
filtered out?

I know some 3rd party contrib lib existed before which was for that, but 
it has been abandoned/deprecated for these newer versions of Lucene.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message