lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From suneethad <>
Subject Remove Duplicates
Date Tue, 12 Feb 2002 03:50:14 GMT
Hello team,
    I have indexed  a set of files based on some categories but I find
the urls to crawl that I've given has a lot of duplication . How
can I remove them .I want to refine the hit results too.
    Secondly can I index also some database values along with the file

View raw message