lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From suneethad <suneet...@india.adventnet.com>
Subject Remove Duplicates
Date Tue, 12 Feb 2002 03:50:14 GMT
Hello team,
    I have indexed  a set of files based on some categories but I find
the urls to crawl that I've given has a lot of duplication . How
can I remove them .I want to refine the hit results too.
    Secondly can I index also some database values along with the file
contents.
Regards,
Suneetha


Mime
View raw message