nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tim Pease <>
Subject purging 404 URLs with SolrClean
Date Thu, 14 Jul 2011 21:07:08 GMT
I've noticed that SolrClean does not mark URLs as purged from Solr. Will running the SolrClean
task multiple times send the same URLs to Solr for deletion? If so, what is the best strategy
to mark these documents in the crawl DB so they are repeatedly deleted from Solr?

View raw message