nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Matt Poff <matt.headfi...@gmail.com>
Subject Re: How parse *only* specific URLs under a domain... -depth 1 -topN 1 does not work as desired
Date Fri, 03 Feb 2012 10:39:14 GMT

>I'm not a nutch expert, but I would try to run a crawl with -depth 0.

Tried that, but a depth of zero genetates no results at all.

On 3/02/2012, at 10:07 PM, Markus Jelsma <markus.jelsma@openindex.io> wrote:

> you can inject the url's you want and use the noAdditions switch when updating 
> the crawldb.

Thanks - that sounds perfect.
Mime
  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message