nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xiao Li <shinelee.thew...@gmail.com>
Subject Just fetch a specified URL list
Date Mon, 06 Feb 2012 05:07:08 GMT
I have compiled a URL list (1 million URLs). I just want to use Nutch to
only crawl these URLs. How can I do it? I have tried to specified the
parameter "-depth 1 -topN 1000000". But Nutch still crawls some non-on-list
URLs.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message