nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Naomi Dushay" <Na...@cs.cornell.edu>
Subject RE: Error trying to crawl.
Date Mon, 02 May 2005 13:45:00 GMT
Ian,

Did you create a nutch db with the appropriate tool?  Seems like it's saying
it can't find that db.  Or maybe you didn't use a command line arg to tell it
where the db is?

- Naomi Dushay
National Science Digital Library
Cornell University

> -----Original Message-----
> From: Ian Reardon [mailto:ian_reardon@hotmail.com]
> Sent: Saturday, April 30, 2005 4:42 PM
> To: nutch-user@incubator.apache.org
> Subject: Error trying to crawl.
> 
> I am getting this error when I try and do a simple crawl.
> 
> ~/nutch-0.6$ bin/nutch crawl sites.crawl -dir crawl.test -depth 2 -delay 5
> >
> crawl.log
> 050430 163851 No NutchFileSystem indicated, so defaulting to local fs.
> 050430 163851 loading file:/home/reardoni/nutch-0.6/conf/nutch-default.xml
> 050430 163851 loading file:/home/reardoni/nutch-0.6/conf/crawl-tool.xml
> 050430 163851 loading file:/home/reardoni/nutch-0.6/conf/nutch-site.xml
> 050430 163851 crawl started in: crawl.test
> 050430 163851 rootUrlFile = 5
> 050430 163851 threads = 10
> 050430 163851 depth = 2
> 050430 163851 Created webdb at
> LocalFS,/home/reardoni/nutch-0.6/crawl.test/db
> Exception in thread "main" java.io.FileNotFoundException: 5 (No such file
> or
> directory)
>         at java.io.FileInputStream.open(Native Method)
>         at java.io.FileInputStream.<init>(FileInputStream.java:106)
>         at java.io.FileReader.<init>(FileReader.java:55)
>         at
> net.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:359)
>         at net.nutch.db.WebDBInjector.main(WebDBInjector.java:510)
>         at net.nutch.tools.CrawlTool.main(CrawlTool.java:121)
> 
> 
> Any ideas?
> 


Mime
View raw message