nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <markus.jel...@openindex.io>
Subject Re: error "java.net.SocketException: Connection reset" in crawl with nutch
Date Tue, 06 Dec 2011 09:09:38 GMT
This is a TCP error and this is going to happen occasionally if you set up 
many connections. There may be little you can do about this:

http://wiki.apache.org/nutch/OptimizingCrawls

> hi, i crawl 4 sites with:
> 
> topN=1000000
> depth=3
> http.max.delays=1000
> http.timeout=80000
> nutch1.3
> 
> i have "java.net.SocketException: Connection reset" error in crawl log.
> help me.
> 
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/error-java-net-SocketException-Connecti
> on-reset-in-crawl-with-nutch-tp3563015p3563015.html Sent from the Nutch -
> User mailing list archive at Nabble.com.

Mime
  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message