nutch-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Update of "ErrorMessages" by ChiragChaman
Date Sat, 09 Apr 2005 01:14:35 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by ChiragChaman:
http://wiki.apache.org/nutch/ErrorMessages

New page:

= Error messages, reasons and solutions =

Please feel free to add error messages, reasons and solutions!

Please report bugs to the mailing list!

  * Fetching
  * Updating
  * Searching

== Errors Fetching ==

'''Why do I get error "123456 104934 fetch of http://mydomain/index.html failed with: net.nutch.net.protocols.http.HttpError:
HTTP Error: 401" when crawling?'''
  * An HTTP 401 error is returned from a remote webserver when you not authorized to view
the page. Currently nutch does not support HTTP authentication but it will be trivial to add
when the new HTTPClient fetcher code is committed.

'''/etc/host.conf: line 1: cannot specify more then 4 services'''
  * Please have a look at http://sources.redhat.com/ml/bug-glibc/2002-07/msg00269.html

== Errors Updating ==

'''Until updating my DB I got a OutOfMemoryException or a 'to many files open' error.'''
  * The problems is that nutch opens more files then your OS allows to open. You can check
the limits of your machine with "ulimit -a". In case you run nutch as superuser you can set
the limit of open files for the actual session with "ulimit -n 65536". To change this limit
permanently please read: http://bbcr.uwaterloo.ca/~brecht/servers/openfiles.html

== Errors Searching ==

'''Tomcat reports root cause: java.lang.OutOfMemoryError and does not find anything.'''
  * Try to give java / tomcat some more memory. Add to catalina.sh (linux): JAVA_OPTS=-Xmx256m

== Errors installing ==

See GettingNutchRunningOnDebian for some help.

Mime
View raw message