nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lewis John Mcgibbney <>
Subject Re: Nutch Hadoop Optimization
Date Thu, 15 Dec 2011 17:47:29 GMT
This is overwhelmingly weighted towards Hadoop configuration.

There are some guidance notes on the Nutch wiki for performance issues
so you may wish to give them a try first.

On Thu, Dec 15, 2011 at 4:22 PM, Bai Shen <> wrote:
> So I have Nutch running on a hadoop cluster with three data nodes.  The
> machines are all pretty beefy, but Nutch isn't performing any faster than
> when I was running in pseudo mode on one machine.
> How to I set Nutch in order to take full advantage of the cluster?
> Thanks.


View raw message