hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark question <markq2...@gmail.com>
Subject Re: connection between slaves and master
Date Wed, 11 Jan 2012 19:09:41 GMT
exactly right. Thanks Praveen.
Mark

On Tue, Jan 10, 2012 at 1:54 AM, Praveen Sripati
<praveensripati@gmail.com>wrote:

> Mark,
>
> > [mark@node67 ~]$ telnet node77
>
> You need to specify the port number along with the server name like `telnet
> node77 1234`.
>
> > 2012-01-09 10:04:03,436 INFO org.apache.hadoop.ipc.Client: Retrying
> connect to server: localhost/127.0.0.1:12123. Already tried 0 time(s).
>
> Slaves are not able to connect to the master. The configurations `
> fs.default.name` and `mapred.job.tracker` should point to the master and
> not to localhost when the master and slaves are on different machines.
>
> Praveen
>
> On Mon, Jan 9, 2012 at 11:41 PM, Mark question <markq2011@gmail.com>
> wrote:
>
> > Hello guys,
> >
> >  I'm requesting from a PBS scheduler a number of  machines to run Hadoop
> > and even though all hadoop daemons start normally on the master and
> slaves,
> > the slaves don't have worker tasks in them. Digging into that, there
> seems
> > to be some blocking between nodes (?) don't know how to describe it
> except
> > that on slave if I "telnet master-node"  it should be able to connect,
> but
> > I get this error:
> >
> > [mark@node67 ~]$ telnet node77
> >
> > Trying 192.168.1.77...
> > telnet: connect to address 192.168.1.77: Connection refused
> > telnet: Unable to connect to remote host: Connection refused
> >
> > The log at the slave nodes shows the same thing, even though it has
> > datanode and tasktracker started from the maste (?):
> >
> > 2012-01-09 10:04:03,436 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 0 time(s).
> > 2012-01-09 10:04:04,439 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 1 time(s).
> > 2012-01-09 10:04:05,442 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 2 time(s).
> > 2012-01-09 10:04:06,444 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 3 time(s).
> > 2012-01-09 10:04:07,446 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 4 time(s).
> > 2012-01-09 10:04:08,448 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 5 time(s).
> > 2012-01-09 10:04:09,450 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 6 time(s).
> > 2012-01-09 10:04:10,452 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 7 time(s).
> > 2012-01-09 10:04:11,454 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 8 time(s).
> > 2012-01-09 10:04:12,456 INFO org.apache.hadoop.ipc.Client: Retrying
> > connect
> > to server: localhost/127.0.0.1:12123. Already tried 9 time(s).
> > 2012-01-09 10:04:12,456 INFO org.apache.hadoop.ipc.RPC: Server at
> > localhost/
> > 127.0.0.1:12123 not available yet, Zzzzz...
> >
> >  Any suggestions of what I can do?
> >
> > Thanks,
> > Mark
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message