incubator-mesos-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joachim Karnbach-Mink <...@infinipool.com>
Subject Re: Process exited when connecting slaves
Date Thu, 09 Jun 2011 08:04:04 GMT
Hi Matei,

yes, you're right, now it's working. I replaced 127.0.0.1 with the real ip
address and it's working. The --ip option doesn't seems to work, I tried
this but this is ignored, the slaves sent the localhost address. Maybe this
is added in the latest release, currently I'm using the alpha-03.

But this "workaround" is OK for me.

Thanks a lot,
Joachim

2011/6/9 Matei Zaharia <matei@eecs.berkeley.edu>

> Hi Joachim,
>
> The problem here is that the slave is reporting IP address 127.0.1.1 for
> itself. The master tries to open a reverse connection to the slave (to
> 1@127.0.1.1:42553) and fails, so it thinks that the slave has died. You
> can fix this by either configuring /etc/hosts on the slave node to lists its
> external IP for its hostname rather than 127.0.1.1, or passing the --ip
> argument to mesos-slave.
>
> Matei
>
> On Jun 9, 2011, at 12:39 AM, Joachim Karnbach-Mink wrote:
>
> > Hi all,
> >
> > I currently setup my first Mesos cluster on Ubuntu Lucid 64Bit and run in
> > some trouble. The master starts without any problems:
> >
> > ./bin/mesos-master --ip=192.168.1.163
> > I0609 09:30:26.087971 28453 logging.cpp:40] Logging to
> > /home/jkm/mesos-mesos-81c4e62/logs
> > I0609 09:30:26.088764 28453 main.cpp:66] Build: 2011-06-08 11:05:22 by
> jkm
> > I0609 09:30:26.088783 28453 main.cpp:67] Starting Mesos master
> > I0609 09:30:26.090169 28453 webui.cpp:63] Starting master web UI on port
> > 8080
> > I0609 09:30:26.090363 28456 webui.cpp:31] Web UI thread started
> > I0609 09:30:26.099850 28454 master.cpp:258] Master started at mesos://
> > 1@192.168.1.163:5050
> > I0609 09:30:26.100067 28454 master.cpp:268] Master ID: 201106090930-0
> > I0609 09:30:26.100087 28454 master.cpp:1124] Creating "simple" allocator
> > I0609 09:30:26.100646 28454 master.cpp:286] New master detected ... maybe
> > it's us!
> > I0609 09:30:26.109102 28456 webui.cpp:43] Loading webui/master/webui.py
> > Bottle server starting up (using WSGIRefServer())...
> > Listening on http://0.0.0.0:8080/
> >
> > But if I want to connect a slave via the network I got Process exited
> after
> > registering.
> >
> > On the slave:
> > ./bin/mesos-slave --url=mesos://1@192.168.1.163:5050
> > I0609 09:32:27.657230  8397 logging.cpp:40] Logging to
> > /home/jkm/mesos-mesos-81c4e62/logs
> > I0609 09:32:27.657889  8397 main.cpp:66] Creating "process" isolation
> module
> > I0609 09:32:27.657948  8397 main.cpp:74] Build: 2011-06-08 11:34:46 by
> jkm
> > I0609 09:32:27.657965  8397 main.cpp:75] Starting Mesos slave
> > I0609 09:32:27.660188  8397 webui.cpp:72] Starting slave web UI on port
> 8081
> > I0609 09:32:27.660663  8398 slave.cpp:149] Slave started at
> > 1@127.0.1.1:42553
> > I0609 09:32:27.661268  8398 slave.cpp:175] New master at
> > 1@192.168.1.163:5050 with ID:0
> > I0609 09:32:27.661519  8400 webui.cpp:32] Web UI thread started
> > I0609 09:32:27.679116  8400 webui.cpp:45] Loading webui/slave/webui.py
> > Bottle server starting up (using WSGIRefServer())...
> > Listening on http://0.0.0.0:8081/
> >
> > On the master:
> > I0609 09:32:27.666820 28454 master.cpp:481] Registering slave
> > 201106090930-0-0 at 1@127.0.1.1:42553
> > I0609 09:32:27.667151 28454 simple_allocator.cpp:36] Added slave
> > 201106090930-0-0
> > I0609 09:32:27.667266 28454 master.cpp:722] Process exited:
> > 1@127.0.1.1:42553
> > I0609 09:32:27.667290 28454 master.cpp:734] slave 201106090930-0-0
> > disconnected
> > I0609 09:32:27.667328 28454 simple_allocator.cpp:45] Removed slave
> > 201106090930-0-0
> >
> > If I start a slave session on the master this works fine:
> > On the master:
> > I0609 09:35:12.966112 28454 master.cpp:481] Registering slave
> > 201106090930-0-1 at 1@127.0.0.1:58401
> > I0609 09:35:12.966351 28454 simple_allocator.cpp:36] Added slave
> > 201106090930-0-1
> >
> > Anybody an idea where I can have a look at? In the log files there are no
> > additional informations. I tried this with different Mesos versions but
> all
> > have the same behavior.
> >
> > Thanks a lot,
> > Joachim
>
>


-- 
Joachim Karnbach-Mink
Technical Manager
global infinipool GmbH

Gerhard-Jahn-Platz 1
D-35037 Marburg

P: +49 6421 983004 0
M: +49 1525 374 11 11
F: +49 6421 983004 90
W: http://infinipool.com/

visit my techblog: http://techblog.infinipool.com

Registration Court: Amtsgericht Marburg, HRB 5255
Managing Director: Martin Scholl
Tax ID: DE255485693

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message