incubator-mesos-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dmitriy Lyubimov <dlie...@gmail.com>
Subject Re: Mesos, lxc and ubuntu 12
Date Thu, 13 Jun 2013 01:01:15 GMT
also, listing cgroups produces



On Wed, Jun 12, 2013 at 5:42 PM, Dmitriy Lyubimov <dlieu.7@gmail.com> wrote:

> and here's the list of subsystems if it is relevant.
> lssubsys -am
> cpuset
> cpu /sys/fs/cgroup/cpu
> cpuacct /sys/fs/cgroup/cpuacct
> memory /sys/fs/cgroup/memory
> devices /sys/fs/cgroup/devices
> freezer /sys/fs/cgroup/freezer
> blkio
> perf_event
>
>
>
> On Wed, Jun 12, 2013 at 5:34 PM, Dmitriy Lyubimov <dlieu.7@gmail.com>wrote:
>
>>
>>
>>
>>
>>
>> On Wed, Jun 12, 2013 at 5:24 PM, Benjamin Mahler <
>> benjamin.mahler@gmail.com> wrote:
>>
>>> Cgroups does not allow one to mount the same subsystem across cgroup
>>> hierarchies. Do you have multiple cgroup hierarchies present on your
>>> machine?
>>>
>>
>> I frankly know nothing of cgroups. how do i check? not that i know of .
>>
>>>
>>> Ideally this will work with a stock ubuntu 12 OS, but it's possible that
>>> ubuntu already mounts a cgroup hierarchy with the freezer subsystem in a
>>> location we did not expect.
>>>
>>> What are the contents of the root directory on that machine?
>>>
>>  bin   cdrom  etc     home        initrd.img.old  lib64       media  opt
>>   root  sbin     srv  tmp  var      vmlinuz.old
>> boot  dev    hadoop  initrd.img  lib             lost+found  mnt    proc
>>  run   selinux  sys  usr  vmlinuz
>>
>>
>>>
>>> On Wed, Jun 12, 2013 at 5:19 PM, Dmitriy Lyubimov <dlieu.7@gmail.com>wrote:
>>>
>>>> ok thanks.
>>>>
>>>> now i switched to cgroups and can't get slave to start. The fatal error
>>>> says
>>>>
>>>> F0612 17:17:04.053773 10059 cgroups_isolation_module.cpp:161] Required
>>>> subsystem 'freezer' is already in use
>>>>
>>>> Any hints appreciated.
>>>>
>>>> thank you.
>>>>
>>>>
>>>> On Wed, Jun 12, 2013 at 4:59 PM, Vinod Kone <vinodkone@gmail.com>wrote:
>>>>
>>>>> No problem. Instead of giving --isolation=lxc, you could give
>>>>> --isolation=cgroups. Also for more flags, start mesos slave with --help.
>>>>> Unfortunately, we have been a bit behind on the documentation, so the
only
>>>>> place you can look at are our header files (e.g.,
>>>>> src/slave/cgroups_isolation.hpp). That said, if your kernel supports
it
>>>>> cgroups should work out of the box with mesos.
>>>>>
>>>>> HTH,
>>>>>
>>>>>
>>>>>
>>>>> -- Vinod
>>>>>
>>>>>
>>>>> On Wed, Jun 12, 2013 at 4:52 PM, Dmitriy Lyubimov <dlieu.7@gmail.com>wrote:
>>>>>
>>>>>> Oops. I am just starting with this. I see it clearly not working..
 I
>>>>>> just downloaded 0.11 and trying to set up spark 0.7.2 with it. it
works ok
>>>>>> with "process" isolation. I assumed lxc would be preferrable since
it is
>>>>>> being advertised feature on the Mesos home page.
>>>>>>
>>>>>> I will snoop around the docs looking for cgroups isolation. If you
>>>>>> can point me to manual, i'd be grateful too.
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Wed, Jun 12, 2013 at 4:48 PM, Vinod Kone <vinodkone@gmail.com>wrote:
>>>>>>
>>>>>>> Hi Dmitry,
>>>>>>>
>>>>>>> What version of mesos are you using? Lxc support has been deprecated
>>>>>>> for a while now. You should use the new cgroups isolation.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Jun 12, 2013 at 4:26 PM, Dmitriy Lyubimov <dlieu.7@gmail.com
>>>>>>> > wrote:
>>>>>>>
>>>>>>>> Hello,
>>>>>>>>
>>>>>>>> is there anything speicific to ubuntu 12 that needs to be
done to
>>>>>>>> make Mesos work with LCX?
>>>>>>>>
>>>>>>>> I set things up according to ubuntu docs,
>>>>>>>> https://help.ubuntu.com/12.10/serverguide/lxc.html#lxc-creation
>>>>>>>>
>>>>>>>> and all container examples there seem to be happily working.
>>>>>>>>
>>>>>>>> However, some mesos unit tests are failing (which i suspect
are
>>>>>>>> relating to lxc) as well as lxc isolation mode fails to spawn
tasks.
>>>>>>>>
>>>>>>>> (I am actually on ubuntu 12-04 LTS).
>>>>>>>>
>>>>>>>> Is there any speicific way to troubleshoot this? Is LXC in
Mesos
>>>>>>>> even working with Ubuntu 12?
>>>>>>>>
>>>>>>>> thank you in advance. (slave output enclosed).
>>>>>>>> -d
>>>>>>>>
>>>>>>>> I0612 16:24:20.682698 26452 slave.cpp:474] Got assigned task
0 for
>>>>>>>> framework 201306121623-16777343-5050-26417-0000
>>>>>>>> I0612 16:24:20.683425 26452 paths.hpp:234] Created executor
>>>>>>>> directory
>>>>>>>> '/tmp/mesos/slaves/201306121623-16777343-5050-26417-0/frameworks/201306121623-16777343-5050-26417-0000/executors/Task
>>>>>>>> 0 ("/home/dmitr...)/runs/9156d4fa-a177-464b-906f-fb62c8b9b363'
>>>>>>>> I0612 16:24:20.683630 26453 lxc_isolation_module.cpp:121]
Launching
>>>>>>>> Task 0 ("/home/dmitr...) (/usr/local/libexec/mesos/mesos-executor)
in
>>>>>>>> /tmp/mesos/slaves/201306121623-16777343-5050-26417-0/frameworks/201306121623-16777343-5050-26417-0000/executors/Task
>>>>>>>> 0 ("/home/dmitr...)/runs/9156d4fa-a177-464b-906f-fb62c8b9b363
with
>>>>>>>> resources ' for framework 201306121623-16777343-5050-26417-0000
>>>>>>>> I0612 16:24:20.683945 26453 lxc_isolation_module.cpp:152]
Forked
>>>>>>>> executor at = 26570
>>>>>>>> lxc-execute: No such file or directory - failed to create
>>>>>>>> '/sys/fs/cgroup/cpuset//lxc/mesos_executor_Task 0
>>>>>>>> ("/home/dmitr...)_framework_201306121623-16777343-5050-26417-0000'
directory
>>>>>>>> lxc-execute: failed to spawn 'mesos_executor_Task 0
>>>>>>>> ("/home/dmitr...)_framework_201306121623-16777343-5050-26417-0000'
>>>>>>>> lxc-execute: No such file or directory - failed to remove
cgroup
>>>>>>>> '/sys/fs/cgroup/cpuset//lxc/mesos_executor_Task 0
>>>>>>>> ("/home/dmitr...)_framework_201306121623-16777343-5050-26417-0000'
>>>>>>>> I0612 16:24:21.451616 26452 lxc_isolation_module.cpp:322]
Telling
>>>>>>>> slave of lost executor Task 0 ("/home/dmitr...) of framework
>>>>>>>> 201306121623-16777343-5050-26417-0000
>>>>>>>> I0612 16:24:21.451709 26452 lxc_isolation_module.cpp:239]
Stopping
>>>>>>>> container mesos_executor_Task 0
>>>>>>>> ("/home/dmitr...)_framework_201306121623-16777343-5050-26417-0000
>>>>>>>> I0612 16:24:21.452199 26454 slave.cpp:998] Executor 'Task
0
>>>>>>>> ("/home/dmitr...)' of framework 201306121623-16777343-5050-26417-0000
has
>>>>>>>> exited with status 255
>>>>>>>> sh: 1: Syntax error: "(" unexpected
>>>>>>>> E0612 16:24:21.453227 26452 lxc_isolation_module.cpp:248]
Failed to
>>>>>>>> stop container mesos_executor_Task 0
>>>>>>>> ("/home/dmitr...)_framework_201306121623-16777343-5050-26417-0000,
lxc-stop
>>>>>>>> returned: 512
>>>>>>>> I0612 16:24:21.453385 26454 slave.cpp:829] Status update:
task 0 of
>>>>>>>> framework 201306121623-16777343-5050-26417-0000 is now in
state TASK_FAILED
>>>>>>>> E0612 16:24:21.453583 26453 lxc_isolation_module.cpp:273]
ERROR!
>>>>>>>> Asked to update resources for an unknown executor!
>>>>>>>> I0612 16:24:21.453891 26451 gc.cpp:97] Scheduling
>>>>>>>> /tmp/mesos/slaves/201306121623-16777343-5050-26417-0/frameworks/201306121623-16777343-5050-26417-0000/executors/Task
>>>>>>>> 0 ("/home/dmitr...)/runs/9156d4fa-a177-464b-906f-fb62c8b9b363
for removal
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

Mime
View raw message