mesos-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jie Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MESOS-7166) port_mapping isolator: netns and veth are not GC-ed
Date Fri, 24 Feb 2017 17:45:44 GMT

    [ https://issues.apache.org/jira/browse/MESOS-7166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15883179#comment-15883179
] 

Jie Yu commented on MESOS-7166:
-------------------------------

We definitely cleanup orphan containers' rules and veths. Can you restart the agent and paste
the logs?

> port_mapping isolator: netns and veth are not GC-ed
> ---------------------------------------------------
>
>                 Key: MESOS-7166
>                 URL: https://issues.apache.org/jira/browse/MESOS-7166
>             Project: Mesos
>          Issue Type: Bug
>          Components: isolation, network
>            Reporter: Pierre Cheynier
>
> By testing port_mapping isolator during a few days in a preproduction environment where
a lot of container starts, sometime fails, are destroyed continuously, I faced this issue:
some agents host still have their network configuration, meaning that the netns, the veth
interfaces, the tc rules are there.
> Here is my setup.
> * Cent OS 7.2
> * LTS Kernel, 4.4.21 at that time
> * libnl 3.2.28
> * mesos 1.0.2 compiled using:
> {noformat}
> ./configure \
>     CFLAGS="%{optflags}" \
>     CXXFLAGS="%{optflags}" \
>     --disable-silent-rules \
>     --prefix=%{_prefix} \
>     --bindir=%{_bindir} \
>     --libdir=%{_libdir} \
>     --includedir=%{_includedir} \
>     --disable-python \
>     --disable-python-dependency-install \
>     --enable-libevent \
>     --enable-ssl \
>     --enable-optimize \
>     --with-network-isolator
> {noformat}
> I have logs that apparently says that at some point the container was considered as orphan
(maybe due to an operation on the host, like an agent configuration update).
> {noformat}
> Feb 24 13:21:30 mesos-slave049-par mesos-slave[48375]: I0224 13:21:30.421066 48395 containerizer.cpp:690]
Removing orphan container a8e05a03-7499-4566-bcba-53d8bf204e5f
> Feb 24 13:21:30 mesos-slave049-par mesos-slave[48375]: I0224 13:21:30.421968 48395 linux_launcher.cpp:349]
Using pid namespace to destroy container a8e05a03-7499-4566-bcba-53d8bf204e5f
> {noformat}
> It would be nice if the agent isolator could handle the cleaning of that.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message