Return-Path: X-Original-To: apmail-incubator-mesos-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-mesos-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 27084979F for ; Sat, 16 Mar 2013 20:34:14 +0000 (UTC) Received: (qmail 61083 invoked by uid 500); 16 Mar 2013 20:34:14 -0000 Delivered-To: apmail-incubator-mesos-dev-archive@incubator.apache.org Received: (qmail 61044 invoked by uid 500); 16 Mar 2013 20:34:13 -0000 Mailing-List: contact mesos-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mesos-dev@incubator.apache.org Delivered-To: mailing list mesos-dev@incubator.apache.org Received: (qmail 61019 invoked by uid 99); 16 Mar 2013 20:34:13 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Mar 2013 20:34:13 +0000 Date: Sat, 16 Mar 2013 20:34:12 +0000 (UTC) From: "Benjamin Mahler (JIRA)" To: mesos-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Assigned] (MESOS-396) Slave GarbageCollector needs to delete the parent executor directories. It currently only deletes the executor run directories. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MESOS-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler reassigned MESOS-396: ------------------------------------- Assignee: Benjamin Mahler > Slave GarbageCollector needs to delete the parent executor directories. It currently only deletes the executor run directories. > ------------------------------------------------------------------------------------------------------------------------------- > > Key: MESOS-396 > URL: https://issues.apache.org/jira/browse/MESOS-396 > Project: Mesos > Issue Type: Bug > Reporter: Benjamin Mahler > Assignee: Benjamin Mahler > Priority: Blocker > > The result of this is that long lived slaves accumulate a large number of empty executor directories. All that remains in these directories is a broken link to the 'latest' run. > Over time, as the slave approaches having LINK_MAX empty executor directories, the slave will crash from mkdir failing, as was found in MESOS-391. > The fix is that we have to schedule the executor parent directories for deletion, however the GC module does not know whether the parent executor can be deleted! This is because there could be more tasks launched with the same executor id, since having scheduled the directory for deletion. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira