hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod Kumar Vavilapalli (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-2899) [HOD] hdfs:///mapredsystem directory not cleaned up after deallocation
Date Thu, 20 Mar 2008 06:12:24 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-2899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12580679#action_12580679
] 

Vinod Kumar Vavilapalli commented on HADOOP-2899:
-------------------------------------------------

Very minor changes :
   * In testing/testHodCleanup.py, test class testUnresponsiveJobTracker has the log message
"Job Tracker did not exit even after a minute. Not going to try and cleanup the system directory".
The time 'minute' should instead depend on the number of retries.
   * In the same class, the mrSysDir parameter used is "/user/yhemanth/mapredsystem/hoduser.123.abc.com".
Need to change this.

Even that, these are pretty cosmetic, so they can be checked in later, given the want of time.

+1 for the fix. OK for commit.

> [HOD] hdfs:///mapredsystem directory not cleaned up after deallocation 
> -----------------------------------------------------------------------
>
>                 Key: HADOOP-2899
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2899
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/hod
>    Affects Versions: 0.16.0
>            Reporter: Luca Telloli
>            Assignee: Hemanth Yamijala
>             Fix For: 0.17.0
>
>         Attachments: 2899.1.patch
>
>
> Each submitted job creates a hdfs:///mapredsystem directory, created by (I guess) the
hodring process. Problem is that it's not cleaned up at the end of the process; a use case
would be:
> - user A allocates a cluster, the hodring is svrX, so a /mapredsystem/srvX directory
is created
> - user A deallocates the cluster, but that directory is not cleaned up
> - user B allocates a cluster, and the first node chosen as hodring is svrX, so hodring
tries to write hdfs:///mapredsystem but it fails
> - allocation succeeds, but there's no hodring running; looking at
> 0-jobtracker/logdir/hadoop.log under the temporary directory I can read:
> 2008-02-26 17:28:42,567 WARN org.apache.hadoop.mapred.JobTracker: Error starting tracker:
org.apache.hadoop.ipc.RemoteException: org.apache.hadoop.fs.permission.AccessControlException:
Permission denied: user=B, access=WRITE, inode="mapredsystem":hadoop:supergroup:rwxr-xr-x
> I guess a possible solution would be to clean up those directories during the deallocation
process. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message