hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eli Collins (Resolved) (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (MAPREDUCE-2924) TaskTracker number of failed disks to tolerate should be configurable
Date Sat, 05 Nov 2011 01:43:51 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-2924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Eli Collins resolved MAPREDUCE-2924.
------------------------------------

    Resolution: Won't Fix

Thought about this some.. I think leaving the current behavior as is (TT keeps running regardless
# disk failures) but using a health script that shutsdown the TT when the DN goes down makes
more sense. The DN already has logic for shutting down given a sufficient # of disk failures,
and it doesn't make sense for the TT to keep running if the DN isn't running. Do think we
still need to fix MAPREDUCE-2657, otherwise restarting a cluster may result in a bunch of
TTs that were running not coming up because they tolerated a disk failure while running but
won't while starting.
                
> TaskTracker number of failed disks to tolerate should be configurable
> ---------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2924
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2924
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>          Components: tasktracker
>    Affects Versions: 0.20.204.0
>            Reporter: Eli Collins
>
> Like HDFS-1161 but for the TT. The user should be able to configure how many valid disks
are needed for operation. Currently the TT will start and accept tasks even if eg only 1 of
its 12 disks is working, which leads to poor performance of jobs with tasks that use this
machine.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message