tez-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bikas Saha (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TEZ-1141) DAGStatus.Progress should include number of failed attempts
Date Thu, 23 Oct 2014 01:58:33 GMT

    [ https://issues.apache.org/jira/browse/TEZ-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14180906#comment-14180906
] 

Bikas Saha commented on TEZ-1141:
---------------------------------

Hive may not show but is it useful? IMO, when there is cluster outage that results in bad
nodes and retries or preemption goes awry, we will never know until killed attempts are exposed.
Either via the UI or via the progress. Does the UI show this?

> DAGStatus.Progress should include number of failed attempts
> -----------------------------------------------------------
>
>                 Key: TEZ-1141
>                 URL: https://issues.apache.org/jira/browse/TEZ-1141
>             Project: Apache Tez
>          Issue Type: Improvement
>    Affects Versions: 0.5.0
>            Reporter: Bikas Saha
>            Assignee: Hitesh Shah
>         Attachments: TEZ-1141.1.patch
>
>
> Currently its impossible to know whether a job is seeing a lot of issues and failures
because we only report running tasks. Eventually the job fails but before that we have no
indication that a bunch of task failures have been happening.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message