hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sergey Shelukhin (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (HIVE-10280) LLAP: Handle errors while sending source state updates to the daemons
Date Wed, 30 Mar 2016 18:06:25 GMT

    [ https://issues.apache.org/jira/browse/HIVE-10280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15218475#comment-15218475
] 

Sergey Shelukhin edited comment on HIVE-10280 at 3/30/16 6:05 PM:
------------------------------------------------------------------

The code looks reasonable... the logic though, would it mean one temp failure will make AM
discard all tasks on the node? 
I also assume it's safe to mark running tasks as killed from AM perspective (wrt potential
future events from them, etc.); -however should we try to send kill to them (and ignore the
failures) so they don't hog resources- actually it may be a good idea to send a kill if we
received a status update from some task that we declared dead.


was (Author: sershe):
The code looks reasonable... the logic though, would it mean one temp failure will make AM
discard all tasks on the node? 
I also assume it's safe to mark running tasks as killed from AM perspective (wrt potential
future events from them, etc.); however should we try to send kill to them (and ignore the
failures) so they don't hog resources?

> LLAP: Handle errors while sending source state updates to the daemons
> ---------------------------------------------------------------------
>
>                 Key: HIVE-10280
>                 URL: https://issues.apache.org/jira/browse/HIVE-10280
>             Project: Hive
>          Issue Type: Sub-task
>          Components: llap
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: HIVE-10280.1.patch
>
>
> Will likely be handled as marking the node as bad. May need a retry policy in place though
before marking a node bad to handle temporary network glitches.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message