asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ASTERIXDB-2003) Failure during job start will cause tasks to hang
Date Thu, 27 Jul 2017 20:10:00 GMT

    [ https://issues.apache.org/jira/browse/ASTERIXDB-2003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103821#comment-16103821
] 

ASF subversion and git services commented on ASTERIXDB-2003:
------------------------------------------------------------

Commit 54a507007e08cd84652774263bd7e1fe9ede8a0f in asterixdb's branch refs/heads/master from
[~mhubail]
[ https://git-wip-us.apache.org/repos/asf?p=asterixdb.git;h=54a5070 ]

[ASTERIXDB-2003][FAIL] Abort jobs failing during job start

- user model changes: no
- storage format changes: no
- interface changes: no

Details:
- Prevent NPE or unmodifiable list in JobCleanupWork and
  JobletCleanupNotificationWork.
- Abort job if a failure happens during job start

Change-Id: If6fe4ed9084270f9f22ee4b4c71936d679c8b883
Reviewed-on: https://asterix-gerrit.ics.uci.edu/1904
Sonar-Qube: Jenkins <jenkins@fulliautomatix.ics.uci.edu>
Reviewed-by: Yingyi Bu <buyingyi@gmail.com>
Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>
BAD: Jenkins <jenkins@fulliautomatix.ics.uci.edu>
Integration-Tests: Jenkins <jenkins@fulliautomatix.ics.uci.edu>


> Failure during job start will cause tasks to hang
> -------------------------------------------------
>
>                 Key: ASTERIXDB-2003
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2003
>             Project: Apache AsterixDB
>          Issue Type: Bug
>          Components: FAIL - Failure handling/reporting, HYR - Hyracks
>            Reporter: Murtadha Hubail
>            Assignee: Murtadha Hubail
>
> If a job fails during its start and some tasks have already been submitted to some NC
but not yet started, those tasks will not be aborted because the JobCleanup triggered due
to the failure will remove the joblet from the NC before the received tasks are added to it.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message