hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-5623) TestJobCleanup fails because of RejectedExecutionException and NPE.
Date Fri, 22 Nov 2013 14:19:35 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-5623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13829994#comment-13829994

Jason Lowe commented on MAPREDUCE-5623:

bq. Or, I've noticed that Job#getCounters() can return null in some cases

I think that's much more likely to be the case of what's happening here.  It's not like the
JobImpl was just created at the time we're trying to get the counters for it, as we're waiting
for job completion before attempting to get them.

Unfortunately I can't reproduce the issue locally to dig deeper.  I think knowing what the
AM logs looked like for the job and whether the client was redirected to the history server
before the null counters could shed a lot of light on the problem.  If the client was redirected,
examining the .jhist file would also be interesting.

> TestJobCleanup fails because of RejectedExecutionException and NPE.
> -------------------------------------------------------------------
>                 Key: MAPREDUCE-5623
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5623
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: Tsuyoshi OZAWA
>            Assignee: Tsuyoshi OZAWA
>         Attachments: MAPREDUCE-5623.1.patch
> org.apache.hadoop.mapred.TestJobCleanup can fail because of RejectedExecutionException
by NonAggregatingLogHandler. This problem is described in YARN-1409. TestJobCleanup can still
fail after fixing RejectedExecutionException, because of NPE by Job#getCounters()'s returning
> {code}
> -------------------------------------------------------------------------------
> Test set: org.apache.hadoop.mapred.TestJobCleanup
> -------------------------------------------------------------------------------
> Tests run: 3, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 140.933 sec <<<
FAILURE! - in org.apache.hadoop.mapred.TestJobCleanup
> testCustomAbort(org.apache.hadoop.mapred.TestJobCleanup)  Time elapsed: 31.068 sec  <<<
> java.lang.NullPointerException: null
>         at org.apache.hadoop.mapred.TestJobCleanup.testFailedJob(TestJobCleanup.java:199)
>         at org.apache.hadoop.mapred.TestJobCleanup.testCustomAbort(TestJobCleanup.java:296)
> {code}

This message was sent by Atlassian JIRA

View raw message