hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joydeep Sen Sarma (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-1422) skip counter update when RunningJob.getCounters() returns null
Date Thu, 29 Jul 2010 23:05:16 GMT

    [ https://issues.apache.org/jira/browse/HIVE-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893852#action_12893852

Joydeep Sen Sarma commented on HIVE-1422:

i looked at the hadoop source for 20 a bit. looks like both getCounters() and getJob() can
return null (in case the job cannot be found). on 0.20 - completed jobs are looked up from
persistent store - so i think this is pretty hard to happen (if it does - it seems like a
hadoop bug). but for 17 (and maybe other versions in between) - we need to guard against these.

> skip counter update when RunningJob.getCounters() returns null
> --------------------------------------------------------------
>                 Key: HIVE-1422
>                 URL: https://issues.apache.org/jira/browse/HIVE-1422
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.6.0
>            Reporter: John Sichi
>            Assignee: Joydeep Sen Sarma
>             Fix For: 0.7.0
>         Attachments: HIVE-1422.1.patch
> Under heavy load circumstances on some Hadoop versions, we may get a NPE from trying
to dereference a null Counters object.  I don't have a unit test which can reproduce it, but
here's an example stack from a production cluster we saw today:
> 10/06/21 13:01:10 ERROR exec.ExecDriver: Ended Job = job_201005200457_701060 with exception
> java.lang.NullPointerException
> at org.apache.hadoop.hive.ql.exec.Operator.updateCounters(Operator.java:999)
> at org.apache.hadoop.hive.ql.exec.ExecDriver.updateCounters(ExecDriver.java:503)
> at org.apache.hadoop.hive.ql.exec.ExecDriver.progress(ExecDriver.java:390)
> at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:697)
> at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:107)
> at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:55)
> at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:47)

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message