hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Koji Noguchi (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-3688) Need better Error message if AM is killed/throws exception
Date Wed, 06 Mar 2013 14:32:13 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-3688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Koji Noguchi updated MAPREDUCE-3688:
------------------------------------

    Attachment: mapreduce-3688-h0.23-v02.patch

Another common error is ApplicationMaster going out of memory when number of tasks are large.
 Adding error message to stdout so that OOM would show.

{quote}
Diagnostics:	 Application application_1362579399138_0003 failed 1 times due to AM Container
for appattempt_1362579399138_0003_000001 exited with exitCode: 255 due to: Error starting
MRAppMaster: java.lang.OutOfMemoryError: Java heap space at 
{quote}

Forgot to mention but having these messages to UI also means it would show up on jobclient(console)
side as well.
                
> Need better Error message if AM is killed/throws exception
> ----------------------------------------------------------
>
>                 Key: MAPREDUCE-3688
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3688
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mr-am, mrv2
>    Affects Versions: 0.23.1
>            Reporter: David Capwell
>            Assignee: Sandy Ryza
>             Fix For: 0.23.2
>
>         Attachments: mapreduce-3688-h0.23-v01.patch, mapreduce-3688-h0.23-v02.patch
>
>
> We need better error messages in the UI if the AM gets killed or throws an Exception.
> If the following error gets thrown: 
> java.lang.NumberFormatException: For input string: "9223372036854775807l" // last char
is an L
> then the UI should say this exception.  Instead I get the following:
> Application application_1326504761991_0018 failed 1 times due to AM Container for appattempt_1326504761991_0018_000001
> exited with exitCode: 1 due to: Exception from container-launch: org.apache.hadoop.util.Shell$ExitCodeException

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message