hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-4730) AM crashes due to OOM while serving up map task completion events
Date Wed, 24 Oct 2012 16:32:13 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-4730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jason Lowe updated MAPREDUCE-4730:

    Status: Open  (was: Patch Available)

Thanks for the review, Vinod.  I'll work on a test case for EventFetcher.

bq. I don't remember us running into it when we ran AMScalability. Can you file a bug, more
details will be great to have.

I'll run the test case again and see if I can get more details on what is causing the connect
timeouts for tasks when they are launched en-masse.  Is AMScalability capable of emulating
the kind of simultaneous connect storm that a large cluster will exhibit?
> AM crashes due to OOM while serving up map task completion events
> -----------------------------------------------------------------
>                 Key: MAPREDUCE-4730
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4730
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: applicationmaster, mrv2
>    Affects Versions: 0.23.3
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>            Priority: Blocker
>         Attachments: MAPREDUCE-4730.patch, MAPREDUCE-4730.patch
> We're seeing a repeatable OOM crash in the AM for a task with around 30000 maps and 3000
reducers.  Details to follow.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message