hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrey Klochkov (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1183) MiniYARNCluster shutdown takes several minutes intermittently
Date Fri, 13 Sep 2013 04:39:57 GMT

    [ https://issues.apache.org/jira/browse/YARN-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13766237#comment-13766237
] 

Andrey Klochkov commented on YARN-1183:
---------------------------------------

bq. MiniYARNCluster is used by several tests. This might bite us if and when we run tests
parallely.
Concurrency level won't make any difference even with that. BTW I'm actually running MR tests
in parallel now. That's when this issue with cluster shutdown working incorrectly becomes
more evident. 

Thanks for catching the thing with synchronized block, fixing it.
                
> MiniYARNCluster shutdown takes several minutes intermittently
> -------------------------------------------------------------
>
>                 Key: YARN-1183
>                 URL: https://issues.apache.org/jira/browse/YARN-1183
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Andrey Klochkov
>         Attachments: YARN-1183--n2.patch, YARN-1183.patch
>
>
> As described in MAPREDUCE-5501 sometimes M/R tests leave MRAppMaster java processes living
for several minutes after successful completion of the corresponding test. There is a concurrency
issue in MiniYARNCluster shutdown logic which leads to this. Sometimes RM stops before an
app master sends it's last report, and then the app master keeps retrying for >6 minutes.
In some cases it leads to failures in subsequent tests, and it affects performance of tests
as app masters eat resources.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message