hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anubhav Dhoot (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (YARN-2096) testQueueMetricsOnRMRestart has race condition
Date Fri, 23 May 2014 06:53:02 GMT

     [ https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Anubhav Dhoot updated YARN-2096:

    Attachment: YARN-2096.patch

Fixed 2 race conditions by
First one) waiting for appropriate transitions before checking metrics and
 Second one) resetting metrics before the events are triggered.

> testQueueMetricsOnRMRestart has race condition
> ----------------------------------------------
>                 Key: YARN-2096
>                 URL: https://issues.apache.org/jira/browse/YARN-2096
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Anubhav Dhoot
>            Assignee: Anubhav Dhoot
>         Attachments: YARN-2096.patch
> org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
fails randomly because of a race condition.
> The test validates that metrics are incremented, but does not wait for all transitions
to finish before checking for the values.
> It also resets metrics after kicking off recovery of second RM. The metrics that need
to be incremented race with this reset causing test to fail randomly.
> We need to wait for the right transitions.

This message was sent by Atlassian JIRA

View raw message