hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-2966) TestNameNodeMetrics tests can fail under load
Date Fri, 17 Feb 2012 18:07:59 GMT
TestNameNodeMetrics tests can fail under load

                 Key: HDFS-2966
                 URL: https://issues.apache.org/jira/browse/HDFS-2966
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: test
    Affects Versions: 0.24.0
         Environment: OS/X running intellij IDEA, firefox, winxp in a virtualbox.
            Reporter: Steve Loughran
            Priority: Minor

I've managed to recreate HDFS-540 and HDFS-2434 by the simple technique of running the HDFS
tests on a desktop with out enough memory for all the programs trying to run. Things got swapped
out and the tests failed as the DN heartbeats didn't come in on time.

the tests both rely on {{waitForDeletion()}} to block the tests until the delete operation
has completed, but all it does is sleep for the same number of seconds as there are datanodes.
This is too brittle -it may work on a lightly-loaded system, but not on a system under heavy
load where it is taking longer to replicate than expect.

Immediate fix: double, triple, the sleep time?
Better fix: have the thread block until all the DN heartbeats have finished.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message