hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-5064) Standby checkpoints should not block concurrent readers
Date Sat, 03 Aug 2013 00:25:48 GMT
Aaron T. Myers created HDFS-5064:

             Summary: Standby checkpoints should not block concurrent readers
                 Key: HDFS-5064
                 URL: https://issues.apache.org/jira/browse/HDFS-5064
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: ha, namenode
    Affects Versions: 2.1.1-beta
            Reporter: Aaron T. Myers
            Assignee: Aaron T. Myers

We've observed an issue which causes fetches of the {{/jmx}} page of the NN to take a long
time to load when the standby is in the process of creating a checkpoint.

Even though both creating the checkpoint and gathering the statistics for {{/jmx}} take only
the FSNS read lock, the issue is that since the FSNS uses a _fair_ RW lock, a single writer
attempting to get the lock will block all threads attempting to get only the read lock for
the duration of the checkpoint. This will cause {{/jmx}}, and really any thread only attempting
to get the read lock, to block for the duration of the checkpoint, even though they should
be able to proceed concurrently with the checkpointing thread.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message