hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Suresh Srinivas (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-1207) 0.20-append: stallReplicationWork should be volatile
Date Fri, 02 Sep 2011 22:02:09 GMT

     [ https://issues.apache.org/jira/browse/HDFS-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Suresh Srinivas updated HDFS-1207:
----------------------------------

    Fix Version/s: 0.20.205.0

I applied the attached patch to 0.20-security.

> 0.20-append: stallReplicationWork should be volatile
> ----------------------------------------------------
>
>                 Key: HDFS-1207
>                 URL: https://issues.apache.org/jira/browse/HDFS-1207
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: name-node
>    Affects Versions: 0.20-append
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>             Fix For: 0.20-append, 0.20.205.0
>
>         Attachments: hdfs-1207.txt
>
>
> the stallReplicationWork member in FSNamesystem is accessed by multiple threads without
synchronization, but isn't marked volatile. I believe this is responsible for about 1% failure
rate on TestFileAppend4.testAppendSyncChecksum* on my 8-core test boxes (looking at logs I
see replication happening even though we've supposedly disabled it)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message