Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: core-dev@hadoop.apache.org
Message-ID: <1747012991.1215036885198.JavaMail.jira@brutus>
Date: Wed, 2 Jul 2008 15:14:45 -0700 (PDT)
From: "Koji Noguchi (JIRA)" <jira@apache.org>
To: core-dev@hadoop.apache.org
Subject: [jira] Created: (HADOOP-3685) Unbalanced replication target
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit

Unbalanced replication target 
------------------------------

                 Key: HADOOP-3685
                 URL: https://issues.apache.org/jira/browse/HADOOP-3685
             Project: Hadoop Core
          Issue Type: Bug
          Components: dfs
    Affects Versions: 0.17.0
            Reporter: Koji Noguchi
            Priority: Critical


In HADOOP-3633, namenode was assigning some datanodes to receive  hundreds of blocks in a short period which caused datanodes to go out of memroy(threads).
Most of them were from remote rack.

Looking at the code, 

{noformat}
    166           chooseLocalRack(results.get(1), excludedNodes, blocksize,
    167                           maxNodesPerRack, results);
{noformat}

was sometimes not choosing the local rack of the writer(source).  

As a result, when a datanode goes down, other datanodes on the same rack were getting large number of blocks from remote racks.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.