hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Benoy Antony (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-6441) Add ability to exclude/include few datanodes while balancing
Date Mon, 28 Jul 2014 01:14:38 GMT

    [ https://issues.apache.org/jira/browse/HDFS-6441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14075821#comment-14075821
] 

Benoy Antony commented on HDFS-6441:
------------------------------------

Updating the patch including changes from HDFS-6010.
The main change is ability to specify the data nodes to include or exclude via command line
or file.
The second change is the ability to specify the data nodes as a host name or ip address, with
or without port address.

Data nodes can be excluded while balancing as follows :
{panel}
run-balancer.sh  -exclude -f <hosts-file>  
{panel}
OR
{panel}
run-balancer.sh -exclude comma-separated-list-of-hosts
{panel}
The specified  datanodes will not be used while balancing.

Specific Data nodes can included while balancing as follows:
{panel}
run-balancer.sh -include -f <hosts-file> 
{panel}
OR
{panel}
run-balancer.sh -include comma-separated-list-of-hosts
{panel}
With this, only the specified datanodes will be used for balancing.


> Add ability to exclude/include few datanodes while balancing
> ------------------------------------------------------------
>
>                 Key: HDFS-6441
>                 URL: https://issues.apache.org/jira/browse/HDFS-6441
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: balancer
>    Affects Versions: 2.4.0
>            Reporter: Benoy Antony
>            Assignee: Benoy Antony
>         Attachments: HDFS-6441.patch, HDFS-6441.patch, HDFS-6441.patch, HDFS-6441.patch,
HDFS-6441.patch, HDFS-6441.patch, HDFS-6441.patch, HDFS-6441.patch, HDFS-6441.patch, HDFS-6441.patch,
HDFS-6441.patch
>
>
> In some use cases, it is desirable to ignore a few data nodes  while balancing. The administrator
should be able to specify a list of data nodes in a file similar to the hosts file and the
balancer should ignore these data nodes while balancing so that no blocks are added/removed
on these nodes.
> Similarly it will be beneficial to specify that only a particular list of datanodes should
be considered for balancing.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message