hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yongjun Zhang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-6133) Make Balancer support exclude specified path
Date Tue, 10 Feb 2015 04:14:40 GMT

    [ https://issues.apache.org/jira/browse/HDFS-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313513#comment-14313513

Yongjun Zhang commented on HDFS-6133:

Hi [~zhaoyunjiong],

Nice work!  Thanks Nicholas for the review, the patch looks good to me too. One nit:

The {{DFSOutputStream#getPinnings}} has nested loop. The search cost is NumNodesInPipeline
* NumFavoredNodes. From application point of view, I wonder  how many favoredNodes could be
for a big cluster (?). If needed, we could create a hash set when doing {{setFavoredNodes}}
for quicker search. This may not be needed if the number is small. Anyways, I don't think
we need to make this change in this jira, just something to watch out.


> Make Balancer support exclude specified path
> --------------------------------------------
>                 Key: HDFS-6133
>                 URL: https://issues.apache.org/jira/browse/HDFS-6133
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: balancer & mover, namenode
>            Reporter: zhaoyunjiong
>            Assignee: zhaoyunjiong
>         Attachments: HDFS-6133-1.patch, HDFS-6133-10.patch, HDFS-6133-2.patch, HDFS-6133-3.patch,
HDFS-6133-4.patch, HDFS-6133-5.patch, HDFS-6133-6.patch, HDFS-6133-7.patch, HDFS-6133-8.patch,
HDFS-6133-9.patch, HDFS-6133.patch
> Currently, run Balancer will destroying Regionserver's data locality.
> If getBlocks could exclude blocks belongs to files which have specific path prefix, like
"/hbase", then we can run Balancer without destroying Regionserver's data locality.

This message was sent by Atlassian JIRA

View raw message