hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Junping Du (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-7967) Reduce the performance impact of the balancer
Date Fri, 27 Apr 2018 02:08:00 GMT

     [ https://issues.apache.org/jira/browse/HDFS-7967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Junping Du updated HDFS-7967:
    Target Version/s: 2.8.5  (was: 2.8.4)

> Reduce the performance impact of the balancer
> ---------------------------------------------
>                 Key: HDFS-7967
>                 URL: https://issues.apache.org/jira/browse/HDFS-7967
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: namenode
>    Affects Versions: 2.0.0-alpha
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>            Priority: Critical
>         Attachments: HDFS-7967-branch-2.8.patch, HDFS-7967-branch-2.patch, HDFS-7967.branch-2-1.patch,
HDFS-7967.branch-2.001.patch, HDFS-7967.branch-2.002.patch, HDFS-7967.branch-2.8-1.patch,
HDFS-7967.branch-2.8.001.patch, HDFS-7967.branch-2.8.002.patch, HDFS-7967.branch-2.8.003.patch
> The balancer needs to query for blocks to move from overly full DNs.  The block lookup
is extremely inefficient.  An iterator of the node's blocks is created from the iterators
of its storages' blocks.  A random number is chosen corresponding to how many blocks will
be skipped via the iterator.  Each skip requires costly scanning of triplets.
> The current design also only considers node imbalances while ignoring imbalances within
the nodes's storages.  A more efficient and intelligent design may eliminate the costly skipping
of blocks via round-robin selection of blocks from the storages based on remaining capacity.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org

View raw message