hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Virajith Jalaparti (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-12809) [READ] Fix the randomized selection of locations in {{ProvidedBlocksBuilder}}.
Date Mon, 13 Nov 2017 22:56:00 GMT

     [ https://issues.apache.org/jira/browse/HDFS-12809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Virajith Jalaparti updated HDFS-12809:
--------------------------------------
    Description: Calling {{getBlockLocations}} on files that have a PROVIDED replica, results
in the datanode locations being selected at random. Currently, this randomization uses the
datanode uuids to pick a node at random ({{ProvidedDescriptor#choose}}, {{ProvidedDescriptor#chooseRandom}}).
Depending on the distribution of the datanode UUIDs, this can lead to large number of iterations
(which may not terminate) before a location is chosen. This JIRA aims to replace this with
a more efficient randomization strategy.  (was: Calling {{getBlockLocations}} on files that
have a PROVIDED replica, results in the datanode locations being selected at random. Currently,
this randomization uses the datanode uuids to pick a node at random ({{ProvidedDescriptor#choose}},
{{ProvidedDescriptor#chooseRandom}}). Depending on the distribution of the datanode UUIDs,
this can lead to large number of iterations before a location is chosen. This JIRA aims to
replace this with a more efficient randomization strategy.)

> [READ] Fix the randomized selection of locations in {{ProvidedBlocksBuilder}}.
> ------------------------------------------------------------------------------
>
>                 Key: HDFS-12809
>                 URL: https://issues.apache.org/jira/browse/HDFS-12809
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Virajith Jalaparti
>
> Calling {{getBlockLocations}} on files that have a PROVIDED replica, results in the datanode
locations being selected at random. Currently, this randomization uses the datanode uuids
to pick a node at random ({{ProvidedDescriptor#choose}}, {{ProvidedDescriptor#chooseRandom}}).
Depending on the distribution of the datanode UUIDs, this can lead to large number of iterations
(which may not terminate) before a location is chosen. This JIRA aims to replace this with
a more efficient randomization strategy.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message