hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "moonwatcher32329@yahoo.com" <moonwatcher32...@yahoo.com>
Subject block distribution with varying disk sizes
Date Tue, 09 Sep 2008 15:05:01 GMT

Does Hadoop distribute blocks according to how many blocks a node currently contains or according
to how much disk space the node has remaining currently ?
Suppose that I have many machines with identical CPUs but different disk sizes. If the blocks
get distributed according to the remaining disk space, then the larger disk nodes would be
storing more data... would this cause performance problems during the mapping phase ?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message