hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Cosmin Lehene <cleh...@adobe.com>
Subject Re: map/reduce locality
Date Wed, 26 Nov 2008 11:36:23 GMT
It doesn't currently do that. However this seems to be on HBase roadmap. See Data-Locality

The Hadoop map reduce framework does makes a best effort at running tasks on the server hosting
the task data after the dictum that its cheaper moving the processing to the data rather than
the inverse. HBase needs smarts to assign regions to the region server that is running on
the server hosting the regions' data. HBase needs to supply map reduce hints such that the
Hadoop framework runs tasks beside the region server hosting the task input. These changes
will make for savings in network I/O.



On 11/26/08 1:32 PM, "David Faitelson" <david@proactivemodeling.com> wrote:


Does HBase/Hadoop create map tasks on the same data node that
contains the region for the map task?

I know that Bigtable does something like that but I could not find
any mention of this optimization in the documentation of HBase.


View raw message