hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stack <st...@duboce.net>
Subject Re: How could I re-calculate every entries in hbase efficiently through mapreduce?
Date Fri, 25 Mar 2011 18:19:10 GMT
On Thu, Mar 24, 2011 at 7:36 PM, Stanley Xu <wenhao.xu@gmail.com> wrote:
> But I have two doubts here:
> 1. It looks the partitioner will do a lots of shuffling, I am wondering why
> it couldn't just do the put on the local region since the read and write on
> the same entry should be on the same region, isn't it?
>

There is a little on this in the mapreduce package javadoc:
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/package-summary.html#sink


> 2. If the job failed for any reason(like timeout), the HBase might be in a
> partial-updated status, is it?
>

This is a problem you'll have to figure.  Turn of speculative execution for one.

St.Ack

Mime
View raw message