hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Rutherglen <jason.rutherg...@gmail.com>
Subject MapReduce job reading directly from the HBase files in HDFS
Date Fri, 06 May 2011 19:27:20 GMT
Is there an issue open or any particular reason that an MR job needs to access
the HBase data directly from the region server? It seems possible to also
provide functionality such that MR can execute over the HFile(s) stored in
HDFS, thereby giving similar performance characteristics comparable to typical
MR jobs that execute against files in HDFS.

Jason

Mime
View raw message