hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Calvin <calvin.li...@gmail.com>
Subject hbase bulk writes
Date Mon, 30 Nov 2009 22:33:40 GMT
I have a large amount of sequential ordered rows I would like to write to an
HBase table.  What is the preferred way to do bulk writes of multi-column
tables in HBase?  Using the get/put interface seems fairly slow even if I
bulk writes with table.put(List<Put>).

I have followed the directions on:
   * http://wiki.apache.org/hadoop/PerformanceTuning
   *
http://ryantwopointoh.blogspot.com/2009/01/performance-of-hbase-importing.html

Are there any other resources for improving the throughput of my bulk
writes?  On
http://hadoop.apache.org/hbase/docs/current/api/org/apache/hadoop/hbase/mapreduce/package-summary.htmlI
see there's a way to write HFiles directly, but HFileOutputFormat can
only
write a single column famly at a time (
https://issues.apache.org/jira/browse/HBASE-1861).

Thanks!

-Calvin

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message