hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bryant Khau (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-13034) Importing rows with bulkupload can overload single regionservers
Date Thu, 12 Feb 2015 22:15:16 GMT
Bryant Khau created HBASE-13034:
-----------------------------------

             Summary: Importing rows with bulkupload can overload single regionservers
                 Key: HBASE-13034
                 URL: https://issues.apache.org/jira/browse/HBASE-13034
             Project: HBase
          Issue Type: Improvement
          Components: hbase, mapreduce, regionserver
    Affects Versions: 0.98.0
            Reporter: Bryant Khau
            Priority: Minor


Exporting a table with a common schema, like hashes as the key, will result in a sorted exported
file. When imported with org.apache.hadoop.hbase.mapreduce.Import, region servers can be overloaded
one by one by requests by the MapReduce job, since the rows are imported in sequential order,
and a regions span ranges in sequential order. This is more likely to happen with lots of
data and not a lot of regions. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message