hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-2075) [hbase] Bulk load and dump tools
Date Thu, 18 Oct 2007 18:33:51 GMT
[hbase] Bulk load and dump tools

                 Key: HADOOP-2075
                 URL: https://issues.apache.org/jira/browse/HADOOP-2075
             Project: Hadoop
          Issue Type: New Feature
          Components: contrib/hbase
            Reporter: stack
            Priority: Minor

Hbase needs tools to facilitate bulk upload and possibly dumping.  Going via the current APIs,
particularly if the dataset is large and cell content is small, uploads can take a long time
even when using many concurrent clients.

PNUTS folks talked of need for a different API to manage bulk upload/dump.

Another notion would be to somehow have the bulk loader tools somehow write regions directly
in hdfs.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message