hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-2075) [hbase] Bulk load and dump tools
Date Thu, 18 Oct 2007 18:33:51 GMT
[hbase] Bulk load and dump tools
--------------------------------

                 Key: HADOOP-2075
                 URL: https://issues.apache.org/jira/browse/HADOOP-2075
             Project: Hadoop
          Issue Type: New Feature
          Components: contrib/hbase
            Reporter: stack
            Priority: Minor


Hbase needs tools to facilitate bulk upload and possibly dumping.  Going via the current APIs,
particularly if the dataset is large and cell content is small, uploads can take a long time
even when using many concurrent clients.

PNUTS folks talked of need for a different API to manage bulk upload/dump.

Another notion would be to somehow have the bulk loader tools somehow write regions directly
in hdfs.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message