hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "haosdent (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-9537) completebulkload does 'copy' StoreFiles instead of 'cut'
Date Wed, 19 Feb 2014 11:31:21 GMT

     [ https://issues.apache.org/jira/browse/HBASE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

haosdent updated HBASE-9537:

    Attachment: HBASE-9537.patch

> completebulkload does 'copy' StoreFiles instead of 'cut'
> --------------------------------------------------------
>                 Key: HBASE-9537
>                 URL: https://issues.apache.org/jira/browse/HBASE-9537
>             Project: HBase
>          Issue Type: Bug
>          Components: HFile, mapreduce, regionserver
>    Affects Versions: 0.94.11
>            Reporter: M. BagherEsmaeily
>         Attachments: HBASE-9537.patch, LoadIncrementalHFiles.log, region.log
> I was using HBase complete bulk load to transfer the output of ImportTsv to a table in
HBase, and I noticed that it copies the output instead of cutting. This takes long time for
my gigabytes of data.
> In HBase documentation (http://hbase.apache.org/book/ops_mgt.html#completebulkload) I
read that the files would be moved not copied. Can anyone help me with this?
> I use Hbase 0.94.11 and Hadoop 1.2.1. The file system of bulkload output directory and
hbase cluster are the same, too.
> I've also coded a MapReduce job using HFileOutputFormat. When I use LoadIncrementalHFiles
to move the output of my job to HBase table, it still copies instead of cut.

This message was sent by Atlassian JIRA

View raw message