hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kannan Muthukkaruppan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-3967) Support deletes in HFileOutputFormat based bulk import mechanism
Date Sat, 28 Apr 2012 08:10:45 GMT

    [ https://issues.apache.org/jira/browse/HBASE-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13264252#comment-13264252

Kannan Muthukkaruppan commented on HBASE-3967:

The point of the JIRA was to really just provide a way to be able to bulk import delete mutations
in addition to put mutations. We solved this on 89-fb branch by introducing a RowMutation
(which extends Row) and its constructor can take a "Put" or "Delete". And by using a RowMutationSortReducer
(that is variant of PutSortReducer, except that it handles Deletes & Puts). I will dig
up the commit revs on 89-fb branch and try to post the links shortly for you to take a look.
Unless there are any technical objections, we should just port the same approach to trunk.
> Support deletes in HFileOutputFormat based bulk import mechanism
> ----------------------------------------------------------------
>                 Key: HBASE-3967
>                 URL: https://issues.apache.org/jira/browse/HBASE-3967
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Kannan Muthukkaruppan
>            Priority: Critical
>             Fix For: 0.96.0
>         Attachments: diff.patch
> During bulk imports, it'll be useful to be able to do delete mutations (either to delete
data that already exists in HBase or was inserted earlier during this run of the import).

> For example, we have a use case, where we are processing a log of data which may have
both inserts and deletes in the mix and we want to upload that into HBase using the bulk import

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message