hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-4552) multi-CF bulk load is not atomic across column families
Date Mon, 17 Oct 2011 17:20:11 GMT

    [ https://issues.apache.org/jira/browse/HBASE-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13129013#comment-13129013
] 

Todd Lipcon commented on HBASE-4552:
------------------------------------

The trick is making sure it's atomic inside the region server - not just that the client sends
all of the files for a given region in one RPC. If there are any concurrent scanners, then
they should either see all of the new data or none of the new data on a given row. So we need
some region-wide coordination. I think probably we have to take a write-lock on HRegion#lock
                
> multi-CF bulk load is not atomic across column families
> -------------------------------------------------------
>
>                 Key: HBASE-4552
>                 URL: https://issues.apache.org/jira/browse/HBASE-4552
>             Project: HBase
>          Issue Type: Bug
>          Components: regionserver
>    Affects Versions: 0.92.0
>            Reporter: Todd Lipcon
>             Fix For: 0.92.0
>
>
> Currently the bulk load API simply imports one HFile at a time. With multi-column-family
support, this is inappropriate, since different CFs show up separately. Instead, the IPC endpoint
should take a of CF -> HFiles, so we can online them all under a single region-wide lock.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message