hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1861) Multi-Family support for bulk upload tools (HFileOutputFormat / loadtable.rb)
Date Thu, 17 Dec 2009 18:03:18 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792031#action_12792031

stack commented on HBASE-1861:

Not to my knowledge.   Thinking on it, this case is a little tougher than the single family

1. In single family case, we just write single files and read the file metadata to create
region (We extract from the file its start and end rows and use these conjuring the region
description).  In the multiple family case, somehow you'll have to tie all files in a region
together -- perhaps in metadata or with a file suffix or prefix.  I was thinking that you'd
keep a running tab on the size of the file in each family and then as soon as any one file
went over the region maximum file size limit, you'd rotate all files.
2. The loadtables.rb script would need to change to read across all files in a region to find
the least first row and the maximum last row by looking at all file metadatas. 

If you want to discuss this issue more, put up some questions and I'll have a stab at them.

> Multi-Family support for bulk upload tools (HFileOutputFormat / loadtable.rb)
> -----------------------------------------------------------------------------
>                 Key: HBASE-1861
>                 URL: https://issues.apache.org/jira/browse/HBASE-1861
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: mapreduce
>    Affects Versions: 0.20.0
>            Reporter: Jonathan Gray
>             Fix For: 0.21.0
> Add multi-family support to bulk upload tools from HBASE-48.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message