hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yi liang (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-3727) MultiHFileOutputFormat
Date Thu, 14 Jul 2016 18:48:21 GMT

     [ https://issues.apache.org/jira/browse/HBASE-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

yi liang updated HBASE-3727:
----------------------------
    Release Note: 
MultiHFileOutputFormat support output of HFiles from multiple tables. It will output directories
and hfiles as follow, 
     --table1
       --family1
       --family2
         --Hfiles
     --table2
       --family3
         --hfiles
       --family4

family directory and its hfiles match the output of HFileOutputFormat2

  was:
MultiHFileOutputFormat will output directories and hfiles as follow, 
     --tableDir1
       --familyDir1
       --familyDir2
         --Hfiles
     --tableDir2
       --familyDir3
         --hfiles
       --familyDir4

create 3 level tree directory, first level is using table name as parent directory and then
use column family name as child directory, and all related HFiles for one family are under
column family directory. Except the table-level directory, the other two are followed hfileoutputformat2.

There are only one major modification in HFileOutputFormat2: change the Anonymous Classes
of return RecordWriter to a class called HFileRecordWriter extends RecordWriter.


> MultiHFileOutputFormat
> ----------------------
>
>                 Key: HBASE-3727
>                 URL: https://issues.apache.org/jira/browse/HBASE-3727
>             Project: HBase
>          Issue Type: New Feature
>    Affects Versions: 2.0.0
>            Reporter: Andrew Purtell
>            Assignee: yi liang
>            Priority: Minor
>         Attachments: HBASE-3727-V3.patch, HBASE-3727-V4.patch, HBASE-3727-V5.patch, MH2.patch,
MultiHFileOutputFormat.java, MultiHFileOutputFormat.java, MultiHFileOutputFormat.java, TestMultiHFileOutputFormat.java
>
>
> Like MultiTableOutputFormat, but outputting HFiles. Key is tablename as an IBW. Creates
sub-writers (code cut and pasted from HFileOutputFormat) on demand that produce HFiles in
per-table subdirectories of the configured output path. Does not currently support partitioning
for existing tables / incremental update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message