hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kevin Peterson (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HBASE-1963) Output to multiple tables from Hadoop MR without use of HTable
Date Tue, 10 Nov 2009 00:27:32 GMT

     [ https://issues.apache.org/jira/browse/HBASE-1963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Kevin Peterson updated HBASE-1963:
----------------------------------

     Component/s: mapreduce
    Release Note: MultiTableOutputFormat allows output from a map/reduce job to be written
to multiple tables. An example illustrates use for creating secondary indexes from an existing
table.

I'll make the ImmutableBytesWritable change tonight.

> Output to multiple tables from Hadoop MR without use of HTable
> --------------------------------------------------------------
>
>                 Key: HBASE-1963
>                 URL: https://issues.apache.org/jira/browse/HBASE-1963
>             Project: Hadoop HBase
>          Issue Type: New Feature
>          Components: mapreduce
>    Affects Versions: 0.20.1
>            Reporter: Kevin Peterson
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: HBASE-1963.patch
>
>
> o.a.h.h.mapreduce.TableOutputFormat allows writing to a single table as output from a
map/reduce job in the natural way. It requires that the user specify the table name ahead
of time and can only write to one table. I had a need to write to multiple tables from the
same job (write my data to one table, and also write to index tables), and I wanted to have
a consistent API whether writing to one or many tables.
> Attached MultiTableOutputFormat takes the table name as the key and the Put or Delete
as the value. Also included is an example demonstrating the usage.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message