hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Mains (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-13356) HBase should provide an InputFormat supporting multiple scans in mapreduce jobs over snapshots
Date Mon, 27 Apr 2015 01:08:40 GMT

     [ https://issues.apache.org/jira/browse/HBASE-13356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Andrew Mains updated HBASE-13356:
---------------------------------
    Attachment: HBASE-13356.2.patch

Thanks for the review! I believe I've addressed all of the comments and hadoop QA errors in
this second patch (it goes through everything in test-patch.sh, at least on my local). Let
me know if there's anything else I can fix up.

> HBase should provide an InputFormat supporting multiple scans in mapreduce jobs over
snapshots
> ----------------------------------------------------------------------------------------------
>
>                 Key: HBASE-13356
>                 URL: https://issues.apache.org/jira/browse/HBASE-13356
>             Project: HBase
>          Issue Type: New Feature
>          Components: mapreduce
>            Reporter: Andrew Mains
>            Assignee: Andrew Mains
>            Priority: Minor
>         Attachments: HBASE-13356.2.patch, HBASE-13356.patch
>
>
> Currently, HBase supports the pushing of multiple scans to mapreduce jobs over live tables
(via MultiTableInputFormat) but only supports a single scan for mapreduce jobs over table
snapshots. It would be handy to support multiple scans over snapshots as well, probably through
another input format (MultiTableSnapshotInputFormat?). To mimic the functionality present
in MultiTableInputFormat, the new input format would likely have to take in the names of all
snapshots used in addition to the scans.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message