flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fabian Hueske (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-2168) Add HBaseTableSource
Date Wed, 11 Jan 2017 17:13:48 GMT

    [ https://issues.apache.org/jira/browse/FLINK-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15818868#comment-15818868

Fabian Hueske commented on FLINK-2168:

For now we should only support scan and not get. 
Specifying which columns to read is exactly the functionality that the `ProjectableTableSource`
When the table source is defined, it maps some columns of an HBase table to a relational schema
with unique names. During optimization, the table source is configured with the actual columns
to fetch.

We definitely need some kind of InputFormat to talk to HBase. Flink features a {{TableInputFormat}}
which could serve as a basis for the HBase table source. The deserialization of byte arrays
can either happen in the input format or in a subsequent Map function.

> Add HBaseTableSource
> --------------------
>                 Key: FLINK-2168
>                 URL: https://issues.apache.org/jira/browse/FLINK-2168
>             Project: Flink
>          Issue Type: New Feature
>          Components: Table API & SQL
>    Affects Versions: 0.9
>            Reporter: Fabian Hueske
>            Assignee: ramkrishna.s.vasudevan
>            Priority: Minor
>              Labels: starter
> Add a {{HBaseTableSource}} to read data from a HBase table. The {{HBaseTableSource}}
should implement the {{ProjectableTableSource}} (FLINK-3848) and {{FilterableTableSource}}
(FLINK-3849) interfaces.
> The implementation can be based on Flink's {{TableInputFormat}}.

This message was sent by Atlassian JIRA

View raw message