flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fabian Hueske (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-2168) Add HBaseTableSource
Date Tue, 10 Jan 2017 11:35:58 GMT

    [ https://issues.apache.org/jira/browse/FLINK-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15814733#comment-15814733

Fabian Hueske commented on FLINK-2168:

I think we should limit this issue to implement a projectable {{BatchTableSource}} for HBase.
As such, all we need to do is to return a {{DataSet}} of a specific type. There is no need
to do any query parsing or query push down at the moment.

A user needs to configure the table source with the following information:
* name of the HBase table to scan
* a list of columns to scan with types and deserializers to convert the {{byte[]}} into the
column type.

Given this information, the table source needs to connect to HBase, fetch the columns, deserialize
the fields, and create the output records.
A projectable table source also needs to be able to restrict the column based on a set of
columns it is configured with. But which columns is automatically figured out by Calcite.

> Add HBaseTableSource
> --------------------
>                 Key: FLINK-2168
>                 URL: https://issues.apache.org/jira/browse/FLINK-2168
>             Project: Flink
>          Issue Type: New Feature
>          Components: Table API & SQL
>    Affects Versions: 0.9
>            Reporter: Fabian Hueske
>            Assignee: ramkrishna.s.vasudevan
>            Priority: Minor
>              Labels: starter
> Add a {{HBaseTableSource}} to read data from a HBase table. The {{HBaseTableSource}}
should implement the {{ProjectableTableSource}} (FLINK-3848) and {{FilterableTableSource}}
(FLINK-3849) interfaces.
> The implementation can be based on Flink's {{TableInputFormat}}.

This message was sent by Atlassian JIRA

View raw message