flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-10126) There should be a Scala DataSource
Date Tue, 25 Sep 2018 02:17:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-10126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16626676#comment-16626676
] 

ASF GitHub Bot commented on FLINK-10126:
----------------------------------------

yanghua commented on issue #6738: [FLINK-10126] There should be a Scala DataSource
URL: https://github.com/apache/flink/pull/6738#issuecomment-424184780
 
 
   hi @StephanEwen , Before providing the API directly, I evaluated the implementation of
a peer `DataSource` in the flink-scala module. However, It will break the existing source
API. It was found that `ExecutionEnvironment.scala` released some `readXXX` APIs that returned
`DataSet.scala`. Inside these APIs, we got a `DataSource` (flink-java) and then called `wrap(source)`
to convert it to a `DataSet.scala`. Considering that there are only two or three APIs exposed
in the `DataSource.java` (one of which `withParameters` has been exposed through the `DataSet`),
I chose to expose the API in the `DataSource` to the `DataSet.scala` with minimal changes
and without breaking the current API. In fact, in this JIRA's description, reporter is more
concerned with getting the APIs provided by `DataSource.java`.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> There should be a Scala DataSource
> ----------------------------------
>
>                 Key: FLINK-10126
>                 URL: https://issues.apache.org/jira/browse/FLINK-10126
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: Alexis Sarda-Espinosa
>            Assignee: vinoyang
>            Priority: Minor
>              Labels: datasource, pull-request-available, scala
>
> In Java, an ExecutionEnvironment's createInput method returns a DataSource, whereas the
Scala version returns a DataSet. There is no Scala DataSource wrapper, and the Scala DataSet
does not provide the Java DataSource methods, such as getSplitDataProperties.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message