beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (BEAM-53) PubSubIO: reimplement in Java
Date Tue, 12 Apr 2016 20:17:25 GMT

    [ https://issues.apache.org/jira/browse/BEAM-53?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15237894#comment-15237894
] 

ASF GitHub Bot commented on BEAM-53:
------------------------------------

GitHub user mshields822 opened a pull request:

    https://github.com/apache/incubator-beam/pull/171

    [BEAM-53] Java-only Pubsub sink for streaming.

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mshields822/incubator-beam pubsub-sink

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-beam/pull/171.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #171
    
----
commit 2cf71eb4e0f65327c42c51143bce72698ebd7e09
Author: Mark Shields <markshields@google.com>
Date:   2016-04-05T01:09:27Z

    Pub/sub sink

commit 15c372da1757d95049db8428ccf6951774532812
Author: Mark Shields <markshields@google.com>
Date:   2016-04-11T22:11:51Z

    TopicPath instead of String

commit c260329e41c5ba605b6d98ca18a9951675055c66
Author: Mark Shields <markshields@google.com>
Date:   2016-04-12T01:02:16Z

    Getters

commit 08dd9022114c62e35adfae89bd41a97d19c831a8
Author: Mark Shields <markshields@google.com>
Date:   2016-04-12T01:04:36Z

    one more getter

----


> PubSubIO: reimplement in Java
> -----------------------------
>
>                 Key: BEAM-53
>                 URL: https://issues.apache.org/jira/browse/BEAM-53
>             Project: Beam
>          Issue Type: New Feature
>          Components: runner-core
>            Reporter: Daniel Halperin
>            Assignee: Mark Shields
>
> PubSubIO is currently only partially implemented in Java: the DirectPipelineRunner uses
a non-scalable API in a single-threaded manner.
> In contrast, the DataflowPipelineRunner uses an entirely different code path implemented
in the Google Cloud Dataflow service.
> We need to reimplement PubSubIO in Java in order to support other runners in a scalable
way.
> Additionally, we can take this opportunity to add new features:
> * getting timestamp from an arbitrary lambda in arbitrary formats rather than from a
message attribute in only 2 formats.
> * exposing metadata and attributes in the elements produced by PubSubIO.Read
> * setting metadata and attributes in the messages written by PubSubIO.Write



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message