beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Halperin (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (BEAM-1316) DoFn#startBundle and #finishBundle should not be able to output
Date Thu, 26 Jan 2017 16:47:24 GMT

    [ https://issues.apache.org/jira/browse/BEAM-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15840014#comment-15840014
] 

Daniel Halperin commented on BEAM-1316:
---------------------------------------

What if my output includes a list of filenames paired with file sizes, or element counts --
aka, information that may only be known after I flush to external systems?

> DoFn#startBundle and #finishBundle should not be able to output
> ---------------------------------------------------------------
>
>                 Key: BEAM-1316
>                 URL: https://issues.apache.org/jira/browse/BEAM-1316
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-java-core
>            Reporter: Thomas Groh
>
> While within startBundle and finishBundle, the window in which elements are output is
not generally defined. Elements must always be output from within a windowed context, or the
{{WindowFn}} used by the {{PCollection}} may not operate appropriately.
> startBundle and finishBundle are suitable for operational duties, similarly to {{setup}}
and {{teardown}}, but within the scope of some collection of input elements. This includes
actions such as clearing field state within a DoFn and ensuring all live RPCs complete successfully
before committing inputs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message