beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (BEAM-3042) Add tracking of bytes read / time spent when reading side inputs
Date Wed, 06 Dec 2017 21:08:00 GMT


ASF GitHub Bot commented on BEAM-3042:

pabloem opened a new pull request #4222: [BEAM-3042] Updating Dataflow Api protos
   This is necessary to be able to report the new Dataflow metrics.
   r: @chamikaramj 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

> Add tracking of bytes read / time spent when reading side inputs
> ----------------------------------------------------------------
>                 Key: BEAM-3042
>                 URL:
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core
>            Reporter: Pablo Estrada
>            Assignee: Pablo Estrada
> It is difficult for Dataflow users to understand how modifying a pipeline or data set
can affect how much inter-transform IO is used in their job. The intent of this feature request
is to help users understand how side inputs behave when they are consumed.
> This will allow users to understand how much time and how much data their pipeline uses
to read/write to inter-transform IO. Users will also be able to modify their pipelines and
understand how their changes affect these IO metrics.
> For further information, please review the internal Google doc go/insights-transform-io-design-doc.

This message was sent by Atlassian JIRA

View raw message