beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vikas Kedigehalli (JIRA)" <>
Subject [jira] [Created] (BEAM-1956) Flatten operation should respect input type hints.
Date Thu, 13 Apr 2017 00:10:41 GMT
Vikas Kedigehalli created BEAM-1956:

             Summary: Flatten operation should respect input type hints.
                 Key: BEAM-1956
             Project: Beam
          Issue Type: Bug
          Components: sdk-py
            Reporter: Vikas Kedigehalli
            Assignee: Vikas Kedigehalli
             Fix For: First stable release

Input type hints are currently not respected by the Flatten operation and instead `Any` type
is chosen as a fallback. This could lead to using a pickle coder even if there was a custom
coder type hint provided for input PCollections. 

Also, this could lead to undesirable results, particularly, when a Flatten operation is followed
by a GroupByKey operation which requires the key coder to be deterministic. Even if the user
provides deterministic coder type hints to their PCollections, defaulting to Any would result
in using the pickle coder (non-deterministic). As a result of this, CoGroupByKey is broken
in such scenarios where input PCollection coder is deterministic for the type while pickle
coder is not.   

This message was sent by Atlassian JIRA

View raw message