tez-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bikas Saha (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TEZ-1396) Grouping should generate consistent groups when given the same set of splits
Date Tue, 21 Oct 2014 21:59:33 GMT

     [ https://issues.apache.org/jira/browse/TEZ-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Bikas Saha updated TEZ-1396:
----------------------------
    Attachment: TEZ-1396.1.patch

Attaching patch split from TEZ-1656. Was reviewed in TEZ-1656.
Makes the grouping repeatable with an option to disable it to avoid hot-spotting.

> Grouping should generate consistent groups when given the same set of splits
> ----------------------------------------------------------------------------
>
>                 Key: TEZ-1396
>                 URL: https://issues.apache.org/jira/browse/TEZ-1396
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: TEZ-1396.1.patch
>
>
> Currently, it seems like Grouping can end up generating a different set of groups on
different invocations of the same set of splits and target tasks.
> The order likely gets affected by the randomization in the block location report from
HDFS.
> This should be consistent for better cache utilization.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message