[ https://issues.apache.org/jira/browse/TEZ-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bikas Saha updated TEZ-1396:
----------------------------
Attachment: TEZ-1396.1.patch
Attaching patch split from TEZ-1656. Was reviewed in TEZ-1656.
Makes the grouping repeatable with an option to disable it to avoid hot-spotting.
> Grouping should generate consistent groups when given the same set of splits
> ----------------------------------------------------------------------------
>
> Key: TEZ-1396
> URL: https://issues.apache.org/jira/browse/TEZ-1396
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Siddharth Seth
> Assignee: Siddharth Seth
> Attachments: TEZ-1396.1.patch
>
>
> Currently, it seems like Grouping can end up generating a different set of groups on
different invocations of the same set of splits and target tasks.
> The order likely gets affected by the randomization in the block location report from
HDFS.
> This should be consistent for better cache utilization.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
|