hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-8349) DISTRIBUTE BY should work with tez auto-parallelism enabled
Date Thu, 16 Oct 2014 03:40:34 GMT


Hive QA commented on HIVE-8349:

{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:

{color:red}ERROR:{color} -1 due to 2 failed/errored test(s), 6558 tests executed
*Failed tests:*

Test results:
Console output:
Test logs:

Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 2 tests failed

This message is automatically generated.

 - PreCommit-HIVE-TRUNK-Build

> DISTRIBUTE BY should work with tez auto-parallelism enabled
> -----------------------------------------------------------
>                 Key: HIVE-8349
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>          Components: Physical Optimizer, Tez
>    Affects Versions: 0.14.0
>            Reporter: Gopal V
>            Assignee: Gopal V
>             Fix For: 0.14.0
>         Attachments: HIVE-8349.1.patch, HIVE-8349.2.patch, HIVE-8349.3.patch, HIVE-8349.4.patch
> Current implementation of DISTRIBUTE BY does not work when tez auto-parallelism is turned
on, because of hashCode distribution issues.
> In case of distribute by, the key is actually zero bytes, with only partitioning enabled
via hashCode - this adversely affects the uniform hashing implementation.
> In an ideal scenario, the edge should go from the ordered kv input to the unordered partitioned
edge, to speed up the processing massively.

This message was sent by Atlassian JIRA

View raw message