hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-13089) Rounding in Stats for equality expressions
Date Sat, 20 Feb 2016 03:01:18 GMT


Hive QA commented on HIVE-13089:

Here are the results of testing the latest attachment:

{color:red}ERROR:{color} -1 due to no test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 7 failed/errored test(s), 9800 tests executed
*Failed tests:*
TestSparkCliDriver-timestamp_lazy.q-bucketsortoptimize_insert_4.q-date_udf.q-and-12-more -
did not produce a TEST-*.xml file

Test results:
Console output:
Test logs:

Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 7 tests failed

This message is automatically generated.

ATTACHMENT ID: 12788525 - PreCommit-HIVE-TRUNK-Build

> Rounding in Stats for equality expressions
> ------------------------------------------
>                 Key: HIVE-13089
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>    Affects Versions: 2.1.0
>            Reporter: Jesus Camacho Rodriguez
>            Assignee: Jesus Camacho Rodriguez
>         Attachments: HIVE-13089.patch
> Currently we divide numRows(long) by countDistinct(long), thus ignoring the decimals.
We should do proper rounding.
> This is specially useful for equality expressions over columns whose values are unique.
As NDV estimates allow for a certain error, if countDistinct > numRows, we end up with
0 rows in the estimate for the expression.

This message was sent by Atlassian JIRA

View raw message