drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Victoria Markman (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-2148) Wrong result with grouping on a column of date type with streaming aggregation
Date Tue, 03 Feb 2015 03:12:34 GMT
Victoria Markman created DRILL-2148:
---------------------------------------

             Summary: Wrong result with grouping on a column of date type with streaming aggregation
                 Key: DRILL-2148
                 URL: https://issues.apache.org/jira/browse/DRILL-2148
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Operators
    Affects Versions: 0.8.0
            Reporter: Victoria Markman
            Assignee: Chris Westin
            Priority: Critical


Disable hash aggregation  and run query below:
{code}

alter system set `planner.enable_hashagg` = false;

select
        c_date,
        COUNT(*)
from    t1
group by
        c_date
order by
        c_date;

{code}

You will get wrong result. Because NULLs are sorted in the middle ( see DRILL-2084 ) they
are folded in one of the non related groups.
We might have the same problem with the merge join on date, time and timestamp columns.
Attached is a parquet file that was used in this query.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message