drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Victoria Markman (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-2148) Wrong result with grouping on a column of date type with streaming aggregation
Date Tue, 03 Feb 2015 03:20:35 GMT

     [ https://issues.apache.org/jira/browse/DRILL-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Victoria Markman updated DRILL-2148:
------------------------------------
    Attachment: t1.res

> Wrong result with grouping on a column of date type with streaming aggregation
> ------------------------------------------------------------------------------
>
>                 Key: DRILL-2148
>                 URL: https://issues.apache.org/jira/browse/DRILL-2148
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Operators
>    Affects Versions: 0.8.0
>            Reporter: Victoria Markman
>            Assignee: Chris Westin
>            Priority: Critical
>         Attachments: 0_0_0.parquet, t1.out, t1.res
>
>
> Disable hash aggregation  and run query below:
> {code}
> alter system set `planner.enable_hashagg` = false;
> select
>         c_date,
>         COUNT(*)
> from    t1
> group by
>         c_date
> order by
>         c_date;
> {code}
> You will get wrong result. Because NULLs are sorted in the middle ( see DRILL-2084 )
they are folded in one of the non related groups.
> We might have the same problem with the merge join on date, time and timestamp columns.
> Attached is a parquet file that was used in this query.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message