hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-7405) Vectorize GROUP BY on the Reduce-Side (Part 1 – Basic)
Date Fri, 25 Jul 2014 03:45:39 GMT

    [ https://issues.apache.org/jira/browse/HIVE-7405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14074019#comment-14074019
] 

Matt McCline commented on HIVE-7405:
------------------------------------

(Note: This patch also contains the large changes for HIVE-7029 Vectorize ReduceWork.)

> Vectorize GROUP BY on the Reduce-Side (Part 1 – Basic)
> ------------------------------------------------------
>
>                 Key: HIVE-7405
>                 URL: https://issues.apache.org/jira/browse/HIVE-7405
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>         Attachments: HIVE-7405.1.patch
>
>
> Vectorize the basic case that does not have any count distinct aggregation.
> Add a 4th processing mode in VectorGroupByOperator for reduce where each input VectorizedRowBatch
has only values for one key at a time.  Thus, the values in the batch can be aggregated quickly.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message