hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-12369) Native Vector GroupBy (Part 1)
Date Sun, 22 Apr 2018 22:05:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-12369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matt McCline updated HIVE-12369:
--------------------------------
    Attachment: HIVE-12369.0991.patch

> Native Vector GroupBy (Part 1)
> ------------------------------
>
>                 Key: HIVE-12369
>                 URL: https://issues.apache.org/jira/browse/HIVE-12369
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>         Attachments: HIVE-12369.01.patch, HIVE-12369.02.patch, HIVE-12369.05.patch, HIVE-12369.06.patch,
HIVE-12369.091.patch, HIVE-12369.094.patch, HIVE-12369.095.patch, HIVE-12369.096.patch, HIVE-12369.097.patch,
HIVE-12369.098.patch, HIVE-12369.099.patch, HIVE-12369.0991.patch
>
>
> Implement Native Vector GroupBy using fast hash table technology developed for Native
Vector MapJoin, etc.
> Patch is currently limited to a single COUNT aggregation.  Or, no aggregation also known
as duplicate reduction.
> Here are examples of new kinds of classes introduces that stored the count in the slot
table and don't allocate hash elements:
> {noformat}
>   COUNT(column)  VectorGroupByHashLongKeySingleCountColumnOperator      
>   COUNT(key)     VectorGroupByHashLongKeySingleCountKeyOperator            
>   COUNT(*)       VectorGroupByHashLongKeySingleCountStarOperator           
> {noformat}
> And the duplicate reduction operator a single key.  Example:
> {noformat}
>   VectorGroupByHashLongKeyDuplicateReductionOperator
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message