hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ferdinand Xu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-14815) Implement Parquet vectorization reader for Primitive types
Date Wed, 02 Nov 2016 08:56:59 GMT

     [ https://issues.apache.org/jira/browse/HIVE-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ferdinand Xu updated HIVE-14815:
--------------------------------
    Summary: Implement Parquet vectorization reader for Primitive types   (was: Implement
Parquet vectorization reader )

> Implement Parquet vectorization reader for Primitive types 
> -----------------------------------------------------------
>
>                 Key: HIVE-14815
>                 URL: https://issues.apache.org/jira/browse/HIVE-14815
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Ferdinand Xu
>            Assignee: Ferdinand Xu
>         Attachments: HIVE-14815.1.patch, HIVE-14815.2.patch, HIVE-14815.3.patch, HIVE-14815.patch
>
>
> Parquet doesn't provide a vectorized reader which can be used by Hive directly. Also
for Decimal Column batch, it consists of a batch of HiveDecimal which is a Hive type which
is unknown for Parquet. To support Hive vectorization execution engine in Hive, we have to
implement the vectorized Parquet reader in Hive side. To limit the performance impacts, we
need to implement a page level vectorized reader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message