hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ferdinand Xu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-14815) Support vectorization for Parquet
Date Thu, 22 Sep 2016 07:14:20 GMT

     [ https://issues.apache.org/jira/browse/HIVE-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ferdinand Xu updated HIVE-14815:
--------------------------------
    Description: Parquet doesn't provide a vectorized reader which can be used by Hive directly.
Also for Decimal Column batch, it consists of a batch of HiveDecimal which is a Hive type
which is unknown for Parquet. To support Hive vectorization execution engine in Hive, we have
to implement the vectorized Parquet reader in Hive side. To limit the performance impacts,
we need to implement a page level vectorized reader.

> Support vectorization for Parquet
> ---------------------------------
>
>                 Key: HIVE-14815
>                 URL: https://issues.apache.org/jira/browse/HIVE-14815
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Ferdinand Xu
>            Assignee: Ferdinand Xu
>
> Parquet doesn't provide a vectorized reader which can be used by Hive directly. Also
for Decimal Column batch, it consists of a batch of HiveDecimal which is a Hive type which
is unknown for Parquet. To support Hive vectorization execution engine in Hive, we have to
implement the vectorized Parquet reader in Hive side. To limit the performance impacts, we
need to implement a page level vectorized reader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message