hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dong Chen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-8128) Improve Parquet Vectorization
Date Wed, 29 Jul 2015 04:31:04 GMT

    [ https://issues.apache.org/jira/browse/HIVE-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14645436#comment-14645436
] 

Dong Chen commented on HIVE-8128:
---------------------------------

Thanks for your review! [~Ferd]

The build failure in the pre-commit test is caused by the dependency downloading of Parquet.
This patch is developed based on the parquet code at https://github.com/nezihyigitbasi-nflx/parquet-mr/commits/vector.
In that code base, parquet version is 1.8.0rc2-SNAPSHOT, which is referred to in this Hive
patch. Since that code is still not merged into the parquet master, we cannot point to a proper
parquet version for download for pre-commit test.

Could we find a workaround? [~spena], [~Ferd]

> Improve Parquet Vectorization
> -----------------------------
>
>                 Key: HIVE-8128
>                 URL: https://issues.apache.org/jira/browse/HIVE-8128
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Brock Noland
>            Assignee: Dong Chen
>             Fix For: parquet-branch
>
>         Attachments: HIVE-8128-parquet.patch.POC, HIVE-8128.1-parquet.patch, HIVE-8128.6-parquet.patch,
HIVE-8128.6-parquet.patch, testParquetFile
>
>
> We'll want to do is finish the vectorization work (e.g. VectorizedOrcSerde, VectorizedOrcSerde)
which was partially done in HIVE-5998.
> As discussed in PARQUET-131, we will work out Hive POC based on the new Parquet vectorized
API, and then finish the implementation after finilized.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message