hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brock Noland (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-5783) Native Parquet Support in Hive
Date Tue, 21 Jan 2014 17:00:22 GMT

    [ https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13877610#comment-13877610
] 

Brock Noland commented on HIVE-5783:
------------------------------------

[~leftylev], good call.

I think we should create a document under "File Formats". I will volunteer for that effort.

> Native Parquet Support in Hive
> ------------------------------
>
>                 Key: HIVE-5783
>                 URL: https://issues.apache.org/jira/browse/HIVE-5783
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>            Reporter: Justin Coffey
>            Assignee: Justin Coffey
>            Priority: Minor
>             Fix For: 0.13.0
>
>         Attachments: HIVE-5783.patch, HIVE-5783.patch, HIVE-5783.patch, HIVE-5783.patch,
HIVE-5783.patch
>
>
> Problem Statement:
> Hive would be easier to use if it had native Parquet support. Our organization, Criteo,
uses Hive extensively. Therefore we built the Parquet Hive integration and would like to now
contribute that integration to Hive.
> About Parquet:
> Parquet is a columnar storage format for Hadoop and integrates with many Hadoop ecosystem
tools such as Thrift, Avro, Hadoop MapReduce, Cascading, Pig, Drill, Crunch, and Hive. Pig,
Crunch, and Drill all contain native Parquet integration.
> Changes Details:
> Parquet was built with dependency management in mind and therefore only a single Parquet
jar will be added as a dependency.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message