crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gabriel Reid (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-450) Adding ORC file format support in Crunch
Date Tue, 12 Aug 2014 12:37:11 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14094024#comment-14094024
] 

Gabriel Reid commented on CRUNCH-450:
-------------------------------------

Looks really cool. I was only able to take a pretty quick look through, but it looks good
to me from what I've seen.

Just one minor thing is that the version of the dependency on hive-exec should probably be
set in a property in the root pom (that's how it's done for everything else), so maybe you
could just make that change while committing the patch. 

> Adding ORC file format support in Crunch
> ----------------------------------------
>
>                 Key: CRUNCH-450
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-450
>             Project: Crunch
>          Issue Type: New Feature
>          Components: Core, IO
>            Reporter: Zhong Wang
>            Assignee: Josh Wills
>             Fix For: 0.11.0
>
>         Attachments: CRUNCH-450-final.patch, CRUNCH-450-newapi.patch, CRUNCH-450-submodule.1.patch,
CRUNCH-450-submodule.2.patch, CRUNCH-450-submodule.patch, CRUNCH-450.patch
>
>
> This JIRA adds ORC file format support in Crunch by:
> --
> 1. Adding input source and output target for ORC
> 2. Adding a new type family - OrcTypeFamily to serialize / deserialize objects into OrcStruct
> 3. Supporting column pruning optimization



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message