crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Wills (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-340) HCatSource
Date Mon, 10 Feb 2014 07:04:20 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13896272#comment-13896272
] 

Josh Wills commented on CRUNCH-340:
-----------------------------------

I worry a bit here about adding in the new dependencies, esp. since (IMHO) Hive dependencies
are a nightmare even by Apache standards. ;-) I think at a minimum we'd want crunch-hcat as
a submodule separate from core. I'd be down to support this if a critical mass of folks in
the community have use cases for it, so let's start a thread on user@ and dev@ to discuss.

> HCatSource
> ----------
>
>                 Key: CRUNCH-340
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-340
>             Project: Crunch
>          Issue Type: New Feature
>            Reporter: Chao Shi
>         Attachments: crunch-340.patch
>
>
> This patch adds HCatSource, which enables crunch pipeline to read from Hive tables. This
is the very first version, leaving a few TODOs in code.
> It adds new dependency from crunch-core to hcatalog (as well as several hive components).
I guess maybe we should create a new subproject (e.g. crunch-hcatalog) rather than add it
into crunch-core.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message