crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Wills (JIRA)" <>
Subject [jira] [Commented] (CRUNCH-340) HCatSource
Date Mon, 10 Feb 2014 07:04:20 GMT


Josh Wills commented on CRUNCH-340:

I worry a bit here about adding in the new dependencies, esp. since (IMHO) Hive dependencies
are a nightmare even by Apache standards. ;-) I think at a minimum we'd want crunch-hcat as
a submodule separate from core. I'd be down to support this if a critical mass of folks in
the community have use cases for it, so let's start a thread on user@ and dev@ to discuss.

> HCatSource
> ----------
>                 Key: CRUNCH-340
>                 URL:
>             Project: Crunch
>          Issue Type: New Feature
>            Reporter: Chao Shi
>         Attachments: crunch-340.patch
> This patch adds HCatSource, which enables crunch pipeline to read from Hive tables. This
is the very first version, leaving a few TODOs in code.
> It adds new dependency from crunch-core to hcatalog (as well as several hive components).
I guess maybe we should create a new subproject (e.g. crunch-hcatalog) rather than add it
into crunch-core.

This message was sent by Atlassian JIRA

View raw message