hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "wolfgang hoschek (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-5881) Integrate Hive with Morphlines
Date Tue, 05 Aug 2014 16:00:12 GMT

     [ https://issues.apache.org/jira/browse/HIVE-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

wolfgang hoschek updated HIVE-5881:
-----------------------------------

    Description: 
Integrate Hive with morphlines. 

Specifically, add support to Hive to call a morphline as a UDTF to leverage existing morphlines
ETL functionality. Often, some flexible massaging needs to happen to get the input data into
the shape that's desired, and morphline logic helps do this in a user-friendly, pluggable,
efficient, pipelined manner. This issue basically boils down to transforming an input row
into a morphline record, feeding the record into the morphline processing API, and finally
converting zero or more morphline output records into corresponding Hive rows.

Some background is here:

http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html

http://kitesdk.org/docs/current/kite-morphlines/index.html

  was:
Integrate Hive with morphlines. 

Specifically, add support to Hive to call a morphline as a UDTF to leverage existing morphlines
ETL functionality. Often, some flexible massaging needs to happen to get the input data into
the shape that's desired, and morphline logic helps do this in a user-friendly, pluggable,
efficient, pipelined manner. This issue basically boils down to transforming an input row
into a morphline record, feeding the record into the morphline processing API, and finally
converting zero or more morphline output records into corresponding Hive rows.

Some background is here:

http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html

http://cloudera.github.io/cdk/docs/current/cdk-morphlines/index.html


> Integrate Hive with Morphlines
> ------------------------------
>
>                 Key: HIVE-5881
>                 URL: https://issues.apache.org/jira/browse/HIVE-5881
>             Project: Hive
>          Issue Type: New Feature
>          Components: UDF
>    Affects Versions: 0.12.0
>            Reporter: wolfgang hoschek
>
> Integrate Hive with morphlines. 
> Specifically, add support to Hive to call a morphline as a UDTF to leverage existing
morphlines ETL functionality. Often, some flexible massaging needs to happen to get the input
data into the shape that's desired, and morphline logic helps do this in a user-friendly,
pluggable, efficient, pipelined manner. This issue basically boils down to transforming an
input row into a morphline record, feeding the record into the morphline processing API, and
finally converting zero or more morphline output records into corresponding Hive rows.
> Some background is here:
> http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html
> http://kitesdk.org/docs/current/kite-morphlines/index.html



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message