hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward Capriolo (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-599) Embedded Hive SQL into Python
Date Wed, 15 Jul 2009 18:56:14 GMT

    [ https://issues.apache.org/jira/browse/HIVE-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731625#action_12731625

Edward Capriolo commented on HIVE-599:

I am not saying that python bindings are a bad idea, but I do not understand what would be
achieved with python that could not be archived by running a stand alone java program. I read
some of the links above it looks like 'langlets' are python apps that abstract input and output
steams of external processes into useful python functions.

Should all the langlets, just be wrappers to 'hivelets'?

For example a 'hivelet' might use the query api to run 'show tables' , or I might just use
the metastore api to read the tables. So in my implementation of langlets we do most of the
heavy lifting in java. This was what I was getting at when saying 'simple java overlay'. 

> Embedded Hive SQL into Python
> -----------------------------
>                 Key: HIVE-599
>                 URL: https://issues.apache.org/jira/browse/HIVE-599
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Ashish Thusoo
>            Assignee: Ashish Thusoo
> While Hive does SQL it would be very powerful to be able to embed that SQL in languages
like python in such a way that the hive query is also able to invoke python functions seemlessly.
One possibility is to explore integration with Dumbo. Another is to see if the internal map_reduce.py
tool can be open sourced as a Hive contrib.
> Other thoughts?

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message