hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "He Yongqiang (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-1624) Patch to allows scripts in S3 location
Date Mon, 13 Sep 2010 11:40:33 GMT

    [ https://issues.apache.org/jira/browse/HIVE-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12908729#action_12908729
] 

He Yongqiang commented on HIVE-1624:
------------------------------------

S3 -> client -> cluster maybe better than directly downloading the script from S3 to
TaskTracker node.
There may be thousands of concurrent downloading request to S3 for downloading a script. (I
agree that the script can be cached in local machine, but right now hive does not do any cache
clean up).
S3 -> client -> cluster will be able to use hadoop distributed cache.

> Patch to allows scripts in S3 location
> --------------------------------------
>
>                 Key: HIVE-1624
>                 URL: https://issues.apache.org/jira/browse/HIVE-1624
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Vaibhav Aggarwal
>         Attachments: HIVE-1624.patch
>
>
> I want to submit a patch which allows user to run scripts located in S3.
> This patch enables Hive to download the hive scripts located in S3 buckets and execute
them. This saves users the effort of copying scripts to HDFS before executing them.
> Thanks
> Vaibhav

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message