Return-Path: Delivered-To: apmail-hadoop-core-commits-archive@www.apache.org Received: (qmail 36514 invoked from network); 19 May 2009 15:52:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 19 May 2009 15:52:29 -0000 Received: (qmail 26233 invoked by uid 500); 19 May 2009 15:52:28 -0000 Delivered-To: apmail-hadoop-core-commits-archive@hadoop.apache.org Received: (qmail 26181 invoked by uid 500); 19 May 2009 15:52:28 -0000 Mailing-List: contact core-commits-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-commits@hadoop.apache.org Received: (qmail 26172 invoked by uid 99); 19 May 2009 15:52:28 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 May 2009 15:52:28 +0000 X-ASF-Spam-Status: No, hits=-1999.1 required=10.0 tests=ALL_TRUSTED,URIBL_RHS_DOB X-Spam-Check-By: apache.org Received: from [140.211.11.130] (HELO eos.apache.org) (140.211.11.130) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 May 2009 15:52:18 +0000 Received: from eos.apache.org (localhost [127.0.0.1]) by eos.apache.org (Postfix) with ESMTP id 98230118BF for ; Tue, 19 May 2009 15:51:57 +0000 (GMT) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: Apache Wiki To: core-commits@hadoop.apache.org Date: Tue, 19 May 2009 15:51:57 -0000 Message-ID: <20090519155157.24697.65539@eos.apache.org> Subject: [Hadoop Wiki] Update of "Hive/HiveAws/HivingS3nRemotely" by JoydeepSensarma X-Virus-Checked: Checked by ClamAV on apache.org Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification. The following page has been changed by JoydeepSensarma: http://wiki.apache.org/hadoop/Hive/HiveAws/HivingS3nRemotely ------------------------------------------------------------------------------ location 's3n://data.s3ndemo.hive/tpch/lineitem'; }}} - The TPCH DDL statements are slightly modified versions of the original TPCH statements (since Hive does not support all the data types used in TPCH). + The TPCH DDL statements are slightly modified versions of the original TPCH statements (since Hive does not support all the data types used in TPCH). All the TPCH DDL statements for Hive can be be found [[attachment:TpchDdlForHive.sql here]] == Executing Queries == Hive can execute some queries without a Hadoop cluster. For example: @@ -117, +117 @@ The socket related options allow Hive CLI to communicate with the Hadoop cluster using a ssh tunnel (that will be established later). The job.ugi is specified to avoid issues with permissions on HDFS. {{{mapred.map.tasks}}} specification is a hack that works around [[https://issues.apache.org/jira/browse/HADOOP-5861 HADOOP-5861]] and may need to be set higher for large clusters. {{{mapred.reduce.tasks}}} is specified to let Hive determine the number of reducers (see [[https://issues.apache.org/jira/browse/HIVE-490 HIVE-490]]). === Links === - * [[../../HiveAws Hive and AWS]] presents general landscape and alternative on running Hive queries in AWS. + * [:../: Hive and AWS] presents general landscape and alternative on running Hive queries in AWS. * [[http://jsensarma.com/blog/2009/05/hive-hadoop-s3-ec2-it-works On issues and lessons learned during this integration effort]]