Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4814910066 for ; Sun, 29 Dec 2013 01:10:24 +0000 (UTC) Received: (qmail 92764 invoked by uid 500); 29 Dec 2013 01:10:22 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 92640 invoked by uid 500); 29 Dec 2013 01:10:22 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 92632 invoked by uid 99); 29 Dec 2013 01:10:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Dec 2013 01:10:22 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jayunit100@gmail.com designates 209.85.216.181 as permitted sender) Received: from [209.85.216.181] (HELO mail-qc0-f181.google.com) (209.85.216.181) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Dec 2013 01:10:17 +0000 Received: by mail-qc0-f181.google.com with SMTP id e9so9932495qcy.12 for ; Sat, 28 Dec 2013 17:09:56 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=references:mime-version:in-reply-to:content-type :content-transfer-encoding:message-id:cc:from:subject:date:to; bh=x4JzoBkfb3LA3JXB+clyCNm/c8W+vtr1x+w8f5VUpN0=; b=qFp3p0CAdUNI2CcYsfjBi0+D4apHALcUbazLPTYJRgMalwA74ItenMSOEy4sAnaouP 6IKh+P/uEgmHkK/TH8fzLS3xHUGOSyboliVNuhtXG+VBnV68nQEvWh0BvsPbt/MrcSxc lk80v8PRpPseV89jklnA7yIHArxy8fM5IQV9CixCwwrSl7iyFnLSCQ4J72XB1tmZfpW9 ZVknBfIwu5+H27Og3ATsFSDImMiDdprT/Y0I1C0JezCXO9fl4RmGNKUykj4EaNhXZDYd jQCvzcnutPc8xyhKkEk3pf+Vj3b+0RT2w3Mxmgif1H9naQhbBlLyCyBX74TVCJxhTeHA 902g== X-Received: by 10.224.168.212 with SMTP id v20mr44526240qay.62.1388279396492; Sat, 28 Dec 2013 17:09:56 -0800 (PST) Received: from [10.0.1.3] (c-24-218-124-12.hsd1.ma.comcast.net. [24.218.124.12]) by mx.google.com with ESMTPSA id fc16sm61920041qeb.3.2013.12.28.17.09.54 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sat, 28 Dec 2013 17:09:55 -0800 (PST) References: <74CBD601-C1B3-4A0A-9896-929866D5135F@gmail.com> Mime-Version: 1.0 (1.0) In-Reply-To: Content-Type: multipart/alternative; boundary=Apple-Mail-52D12D13-16E9-4049-BA18-CC35AEFD1633 Content-Transfer-Encoding: 7bit Message-Id: <59A9DD4B-9F91-4C9A-A46C-14BC61094F3C@gmail.com> Cc: "user@hive.apache.org" X-Mailer: iPhone Mail (11A465) From: Jay Vyas Subject: Re: Hive, datanucleus, jdbc, localmode. Date: Sat, 28 Dec 2013 20:09:54 -0500 To: "user@hive.apache.org" X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-52D12D13-16E9-4049-BA18-CC35AEFD1633 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable -Local mode should have clear instructions on how to run fully local hive jo= bs, with no hadoop installation. -I like the hive_test repo but I'm not yet sure hive_test is 100% up to date= with the simplest strategy for testing hive workflows on the JVM. > On Dec 28, 2013, at 4:19 PM, Lefty Leverenz wrot= e: >=20 > This sounds like something the documentation should cover.=20 > What information should be added to the Local Mode section?=20 > Should the wiki have a link to hive_test (for example, in Hive Developer FA= Q)? >=20 > -- Lefty >=20 >=20 >> On Sat, Dec 28, 2013 at 8:02 AM, Edward Capriolo w= rote: >> I do not think so. Local mode, just implies the job tracker is local (and= some of the temp storage directories) it does not imply hive will use hadoo= p without forking.=20 >>=20 >>=20 >>> On Sat, Dec 28, 2013 at 10:43 AM, Jay Vyas wrote:= >>> Thanks... But are you sure this is the only way? Or is there some magic w= ay to run hive in local mode that we both are missing out on ?:)... >>>=20 >>> - isn't hive in local mode supposed to be run simply via the jdbc://hive= URL which runs local mode... Or maybe by the fork config parameter? >>>=20 >>> - For example see the parameters in this file: >>>=20 >>> https://github.com/riptano/brisk/blob/master/resources/hive/conf/hive-si= te.xml >>>=20 >>>=20 >>>> On Dec 28, 2013, at 10:22 AM, Edward Capriolo w= rote: >>>>=20 >>>> You can follow along to what I do here. >>>>=20 >>>> https://github.com/edwardcapriolo/hive_test >>>>=20 >>>> Essentially hive requires a HADOOP_HOME because it always wants to fork= a bin/hadoop process. Hive-test helps you unpack hadoop inside target and c= hange your hadoop_home to some other directory.=20 >>>>=20 >>>> It would be nice if there was some other way to do this. >>>>=20 >>>>=20 >>>>> On Fri, Dec 27, 2013 at 10:27 PM, Jay Vyas wrot= e: >>>>> Hi Hive: >>>>>=20 >>>>> I'm attempting to create a robust eclipse based dev environment for te= sting my hive jobs in localmode however I run into classnotfound errors depe= nding on which maven dependencies I use. Also, it seems when I change these d= ependencies from hive 0.12 to hive 0.11, I get other errors related to hive t= rying to launch jobs via calling /usr/bin/hadoop. >>>>>=20 >>>>> This I am stuck: I can't run hive 12 in local java mode because of sub= tle datanucleus class and API inconsistencies which are tough to resolve, an= d when going to hive 11, it seems local mode is not natively detected via th= e jdbc URL... >>>>>=20 >>>>> So I have 2 questions: >>>>>=20 >>>>> 0) how does hive 12 versus 11 implement local mode differently ? >>>>>=20 >>>>> And >>>>>=20 >>>>> 1) What is the right way to in hive in pure java/ local environments? >>>>>=20 >>>>> The hive book suggests modifying configuration properties, for local m= ode.. >>>>>=20 >>>>> but I also have found that in hive 0.12 , using the jdbc://hive conne= ction URL automagically launches jobs in local mode.. >>>>>=20 >>>>> However in 0.11 , I see calls to /usr/bin/hadoop when running java cla= sses in local eclipse environment. >>>>>=20 >>>>> Thanks! >>>>>=20 >>>>> FYI to see an example of my pom.xml, you can checkout the github://jay= unit100/bigpetstore pom.xml file. >=20 --Apple-Mail-52D12D13-16E9-4049-BA18-CC35AEFD1633 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 7bit
-Local mode should have clear instructions on how to run fully local hive jobs, with no hadoop installation.

-I like the hive_test repo but I'm not yet sure hive_test is 100% up to date with the simplest strategy for testing hive workflows on the JVM.


On Dec 28, 2013, at 4:19 PM, Lefty Leverenz <leftyleverenz@gmail.com> wrote:

This sounds like something the documentation should cover. 

-- Lefty


On Sat, Dec 28, 2013 at 8:02 AM, Edward Capriolo <edlinuxguru@gmail.com> wrote:
I do not think so. Local mode, just implies the job tracker is local (and some of the temp storage directories) it does not imply hive will use hadoop without forking.


On Sat, Dec 28, 2013 at 10:43 AM, Jay Vyas <jayunit100@gmail.com> wrote:
Thanks... But are you sure this is the only way? Or is there some magic way to run hive in local mode that we both are missing out on ?:)...

- isn't hive in local mode supposed to be run simply via the jdbc://hive URL which runs local mode... Or maybe by the fork config parameter?

- For example see the parameters in this file:



On Dec 28, 2013, at 10:22 AM, Edward Capriolo <edlinuxguru@gmail.com> wrote:

You can follow along to what I do here.

https://github.com/edwardcapriolo/hive_test

Essentially hive requires a HADOOP_HOME because it always wants to fork a bin/hadoop process. Hive-test helps you unpack hadoop inside target and change your hadoop_home to some other directory.

It would be nice if there was some other way to do this.


On Fri, Dec 27, 2013 at 10:27 PM, Jay Vyas <jayunit100@gmail.com> wrote:
Hi Hive:

I'm attempting to create a robust eclipse based dev environment for testing my hive jobs in localmode however I run into classnotfound errors depending on which maven dependencies I use. Also, it seems when I change these dependencies from hive 0.12 to hive 0.11, I get other errors related to hive trying to launch jobs via calling /usr/bin/hadoop.

This I am stuck: I can't run hive 12 in local java mode because of subtle datanucleus class and API inconsistencies which are tough to resolve, and when going to hive 11, it seems local mode is not natively detected via the jdbc URL...

So I have 2 questions:

0) how does hive 12 versus 11 implement local mode differently ?

And

1) What is the right way to in hive in pure java/ local environments?

The hive book suggests modifying configuration properties, for local mode..

but I also have found  that in hive 0.12 , using the jdbc://hive connection URL automagically launches jobs in local mode..

However in 0.11 , I see calls to /usr/bin/hadoop when running java classes in local eclipse environment.

Thanks!

FYI to see an example of my pom.xml, you can checkout the github://jayunit100/bigpetstore pom.xml file.



--Apple-Mail-52D12D13-16E9-4049-BA18-CC35AEFD1633--