Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C6FB818F7E for ; Tue, 1 Mar 2016 21:39:04 +0000 (UTC) Received: (qmail 26185 invoked by uid 500); 1 Mar 2016 21:39:03 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 26113 invoked by uid 500); 1 Mar 2016 21:39:03 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 26103 invoked by uid 99); 1 Mar 2016 21:39:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Mar 2016 21:39:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id A0DA4C391B for ; Tue, 1 Mar 2016 21:39:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.697 X-Spam-Level: * X-Spam-Status: No, score=1.697 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_REPLYTO_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=yahoo.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id YL6h_dos9i-B for ; Tue, 1 Mar 2016 21:39:00 +0000 (UTC) Received: from nm40-vm4.bullet.mail.ir2.yahoo.com (nm40-vm4.bullet.mail.ir2.yahoo.com [212.82.97.172]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id E28F35F54E for ; Tue, 1 Mar 2016 21:38:59 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1456868333; bh=tbOEyk07qn3CIB5zyMcPy2xzZTFQ9iBzmGEQJqtUh5g=; h=Date:From:Reply-To:To:In-Reply-To:References:Subject:From:Subject; b=FLsdNYjlDmwxr/FjH+6s8NVIGo159Vo40n93rL2uPkYEtQrgo92edjk75iau7mwlnvsPyxicWlWTmH+IswyXuH16koNkKfxtz+2ruq4JT3/wo/1AAmtXWNQo7fGnMbbs13Hxrz7gsA50mlCVJLWu6XrYaUouyHtV7nSY6Yf7CYnekvRIkGUm73nZxOKen8/YZ3DRF1GFkDhGqFSmiEZWKWK2oUlKvygTJL/P1CwTm6scotpyGB37t0dbOOdfV2eGWa4na6gMLRsAGClgb3vRPCHHC0VcCozetD9t2ud/6RYgNYY0tZtbSxdwYAv6M2UsaVbUZ98s0UbxY3I+gY0GFg== Received: from [212.82.98.61] by nm40.bullet.mail.ir2.yahoo.com with NNFMP; 01 Mar 2016 21:38:53 -0000 Received: from [212.82.98.111] by tm14.bullet.mail.ir2.yahoo.com with NNFMP; 01 Mar 2016 21:38:53 -0000 Received: from [127.0.0.1] by omp1048.mail.ir2.yahoo.com with NNFMP; 01 Mar 2016 21:38:53 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 162195.97605.bm@omp1048.mail.ir2.yahoo.com X-YMail-OSG: ShEZkIIVM1mtq0SdwQG.Yq0tSojr5cU1VXAOBpgBAzO6uqdu180I6GeZdnzj6Hi EEZhWLra9alOdxzoj8fIyLCKd.7ZDGgOkLjFr_F_tS_z2IMuSjJHPcr61y6E0ZMsDTMU0PhzEsjV rI008iFgqBCr_RQLxYo1G_tFgwZf3bouha36d78VYLuSherW8VmCcs2tSJpnFrlejUtBbrWUGiin ExKzUAi3u1r36WyHmNa1RshGYRbN07hEJwNVRwykN8qcGPIxKtTJiA1Vg8uVYJQvpBBafVxK0lGR 20WOV9IDZNHj9bOGS1ylH._Xq.FCC1Mkymh0wvjRDlu0ObdhSIpadBFviMHmFQLp9fMsth.NVFgP Ya9yV8KWP1l38xNQMTJDskC0YW_TuAOT95icxnxuUYUWmHaxAJjM6H03d7Xo_bkA9K5q..raDiLg 3_6uLhp1UkHQaxJJyOJWPAlr4zt5O0hef3JKK1.5XIliRnPPtioFBu5rKwu5ht5Ah_q9n_UT9qEL 5VyJqv92ZgmZE Received: by 217.12.9.12; Tue, 01 Mar 2016 21:38:52 +0000 Date: Tue, 1 Mar 2016 21:38:52 +0000 (UTC) From: Ashok Kumar Reply-To: Ashok Kumar To: "user@hive.apache.org" Message-ID: <1892683924.2956880.1456868332338.JavaMail.yahoo@mail.yahoo.com> In-Reply-To: References: Subject: Re: Hive and Impala MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_2956879_108522699.1456868332329" ------=_Part_2956879_108522699.1456868332329 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Dr Mitch, My two cents here. I don't have direct experience of Impala but in my humble opinion I share y= our views that Hive provides the best metastore of all Big Data systems. Lo= oking around almost every product in one form and shape use Hive code somew= here. My colleagues inform me that Hive is one of the most stable Big Data = products. With the capabilities of Spark on Hive and Hive on Spark or Tez plus of cou= rse MR, there is really little need for many other products in the same spa= ce. It is good to keep things simple. Warmest=20 On Tuesday, 1 March 2016, 11:33, Mich Talebzadeh wrote: =20 I have not heard of Impala anymore. I saw an article in LinkedIn titled "Apache Hive Or Cloudera Impala? What is Best for me?" "We can access all objects from Hive data warehouse with HiveQL which lever= ages the map-reduce architecture in background for data retrieval and trans= formation and this results in latency."=20 My response was This statement is no longer valid as you have choices of three engines now = with MR, Spark and Tez. I have not used Impala myself as I don't think ther= e is a need for it with Hive on Spark or Spark using Hive metastore providi= ng whatever needed. Hive is for Data Warehouse and provides what is says on= the tin. Please also bear in mind that Hive offers ORC storage files that = provide store Index capabilities further optimizing the queries with additi= onal stats at file, stripe and row group levels.=C2=A0 Anyway the question is with Hive on Spark or Spark using Hive metastore wha= t we cannot achieve that we can achieve with Impala? Dr Mich Talebzadeh=C2=A0LinkedIn =C2=A0https://www.linkedin.com/profile/vie= w?id=3DAAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw=C2=A0http://talebzadehmich.w= ordpress.com=C2=A0 ------=_Part_2956879_108522699.1456868332329 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

Dr Mitch,

My tw= o cents here.

I don't= have direct experience of Impala but in my humble opinion I share your vie= ws that Hive provides the best metastore of all Big Data systems. Looking a= round almost every product in one form and shape use Hive code somewhere. M= y colleagues inform me that Hive is one of the most stable Big Data product= s.

=
With the capabilit= ies of Spark on Hive and Hive on Spark or Tez plus of course MR, there is r= eally little need for many other products in the same space. It is good to = keep things simple.

<= /div>
Warmest


On Tuesday, 1 March 2016,= 11:33, Mich Talebzadeh <mich.talebzadeh@gmail.com> wrote:
=


I have no= t heard of Impala anymore. I saw an article in LinkedIn titled

"Apache Hive Or Cloudera Impala? What is Best for me?"

"We can access all objects from Hive data wareh= ouse with HiveQL which leverages the map-reduce architecture in background = for data retrieval and transformation and this results in latency."
<= div id=3D"yui_3_16_0_1_1456851404237_20065">
My response was<= /div>

This state= ment is no longer valid as you have choices of three engines now with MR, S= park and Tez. I have not used Impala myself as I don't think there is a nee= d for it with Hive on Spark or Spark using Hive metastore providing whateve= r needed. Hive is for Data Warehouse and provides what is says on the tin. = Please also bear in mind that Hive offers ORC storage files that provide st= ore Index capabilities further optimizing the queries with additional stats= at file, stripe and row group levels. 

Anywa= y the question is with Hive on Spark or Spark using Hive metastore what we = cannot achieve that we can achieve with Impala?


------=_Part_2956879_108522699.1456868332329--