Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8B24A19DDF for ; Tue, 1 Mar 2016 11:33:22 +0000 (UTC) Received: (qmail 70409 invoked by uid 500); 1 Mar 2016 11:33:21 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 70334 invoked by uid 500); 1 Mar 2016 11:33:21 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 70324 invoked by uid 99); 1 Mar 2016 11:33:21 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Mar 2016 11:33:21 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 980E91805A8 for ; Tue, 1 Mar 2016 11:33:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id iYOvGssDGGMl for ; Tue, 1 Mar 2016 11:33:18 +0000 (UTC) Received: from mail-vk0-f52.google.com (mail-vk0-f52.google.com [209.85.213.52]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id EE7775F54E for ; Tue, 1 Mar 2016 11:33:17 +0000 (UTC) Received: by mail-vk0-f52.google.com with SMTP id c3so164157238vkb.3 for ; Tue, 01 Mar 2016 03:33:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to; bh=McqhwpElvVZNC/EpRCbUKJu+G4Mhj4H02UxhOSpkKT4=; b=UdNtiqxGIDtG1Gr8NESfqD5nBFa1LrM/fpBLExeoG7hbmqgboDfiX2Typ8dRszMA2Q gLdN/rra4eWDm5d4yYdd/27YDFjGOpY0hAhS2CFFuFcIA+Vr+fkOnwGHEqH4sDw5z7hh 3pKBSBiIGLCNYmMk2CuZ9NimUkM2L383xfTQiuV6fTqtCxEv5BgAy86ipyGXbIBLMh7/ EWy8Uel0XYuhg2tSSF/6qqLBRmxg15+o3zYfgnL6MqDfdkdLjVZilxkS1E6ZpwLz9wNg /R9IuRh7KIa8hLOy+UUvwZuPMnFlzGizN+6RqAnfMsm/9GDTVY5NAJs0Dcxfk2rDrys9 sypg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to; bh=McqhwpElvVZNC/EpRCbUKJu+G4Mhj4H02UxhOSpkKT4=; b=Nofm1eEF2Iyob8/+pYZ241tDFx2UBGdTVxlizY9zqHvzDGfaFmc5EmIqIBQ8nOopQV J4oel4BImaRY8nD5HCvVMsOOdXtbm0Pm0cN1mmEU0OxMNaTB8N6/Sn/RyP7QXMpF9pct iROiJfxituG3vVk+X5IQuUjZIC4U6HpMWxSakO0HguZDumZsiPOl89jtMJsu9WMe3Izr TWqytR8kYeqIjQLTyFXMXeO6mO+uzBl/xHl7rCfg3EFpw1G7puWEuuOa+7Ht/h3CNoiW TEC+94Rqr5XhRdfFfWpExDw5K25lrsnTrD1G40kRDtNDTUYYtZWZJLg/S8Vox5kaEZ7S iHzA== X-Gm-Message-State: AD7BkJJyrj/eMoN78Tja6Q1Ln9tyQ6029BjhdagMhjoNsWFPu7LC4DaS4bdr37SaYEkRfJ2Hu9n+WpUgtPWo8g== MIME-Version: 1.0 X-Received: by 10.31.58.193 with SMTP id h184mr15314479vka.111.1456831990607; Tue, 01 Mar 2016 03:33:10 -0800 (PST) Received: by 10.31.128.213 with HTTP; Tue, 1 Mar 2016 03:33:10 -0800 (PST) Date: Tue, 1 Mar 2016 11:33:10 +0000 Message-ID: Subject: Hive and Impala From: Mich Talebzadeh To: user@hive.apache.org Content-Type: multipart/alternative; boundary=001a1143ff704c6bc8052cfb21ae --001a1143ff704c6bc8052cfb21ae Content-Type: text/plain; charset=UTF-8 I have not heard of Impala anymore. I saw an article in LinkedIn titled "Apache Hive Or Cloudera Impala? What is Best for me?" "We can access all objects from Hive data warehouse with HiveQL which leverages the map-reduce architecture in background for data retrieval and transformation and this results in latency." My response was This statement is no longer valid as you have choices of three engines now with MR, Spark and Tez. I have not used Impala myself as I don't think there is a need for it with Hive on Spark or Spark using Hive metastore providing whatever needed. Hive is for Data Warehouse and provides what is says on the tin. Please also bear in mind that Hive offers ORC storage files that provide store Index capabilities further optimizing the queries with additional stats at file, stripe and row group levels. Anyway the question is with Hive on Spark or Spark using Hive metastore what we cannot achieve that we can achieve with Impala? Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw * http://talebzadehmich.wordpress.com --001a1143ff704c6bc8052cfb21ae Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I have not heard of Impala anymore. I saw an article = in LinkedIn titled

"Apache Hive Or Cloudera I= mpala? What is Best for me?"

"We can acc= ess all objects from Hive data warehouse with HiveQL which leverages the ma= p-reduce architecture in background for data retrieval and transformation a= nd this results in latency."

My response was=

This statement is no longer valid as you have cho= ices of three engines now with MR, Spark and Tez. I have not used Impala my= self as I don't think there is a need for it with Hive on Spark or Spar= k using Hive metastore providing whatever needed. Hive is for Data Warehous= e and provides what is says on the tin. Please also bear in mind that Hive = offers ORC storage files that provide store Index capabilities further opti= mizing the queries with additional stats at file, stripe and row group leve= ls.=C2=A0

Anyway the question is with Hive on Spar= k or Spark using Hive metastore what we cannot achieve that we can achieve = with Impala?


--001a1143ff704c6bc8052cfb21ae--