Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8291A188B7 for ; Thu, 22 Oct 2015 21:38:31 +0000 (UTC) Received: (qmail 30499 invoked by uid 500); 22 Oct 2015 21:38:31 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 30418 invoked by uid 500); 22 Oct 2015 21:38:31 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 30405 invoked by uid 99); 22 Oct 2015 21:38:30 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Oct 2015 21:38:30 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 3258D1A26F8 for ; Thu, 22 Oct 2015 21:38:30 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.101 X-Spam-Level: X-Spam-Status: No, score=-0.101 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id IwrNTN_VZBSm for ; Thu, 22 Oct 2015 21:38:26 +0000 (UTC) Received: from mail-io0-f173.google.com (mail-io0-f173.google.com [209.85.223.173]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 9FD0A2303D for ; Thu, 22 Oct 2015 21:38:25 +0000 (UTC) Received: by iow1 with SMTP id 1so105840503iow.1 for ; Thu, 22 Oct 2015 14:38:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=sKH067Mp4Lp87g5QRteUseEaiErgwSBPcKsHWf53uTU=; b=jSQf7vf73/A2Ht9pcfTDh/lhqeH7izqYiaTmbqG7i2gAWPq2hegkUSAzFrlR4n0iaP ek9Ek6zsXfF0gIEg7CVkG89OTL/OszvKjTkfREx2BK8pgq2yOLqOs5hhibpuG2xxli+T aLtrpqN+ee+K8OnU0XkyLkUIafPHBm0C5zTIu7iQUnmCM3J7zQVk+w9H1wtni8XG12hI ApLrn2HQyXGw81Vum1Q5HKXr8SnpHzekI9RJBIYuMyg7xmWkpmIblTYvQ1dIzrRGaff4 kre5sSOCTY/wz2NnP7v+lRz5JvfvjktATtW5t8Gt/bnEsXwlkrOOD8UztXuX9ucfGP7R 7apA== MIME-Version: 1.0 X-Received: by 10.107.133.157 with SMTP id p29mr22323209ioi.71.1445549905162; Thu, 22 Oct 2015 14:38:25 -0700 (PDT) Received: by 10.107.46.159 with HTTP; Thu, 22 Oct 2015 14:38:25 -0700 (PDT) Date: Thu, 22 Oct 2015 14:38:25 -0700 Message-ID: Subject: [DISCUSS] Re: deprecating MR in the first release of Hive 2.0 From: Thejas Nair To: dev Content-Type: text/plain; charset=UTF-8 (Adding [DISCUSS] to subject to bring it to attention of wider audience.) +1 Given how much investment is going into Tez and Spark execution modes, it makes sense to convey that better to the user community and recommend the use of the new modes over MR. Users who choose those modes are going to get better experience, and it will help to improve the overall perception of Hive. Once most users have moved to the new modes, we can start looking into removing MR support. (Though that is likely to take a while). On Wed, Oct 21, 2015 at 9:44 PM, Sergey Shelukhin wrote: > We have discussed the removal of hadoop-1 and MR support in Hive 2 line in the past.. > Hadoop-1 removal seems to be non-controversial and on track; before we cut the first release of Hive 2, I propose we deprecate MR. > > Tez and Spark engines provide vast perf improvements over MR; > Execution optimization work by most contributors for a long time has been done for these engines and is not portable to MR, so it is languishing further; > At the same time, supporting additional code has other development costs for new features or bugs, plus we have to run tests for it both in Apache and for local changes and to deploy code. > > However, MR is hard to remove. Plus, it may provide a baseline for some bugs in other engines (which is not bulletproof since MR logic can be incorrect), or to mock during perf benchmarks. > > Therefore, I propose that for now we add deprecation warnings suggesting the other alternatives: > > * to Hive configuration documentation. > * to Hive wiki. > * to release notes on Hive 2. > * in Beeline and CLI when using MR. > > Additionally, I propose we remove Minimr test driver from HiveQA runs for master. > > What do you think?