Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9F1BB185AB for ; Thu, 24 Sep 2015 00:11:23 +0000 (UTC) Received: (qmail 83087 invoked by uid 500); 24 Sep 2015 00:11:16 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 83002 invoked by uid 500); 24 Sep 2015 00:11:16 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 82990 invoked by uid 99); 24 Sep 2015 00:11:16 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Sep 2015 00:11:16 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 1C6A6188600 for ; Thu, 24 Sep 2015 00:11:16 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.999 X-Spam-Level: ** X-Spam-Status: No, score=2.999 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id kk0NKobWkL7s for ; Thu, 24 Sep 2015 00:11:08 +0000 (UTC) Received: from mail-qg0-f47.google.com (mail-qg0-f47.google.com [209.85.192.47]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 20F4920F5E for ; Thu, 24 Sep 2015 00:11:07 +0000 (UTC) Received: by qgx61 with SMTP id 61so32914945qgx.3 for ; Wed, 23 Sep 2015 17:11:06 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=dVqrlIFIhLRvPE/s4QcLFJmYpj+VO8aCIbjOHDB1O/k=; b=TluncnIYWCdRvRSt1Dq5ZT3pSD3ZQKLW2NnIAPKDXXXymHA85z8XlOjq54ZZjGiW5X 7wzqmun7R66+r29Lpj81i3juo/U3Yxz49ybE8YKRlrCV3v4zX96yKedAnyyGIfsXcuua +HHoxHsJQvGiEw2+S16iIfzz6SfgrLrMNx6VjVt0jfmdNvLwiB0hQoBSBzDsbU5tBxff t2l/c95Bgg+0jPIsOflQ103HwXkTKGzZ+JHOmP6h3j9utg8ZcERVRsA8IUEMgFZQSg/A 5/V+CEtPHI71xlvjJLHKhy7SM4IM7JJSGWcNs6UONtUgH2OK5U7P/dHFB3kVXPt/ZM4p i5eA== X-Gm-Message-State: ALoCoQnHUOzZxMkuxQJjVhZ1hjkl/PTGC3xaGX1YAPiybYV+OX2ca4838yxsqwq5HFLmHOGw8CJH X-Received: by 10.140.33.225 with SMTP id j88mr39429729qgj.30.1443053466119; Wed, 23 Sep 2015 17:11:06 -0700 (PDT) MIME-Version: 1.0 Received: by 10.140.44.10 with HTTP; Wed, 23 Sep 2015 17:10:46 -0700 (PDT) In-Reply-To: References: From: Szehon Ho Date: Wed, 23 Sep 2015 17:10:46 -0700 Message-ID: Subject: Re: HiveQA is getting annoyingly slow To: "dev@hive.apache.org" Content-Type: multipart/alternative; boundary=001a113abd103d9a450520731174 --001a113abd103d9a450520731174 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Thanks for the investigation, its very helpful. I think its a good idea to disable the most highly offending tests as you suggested if they are testing features that aren't heavily used (rcfile_merge1 and gbtoidx). I wasnt aware we are dropping support on MR anytime soon though? I think removing those two offending tests (or any other we identify) would be low-hanging fruit, on our end we'll also see whether we can play around with the build's configured batch sizes of MiniMR tests (unit of parallelism in the PTest) and see if we can get it down further. Hope that works? Thanks Szehon On Wed, Sep 23, 2015 at 4:28 PM, Sergey Shelukhin wrote: > HiveQA is taking too long to run. > I browsed test results a bit, the main offenders are obviously various > CliDrivers. > I think there=E2=80=99s a JIRA to speed up Tez CLI driver that is being w= orked on; > and Spark and HBase have tolerable runtimes. > > That leaves us base CliDriver and MR. > Base tests generally take 0-30seconds, 1-2 minutes at most, but there are > some ridiculous test runtimes (these are fairly consistent between runs): > testCliDriver_rcfile_merge1 31 min > testCliDriver_escape2 13 min > testCliDriver_escape1 8 min 10 sec > testCliDriver_dynpart_sort_opt_vectorization 4 min 47 se= c > testCliDriver_unionDistinct_1 4 min 32 sec > testCliDriver_dynpart_sort_optimization 4 min 2 sec > testCliDriver_rcfile_merge2 3 min 55 sec > testCliDriver_vector_leftsemi_mapjoin 3 min 53 sec > testCliDriver_archive_excludeHadoop20 3 min 13 sec > > > If we remove or rein in 3 tests the testCliDriver runtime will go down by > almost an hour. > Anyone particularly attached to rcfile tests? It=E2=80=99s all good to te= st > rcfile, but it=E2=80=99s a rarely use format with Avro, ORC and Parquet s= eemingly > having taken over (not speaking of Text), the test should not take half a= n > hour. I suggest we disable this test (rcfile_merge1) and file a JIRA to > investigate its perf if someone feels it=E2=80=99s important. > Another work item is to look at why escape tests take so long, it should > be a simple thing to test, not 21 minutes aggregate (most test finish in > 0-2 minutes). > > Then, MiniMR test takes 2 hours. Some GBY index specific tests are the > worst offenders (gbtoidx), to the tune of 35mins for 3 tests; as well as > smb_mapjoin for 15mins. > Since the plan was to drop MR support on master, how about starting by no= t > running these long MR tests and deprecating MR engine, while still keepin= g > it around before the task of removing it. > > --001a113abd103d9a450520731174--