Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 0E182200BF8 for ; Fri, 13 Jan 2017 23:03:29 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 0CADE160B3F; Fri, 13 Jan 2017 22:03:29 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 53F2B160B2E for ; Fri, 13 Jan 2017 23:03:28 +0100 (CET) Received: (qmail 78660 invoked by uid 500); 13 Jan 2017 22:03:27 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 78649 invoked by uid 99); 13 Jan 2017 22:03:27 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Jan 2017 22:03:27 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id C1E1AC04BC for ; Fri, 13 Jan 2017 22:03:26 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.362 X-Spam-Level: X-Spam-Status: No, score=0.362 tagged_above=-999 required=6.31 tests=[RDNS_DYNAMIC=0.363, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id gtC_9ynbOpPg for ; Fri, 13 Jan 2017 22:03:25 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 83E455F3A1 for ; Fri, 13 Jan 2017 22:03:25 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id v0DM3IVt001410; Fri, 13 Jan 2017 22:03:18 GMT Message-Id: <201701132203.v0DM3IVt001410@ip-10-146-233-104.ec2.internal> Date: Fri, 13 Jan 2017 22:03:17 +0000 From: "David Knupp (Code Review)" To: Dimitris Tsirogiannis , impala-cr@cloudera.com, reviews@impala.incubator.apache.org CC: Jim Apple , Harrison Sheinblatt Reply-To: dknupp@cloudera.com X-Gerrit-MessageType: newpatchset Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-4482=3A_Use_ALTER_TABLE_/_RECOVER_PARTITIONS_when_loading_tpcds=2Estore_sales=0A?= X-Gerrit-Change-Id: Iaae97d1d44201aeeacacdd39adbae35753512950 X-Gerrit-ChangeURL: X-Gerrit-Commit: 1782782e1055e9678577f9382aae3e427cf6c976 In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.12.2 archived-at: Fri, 13 Jan 2017 22:03:29 -0000 Hello Internal Jenkins, Dimitris Tsirogiannis, I'd like you to reexamine a change. Please visit http://gerrit.cloudera.org:8080/5177 to look at the new patch set (#5). Change subject: IMPALA-4482: Use ALTER TABLE / RECOVER PARTITIONS when loading tpcds.store_sales ...................................................................... IMPALA-4482: Use ALTER TABLE / RECOVER PARTITIONS when loading tpcds.store_sales This patch changes the way we load tpcds.store_sales test data. Before this, we were relying on a force_reload to build the table partitions based upon the data that had been copied over to HDFS from the warehouse snapshot. This worked on the local mini-cluster, but for some reason, it was selectively duplicating data when run on a remote cluster. This patch doesn't solve the mystery of why data duplication occurs on remote clusters, but it does resolve the immediate concern of loading test data by using Impala's recover partitions feature to automatically recognize the partitions in the HDFS directories. We just needed to add an ALTER TABLE store_sales RECOVER PARTITIONS to the tpcds schema template file. Tested by dropping the tpcds table on from a remote cluster setup, reloading the table, and running the tests in test_tpcds_queries.py. Tests that had been failng before are now passing. Change-Id: Iaae97d1d44201aeeacacdd39adbae35753512950 --- M testdata/datasets/tpcds/tpcds_schema_template.sql 1 file changed, 6 insertions(+), 0 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/77/5177/5 -- To view, visit http://gerrit.cloudera.org:8080/5177 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newpatchset Gerrit-Change-Id: Iaae97d1d44201aeeacacdd39adbae35753512950 Gerrit-PatchSet: 5 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: David Knupp Gerrit-Reviewer: David Knupp Gerrit-Reviewer: Dimitris Tsirogiannis Gerrit-Reviewer: Harrison Sheinblatt Gerrit-Reviewer: Internal Jenkins Gerrit-Reviewer: Jim Apple