Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5C7A9200BEB for ; Wed, 14 Dec 2016 04:05:02 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 5B1E8160B31; Wed, 14 Dec 2016 03:05:02 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A255D160B23 for ; Wed, 14 Dec 2016 04:05:01 +0100 (CET) Received: (qmail 78825 invoked by uid 500); 14 Dec 2016 03:05:00 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 78806 invoked by uid 99); 14 Dec 2016 03:05:00 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Dec 2016 03:05:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 2AFB7D000F for ; Wed, 14 Dec 2016 03:05:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.362 X-Spam-Level: X-Spam-Status: No, score=0.362 tagged_above=-999 required=6.31 tests=[RDNS_DYNAMIC=0.363, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id apEGsLBLgKvu for ; Wed, 14 Dec 2016 03:04:57 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id D63E260E3B for ; Wed, 14 Dec 2016 03:04:56 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id uBE34nvl003252; Wed, 14 Dec 2016 03:04:49 GMT Message-Id: <201612140304.uBE34nvl003252@ip-10-146-233-104.ec2.internal> Date: Wed, 14 Dec 2016 03:04:49 +0000 From: "Taras Bobrovytsky (Code Review)" To: impala-cr@cloudera.com, reviews@impala.incubator.apache.org CC: Matthew Jacobs , Alex Behm , Michael Brown Reply-To: tbobrovytsky@cloudera.com X-Gerrit-MessageType: comment Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-4467=3A_Add_support_for_DML_statements_in_stress_test=0A?= X-Gerrit-Change-Id: Ia2aafdc6851cc0e1677a3c668d3350e47c4bfe40 X-Gerrit-ChangeURL: X-Gerrit-Commit: 02817f0db26ec60033f062ba3c7f0c6e5ee729bb In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.12.2 archived-at: Wed, 14 Dec 2016 03:05:02 -0000 Taras Bobrovytsky has posted comments on this change. Change subject: IMPALA-4467: Add support for DML statements in stress test ...................................................................... Patch Set 4: (13 comments) http://gerrit.cloudera.org:8080/#/c/5093/4//COMMIT_MSG Commit Message: Line 24: following flag: --dml-mod-values 11 13 17. For each mod value 4 DML > Seems ok for now, but from a perspective we should consider changing these Actually a mod value is equivalent to %rows: %rows ~= 100 / mod_value Maybe instead we are more interested in the absolute number of rows? So a user could specify the number of rows that should be touched by a query, and the stress test would then automatically compute what the mod value should be for a given table? http://gerrit.cloudera.org:8080/#/c/5093/4/tests/stress/concurrent_select.py File tests/stress/concurrent_select.py: Line 368: self._select_probability = select_probability > check that this is indeed a float between 0 and 1 Do you really think that it makes sense to add input validation? We are not validating other inputs in other places. Line 747: self.modifies_table = False > This flag is pretty confusing. What do we use it for? Done Line 1297: return > add comment why we need to skip compute stats Done Line 1428: if table.name + "_original" in set(table.name for table in tables): > What does it mean if this is not true? Maybe add a LOG.debug message to exp Done Line 1429: cursor.execute("SHOW CREATE TABLE " + table.name) > Add a note or TODO that this will not work for certain types of Kudu table Yes, to my knowledge at this time we only use Kudu tables with a simple hash partitioning. Added a note. How do we create tables with more complex partition schemes? Line 1440: For each table in the database that cursor is connected to, create several queries, > Mention the limitations of this function: Added the two points that you suggested. I don't really see what other limitations I can add. Line 1443: exists a table with a '_original' suffix that has unmodified, for example, tpch data. > that has unmodified -> that is never modified Done Line 1450: continue > LOG.debug what is happening here Done Line 1452: if primary_key.exact_type not in (TinyInt, SmallInt, BigInt): > what about Int? Done Line 1456: continue > LOG.debug what is happening here because some of these limitations are not Done Line 1481: "UPDATE a SET {update_list} FROM {table_name} a JOIN {table_name}_original b " > This might give you strange results for tables with multiple primary keys. Maybe it's okay to keep it as is? It can potentially result in many rows having the data, but we are not really checking results in the stress test. (The update statement duplicates rows anyways, even if there is only 1 primary key column). Line 1559: cursor.execute("SHOW CREATE TABLE " + table_name) > same deal with SHOW CREATE TABLE as above, add a note or TODO that this pro Done -- To view, visit http://gerrit.cloudera.org:8080/5093 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: Ia2aafdc6851cc0e1677a3c668d3350e47c4bfe40 Gerrit-PatchSet: 4 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: Taras Bobrovytsky Gerrit-Reviewer: Alex Behm Gerrit-Reviewer: Matthew Jacobs Gerrit-Reviewer: Michael Brown Gerrit-Reviewer: Taras Bobrovytsky Gerrit-HasComments: Yes