Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B8023200BBD for ; Tue, 8 Nov 2016 17:52:25 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id B6B04160B0A; Tue, 8 Nov 2016 16:52:25 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0A190160AD0 for ; Tue, 8 Nov 2016 17:52:24 +0100 (CET) Received: (qmail 35859 invoked by uid 500); 8 Nov 2016 16:52:24 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 35840 invoked by uid 99); 8 Nov 2016 16:52:23 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Nov 2016 16:52:23 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 63A131A9B33 for ; Tue, 8 Nov 2016 16:52:23 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.363 X-Spam-Level: X-Spam-Status: No, score=0.363 tagged_above=-999 required=6.31 tests=[RDNS_DYNAMIC=0.363, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 3YOWl_Ic1NyP for ; Tue, 8 Nov 2016 16:52:21 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 5E1A35FE24 for ; Tue, 8 Nov 2016 16:52:21 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id uA8GqFnT014456; Tue, 8 Nov 2016 16:52:15 GMT Message-Id: <201611081652.uA8GqFnT014456@ip-10-146-233-104.ec2.internal> Date: Tue, 8 Nov 2016 16:52:15 +0000 From: "Matthew Jacobs (Code Review)" To: impala-cr@cloudera.com, reviews@impala.incubator.apache.org CC: Alex Behm Reply-To: mj@cloudera.com X-Gerrit-MessageType: comment Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-3710=3A_Kudu_DML_should_ignore_conflicts=2C_pt2=0A?= X-Gerrit-Change-Id: I4eb1ad91dc355ea51de261c3a14df0f9d28c879c X-Gerrit-ChangeURL: X-Gerrit-Commit: a8d301192e80b70f0bbd8810ff5ecd8df895c343 In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.12.2 archived-at: Tue, 08 Nov 2016 16:52:25 -0000 Matthew Jacobs has posted comments on this change. Change subject: IMPALA-3710: Kudu DML should ignore conflicts, pt2 ...................................................................... Patch Set 1: (7 comments) http://gerrit.cloudera.org:8080/#/c/4985/1/be/src/exec/data-sink.cc File be/src/exec/data-sink.cc: Line 119: if (!dst_stats->__isset.kudu_stats) dst_stats->__set_kudu_stats(TKuduDmlStats()); > pass 0 in c'tor of TKuduDmlStats? I can't bc there's no parameter, I think because I use optional fields. Line 161: ss << "NumRowErrors: " << stats.kudu_stats.num_row_errors << endl; > General question: Is there an easy way to distinguish the different types o It is unfortunate and probably not helpful enough, but I'm very worried about us presenting incorrect information to the user which I think is worse. The issue is that we can't trust the Kudu error codes enough. E.g. they re-use error codes for some different cases where I'm not confident enough that we'll always get it right. My goal is to work with them for the next release to get a more concrete contract. I was on the fence about having different Impala error codes at all, but I think it's worthwhile for de-duping the logs. If you think this is confusing, I'd prefer to just remove the warning logging w/ different error codes. http://gerrit.cloudera.org:8080/#/c/4985/1/be/src/exec/kudu-table-sink.cc File be/src/exec/kudu-table-sink.cc: Line 214: COUNTER_ADD(num_row_errors_, 1); > This is expensive right? How about we update a local counter and COUNTER_AD Done Line 305: if (!e.IsNotFound() && !e.IsAlreadyPresent() && status.ok()) { > Why not move to L325 into an else if? Done Line 310: // Kudu does not have yet have a way to programmatically differentiate between 'row > few words mixed up Done http://gerrit.cloudera.org:8080/#/c/4985/1/common/thrift/ImpalaInternalService.thrift File common/thrift/ImpalaInternalService.thrift: Line 430: // TODO: Refactor to reflect usage by other DML statements. > Do you intend to do that soon? The structures are a little confusing now. Yeah I'll go ahead and do this if you think the strategy is otherwise OK. Line 445: // The latest observed Kudu timestamp reported by the KuduSession at this partition. > is partition == table here? Hm yeah I guess it's not great to call this a partition... Table I think might be confusing too since there are many of these inserting across the cluster for the same table. How about I just remove 'at this partition'? -- To view, visit http://gerrit.cloudera.org:8080/4985 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: I4eb1ad91dc355ea51de261c3a14df0f9d28c879c Gerrit-PatchSet: 1 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: Matthew Jacobs Gerrit-Reviewer: Alex Behm Gerrit-Reviewer: Matthew Jacobs Gerrit-HasComments: Yes