Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 0486C200C42 for ; Sat, 18 Feb 2017 01:16:28 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 01C12160B57; Sat, 18 Feb 2017 00:16:28 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5275A160B6D for ; Sat, 18 Feb 2017 01:16:27 +0100 (CET) Received: (qmail 55485 invoked by uid 500); 18 Feb 2017 00:16:26 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 55471 invoked by uid 99); 18 Feb 2017 00:16:26 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Feb 2017 00:16:26 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id B01F01A79F2 for ; Sat, 18 Feb 2017 00:16:25 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.362 X-Spam-Level: X-Spam-Status: No, score=0.362 tagged_above=-999 required=6.31 tests=[RDNS_DYNAMIC=0.363, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 536I1grcQq1L for ; Sat, 18 Feb 2017 00:16:25 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id B9EEC5F613 for ; Sat, 18 Feb 2017 00:16:24 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id v1I0GNDW014950; Sat, 18 Feb 2017 00:16:23 GMT Message-Id: <201702180016.v1I0GNDW014950@ip-10-146-233-104.ec2.internal> Date: Sat, 18 Feb 2017 00:16:23 +0000 From: "Alex Behm (Code Review)" To: Michael Ho , impala-cr@cloudera.com, reviews@impala.incubator.apache.org CC: Dan Hecht Reply-To: alex.behm@cloudera.com X-Gerrit-MessageType: comment Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-4821=3A_Update_AVG=28=29_for_DECIMAL_V2=0A?= X-Gerrit-Change-Id: I28f5ef0370938440eb5b1c6d29b2f24e6f88499f X-Gerrit-ChangeURL: X-Gerrit-Commit: 86782feba82929bb96a945c288d4417ca807182c In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.12.7 archived-at: Sat, 18 Feb 2017 00:16:28 -0000 Alex Behm has posted comments on this change. Change subject: IMPALA-4821: Update AVG() for DECIMAL_V2 ...................................................................... Patch Set 3: (3 comments) http://gerrit.cloudera.org:8080/#/c/6038/3//COMMIT_MSG Commit Message: Line 12: 2. precision will be augmented if the scale is adjusted. In addition to my other comment, I think we should mention some of the limitations here, for example, that it won't be possible to compute the AVG() of some decimal types at all. http://gerrit.cloudera.org:8080/#/c/6038/3/fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java File fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java: Line 337: // an implicit divide. This is similar to the MS SQL server's behavior which takes Suggest these minor fixes and reordering: AVG() gets at least MIN_ADJUSTED_SCALE decimal places since it performs an implicit divide. However, the output type is not always the same as SUM()/COUNT(). Our behavior is similar to MS SQL Server's which takes the max of the input's scale and MIN_ADJUSTED_SCALE. The observations are ok, but at a high level: * the type is not the same as SUM()/COUNT() * the behavior is not the same as MS SQL Server's It would be good to provide a justification/intuition for why we chose yet another behavior (also in the commit msg). http://gerrit.cloudera.org:8080/#/c/6038/3/testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test File testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test: Line 208: ==== We should also add tests for those decimal types where AVG() cannot be computed (because we always overflow). -- To view, visit http://gerrit.cloudera.org:8080/6038 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: I28f5ef0370938440eb5b1c6d29b2f24e6f88499f Gerrit-PatchSet: 3 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: Michael Ho Gerrit-Reviewer: Alex Behm Gerrit-Reviewer: Dan Hecht Gerrit-Reviewer: Michael Ho Gerrit-HasComments: Yes