Return-Path: X-Original-To: apmail-spark-issues-archive@minotaur.apache.org Delivered-To: apmail-spark-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5CDA51801A for ; Thu, 19 Nov 2015 18:59:11 +0000 (UTC) Received: (qmail 22906 invoked by uid 500); 19 Nov 2015 18:59:11 -0000 Delivered-To: apmail-spark-issues-archive@spark.apache.org Received: (qmail 22876 invoked by uid 500); 19 Nov 2015 18:59:11 -0000 Mailing-List: contact issues-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@spark.apache.org Received: (qmail 22858 invoked by uid 99); 19 Nov 2015 18:59:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Nov 2015 18:59:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 139202C14FB for ; Thu, 19 Nov 2015 18:59:11 +0000 (UTC) Date: Thu, 19 Nov 2015 18:59:11 +0000 (UTC) From: "Reynold Xin (JIRA)" To: issues@spark.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (SPARK-11850) Spark StdDev/Variance defaults are incompatible with Hive MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SPARK-11850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15014145#comment-15014145 ] Reynold Xin commented on SPARK-11850: ------------------------------------- Yup we decided that Hive was weird and not the example to follow. > Spark StdDev/Variance defaults are incompatible with Hive > --------------------------------------------------------- > > Key: SPARK-11850 > URL: https://issues.apache.org/jira/browse/SPARK-11850 > Project: Spark > Issue Type: Sub-task > Components: SQL > Affects Versions: 1.6.0 > Reporter: Herman van Hovell > > The {{stddev}} and {{variance}} functions currently defaults to the 'sample' version whereas Hive uses the 'population' version for this. See: > * https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-Built-inAggregateFunctions(UDAF) > * https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala#L192-L196 > Is this on purpose? Or by accident? -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org For additional commands, e-mail: issues-help@spark.apache.org