Return-Path: X-Original-To: apmail-spark-issues-archive@minotaur.apache.org Delivered-To: apmail-spark-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 39CEA17D45 for ; Sun, 13 Sep 2015 02:17:46 +0000 (UTC) Received: (qmail 25182 invoked by uid 500); 13 Sep 2015 02:17:45 -0000 Delivered-To: apmail-spark-issues-archive@spark.apache.org Received: (qmail 25153 invoked by uid 500); 13 Sep 2015 02:17:45 -0000 Mailing-List: contact issues-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@spark.apache.org Received: (qmail 25142 invoked by uid 99); 13 Sep 2015 02:17:45 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 13 Sep 2015 02:17:45 +0000 Date: Sun, 13 Sep 2015 02:17:45 +0000 (UTC) From: "Narine Kokhlikyan (JIRA)" To: issues@spark.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (SPARK-10579) Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Narine Kokhlikyan created SPARK-10579: ----------------------------------------- Summary: Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns Key: SPARK-10579 URL: https://issues.apache.org/jira/browse/SPARK-10579 Project: Spark Issue Type: New Feature Components: MLlib Reporter: Narine Kokhlikyan Priority: Minor Fix For: 1.6.0 Hi everyone, I think it would be good to extend statistical functions in mllib package, by adding Cardinality/Quantiles/Quartiles/Median for the columns, as many other ml and statistical libraries already have it. I couldn't find it in mllib package, hence would like to suggest it. Since this is my first time working with jira, I'd truly appreciate if someone could review this and let me know what do you think. Also, I'd really like to work on it and looking forward to hearing from you! Thanks, Narine -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org For additional commands, e-mail: issues-help@spark.apache.org