Return-Path: X-Original-To: apmail-spark-issues-archive@minotaur.apache.org Delivered-To: apmail-spark-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 259B811AB2 for ; Thu, 31 Jul 2014 20:32:39 +0000 (UTC) Received: (qmail 98841 invoked by uid 500); 31 Jul 2014 20:32:38 -0000 Delivered-To: apmail-spark-issues-archive@spark.apache.org Received: (qmail 98803 invoked by uid 500); 31 Jul 2014 20:32:38 -0000 Mailing-List: contact issues-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@spark.apache.org Delivered-To: mailing list issues@spark.apache.org Received: (qmail 98780 invoked by uid 99); 31 Jul 2014 20:32:38 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Jul 2014 20:32:38 +0000 Date: Thu, 31 Jul 2014 20:32:38 +0000 (UTC) From: "Xiangrui Meng (JIRA)" To: issues@spark.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Closed] (SPARK-2776) Add normalizeByCol method to mllib.util.MLUtils MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SPARK-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-2776. -------------------------------- Resolution: Duplicate > Add normalizeByCol method to mllib.util.MLUtils > ----------------------------------------------- > > Key: SPARK-2776 > URL: https://issues.apache.org/jira/browse/SPARK-2776 > Project: Spark > Issue Type: New Feature > Reporter: Andres Perez > Priority: Minor > > Add the ability to compute the mean and standard deviations of each vector (LabeledPoint) component and normalize each vector in the RDD, using only RDD transformations. The result is an RDD of Vectors where each column has a mean of zero and standard deviation of one. > See https://github.com/apache/spark/pull/1698 -- This message was sent by Atlassian JIRA (v6.2#6252)