spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiangrui Meng (JIRA)" <j...@apache.org>
Subject [jira] [Closed] (SPARK-2776) Add normalizeByCol method to mllib.util.MLUtils
Date Thu, 31 Jul 2014 20:32:38 GMT

     [ https://issues.apache.org/jira/browse/SPARK-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Xiangrui Meng closed SPARK-2776.
--------------------------------

    Resolution: Duplicate

> Add normalizeByCol method to mllib.util.MLUtils
> -----------------------------------------------
>
>                 Key: SPARK-2776
>                 URL: https://issues.apache.org/jira/browse/SPARK-2776
>             Project: Spark
>          Issue Type: New Feature
>            Reporter: Andres Perez
>            Priority: Minor
>
> Add the ability to compute the mean and standard deviations of each vector (LabeledPoint)
component and normalize each vector in the RDD, using only RDD transformations. The result
is an RDD of Vectors where each column has a mean of zero and standard deviation of one.
> See https://github.com/apache/spark/pull/1698



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message