spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul K Bhojwani (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-2547) The clustering documentaion example provided for spark 0.9.1/docs is having a error
Date Thu, 17 Jul 2014 05:24:04 GMT
Rahul K Bhojwani created SPARK-2547:
---------------------------------------

             Summary: The clustering documentaion example provided for spark 0.9.1/docs is
having a error
                 Key: SPARK-2547
                 URL: https://issues.apache.org/jira/browse/SPARK-2547
             Project: Spark
          Issue Type: Documentation
          Components: Documentation, Examples, MLlib, PySpark
    Affects Versions: 0.9.1
         Environment: All
            Reporter: Rahul K Bhojwani


The documentation example for MLlib Clustering contains Kmeans example.

http://spark.apache.org/docs/0.9.1/mllib-guide.html#clustering-2

Here this line mentioned below is wrong and misleading.

clusters = KMeans.train(parsedData, 2, maxIterations=10,runs=30, initialization_mode="random")

Look at the key parameter "initialization_mode" given in example. Its wrong as per the implementation
of KMeans. It should be "initializationMode"

Correction: 

clusters = KMeans.train(parsedData, 2, maxIterations=10,runs=30, initializationMode="random")





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message