spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph K. Bradley (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-7536) Audit MLlib Python API for 1.4
Date Thu, 14 May 2015 18:26:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14544152#comment-14544152
] 

Joseph K. Bradley commented on SPARK-7536:
------------------------------------------

[~yanboliang] I realized I forgot to add in the JIRA description that this should include
comparing the APIs between 1.3 and 1.4 so we can note breaking changes.  We'll need to note
those changes (if any) in the user guide's Migration Guide section.

> Audit MLlib Python API for 1.4
> ------------------------------
>
>                 Key: SPARK-7536
>                 URL: https://issues.apache.org/jira/browse/SPARK-7536
>             Project: Spark
>          Issue Type: Sub-task
>          Components: MLlib, PySpark
>            Reporter: Joseph K. Bradley
>            Assignee: Yanbo Liang
>
> For new public APIs added to MLlib, we need to check the generated HTML doc and compare
the Scala & Python versions.  We need to track:
> * Inconsistency: Do class/method/parameter names match?
> * Docs: Is the Python doc missing or just a stub?  We want the Python doc to be as complete
as the Scala doc.
> * API changes: These should be very rare but are occasionally either necessary (intentional)
or accidental.  These must be recorded and added in the Programming Guide.
> ** Note: If the API change is for an Alpha/Experimental/DeveloperApi component, please
note that as well.
> * Missing classes/methods/parameters: We should create to-do JIRAs for functionality
missing from Python.
> ** classification
> *** StreamingLogisticRegressionWithSGD SPARK-7633
> ** clustering
> *** GaussianMixture SPARK-6258
> *** LDA SPARK-6259
> *** Power Iteration Clustering SPARK-5962
> *** StreamingKMeans SPARK-4118 
> ** evaluation
> *** MultilabelMetrics SPARK-6094 
> ** feature
> *** ElementwiseProduct SPARK-7605
> *** PCA SPARK-7604
> ** linalg
> *** Distributed linear algebra SPARK-6100
> ** pmml.export SPARK-7638
> ** regression
> *** StreamingLinearRegressionWithSGD SPARK-4127
> ** stat
> *** KernelDensity SPARK-7639
> ** util
> *** MLUtils SPARK-6263 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message