spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yuhao yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-18441) Add Smote in spark mlib and ml
Date Tue, 15 Nov 2016 08:44:58 GMT

    [ https://issues.apache.org/jira/browse/SPARK-18441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15666539#comment-15666539
] 

yuhao yang commented on SPARK-18441:
------------------------------------

Hi [~licl], I had an implementation of Smote at https://github.com/hhbyyh/Test/blob/master/Smote/src/SmoteTest.scala
. Hope it can help you. 

Right now Spark 2.1 is already in the QA phase (feature lock down), so the earliest possible
window would be 2.2. Before that, we need to resolve some issues, like if SmoteSampler is
a feature transformer, we need to find a way to disable it in the pipeline during the predict
phase. Besides, guess we need to collect more opinions from the community to see if this is
a common requirement. 

> Add Smote in spark mlib and ml
> ------------------------------
>
>                 Key: SPARK-18441
>                 URL: https://issues.apache.org/jira/browse/SPARK-18441
>             Project: Spark
>          Issue Type: Wish
>          Components: ML, MLlib
>    Affects Versions: 2.0.1
>            Reporter: lichenglin
>
> PLZ Add Smote in spark mlib and ml in case of  the "not balance of train data" for Classification



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message