flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael Gendelman (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (FLINK-1728) Add random forest ensemble method to machine learning library
Date Tue, 12 Sep 2017 13:45:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16162953#comment-16162953
] 

Michael Gendelman edited comment on FLINK-1728 at 9/12/17 1:44 PM:
-------------------------------------------------------------------

Hi,

What is the status of this? Are there any plans to implement this in near-term flink releases?

Thanks,
Michael


was (Author: micg):
What is the status of this? Are there any plans to implement this in near-term flink releases?

> Add random forest ensemble method to machine learning library
> -------------------------------------------------------------
>
>                 Key: FLINK-1728
>                 URL: https://issues.apache.org/jira/browse/FLINK-1728
>             Project: Flink
>          Issue Type: New Feature
>          Components: Machine Learning Library
>            Reporter: Till Rohrmann
>            Assignee: Mikio Braun
>              Labels: ML
>
> Random forests [2,3] are a well-established mean to mitigate the decision trees' weakness
of overfitting. Therefore this would be a valuable contribution to Flink's machine learning
library.
> Google [1] describes some of the techniques they used to do ensemble learning of MapReduce.
This could be helpful while implementing a distributed random forest.
> Resources:
> [1] [http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/36296.pdf]
> [2] [http://www.stat.berkeley.edu/~breiman/randomforest2001.pdf]
> [3] [http://www.stat.berkeley.edu/~breiman/Using_random_forests_V3.1.pdf]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message