flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anna Beer (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-5785) Add an Imputer for preparing data
Date Mon, 27 Mar 2017 11:42:41 GMT

    [ https://issues.apache.org/jira/browse/FLINK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943106#comment-15943106

Anna Beer commented on FLINK-5785:

[~skonto] I made a pull request. The imputer works now for a DataSet of vectors but I'm not
sure if I loaded it up correctly, I'm new to github and all :S

> Add an Imputer for preparing data
> ---------------------------------
>                 Key: FLINK-5785
>                 URL: https://issues.apache.org/jira/browse/FLINK-5785
>             Project: Flink
>          Issue Type: New Feature
>          Components: Machine Learning Library
>            Reporter: Stavros Kontopoulos
>            Assignee: Stavros Kontopoulos
> We need to add an Imputer as described in [1].
> "The Imputer class provides basic strategies for imputing missing values, either using
the mean, the median or the most frequent value of the row or column in which the missing
values are located. This class also allows for different missing values encodings."
> References
> 1. http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing
> 2. http://scikit-learn.org/stable/auto_examples/missing_values.html#sphx-glr-auto-examples-missing-values-py

This message was sent by Atlassian JIRA

View raw message