flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anna Beer (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-5785) Add an Imputer for preparing data
Date Mon, 27 Mar 2017 13:03:42 GMT

    [ https://issues.apache.org/jira/browse/FLINK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943222#comment-15943222
] 

Anna Beer commented on FLINK-5785:
----------------------------------

[~Zentol] Thank you for the detailed description, hope I've done it right this time:
https://github.com/apache/flink/pull/3620

> Add an Imputer for preparing data
> ---------------------------------
>
>                 Key: FLINK-5785
>                 URL: https://issues.apache.org/jira/browse/FLINK-5785
>             Project: Flink
>          Issue Type: New Feature
>          Components: Machine Learning Library
>            Reporter: Stavros Kontopoulos
>            Assignee: Stavros Kontopoulos
>
> We need to add an Imputer as described in [1].
> "The Imputer class provides basic strategies for imputing missing values, either using
the mean, the median or the most frequent value of the row or column in which the missing
values are located. This class also allows for different missing values encodings."
> References
> 1. http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing
> 2. http://scikit-learn.org/stable/auto_examples/missing_values.html#sphx-glr-auto-examples-missing-values-py



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message