flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Theodore Vasiloudis (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-1901) Create sample operator for Dataset
Date Thu, 16 Apr 2015 13:50:58 GMT
Theodore Vasiloudis created FLINK-1901:

             Summary: Create sample operator for Dataset
                 Key: FLINK-1901
                 URL: https://issues.apache.org/jira/browse/FLINK-1901
             Project: Flink
          Issue Type: Improvement
          Components: Core
            Reporter: Theodore Vasiloudis

In order to be able to implement Stochastic Gradient Descent and a number of other machine
learning algorithms we need to have a way to take a random sample from a Dataset.

We need to be able to sample with or without replacement from the Dataset, choose the relative
size of the sample, and set a seed for reproducibility.

This message was sent by Atlassian JIRA

View raw message