flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chengxiang Li (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-2535) Fixed size sample algorithm optimization
Date Mon, 17 Aug 2015 03:14:45 GMT
Chengxiang Li created FLINK-2535:
------------------------------------

             Summary: Fixed size sample algorithm optimization
                 Key: FLINK-2535
                 URL: https://issues.apache.org/jira/browse/FLINK-2535
             Project: Flink
          Issue Type: Improvement
          Components: Core
            Reporter: Chengxiang Li
            Priority: Minor


Fixed size sample algorithm is known to be less efficient than sample algorithms with fraction,
but sometime it's necessary. Some optimization could significantly reduce the storage size
and computation cost, such as the algorithm described in [this paper|http://machinelearning.wustl.edu/mlpapers/papers/icml2013_meng13a].



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message