flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Greg Hogan <c...@greghogan.com>
Subject Re: sampling function
Date Sat, 09 Jul 2016 15:43:39 GMT
Hi Do,

DataSet provides a stable @Public interface. DataSetUtils is marked
@PublicEvolving which is intended for public use, has stable behavior, but
method signatures may change. It's also good to limit DataSet to common
methods whereas the utility methods tend to be used for specific

I don't have the pulse of streaming but this sounds like a useful feature
that could be added.


On Sat, Jul 9, 2016 at 10:47 AM, Le Quoc Do <lequocdo@gmail.com> wrote:

> Hi all,
> I'm working on approximate computing using sampling techniques. I
> recognized that Flink supports the sample function for Dataset
> (org/apache/flink/api/java/utils/DataSetUtils.java). I'm just wondering why
> you didn't merge the function to org/apache/flink/api/java/DataSet.java
> since the sample function works as a transformation operator?
> The second question is that are you planning to support the sample
> function for DataStream (within windows) since I did not see it in
> DataStream code ?
> Thank you,
> Do

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message