flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From zhangminglei <18717838...@163.com>
Subject Re: Streaming
Date Wed, 27 Jun 2018 09:34:18 GMT

From my side, I do not think distinct is very easy to deal with. Even though together work
with kafka support exactly-once.

For uv, we can use a bloomfilter to filter pv for geting uv in the end. 

Window is usually used in an aggregate operation, so I think all should be realized by windows.

I am not familiar with this fields, so I still want to know what others response this question.


> 在 2018年6月27日,下午5:12,aitozi <gjying1314@gmail.com> 写道:
> Hi, community
> I am using flink to deal with some situation.
> 1. "distinct count" to calculate the uv/pv.
> 2.  calculate the topN of the past 1 hour or 1 day time.
> Are these all realized by window? Or is there a best practice on doing this?
> 3. And when deal with the distinct, if there is no need to do the keyBy
> previous, how does the window deal with this.
> Thanks 
> Aitozi.
> --
> Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

View raw message