flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kürşat Kurt <kur...@kursatkurt.com>
Subject RE: Aggregation problem.
Date Sat, 08 Apr 2017 22:38:23 GMT


I have just upgraded flink and cant use maxBy on grouped dataset.

I am getting the error below.


value maxBy is not a member of org.apache.flink.api.scala.GroupedDataSet




From: Kürşat Kurt [mailto:kursat@kursatkurt.com] 
Sent: Sunday, February 19, 2017 1:28 AM
To: user@flink.apache.org
Subject: RE: Aggregation problem.


Yes, it works.

Thank you Yassine.


From: Yassine MARZOUGUI [mailto:y.marzougui@mindlytix.com] 
Sent: Saturday, February 18, 2017 2:48 PM
To: user@flink.apache.org <mailto:user@flink.apache.org> 
Subject: RE: Aggregation problem.




I think this is an expected output and not necessarily a bug. To get the element having the
maximum value, maxBy() should be used instead of max().


See this answer for more details : http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Wrong-and-non-consistent-behavior-of-max-tp484p488.html





On Feb 18, 2017 12:28, "Kürşat Kurt" <kursat@kursatkurt.com <mailto:kursat@kursatkurt.com>
> wrote:

Ok, i have opened the issue with the test case. 






From: Fabian Hueske [mailto:fhueske@gmail.com] 
Sent: Saturday, February 18, 2017 3:33 AM
To: user@flink.apache.org <mailto:user@flink.apache.org> 
Subject: Re: Aggregation problem.



this looks like a bug to me.

Can you open a JIRA and maybe a small testcase to reproduce the issue?

Thank you,



2017-02-18 1:06 GMT+01:00 Kürşat Kurt <kursat@kursatkurt.com <mailto:kursat@kursatkurt.com>



I have a Dataset like this:









This code; ds.groupBy(0).max(4).print() prints :





..but i am expecting





What is wrong with this code?



View raw message