flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yassine MARZOUGUI <y.marzou...@mindlytix.com>
Subject RE: Aggregation problem.
Date Sat, 18 Feb 2017 11:48:04 GMT
Hi,

I think this is an expected output and not necessarily a bug. To get the
element having the maximum value, maxBy() should be used instead of max().

See this answer for more details :
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Wrong-and-non-consistent-behavior-of-max-tp484p488.html

Best,
Yassine

On Feb 18, 2017 12:28, "Kürşat Kurt" <kursat@kursatkurt.com> wrote:

Ok, i have opened the issue with the test case.

Thanks.



https://issues.apache.org/jira/browse/FLINK-5840





*From:* Fabian Hueske [mailto:fhueske@gmail.com]
*Sent:* Saturday, February 18, 2017 3:33 AM
*To:* user@flink.apache.org
*Subject:* Re: Aggregation problem.



Hi,

this looks like a bug to me.

Can you open a JIRA and maybe a small testcase to reproduce the issue?

Thank you,

Fabian



2017-02-18 1:06 GMT+01:00 Kürşat Kurt <kursat@kursatkurt.com>:

Hi;



I have a Dataset like this:



*(**0,Auto,0.4,1,5.8317538999854194E-5)*

*(0,Computer,0.2,1,4.8828125E-5)*

*(0,Sports,0.4,2,1.7495261699956258E-4)*

*(1,Auto,0.4,1,1.7495261699956258E-4)*

*(1,Computer,0.2,1,4.8828125E-5)*

*(1,Sports,0.4,1,5.8317538999854194E-5)*



This code; *ds.groupBy(0).max(4).print() *prints :



*(0,Sports,0.4,1,1.7495261699956258E-4)*

*(1,Sports,0.4,1,1.7495261699956258E-4)*



..but i am expecting



*(0,Sports,0.4,2,1.7495261699956258E-4)*

*(1,Auto,0.4,1,1.7495261699956258E-4)*



What is wrong with this code?

Mime
View raw message