flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Till Rohrmann <trohrm...@apache.org>
Subject Re: Performance and accuracy of Flink iterations
Date Tue, 17 May 2016 12:39:36 GMT
Hi Greg,

as far as I know there has not been an exhaustive comparison to what extent
the delta iterations can achieve the same accuracy as bulk iterations or
how much accuracy you'll lose. I think it strongly depends on the problem.
For example, graph algorithms such as connected components shouldn't suffer
from it. In contrast, the PageRank implementation with the THRESHOLD value
should not produce the (most) accurate result. Of course this depends on
the threshold value. Do you want to make such a comparison?

Cheers,
Till

On Mon, May 16, 2016 at 3:10 PM, Greg Hogan <code@greghogan.com> wrote:

> Hi,
>
> This question has arisen with the HITS algorithm (Hubs and Authorities) but
> the question is the same as with PageRank, for which Stephan published an
> excellent discussion and comparison of bulk and delta iterations [0].
>
> Delta iterations are clearly faster. Has there been a comparison as to
> whether, when, or how delta iterations are more accurate?
>
> Greg
>
> [0]
>
> http://data-artisans.com/data-analysis-with-flink-a-case-study-and-tutorial/
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message