Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B6C42200B72 for ; Fri, 12 Aug 2016 02:52:14 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id B52D5160A93; Fri, 12 Aug 2016 00:52:14 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0A846160A94 for ; Fri, 12 Aug 2016 02:52:13 +0200 (CEST) Received: (qmail 51723 invoked by uid 500); 12 Aug 2016 00:52:13 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 51713 invoked by uid 99); 12 Aug 2016 00:52:13 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Aug 2016 00:52:13 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 89CC2CB631 for ; Fri, 12 Aug 2016 00:52:12 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id rm32E2aft9Jv for ; Fri, 12 Aug 2016 00:52:11 +0000 (UTC) Received: from mail-wm0-f52.google.com (mail-wm0-f52.google.com [74.125.82.52]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id 8ADE25F238 for ; Fri, 12 Aug 2016 00:52:11 +0000 (UTC) Received: by mail-wm0-f52.google.com with SMTP id q128so1333092wma.1 for ; Thu, 11 Aug 2016 17:52:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to; bh=RDTC/egud+w9pzuUl0wdZJAdJ/+5dEIdF0S2hyWm8R8=; b=ithgDf3vjI6ikhxow7mtcg6nW9h7PIw9GOfUK7ngqpuCNl7ibHin4L5h+j5HNcbUJB bUsTbfno6lDq3Mru3x+fuXBDT+VmxZSwM0wSL3ixT4yM+BDk8pR5txIQwDvaPpPDbKN/ 9tkIXrZOcGI1QL9JfuxDTzgcH0sfcaIWaAhq9MnYBdd1CpkM6O1HCl4AC+muooSp+9GW VY8V/CXhr/gJ291qoCLm82tILZibeTdz/qOLT3MnHBGDEf7GPkj3pPCCFE1VMEVcvatm fbSAGTEDs3ckSi/KO9kv05wyWkbhQ6vTrblJwnFZTe3BCoQ0zk+mEFPeer2zkx8/+pCJ ee1A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=RDTC/egud+w9pzuUl0wdZJAdJ/+5dEIdF0S2hyWm8R8=; b=BPYT75V/HTz4cg8LRZARsRNsCy+X4nc58/FXLSIYwqxvlrzblwj7A/iYwe1CmQQFqZ phKATohMPctEHbaT6Dz7nQzT3oAvhU25ucLldztTi2gMENNbARnwf/Lk4Fcei9c7z/Zs bO1RuRdd8yYVigk6BsXCjtdIWCY4S5mMvqhL/NCEeADA1nJGoj1AZer79jkRtYsHHIuA ttXujUcfKcrp/QwK7mEdq1B9PzJq9zKe9fu+k8spy5xgclz/Ajw+sV8hhxouTQPHrF6C 8kkdFdyU2qHrmETPCMJSBwL9QqSd6DD3hhsG1O+KUQwrgMQY+trQptWP1heZC5vpnZlL x4YQ== X-Gm-Message-State: AEkoouv4wDgUl5Z2Z1ahwO8v1eg0FII/u5jB70F/5oEfzzyXbghifE7Z0LFasD67d1AlqaMNLkpNvXvIUjKr+w== X-Received: by 10.25.146.85 with SMTP id u82mr1936096lfd.222.1470963130418; Thu, 11 Aug 2016 17:52:10 -0700 (PDT) MIME-Version: 1.0 Received: by 10.25.26.198 with HTTP; Thu, 11 Aug 2016 17:51:49 -0700 (PDT) From: Elias Levy Date: Thu, 11 Aug 2016 17:51:49 -0700 Message-ID: Subject: Does Flink DataStreams using combiners? To: user@flink.apache.org Content-Type: multipart/alternative; boundary=001a11401dcaddcc540539d54ada archived-at: Fri, 12 Aug 2016 00:52:14 -0000 --001a11401dcaddcc540539d54ada Content-Type: text/plain; charset=UTF-8 I am wondering if Flink makes use of combiners to pre-reduce a keyed and windowed stream before shuffling the data among workers. I.e. will it use a combiner in something like: stream.flatMap {...} .assignTimestampsAndWatermarks(...) .keyBy(...) .timeWindow(...) .trigger(...) .sum("cnt") or will it shuffle the keyed input before the sum reduction? If it does make use of combiners, it would be useful to point this out in the documentation, particularly if it only applies to certain types of reducers, folds, etc. --001a11401dcaddcc540539d54ada Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I am wondering if Flink makes use of combiners to pre-redu= ce a keyed and windowed stream before shuffling the data among workers.
I.e. will it use a combiner in something like:
stream.flatMap {...}
=C2=A0 =C2=A0 =C2=A0 .assi= gnTimestampsAndWatermarks(...)
=C2=A0 =C2=A0 =C2=A0 .keyBy(...)
=C2=A0 =C2=A0 =C2=A0 .timeWindow(...)
=C2=A0 =C2=A0 = =C2=A0 .trigger(...)
=C2=A0 =C2=A0 =C2=A0 .sum("cnt"= ;)

or will it shuffle the keyed input be= fore the sum reduction?

If it does make use of com= biners, it would be useful to point this out in the documentation, particul= arly if it only applies to certain types of reducers, folds, etc.
--001a11401dcaddcc540539d54ada--