Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C9B1B200D45 for ; Thu, 23 Nov 2017 20:23:32 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id C708E160BFE; Thu, 23 Nov 2017 19:23:32 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 19605160BED for ; Thu, 23 Nov 2017 20:23:31 +0100 (CET) Received: (qmail 6735 invoked by uid 500); 23 Nov 2017 19:23:31 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list user@flink.apache.org Received: (qmail 6725 invoked by uid 99); 23 Nov 2017 19:23:30 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Nov 2017 19:23:30 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 319031807A7 for ; Thu, 23 Nov 2017 19:23:30 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id XEHL_5VFz2x3 for ; Thu, 23 Nov 2017 19:23:29 +0000 (UTC) Received: from mail-pg0-f50.google.com (mail-pg0-f50.google.com [74.125.83.50]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 0749F5F568 for ; Thu, 23 Nov 2017 19:23:29 +0000 (UTC) Received: by mail-pg0-f50.google.com with SMTP id r12so14340743pgu.10 for ; Thu, 23 Nov 2017 11:23:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=OSBMHo2fG/kLPsE0boY2NFp+q4xlbpvqV0sKpuwjlpo=; b=rWI4W160PpFCmcuetQs8EkoYiztW+RxN8CU4apdPwAYQWuQQb3pxmpKWA3LxJvRPs6 iv0G/tS4AWhNEh0GD5zRsZPAAM6Oq7QWzZLwtAWMPnLFbEHLfSasb8RQx4dbBCKVdciw LKErSvHe9WoHjnX4YETpoSSe7us5f2mYiE2G8M36y7szUuU+89RqcvMLVyF4jh32emuK bCrrHXoi3WklymFaLwNZWbT0NyLk7dr9dsOmbDZXBsrgoGJ+0SDA0bKDi4UYPjsGPDTa Im4vyRn/0QgbfT8iBQHE0JDy00/AEl/Kze0BlvTdCieM3+zbnIkUsDfU7CZp/GA2JR9N T7Xg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=OSBMHo2fG/kLPsE0boY2NFp+q4xlbpvqV0sKpuwjlpo=; b=pqHpz8GdERCer5k1yHRJ3q88FmptV34myNI0M78Cl6dLnOKNhIADLET2YLfRHuYkCG GajQ9gAp3I+E2s7pA8dhR26crmf9D+YMu2bAjR/Zt/VyveL6psm2Jh+UzbbyshonnTRp JhD6WclVayqy0qoPlMwEd84pSZYIi4yqQrZgLuLxPi15X5sEqi9bV1Qydf6NeWr9Wa9X zDbV4oQA/gC/nB6RJpXcq7LGZzV3NxQb2IROqqoQ7ZJWYH4Hgu3pI4DlZ+AZdAS6TCiJ S3TbMM/SP2XcBprd4rYTq+CPfvxik8Ct7wbVq4t6ZO4IoYRJHxnq/AR08P2sagpxR9H5 zOxg== X-Gm-Message-State: AJaThX6/+CJEYo70lbzqzshVAJZM7DRPGOFYtQ2ur6lx6I+TM59MXWSc yd/mOpSUzvUQFInxgN5Jb1ox5mH+DZ/IOqpn07BLtA== X-Google-Smtp-Source: AGs4zMaqYCAW7LV9LZsmtfWpuVmwZcTm7J2gIrD+wnV43GM9b7wz03QJ0V4H4bYvqy9NkPa3pwc4f7I+mBE0qVZhGR4= X-Received: by 10.101.88.76 with SMTP id s12mr25572580pgr.215.1511465001876; Thu, 23 Nov 2017 11:23:21 -0800 (PST) MIME-Version: 1.0 Received: by 10.100.170.66 with HTTP; Thu, 23 Nov 2017 11:23:21 -0800 (PST) From: Shivam Sharma <28shivamsharma@gmail.com> Date: Fri, 24 Nov 2017 00:53:21 +0530 Message-ID: Subject: How & Where does flink stores data for aggregations. To: user@flink.apache.org Content-Type: multipart/alternative; boundary="089e08234c488703df055eab5e3c" archived-at: Thu, 23 Nov 2017 19:23:33 -0000 --089e08234c488703df055eab5e3c Content-Type: text/plain; charset="UTF-8" Hi All, I have a small question regarding where does Flink stores data for doing window aggregations. Lets say I am running following query on Flink table: SELECT name, count(*) FROM testTable GROUP BY TUMBLE(rowtime, INTERVAL '1' MINUTE), name So, If I understand above query properly so it must be saving data for 1 minute somewhere to find aggregations. If Flink is persisting this in memory then my concern is if I increase interval to a DAY or more then it will store the complete data for interval which can cross memory. If persistence is disk then latency will be there. Basically how do we solve such kind of use-cases using FLINK where aggregation interval are quite high. Thanks in advance -- Shivam Sharma --089e08234c488703df055eab5e3c Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi Al= l,

I have a small question re= garding where does Flink stores data for doing window aggregations. Lets sa= y I am running following query on Flink table:

SELECT= name, count(*)
FROM testTable
GROUP BY TUMBLE(rowtime, INTERVAL '= 1' MINUTE), name

S= o, If I understand above query properly so it must be saving data for 1 min= ute somewhere to find aggregations. If Flink is persisting this in memory t= hen my concern is if I increase interval to a DAY or more then it will stor= e the complete data for interval which can cross memory. If persistence is = disk then latency will be there.

Basically how do we solve such kind of use-cases using FLINK where aggr= egation interval are quite high.

Thanks in advance

--
<= div dir=3D"ltr">
Shiv= am Sharma

=
--089e08234c488703df055eab5e3c--