Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5FA8D1780E for ; Tue, 30 Jun 2015 08:41:30 +0000 (UTC) Received: (qmail 65932 invoked by uid 500); 30 Jun 2015 08:41:30 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 65861 invoked by uid 500); 30 Jun 2015 08:41:30 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 65851 invoked by uid 99); 30 Jun 2015 08:41:29 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Jun 2015 08:41:29 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 58CB11A63AB for ; Tue, 30 Jun 2015 08:41:29 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.9 X-Spam-Level: ** X-Spam-Status: No, score=2.9 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ZTxGSVLdnfD8 for ; Tue, 30 Jun 2015 08:41:23 +0000 (UTC) Received: from mail-la0-f54.google.com (mail-la0-f54.google.com [209.85.215.54]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 6EEE924B1A for ; Tue, 30 Jun 2015 08:41:22 +0000 (UTC) Received: by lagh6 with SMTP id h6so4215589lag.2 for ; Tue, 30 Jun 2015 01:41:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=poyaJF8I4pO9YlD6vbZQf7mriAaFdnNaqSysp6X2y9c=; b=S2WJTuuiY3FnzmUGKzPqeiI39EBVbpMA9cgMqFf/x1GbQ90/0VkqKnvdzeR/zb0zNv Tw7zsFmNeOg5ZLgBTtYBwx2M8J5yvM81xo+FNjFTWORbokLpr34P9Yhjhn7caFUYU2Y1 9e++/PB78EFtA1zC0JhmTQsuzkr/KlqNXO+nEJ9D+Isu0+M/1xLO4h/AcvOKckV4bo/O hbeFpj7GNyQpIqfyxVbWnPznjMtvpHX+iUk7XoGDYh2E9aIZWI3afh6spsTGoLZhecm+ yQ9qfmtuWuSMUMQ4ij9nMnxCjWc1vrRfKj+UVjljBn5tTDwzkaisgCweERNnKNsjFlSr 7cCw== MIME-Version: 1.0 X-Received: by 10.112.12.102 with SMTP id x6mr18695499lbb.80.1435653681937; Tue, 30 Jun 2015 01:41:21 -0700 (PDT) Received: by 10.152.225.171 with HTTP; Tue, 30 Jun 2015 01:41:21 -0700 (PDT) In-Reply-To: References: <48A214C8-C8F9-443A-9D60-6B06CB477086@mail.polimi.it> Date: Tue, 30 Jun 2015 10:41:21 +0200 Message-ID: Subject: Re: Execution graph From: Fabian Hueske To: user@flink.apache.org Content-Type: multipart/alternative; boundary=001a11c3d55cbb83bf0519b82b0b --001a11c3d55cbb83bf0519b82b0b Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable As an addition, some operators can only be run with a parallelism of 1. For example data sources based on collections and (un-grouped) all reduces. In some cases, the parallelism of the following operators will as well be set to 1 to avoid a network shuffle. If you do: env.fromCollection(myCollection).map(new MyMapper()).groupBy(0).reduce(new MyReduce()).writeToFile(); the data source and mapper will be run with a parallelism of 1, the reducer and sink will be executed with the default parallelism. Best, Fabian 2015-06-30 10:25 GMT+02:00 Maximilian Michels : > Hi Michele, > > If you don't set the parallelism, the default parallelism is used. For th= e > visualization in the web client, a parallelism of one is used. When you r= un > your example from your IDE, the default parallelism is set to the number = of > (virtual) cores of your CPU. > > Moreover, Flink will currently not automatically set the parallelism in a > cluster environment. It will use the default parallelism or the user-set > parallelism. In your example, if you set the parallelism explicitly then = it > will also show up in the visualization. > > Best, > Max > > On Tue, Jun 30, 2015 at 7:11 AM, Michele Bertoni < > michele1.bertoni@mail.polimi.it> wrote: > >> Hi, I was trying to run my program in the flink web environment (the >> local one) >> when I run it I get the graph of the planned execution but in each node >> there is a "parallelism =3D 1=E2=80=9D, instead i think it runs with par= =3D 8 (8 core, >> i always get 8 output) >> >> what does that mean? >> is that wrong or is it really running with 1 degree of par? >> >> just a note: I never do any setParallelism() command, i leave it >> automatical >> >> thanks >> Best >> Michele > > > --001a11c3d55cbb83bf0519b82b0b Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
As an addition, some operators can only be run with a= parallelism of 1. For example data sources based on collections and (un-gr= ouped) all reduces. In some cases, the parallelism of the following operato= rs will as well be set to 1 to avoid a network shuffle.

I= f you do:

env.fromCollection(myCollection).map(new MyMapp= er()).groupBy(0).reduce(new MyReduce()).writeToFile();

th= e data source and mapper will be run with a parallelism of 1, the reducer a= nd sink will be executed with the default parallelism.

Be= st, Fabian

2015-06-30 10:25 GMT+02:00 Maximilian Michels &l= t;mxm@apache.org>= ;:
Hi Michele,

If you don't set the parallelism, the default= parallelism is used. For the visualization in the web client, a parallelis= m of one is used. When you run your example from your IDE, the default para= llelism is set to the number of (virtual) cores of your CPU.

<= div>Moreover, Flink will currently not automatically set the parallelism in= a cluster environment. It will use the default parallelism or the user-set= parallelism. In your example, if you set the parallelism explicitly then i= t will also show up in the visualization.

Best,
=
Max

On Tue, Jun 30, 2015 at 7:11 AM, = Michele Bertoni <michele1.bertoni@mail.polimi.it> wrote:
Hi, I was trying to run my pro= gram in the flink web environment (the local one)
when I run it I get the graph of the planned execution but in each node the= re is a "parallelism =3D 1=E2=80=9D, instead i think it runs with par = =3D 8 (8 core, i=C2=A0 always get 8 output)

what does that mean?
is that wrong or is it really running with 1 degree of par?

just a note: I never do any setParallelism() command, i leave it automatica= l

thanks
Best
Michele

<= /div>

--001a11c3d55cbb83bf0519b82b0b--