Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 60518200B79 for ; Wed, 24 Aug 2016 04:20:59 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 5ED23160ABF; Wed, 24 Aug 2016 02:20:59 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A40E1160AAD for ; Wed, 24 Aug 2016 04:20:58 +0200 (CEST) Received: (qmail 94701 invoked by uid 500); 24 Aug 2016 02:20:52 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 94690 invoked by uid 99); 24 Aug 2016 02:20:52 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Aug 2016 02:20:52 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 3FB051804B4 for ; Wed, 24 Aug 2016 02:20:52 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id U7_JsdOvU20N for ; Wed, 24 Aug 2016 02:20:48 +0000 (UTC) Received: from mail-it0-f41.google.com (mail-it0-f41.google.com [209.85.214.41]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id CF2655FE4D for ; Wed, 24 Aug 2016 02:20:47 +0000 (UTC) Received: by mail-it0-f41.google.com with SMTP id e63so184269157ith.1 for ; Tue, 23 Aug 2016 19:20:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to; bh=td2wYSc7pDXIaHaiqSzQz1BVWUw28CqIDTGaCtkaegg=; b=B14/kdF94qYLsU4/+CvzayzraVffnmR2QqsA21ZHryShI4NwsC0tKpgzBTX4dUYqFF Q1xtnmToY3FITFiqhVyTY1Sax93eb1Vh4XlZxzx6ZNxmFIf+lJQa+jPck6G9/ztlFdoZ rhG3X3a/XhezH+wv/xFEDCqWS38wQGviKUWvjdzK8O1h30lKkgZ5oxHxLh6F5Q9lyXtU 1qpbnR6Dy+p1aRPyCFkxZeQUNU2iyRw+PWRHMewaZMW097lTpYSmSCBwkgZANr0b8WOo ERFjDfFB8rd3HiEPpwS2Uwaj+KRA1hsHch6s+z3XDlR2CL9EYL/9pgkuQ0qZ+nQclwS6 rZkw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=td2wYSc7pDXIaHaiqSzQz1BVWUw28CqIDTGaCtkaegg=; b=EdFLznzTf4zuHSigzbDNrzzKnPsEkFoNFvQo1Hi9knLSa8Gqk1bAsLqOBgfvvr67ie qxjNbdIM8DaTO8I+i/HRb/8iY+uAoDqm97PSSLf/0njnkevY+i0JD74FMvDRHy4/SZVi 9MxoZd3ZuyEVzvLJ9x115N8KeM22YxB9a8AtXqa7qMdzR1N6NUANEW2uvHl7ZNh4XkKG 1ev6fLMSKzgUPOWnYfqrVjeXlKCSZEuxoAGgWnt5K9D4WPaV6MDduKEhEXXPBzb1VSU/ e3vm9DJMwf5pO0j7OItcEQbJyedskokS3Y1IgzIuXb5J0xRD2XP+gRufRl6VZrYcTpuz it4A== X-Gm-Message-State: AEkoouuggHNEqlMJTAzdJeTFp4Q0E9ap70wisImsZ8md3X5xT3gwR6n7iej/Y5c1c6uXXl2gbdvwA0LUEX3KSw== X-Received: by 10.107.15.145 with SMTP id 17mr1431931iop.77.1472005246538; Tue, 23 Aug 2016 19:20:46 -0700 (PDT) MIME-Version: 1.0 Received: by 10.36.52.202 with HTTP; Tue, 23 Aug 2016 19:20:46 -0700 (PDT) From: Vinay Patil Date: Tue, 23 Aug 2016 21:20:46 -0500 Message-ID: Subject: Dealing with Multiple sinks in Flink To: user@flink.apache.org Content-Type: multipart/alternative; boundary=001a113ee1bcd3ffe8053ac7ed05 archived-at: Wed, 24 Aug 2016 02:20:59 -0000 --001a113ee1bcd3ffe8053ac7ed05 Content-Type: text/plain; charset=UTF-8 Hi, In our flink pipeline we are currently writing the data to multiple S3 objects/folders based on some conditions, so the issue I am facing is as follows : Consider these S3 folders : temp_bucket/processedData/20160823/ temp_bucket/rawData/20160822/ temp_bucket/errorData/20160821/ Now when the parallelism is set to 1, the data gets written to all S3 folders above, but when I set it to larger value the data is written only to the first folder and not the others. I am testing the flink job on EMR with 4 task managers having 16 slots, even if I keep parallelism as 4 , I am facing the same issue. (running from IDE is resulting in same output, Tested this with Flink 1.0.3 and 1.1.1) I am not understanding why this is happening. Regards, Vinay Patil --001a113ee1bcd3ffe8053ac7ed05 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi,

In our flink pipeline we are curren= tly writing the data to multiple S3 objects/folders based on some condition= s, so the issue I am facing is as follows :

Consid= er these S3 folders :
temp_bucket/processedData/20160823/
temp= _bucket/rawData/20160822/
temp_bucket/errorData/20160821/
=

Now when the parallelism is set to 1, the data ge= ts written to all S3 folders above, but when I set it to larger value the d= ata is written only to the first folder and not the others.

<= /div>
I am testing the flink job on EMR with 4 task managers having 16 = slots, even if I keep parallelism as 4 , I am facing the same issue.
<= div>(running from IDE is resulting in same output, Tested this with Flink 1= .0.3 and 1.1.1)

I am not understanding why this is= happening.


Regards,
Vinay Patil
--001a113ee1bcd3ffe8053ac7ed05--