Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DF65F199AA for ; Tue, 26 Apr 2016 08:03:09 +0000 (UTC) Received: (qmail 64970 invoked by uid 500); 26 Apr 2016 08:03:09 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 64874 invoked by uid 500); 26 Apr 2016 08:03:09 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 64865 invoked by uid 99); 26 Apr 2016 08:03:09 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Apr 2016 08:03:09 +0000 Received: from mail-ob0-f182.google.com (mail-ob0-f182.google.com [209.85.214.182]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 53D5A1A013B for ; Tue, 26 Apr 2016 08:03:09 +0000 (UTC) Received: by mail-ob0-f182.google.com with SMTP id j9so3254172obd.3 for ; Tue, 26 Apr 2016 01:03:09 -0700 (PDT) X-Gm-Message-State: AOPr4FXjjHFbOE2S9BuBvZXOXoD97EignmiC4OnizTOXUoNZpcIYxdZsKQyilQCA7vK46gNqUdtX/Zz8X1NEJUSO X-Received: by 10.182.28.103 with SMTP id a7mr422065obh.68.1461657788524; Tue, 26 Apr 2016 01:03:08 -0700 (PDT) MIME-Version: 1.0 Received: by 10.157.44.239 with HTTP; Tue, 26 Apr 2016 01:02:29 -0700 (PDT) In-Reply-To: References: From: Ufuk Celebi Date: Tue, 26 Apr 2016 10:02:29 +0200 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Job hangs To: user@flink.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hey Timur, is it possible to connect to the VMs and get stack traces of the Flink processes as well? We can first have a look at the logs, but the stack traces will be helpful if we can't figure out what the issue is. =E2=80=93 Ufuk On Tue, Apr 26, 2016 at 9:42 AM, Till Rohrmann wrote= : > Could you share the logs with us, Timur? That would be very helpful. > > Cheers, > Till > > On Apr 26, 2016 3:24 AM, "Timur Fayruzov" wrot= e: >> >> Hello, >> >> Now I'm at the stage where my job seem to completely hang. Source code i= s >> attached (it won't compile but I think gives a very good idea of what >> happens). Unfortunately I can't provide the datasets. Most of them are a= bout >> 100-500MM records, I try to process on EMR cluster with 40 tasks 6GB mem= ory >> for each. >> >> It was working for smaller input sizes. Any idea on what I can do >> differently is appreciated. >> >> Thans, >> Timur