Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F3D981828C for ; Tue, 23 Jun 2015 11:54:05 +0000 (UTC) Received: (qmail 62053 invoked by uid 500); 23 Jun 2015 11:54:05 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 61971 invoked by uid 500); 23 Jun 2015 11:54:05 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 61961 invoked by uid 99); 23 Jun 2015 11:54:05 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Jun 2015 11:54:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 5E5591A5F8B for ; Tue, 23 Jun 2015 11:54:05 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.001 X-Spam-Level: *** X-Spam-Status: No, score=3.001 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 12cKUk4t56d6 for ; Tue, 23 Jun 2015 11:53:57 +0000 (UTC) Received: from mail-vn0-f46.google.com (mail-vn0-f46.google.com [209.85.216.46]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 0640C20B6F for ; Tue, 23 Jun 2015 11:53:57 +0000 (UTC) Received: by vnbf190 with SMTP id f190so1104618vnb.0 for ; Tue, 23 Jun 2015 04:53:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=XG8PKLPrlg+vyqV4MXVdhRgEWAKSMl46G9OmMduNwyc=; b=N5UKHx07VCsBq3J5s6Y4CLo1XXAGI1eKpeu93B5JdL2Ftv6Yp6kpTE9un/7/gXxNDv DxbYtxs2hz4Nf2etEA5subUrw2bpiymreLX+zdj+VceyvP5mMcI1zVOw/4QFmI2DGicc odg/2CNYIqaqWl06JKz5uaUPGzfdZMH7DtS7oUE8DtPddSzCCOnpdohjEe2ZT+ItAOSW CXE8J2gAd4nBcS+QgOOHvenpcJYkdnTbki+xq7BBvueYFzfTVDGoNpwXs58Ah8aZ5xUi ybyMpMz2iiW0Oztylt69PaUW47usQRpDm3eWQc3n3H7XWnxTMkfHCFBn+sUNldeT19T5 PqBA== MIME-Version: 1.0 X-Received: by 10.52.75.201 with SMTP id e9mr31166203vdw.33.1435060430049; Tue, 23 Jun 2015 04:53:50 -0700 (PDT) Sender: ewenstephan@gmail.com Received: by 10.31.164.210 with HTTP; Tue, 23 Jun 2015 04:53:50 -0700 (PDT) In-Reply-To: References: Date: Tue, 23 Jun 2015 13:53:50 +0200 X-Google-Sender-Auth: L8FS4HjN4-sQxz64NWuF-0HCpes Message-ID: Subject: Re: memory flush on cluster From: Stephan Ewen To: user@flink.apache.org Content-Type: multipart/alternative; boundary=20cf3071cb282a171905192e0b4c --20cf3071cb282a171905192e0b4c Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Currently, Flink does not cache anything across runs, except JAR files on the workers. The reason the first run is slower may be: - Because in the first run, code is distributed in the cluster. In subsequent runs, the JAR files need not be redistributed. - Because the JIT takes a bit to kick in and compile code in the first run. In subsequent runs, the code is already JIT-ted. The system should not freeze after 100 runs. Can you tell us a bit more of what you see? Can you identify which process hangs and send us a stack-trace of that one? Then we could look into this... On Tue, Jun 23, 2015 at 10:56 AM, Pa R=C3=B6 wrote: > hi flink community, > > to time i test my flink app with a benchmark on an hadoop cluster (flink > on yarn). > my results show me that flink need for the first round more time as all > other rounds. maybe flink cache something in memory? and if i run the > benchmark 100 rounds my system freeze, i think the memory is full. give i= t > a way to flush the memory after the execution? > > best regards, > paul > --20cf3071cb282a171905192e0b4c Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Currently, Flink does not cache anything across runs, exce= pt JAR files on the workers.

The reason the first run is= slower may be:
=C2=A0- Because in the first run, code is distrib= uted in the cluster. In subsequent runs, the JAR files need not be redistri= buted.
=C2=A0- Because the JIT takes a bit to kick in and compile= code in the first run. In subsequent runs, the code is already JIT-ted.


The system should not freeze after 10= 0 runs. Can you tell us a bit more of what you see? Can you identify which = process hangs and send us a stack-trace of that one? Then we could look int= o this...



On Tue, Jun 23, 2015 at 10:56 AM, Pa R=C3= =B6 <paul.roewer1990@googlemail.com> wrote:
=
hi flin= k community,

to time i test my flink app with a benchmark on a= n hadoop cluster (flink on yarn).
my results show me that flink ne= ed for the first round more time as all other rounds. maybe flink cache som= ething in memory? and if i run the benchmark 100 rounds my system freeze, i= think the memory is full. give it a way to flush the memory after the exec= ution?

best regards,
paul

--20cf3071cb282a171905192e0b4c--