Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id F1910200BA3 for ; Thu, 20 Oct 2016 11:17:56 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id F0272160AE0; Thu, 20 Oct 2016 09:17:56 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3C8C1160ADB for ; Thu, 20 Oct 2016 11:17:56 +0200 (CEST) Received: (qmail 85935 invoked by uid 500); 20 Oct 2016 09:17:54 -0000 Mailing-List: contact dev-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list dev@spark.apache.org Received: (qmail 85904 invoked by uid 99); 20 Oct 2016 09:17:52 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Oct 2016 09:17:52 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 9226D180647 for ; Thu, 20 Oct 2016 09:17:52 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.748 X-Spam-Level: X-Spam-Status: No, score=0.748 tagged_above=-999 required=6.31 tests=[KAM_INFOUSMEBIZ=0.75, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id iIvkfp1Yerlz for ; Thu, 20 Oct 2016 09:17:50 +0000 (UTC) Received: from greenstation.de (greenstation.de [176.9.113.125]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 997025FBD6 for ; Thu, 20 Oct 2016 09:17:50 +0000 (UTC) Received: by greenstation.de (Postfix, from userid 115) id 702F3103576; Thu, 20 Oct 2016 11:17:49 +0200 (CEST) Received: from [192.168.42.86] (p5B13AE88.dip0.t-ipconnect.de [91.19.174.136]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by greenstation.de (Postfix) with ESMTPSA id 183E2103574 for ; Thu, 20 Oct 2016 11:17:48 +0200 (CEST) From: Andreas Hechenberger Subject: Get size of intermediate results To: dev@spark.apache.org Message-ID: Date: Thu, 20 Oct 2016 11:17:46 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.4.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit archived-at: Thu, 20 Oct 2016 09:17:57 -0000 Hey awesome Spark-Dev's :) i am new to spark and i read a lot but now i am stuck :( so please be kind, if i ask silly questions. I want to analyze some algorithms and strategies in spark and for one experiment i want to know the size of the intermediate results between iterations/jobs. Some of them are written to disk and some are in the cache, i guess. I am not afraid of looking into the code (i already did) but its complex and have no clue where to start :( It would be nice if someone can point me in the right direction or where i can find more information about the structure of spark core devel :) I already setup the devel environment and i can compile spark. It was really awesome how smoothly the setup was :) Thx for that. Servus Andy --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscribe@spark.apache.org