Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 4EBA9200D31 for ; Sat, 21 Oct 2017 00:41:02 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 4D330160BED; Fri, 20 Oct 2017 22:41:02 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 92994160BCB for ; Sat, 21 Oct 2017 00:41:01 +0200 (CEST) Received: (qmail 43492 invoked by uid 500); 20 Oct 2017 22:41:00 -0000 Mailing-List: contact dev-help@reef.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@reef.apache.org Delivered-To: mailing list dev@reef.apache.org Received: (qmail 43480 invoked by uid 99); 20 Oct 2017 22:41:00 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Oct 2017 22:41:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id A5A1FC11C7 for ; Fri, 20 Oct 2017 22:40:59 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.349 X-Spam-Level: X-Spam-Status: No, score=0.349 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, KAM_NUMSUBJECT=0.5, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id OY3iefCpb2co for ; Fri, 20 Oct 2017 22:40:58 +0000 (UTC) Received: from mail-pf0-f170.google.com (mail-pf0-f170.google.com [209.85.192.170]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 337F05F566 for ; Fri, 20 Oct 2017 22:40:58 +0000 (UTC) Received: by mail-pf0-f170.google.com with SMTP id z11so12956857pfk.4 for ; Fri, 20 Oct 2017 15:40:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=6Zz2Tkh6+qDptFgz72I0tpJHzFQP/iF9geQ3f1IY6FI=; b=vNiPZMLhBmr9I9yfNMYHxgS+gzpHSrNPCb8YRgv1/q/DWsVdEmLWxY2axQr2YabI/+ 1Ro4MBdkgFvKb1sfurpXJj+pPMoDsWd8f/mFKnHfPtJehd1UiIdrCKdN/v+Rn6KQzqsw dA0G8BRLma6ja1+iYCkCG5C4RAeMQ3mtkKRXa9CYt7Nr63xdAngMhEZRhmP9BksI4Tiz eXiOxtaGT5m1vivfKvEJDqI/lXtKSudrF0UUAhXznGqxe7gCzasN/4hRSHt+pBm5hVfC SeYu8JPi690U4OalhgYEF19AQmCKLYlLSXjlVjXXsmyPQHRfLiGo/yQoChtxCHQb38R2 b4vw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=6Zz2Tkh6+qDptFgz72I0tpJHzFQP/iF9geQ3f1IY6FI=; b=YAZTDHilYSK2Wr0BwVcqhP/b4i0ZA9F5anS7cSUGmHDVa3gZs7EINUQy6dsZckJ8je Ub4fTRLRWDLXpEPaZq/aZ5ZRb0uAsr/egLnhjobc659d+KPXXMYRdPP8Ep469QfEh3dO rOcPC+wsWnEZzVuDJLUiLxOVuiM7yfe3Ltr4yop4x7zpD6IJ5bH1TBWwtJvfBKy+lF0E lCK0TyPxqj6zFeFdBI5Y93y8+XG7ItkenhsP6FcRQ4AKApMxav5B7KsjiladjBJwTnq+ HH+8gBCspfKMuJbVT9COo9cdpWaY0Q/ProqTVfq0u2KJx5K4cOcWjwBNt1MhrsAnDvOI N4nQ== X-Gm-Message-State: AMCzsaWRUPl5Or36vl6TWIf5WN8AA1VHfV7JCMIF8GN36ZVmqFsFyhiA h0vQQ/nqe/gjTvnp31109LKwMLOFXHJjUAa79zg= X-Google-Smtp-Source: ABhQp+Sd80+mKbkxhzLCl8/nphvmGSXVx0EICa1OSLo2EJFX+kNyJB0IOd7648qOTrIgGfGA7WORic/mr25Hw9/z+wk= X-Received: by 10.98.214.215 with SMTP id a84mr6166458pfl.27.1508539256828; Fri, 20 Oct 2017 15:40:56 -0700 (PDT) MIME-Version: 1.0 Received: by 10.100.146.1 with HTTP; Fri, 20 Oct 2017 15:40:56 -0700 (PDT) In-Reply-To: References: From: Saikat Kanjilal Date: Fri, 20 Oct 2017 15:40:56 -0700 Message-ID: Subject: Re: The plan for reef-1791 To: dev@reef.apache.org Content-Type: multipart/alternative; boundary="f403045cc2ea88763f055c022a2b" archived-at: Fri, 20 Oct 2017 22:41:02 -0000 --f403045cc2ea88763f055c022a2b Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hello Folks, I wanted to give a quick end of the week status on reef-1791, here's what I have working so far: * Successfully launching the LineCounter program using the DataLoader architecture against the spark runtime for a local file running against a 1 node hadoop yarn install * Successfully invoking the flatmap function and having reef launcher run inside of that against all the predefined partitions ToDo * Some code cleanup before I submit a PR, no unit tests yet , will add during the the time we're flushing out the PR * Documentation around the chosen architecture What are the major changes: * Addition of a new runtime called reef-runtime-spark which invokes the sparkcontext and launches reef within that cotnext through a simple flatmap function for now * Had to change all the Reef Configuration relation classes (JavaConfigurationBuilderImpl) to implement the Serializable interface as each closure in spark requires that all the classes passed inside them have to be serializable, I am wondering about the impact of this (including performance impact) against the rest of the reef codebase Please let me know if there are any questions or additional feedback, look for the cr hopefully in the next week or so. Thanks in advance. On Tue, Oct 10, 2017 at 9:20 AM, Saikat Kanjilal wrote: > Good morning Reef dev community > > I wanted to share some thoughts on how I am thinking we move forward with > the implementation of reef-runtime-spark: > > > > 1. I have completed my first cut of the code based on discussions with > Sergiy and am ready to test this code and will do so both locally and > either on hdinsight or on a vm installed with spark and Hadoop running= on > yarn > 2. Testing will take a bit of time as we need to work out all the > bugs that come up coordinating events with reef and spark containers > 3. Next week I will be testing this on my mac running spark binaries > on Hadoop locally > 4. Towards the end of the month I will transition to testing on AWS , > specifically running spark on EMR and reef on that setup, I think runn= ing > REEF on AWS/EMR is a big plus and will enable more users to run spark = omn > REF > 5. I was going to wait to put out a code review till the first > successful tests go through , to reiterate the goal for the first phas= e is > to simply run HelloReef on spark > > > > > > If you have any concerns or feedback on this plan do let me know, as I > mentioned in JIRA I would really like to see us move to Java8 sooner than > latter, it=E2=80=99ll make the development of reef-runtime-spark a lot si= mpler. > > > > Thanks in advance for your help. > --f403045cc2ea88763f055c022a2b--