Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8A21B200C02 for ; Fri, 20 Jan 2017 08:37:58 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 88C56160B48; Fri, 20 Jan 2017 07:37:58 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A9ED4160B39 for ; Fri, 20 Jan 2017 08:37:57 +0100 (CET) Received: (qmail 43128 invoked by uid 500); 20 Jan 2017 07:37:56 -0000 Mailing-List: contact dev-help@airflow.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@airflow.incubator.apache.org Delivered-To: mailing list dev@airflow.incubator.apache.org Delivered-To: moderator for dev@airflow.incubator.apache.org Received: (qmail 84609 invoked by uid 99); 20 Jan 2017 06:55:26 -0000 X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 5.879 X-Spam-Level: ***** X-Spam-Status: No, score=5.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=2, KAM_WORKFROMHOME=2.5, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to; bh=6bliw97SwXYaOgYG+kIi+iD67XgaZysQ/KiH6sMU8lk=; b=gd7/7A/rVQHknwnrHOwYHowb1jp9lS/HE1i5+ra9E0cfbqm0qdGcmnVZOJtPTdw359 z6raIponA26HT/3td/yHKPa0UqoOqhJA536eQIFZmkSv9vUv0ot0wrrUifhthbIBrxhU bPiVxxhXYB0Wev2ECr2GVMEBYGjuMuieCs7e1Lqmza0gHYWpg825ozNA3BGByyq+CDat DNyc/vQrRa4IJHVKa1n2BYcSVWzAsvnuDRf+lW92qJgmlbRczI52rWBjuRhpebAoKtPW 1wwTRs54Hb/zg7jVXeGVmL3xWEBow7sIUaxeD9htVD4VoTM4CTGA0dcC3H9+4f+Xh1US 7hZw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to; bh=6bliw97SwXYaOgYG+kIi+iD67XgaZysQ/KiH6sMU8lk=; b=jTOm6N0YmI9EMiDZ6P84SZEkUYT3+A78JnH3R+6j8NMs9vdfQc9vS7GX7yC64JSulv 6E4IbmePJ2kqdJzu7b2X6D9QJN3NqS3fk88Rc7ieC642SHn6uN0M+pqQCEURILgfFO94 ryMwNHe6Lmtknsaj0VDC0PaD/x6996HyOClJ0ooaOGIFdLrA9jJkj7z47jxx1X9rQAUv GlHwxjfONSqR2FlkPhIxhrYFke3mtg/Fw5W7eDvUJkc+6xBxW5teuZp/I3iGz6HLGbRe nrtsuY4nzf0NobSuQhg3NmKMZ3FeyxJv/sTbt//fk/P9SQZwjrAnJYQTH1dzHchE/er6 P3cQ== X-Gm-Message-State: AIkVDXJjKhZq+qyQmUEgOTLqIPeULwIFwoo1rmdNM74YWrHLXvEieju6kH9R3WdZvttA8WaIApJ6BvomHmxw2A== X-Received: by 10.237.47.161 with SMTP id m30mr11145885qtd.20.1484895321719; Thu, 19 Jan 2017 22:55:21 -0800 (PST) MIME-Version: 1.0 Sender: jayesh.senjaliya@gmail.com In-Reply-To: References: From: Jayesh Senjaliya Date: Thu, 19 Jan 2017 22:54:51 -0800 X-Google-Sender-Auth: x-g3O9koCZ49fz0T5RaChIjk3T0 Message-ID: Subject: Re: New book covers Airflow with PySpark: Agile Data Science 2.0 (O'Reilly, 2017) AND Airflow Meetup? To: dev@airflow.incubator.apache.org Content-Type: multipart/alternative; boundary=94eb2c1239ea2e1d850546812240 archived-at: Fri, 20 Jan 2017 07:37:58 -0000 --94eb2c1239ea2e1d850546812240 Content-Type: text/plain; charset=UTF-8 Hi Siddharth, I am Jayesh from Paypal, and at last meetup we briefly talked about hosting next one and I offered to host next Airflow meetup at Paypal office. If we can come up with some dates, I can talk to facilities and reserve space accordingly. Please let me know. Thanks Jayesh On Thu, Jan 19, 2017 at 6:58 PM, siddharth anand wrote: > Mid-March might be a good time given that we had 2 meet-ups recently. > > We have a wiki about Airflow meet-ups : > https://cwiki.apache.org/confluence/display/AIRFLOW/Meetups. Feel free to > ask this list if someone would like to host......I'd imagine interest would > primarily come from other members of the community, but we're open to all > ideas. Since the last meet-up was in SF, it would be great if the next one > were in the South Bay. > > -s > > On Thu, Jan 19, 2017 at 6:46 PM, Russell Jurney > wrote: > > > Siddharth, nice to hear from you. Great to hear! > > > > I'm just starting a consultancy called Data Syndrome around the book, > and I > > work from home, which doesn't put me in a great position to personally > host > > the meetup. If you need someone to organize it and to seek a venue, I can > > do that. How does that sound? I'm sure I could find someone to host it. > > > > When would be a good date, do you think? Late February? > > > > On Thu, Jan 19, 2017 at 5:19 PM, siddharth anand > > wrote: > > > > > Sounds like a great idea. We are looking for someone to host the next > > one.. > > > once one is announced, you can sign up as a speaker.. You are also > > welcome > > > to host a meet-up if you like. > > > -s > > > > > > On Thu, Jan 19, 2017 at 4:39 PM, Russell Jurney < > > russell.jurney@gmail.com> > > > wrote: > > > > > > > Hello! My name is Russell Jurney. I am a relatively new Airflow user > > and > > > > just joined the group. I am an Azkaban refugee, and an enemy of Oozie > > and > > > > the tyranny of XML. > > > > > > > > I wanted to tell you about my new book, out in pre-release, called > > Agile > > > > Data Science 2.0 (O'Reilly 2017). > > In > > > > the > > > > book, we use Airflow in chapter 2, Setup, in a way similar to the > > Airflow > > > > tutorial. Then, in chapter 8, Deploying Predictive Systems, we use > > > Airflow > > > > to deploy a predictive system built with PySpark and Spark MLlib. > > > > > > > > Some highlights in the code at http://github.com/rjurney/ > > > Agile_Data_Code_2 > > > > : > > > > > > > > - ch02/airflow_test.py > > > > > > > ch02/airflow_test.py> > > > > is > > > > a complete Airflow/PySpark tutorial along with > > > ch02/pyspark_task_one.py > > > > > > > ch02/pyspark_task_one.py> > > > > and > > > > ch02/pyspark_task_two.py > > > > > > > ch02/pyspark_task_two.py> > > > > - The airflow setup for chapter 8 is at ch08/airflow/setup.py > > > > > > > ch08/airflow/setup.py> > > > > . > > > > - The scripts that it operates on are in ch08/ > > > > > and > > > > show > > > > things like how to use '{{ ds }}' and other parameters to hook > your > > > > scripts > > > > into 'airflow backfill' and other features. > > > > - ch08/make_predictions.py > > > > > > > ch08/make_predictions.py> > > > > shows > > > > how to setup a PySpark environment in a script in a way that can > > work > > > > with > > > > Airflow. > > > > > > > > If there is any interest, I would love to present on something like > > > > "Building Predictive Systems with Spark and Airflow" at an upcoming > > > Airflow > > > > meetup. > > > > > > > > Thanks! > > > > -- > > > > Russell Jurney twitter.com/rjurney russell.jurney@gmail.com > relato.io > > > > > > > > > > > > > > > -- > > Russell Jurney twitter.com/rjurney russell.jurney@gmail.com relato.io > > > --94eb2c1239ea2e1d850546812240--