From dev-return-24813-archive-asf-public=cust-asf.ponee.io@spark.apache.org  Wed Jun  6 23:43:11 2018
Return-Path: <dev-return-24813-archive-asf-public=cust-asf.ponee.io@spark.apache.org>
X-Original-To: archive-asf-public@cust-asf.ponee.io
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [140.211.11.3])
	by mx-eu-01.ponee.io (Postfix) with SMTP id 6E3B5180671
	for <archive-asf-public@cust-asf.ponee.io>; Wed,  6 Jun 2018 23:43:09 +0200 (CEST)
Received: (qmail 61727 invoked by uid 500); 6 Jun 2018 21:43:07 -0000
Mailing-List: contact dev-help@spark.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:dev-help@spark.apache.org>
List-Unsubscribe: <mailto:dev-unsubscribe@spark.apache.org>
List-Post: <mailto:dev@spark.apache.org>
List-Id: <dev.spark.apache.org>
Delivered-To: mailing list dev@spark.apache.org
Received: (qmail 61711 invoked by uid 99); 6 Jun 2018 21:43:07 -0000
Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142)
    by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jun 2018 21:43:07 +0000
Received: from localhost (localhost [127.0.0.1])
	by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 6D5FEC00E8
	for <dev@spark.apache.org>; Wed,  6 Jun 2018 21:43:06 +0000 (UTC)
X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org
X-Spam-Flag: NO
X-Spam-Score: 2.899
X-Spam-Level: **
X-Spam-Status: No, score=2.899 tagged_above=-999 required=6.31
	tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1,
	FREEMAIL_REPLY=1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001,
	SPF_PASS=-0.001, T_DKIMWL_WL_MED=-0.01, T_REMOTE_IMAGE=0.01]
	autolearn=disabled
Authentication-Results: spamd4-us-west.apache.org (amavisd-new);
	dkim=pass (2048-bit key) header.d=gmail.com
Received: from mx1-lw-us.apache.org ([10.40.0.8])
	by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024)
	with ESMTP id 6G9olaRVe4kf for <dev@spark.apache.org>;
	Wed,  6 Jun 2018 21:43:01 +0000 (UTC)
Received: from mail-it0-f43.google.com (mail-it0-f43.google.com [209.85.214.43])
	by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 0EE655F21E
	for <dev@spark.apache.org>; Wed,  6 Jun 2018 21:43:01 +0000 (UTC)
Received: by mail-it0-f43.google.com with SMTP id a3-v6so10056653itd.0
        for <dev@spark.apache.org>; Wed, 06 Jun 2018 14:43:01 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20161025;
        h=mime-version:references:in-reply-to:from:date:message-id:subject:to
         :cc;
        bh=C7NnDHZZE/y4x5MuJ2K+nZTD2EWStOsR0zMxCGYQGY4=;
        b=omE9ksJQu1LnkTtM8kI4NmMK1w7fAHFi7Q7qS2/GI3Zo0xYksoIzhTvre7NUdRFs/C
         OU4+qZYjs3lSC1Mu/3KC+ls4Vo1Jk1dkzt6HnD7xRQlH1wOcOlN5f7wKy4j98skzsERX
         JavAl2UdtYmqtiZ4iXFh7aZDuhNNPVNhDBRkzcn7Y0y36bbxUc4vQmrDLBThrDicGlwv
         jaINlp4olvX1ichucZWu724MOhMnq4cSLJ7KW1NNteDzD171e/0T7S0JAw3l4pTqhjal
         UdlzB8nY5Of/CJOM8loPdcmK3v/ai5KakAoQeZIDvx0+qp+V5yWXpHTP1fpSiEKrFUzq
         bFUg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20161025;
        h=x-gm-message-state:mime-version:references:in-reply-to:from:date
         :message-id:subject:to:cc;
        bh=C7NnDHZZE/y4x5MuJ2K+nZTD2EWStOsR0zMxCGYQGY4=;
        b=EB3XY0zq7Q0HNfSiDQ9xAw5bN1TY3wz/QVeBB/3e4V9LIb0+V+xznDreD29TnjOhIB
         wGuSrMKoIqIUkX3sGuI9Je8u0TzT731IIfHLnTjYP2YNQ/kkk1aegtoh019JBuunSNuQ
         YUEGvPCR4Av3bX6hsYqUOFjJmGsNxuxv7jOgyQJFY5F60u1PumwkhEE14s/pROH+kZnb
         6UdG9GnTWBxBFboggcA3SGFIG9UPNi/JeM0qFPRGn+B3+P5C2rGTndd9uL38rQFJ+8Aw
         NQNpIeZ+shsy4KvNPGaGqnIzQbriYKvVPO3oDRG1M0U+xpM0T2o+69dgJk0x3RHt+5Po
         hsdw==
X-Gm-Message-State: APt69E1nTRS5Ooj1PQM1l+lHjknoIlsfqGkeBoGGx2If/DNTGE3XrTUj
	J/fuXuXssVk7TgV4SHkr2efAlWBFQE2WjCUtua4=
X-Google-Smtp-Source: ADUXVKLw8Zij6NwGSnAejRhX7T0gNMsx6QKstdE2t0PN9S7/ibrE2CiO7DQyApd10x8N9q1GFLOWcQvVbFDzBXtWXoo=
X-Received: by 2002:a24:7dc6:: with SMTP id b189-v6mr4713895itc.135.1528321380337;
 Wed, 06 Jun 2018 14:43:00 -0700 (PDT)
MIME-Version: 1.0
References: <CAJLcJd9fLVTZJ4QDJSE-Abmm3cWOWv7U62G2=DPhz5-x-aTW+Q@mail.gmail.com>
 <CAF7ADNr+Q5w8z1UXxmA9VNLvnRu8-jj4L2D44BaxXNTLfJLfLw@mail.gmail.com>
 <CAJLcJd-qt5oWd9V-WRMkBUFLYuwAnVFTzUpeLYhF21=d9L0vdQ@mail.gmail.com>
 <MW2PR02MB3644E3208B2BCFD01D835F5491980@MW2PR02MB3644.namprd02.prod.outlook.com>
 <MW2PR02MB3644448D92EE0FC4189AC39291950@MW2PR02MB3644.namprd02.prod.outlook.com>
 <CAF7ADNo0ofJ8WLpRRK_+n0tvvN8=XCp3mvb2UjYgbH88mVwsYw@mail.gmail.com>
 <CAOa4UzLNHhzs=xD3vQ-abv0g1q58-qBUbCsPThQgxQ2SQYqLzA@mail.gmail.com>
 <CANTBKQUAguo0iGM3E9d5zq+5Mx+Z3qmAipxcSTBuV47FZsk0qA@mail.gmail.com>
 <CY4PR1401MB20216507ACF657181143E7FBA8940@CY4PR1401MB2021.namprd14.prod.outlook.com>
 <BL0PR02MB3634FC24C7F7899ACD30B8F591680@BL0PR02MB3634.namprd02.prod.outlook.com>
 <MW2PR02MB36444AD06B5FA7B9F1E775D4916D0@MW2PR02MB3644.namprd02.prod.outlook.com>
 <CAJLcJd-1CzkVGideORkNSunasqCv6vVkctSYkcv3Mt+e_N7Uug@mail.gmail.com>
 <MW2PR02MB3644CF05A6C52B3C8FD4467F916D0@MW2PR02MB3644.namprd02.prod.outlook.com>
 <CANTBKQUtTkLk+wHXgqYBt2075d=D=e7=-Q-_rtQLNr3+sWR0aw@mail.gmail.com>
 <CY4PR1401MB2021D8BEEEA72B0282DAD7B5A86D0@CY4PR1401MB2021.namprd14.prod.outlook.com>
 <MW2PR02MB3644FBBD9E2E961E7A01D9B0916C0@MW2PR02MB3644.namprd02.prod.outlook.com>
 <780FCC56-D11B-43B8-BD03-3EC456414E71@fregly.com> <CY4PR1401MB2021C145000571C31AE1BAC8A8620@CY4PR1401MB2021.namprd14.prod.outlook.com>
 <CANTBKQU8RXk7jZBAofxSbau=xpiSDt5UkCXKph6zERReG2z2ag@mail.gmail.com>
 <CAJLcJd9aKZR53HJf=4B_Ag7jZiDnhpvN0NgBurNbfGoGKc3BCg@mail.gmail.com> <CALD+6GNwj4F8JsDwNYCsGZXY0vCAvAMFpHMVgaSsGcto7traUg@mail.gmail.com>
In-Reply-To: <CALD+6GNwj4F8JsDwNYCsGZXY0vCAvAMFpHMVgaSsGcto7traUg@mail.gmail.com>
From: Maximiliano Felice <maximilianofelice@gmail.com>
Date: Wed, 6 Jun 2018 14:42:48 -0700
Message-ID: <CANTBKQUYK7e9=UP9kSjDY5zFkU+1zNJqk_3iYk8PUHK_Y+13OQ@mail.gmail.com>
Subject: Re: Revisiting Online serving of Spark models?
To: Nick Pentreath <nick.pentreath@gmail.com>
Cc: Chris Fregly <chris@fregly.com>, Felix Cheung <felixcheung_m@hotmail.com>, 
	Holden Karau <holden@pigscanfly.ca>, Joseph Bradley <joseph@databricks.com>, 
	Leif Walsh <leif.walsh@gmail.com>, Saikat Kanjilal <sxk1969@hotmail.com>, dev <dev@spark.apache.org>
Content-Type: multipart/alternative; boundary="000000000000fa537b056e000cd5"

--000000000000fa537b056e000cd5
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi!

Do we meet at the entrance?

See you

El mar., 5 de jun. de 2018 3:07 PM, Nick Pentreath <nick.pentreath@gmail.co=
m>
escribi=C3=B3:

> I will aim to join up at 4pm tomorrow (Wed) too. Look forward to it.
>
> On Sun, 3 Jun 2018 at 00:24 Holden Karau <holden@pigscanfly.ca> wrote:
>
>> On Sat, Jun 2, 2018 at 8:39 PM, Maximiliano Felice <
>> maximilianofelice@gmail.com> wrote:
>>
>>> Hi!
>>>
>>> We're already in San Francisco waiting for the summit. We even think
>>> that we spotted @holdenk this afternoon.
>>>
>> Unless you happened to be walking by my garage probably not super likely=
,
>> spent the day working on scooters/motorcycles (my style is a little less
>> unique in SF :)). Also if you see me feel free to say hi unless I look l=
ike
>> I haven't had my first coffee of the day, love chatting with folks IRL :=
)
>>
>>>
>>> @chris, we're really interested in the Meetup you're hosting. My team
>>> will probably join it since the beginning of you have room for us, and =
I'll
>>> join it later after discussing the topics on this thread. I'll send you=
 an
>>> email regarding this request.
>>>
>>> Thanks
>>>
>>> El vie., 1 de jun. de 2018 7:26 AM, Saikat Kanjilal <sxk1969@hotmail.co=
m>
>>> escribi=C3=B3:
>>>
>>>> @Chris This sounds fantastic, please send summary notes for Seattle
>>>> folks
>>>>
>>>> @Felix I work in downtown Seattle, am wondering if we should a tech
>>>> meetup around model serving in spark at my work or elsewhere close,
>>>> thoughts?  I=E2=80=99m actually in the midst of building microservices=
 to manage
>>>> models and when I say models I mean much more than machine learning mo=
dels
>>>> (think OR, process models as well)
>>>>
>>>> Regards
>>>>
>>>> Sent from my iPhone
>>>>
>>>> On May 31, 2018, at 10:32 PM, Chris Fregly <chris@fregly.com> wrote:
>>>>
>>>> Hey everyone!
>>>>
>>>> @Felix:  thanks for putting this together.  i sent some of you a quick
>>>> calendar event - mostly for me, so i don=E2=80=99t forget!  :)
>>>>
>>>> Coincidentally, this is the focus of June 6th's *Advanced Spark and
>>>> TensorFlow Meetup*
>>>> <https://www.meetup.com/Advanced-Spark-and-TensorFlow-Meetup/events/25=
0924195/> @5:30pm
>>>> on June 6th (same night) here in SF!
>>>>
>>>> Everybody is welcome to come.  Here=E2=80=99s the link to the meetup t=
hat
>>>> includes the signup link:
>>>> *https://www.meetup.com/Advanced-Spark-and-TensorFlow-Meetup/events/25=
0924195/*
>>>> <https://www.meetup.com/Advanced-Spark-and-TensorFlow-Meetup/events/25=
0924195/>
>>>>
>>>> We have an awesome lineup of speakers covered a lot of deep, technical
>>>> ground.
>>>>
>>>> For those who can=E2=80=99t attend in person, we=E2=80=99ll be broadca=
sting live - and
>>>> posting the recording afterward.
>>>>
>>>> All details are in the meetup link above=E2=80=A6
>>>>
>>>> @holden/felix/nick/joseph/maximiliano/saikat/leif:  you=E2=80=99re mor=
e than
>>>> welcome to give a talk. I can move things around to make room.
>>>>
>>>> @joseph:  I=E2=80=99d personally like an update on the direction of th=
e
>>>> Databricks proprietary ML Serving export format which is similar to PM=
ML
>>>> but not a standard in any way.
>>>>
>>>> Also, the Databricks ML Serving Runtime is only available to Databrick=
s
>>>> customers.  This seems in conflict with the community efforts describe=
d
>>>> here.  Can you comment on behalf of Databricks?
>>>>
>>>> Look forward to your response, joseph.
>>>>
>>>> See you all soon!
>>>>
>>>> =E2=80=94
>>>>
>>>>
>>>> *Chris Fregly *Founder @ *PipelineAI* <https://pipeline.ai/> (100,000
>>>> Users)
>>>> Organizer @ *Advanced Spark and TensorFlow Meetup*
>>>> <https://www.meetup.com/Advanced-Spark-and-TensorFlow-Meetup/> (85,000
>>>> Global Members)
>>>>
>>>>
>>>>
>>>> *San Francisco - Chicago - Austin -
>>>> Washington DC - London - Dusseldorf *
>>>> *Try our PipelineAI Community Edition with GPUs and TPUs!!
>>>> <http://community.pipeline.ai/>*
>>>>
>>>>
>>>> On May 30, 2018, at 9:32 AM, Felix Cheung <felixcheung_m@hotmail.com>
>>>> wrote:
>>>>
>>>> Hi!
>>>>
>>>> Thank you! Let=E2=80=99s meet then
>>>>
>>>> June 6 4pm
>>>>
>>>> Moscone West Convention Center
>>>> 800 Howard Street, San Francisco, CA 94103
>>>> <https://maps.google.com/?q=3D800+Howard+Street,+San+Francisco,+CA+941=
03&entry=3Dgmail&source=3Dg>
>>>>
>>>> Ground floor (outside of conference area - should be available for all=
)
>>>> - we will meet and decide where to go
>>>>
>>>> (Would not send invite because that would be too much noise for dev@)
>>>>
>>>> To paraphrase Joseph, we will use this to kick off the discusssion and
>>>> post notes after and follow up online. As for Seattle, I would be very
>>>> interested to meet in person lateen and discuss ;)
>>>>
>>>>
>>>> _____________________________
>>>> From: Saikat Kanjilal <sxk1969@hotmail.com>
>>>> Sent: Tuesday, May 29, 2018 11:46 AM
>>>> Subject: Re: Revisiting Online serving of Spark models?
>>>> To: Maximiliano Felice <maximilianofelice@gmail.com>
>>>> Cc: Felix Cheung <felixcheung_m@hotmail.com>, Holden Karau <
>>>> holden@pigscanfly.ca>, Joseph Bradley <joseph@databricks.com>, Leif
>>>> Walsh <leif.walsh@gmail.com>, dev <dev@spark.apache.org>
>>>>
>>>>
>>>> Would love to join but am in Seattle, thoughts on how to make this
>>>> work?
>>>>
>>>> Regards
>>>>
>>>> Sent from my iPhone
>>>>
>>>> On May 29, 2018, at 10:35 AM, Maximiliano Felice <
>>>> maximilianofelice@gmail.com> wrote:
>>>>
>>>> Big +1 to a meeting with fresh air.
>>>>
>>>> Could anyone send the invites? I don't really know which is the place
>>>> Holden is talking about.
>>>>
>>>> 2018-05-29 14:27 GMT-03:00 Felix Cheung <felixcheung_m@hotmail.com>:
>>>>
>>>>> You had me at blue bottle!
>>>>>
>>>>> _____________________________
>>>>> From: Holden Karau <holden@pigscanfly.ca>
>>>>> Sent: Tuesday, May 29, 2018 9:47 AM
>>>>> Subject: Re: Revisiting Online serving of Spark models?
>>>>> To: Felix Cheung <felixcheung_m@hotmail.com>
>>>>> Cc: Saikat Kanjilal <sxk1969@hotmail.com>, Maximiliano Felice <
>>>>> maximilianofelice@gmail.com>, Joseph Bradley <joseph@databricks.com>,
>>>>> Leif Walsh <leif.walsh@gmail.com>, dev <dev@spark.apache.org>
>>>>>
>>>>>
>>>>>
>>>>> I'm down for that, we could all go for a walk maybe to the mint plaza=
a
>>>>> blue bottle and grab coffee (if the weather holds have our design mee=
ting
>>>>> outside :p)?
>>>>>
>>>>> On Tue, May 29, 2018 at 9:37 AM, Felix Cheung <
>>>>> felixcheung_m@hotmail.com> wrote:
>>>>>
>>>>>> Bump.
>>>>>>
>>>>>> ------------------------------
>>>>>> *From:* Felix Cheung <felixcheung_m@hotmail.com>
>>>>>> *Sent:* Saturday, May 26, 2018 1:05:29 PM
>>>>>> *To:* Saikat Kanjilal; Maximiliano Felice; Joseph Bradley
>>>>>> *Cc:* Leif Walsh; Holden Karau; dev
>>>>>>
>>>>>> *Subject:* Re: Revisiting Online serving of Spark models?
>>>>>>
>>>>>> Hi! How about we meet the community and discuss on June 6 4pm at
>>>>>> (near) the Summit?
>>>>>>
>>>>>> (I propose we meet at the venue entrance so we could accommodate
>>>>>> people might not be in the conference)
>>>>>>
>>>>>> ------------------------------
>>>>>> *From:* Saikat Kanjilal <sxk1969@hotmail.com>
>>>>>> *Sent:* Tuesday, May 22, 2018 7:47:07 AM
>>>>>> *To:* Maximiliano Felice
>>>>>> *Cc:* Leif Walsh; Felix Cheung; Holden Karau; Joseph Bradley; dev
>>>>>> *Subject:* Re: Revisiting Online serving of Spark models?
>>>>>>
>>>>>> I=E2=80=99m in the same exact boat as Maximiliano and have use cases=
 as well
>>>>>> for model serving and would love to join this discussion.
>>>>>>
>>>>>> Sent from my iPhone
>>>>>>
>>>>>> On May 22, 2018, at 6:39 AM, Maximiliano Felice <
>>>>>> maximilianofelice@gmail.com> wrote:
>>>>>>
>>>>>> Hi!
>>>>>>
>>>>>> I'm don't usually write a lot on this list but I keep up to date wit=
h
>>>>>> the discussions and I'm a heavy user of Spark. This topic caught my
>>>>>> attention, as we're currently facing this issue at work. I'm attendi=
ng to
>>>>>> the summit and was wondering if it would it be possible for me to jo=
in that
>>>>>> meeting. I might be able to share some helpful usecases and ideas.
>>>>>>
>>>>>> Thanks,
>>>>>> Maximiliano Felice
>>>>>>
>>>>>> El mar., 22 de may. de 2018 9:14 AM, Leif Walsh <leif.walsh@gmail.co=
m>
>>>>>> escribi=C3=B3:
>>>>>>
>>>>>>> I=E2=80=99m with you on json being more readable than parquet, but =
we=E2=80=99ve had
>>>>>>> success using pyarrow=E2=80=99s parquet reader and have been quite =
happy with it so
>>>>>>> far. If your target is python (and probably if not now, then soon, =
R), you
>>>>>>> should look in to it.
>>>>>>>
>>>>>>> On Mon, May 21, 2018 at 16:52 Joseph Bradley <joseph@databricks.com=
>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Regarding model reading and writing, I'll give quick thoughts here=
:
>>>>>>>> * Our approach was to use the same format but write JSON instead o=
f
>>>>>>>> Parquet.  It's easier to parse JSON without Spark, and using the s=
ame
>>>>>>>> format simplifies architecture.  Plus, some people want to check f=
iles into
>>>>>>>> version control, and JSON is nice for that.
>>>>>>>> * The reader/writer APIs could be extended to take format
>>>>>>>> parameters (just like DataFrame reader/writers) to handle JSON (an=
d maybe,
>>>>>>>> eventually, handle Parquet in the online serving setting).
>>>>>>>>
>>>>>>>> This would be a big project, so proposing a SPIP might be best.  I=
f
>>>>>>>> people are around at the Spark Summit, that could be a good time t=
o meet up
>>>>>>>> & then post notes back to the dev list.
>>>>>>>>
>>>>>>>> On Sun, May 20, 2018 at 8:11 PM, Felix Cheung <
>>>>>>>> felixcheung_m@hotmail.com> wrote:
>>>>>>>>
>>>>>>>>> Specifically I=E2=80=99d like bring part of the discussion to Mod=
el and
>>>>>>>>> PipelineModel, and various ModelReader and SharedReadWrite implem=
entations
>>>>>>>>> that rely on SparkContext. This is a big blocker on reusing  trai=
ned models
>>>>>>>>> outside of Spark for online serving.
>>>>>>>>>
>>>>>>>>> What=E2=80=99s the next step? Would folks be interested in gettin=
g
>>>>>>>>> together to discuss/get some feedback?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> _____________________________
>>>>>>>>> From: Felix Cheung <felixcheung_m@hotmail.com>
>>>>>>>>> Sent: Thursday, May 10, 2018 10:10 AM
>>>>>>>>> Subject: Re: Revisiting Online serving of Spark models?
>>>>>>>>> To: Holden Karau <holden@pigscanfly.ca>, Joseph Bradley <
>>>>>>>>> joseph@databricks.com>
>>>>>>>>> Cc: dev <dev@spark.apache.org>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Huge +1 on this!
>>>>>>>>>
>>>>>>>>> ------------------------------
>>>>>>>>> *From:*holden.karau@gmail.com <holden.karau@gmail.com> on behalf
>>>>>>>>> of Holden Karau <holden@pigscanfly.ca>
>>>>>>>>> *Sent:* Thursday, May 10, 2018 9:39:26 AM
>>>>>>>>> *To:* Joseph Bradley
>>>>>>>>> *Cc:* dev
>>>>>>>>> *Subject:* Re: Revisiting Online serving of Spark models?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Thu, May 10, 2018 at 9:25 AM, Joseph Bradley <
>>>>>>>>> joseph@databricks.com> wrote:
>>>>>>>>>
>>>>>>>>>> Thanks for bringing this up Holden!  I'm a strong supporter of
>>>>>>>>>> this.
>>>>>>>>>>
>>>>>>>>>> Awesome! I'm glad other folks think something like this belongs
>>>>>>>>> in Spark.
>>>>>>>>>
>>>>>>>>>> This was one of the original goals for mllib-local: to have loca=
l
>>>>>>>>>> versions of MLlib models which could be deployed without the big=
 Spark JARs
>>>>>>>>>> and without a SparkContext or SparkSession.  There are related c=
ommercial
>>>>>>>>>> offerings like this : ) but the overhead of maintaining those of=
ferings is
>>>>>>>>>> pretty high.  Building good APIs within MLlib to avoid copying l=
ogic across
>>>>>>>>>> libraries will be well worth it.
>>>>>>>>>>
>>>>>>>>>> We've talked about this need at Databricks and have also been
>>>>>>>>>> syncing with the creators of MLeap.  It'd be great to get this
>>>>>>>>>> functionality into Spark itself.  Some thoughts:
>>>>>>>>>> * It'd be valuable to have this go beyond adding transform()
>>>>>>>>>> methods taking a Row to the current Models.  Instead, it would b=
e ideal to
>>>>>>>>>> have local, lightweight versions of models in mllib-local, outsi=
de of the
>>>>>>>>>> main mllib package (for easier deployment with smaller & fewer
>>>>>>>>>> dependencies).
>>>>>>>>>> * Supporting Pipelines is important.  For this, it would be idea=
l
>>>>>>>>>> to utilize elements of Spark SQL, particularly Rows and Types, w=
hich could
>>>>>>>>>> be moved into a local sql package.
>>>>>>>>>> * This architecture may require some awkward APIs currently to
>>>>>>>>>> have model prediction logic in mllib-local, local model classes =
in
>>>>>>>>>> mllib-local, and regular (DataFrame-friendly) model classes in m=
llib.  We
>>>>>>>>>> might find it helpful to break some DeveloperApis in Spark 3.0 t=
o
>>>>>>>>>> facilitate this architecture while making it feasible for 3rd pa=
rty
>>>>>>>>>> developers to extend MLlib APIs (especially in Java).
>>>>>>>>>>
>>>>>>>>> I agree this could be interesting, and feed into the other
>>>>>>>>> discussion around when (or if) we should be considering Spark 3.0
>>>>>>>>> I _think_ we could probably do it with optional traits people
>>>>>>>>> could mix in to avoid breaking the current APIs but I could be wr=
ong on
>>>>>>>>> that point.
>>>>>>>>>
>>>>>>>>>> * It could also be worth discussing local DataFrames.  They migh=
t
>>>>>>>>>> not be as important as per-Row transformations, but they would b=
e helpful
>>>>>>>>>> for batching for higher throughput.
>>>>>>>>>>
>>>>>>>>> That could be interesting as well.
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> I'll be interested to hear others' thoughts too!
>>>>>>>>>>
>>>>>>>>>> Joseph
>>>>>>>>>>
>>>>>>>>>> On Wed, May 9, 2018 at 7:18 AM, Holden Karau <
>>>>>>>>>> holden@pigscanfly.ca> wrote:
>>>>>>>>>>
>>>>>>>>>>> Hi y'all,
>>>>>>>>>>>
>>>>>>>>>>> With the renewed interest in ML in Apache Spark now seems like =
a
>>>>>>>>>>> good a time as any to revisit the online serving situation in S=
park ML. DB
>>>>>>>>>>> & other's have done some excellent working moving a lot of the =
necessary
>>>>>>>>>>> tools into a local linear algebra package that doesn't depend o=
n having a
>>>>>>>>>>> SparkContext.
>>>>>>>>>>>
>>>>>>>>>>> There are a few different commercial and non-commercial
>>>>>>>>>>> solutions round this, but currently our individual transform/pr=
edict
>>>>>>>>>>> methods are private so they either need to copy or re-implement=
 (or put
>>>>>>>>>>> them selves in org.apache.spark) to access them. How would folk=
s feel about
>>>>>>>>>>> adding a new trait for ML pipeline stages to expose to do trans=
formation of
>>>>>>>>>>> single element inputs (or local collections) that could be opti=
onally
>>>>>>>>>>> implemented by stages which support this? That way we can have =
less copy
>>>>>>>>>>> and paste code possibly getting out of sync with our model trai=
ning.
>>>>>>>>>>>
>>>>>>>>>>> I think continuing to have on-line serving grow in different
>>>>>>>>>>> projects is probably the right path, forward (folks have differ=
ent needs),
>>>>>>>>>>> but I'd love to see us make it simpler for other projects to bu=
ild reliable
>>>>>>>>>>> serving tools.
>>>>>>>>>>>
>>>>>>>>>>> I realize this maybe puts some of the folks in an awkward
>>>>>>>>>>> position with their own commercial offerings, but hopefully if =
we make it
>>>>>>>>>>> easier for everyone the commercial vendors can benefit as well.
>>>>>>>>>>>
>>>>>>>>>>> Cheers,
>>>>>>>>>>>
>>>>>>>>>>> Holden :)
>>>>>>>>>>>
>>>>>>>>>>> --
>>>>>>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> Joseph Bradley
>>>>>>>>>> Software Engineer - Machine Learning
>>>>>>>>>> Databricks, Inc.
>>>>>>>>>> [image: http://databricks.com] <http://databricks.com/>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> --
>>>>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>> Joseph Bradley
>>>>>>>> Software Engineer - Machine Learning
>>>>>>>> Databricks, Inc.
>>>>>>>> [image: http://databricks.com] <http://databricks.com/>
>>>>>>>>
>>>>>>> --
>>>>>>> --
>>>>>>> Cheers,
>>>>>>> Leif
>>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>>
>>
>>
>> --
>> Twitter: https://twitter.com/holdenkarau
>>
>

--000000000000fa537b056e000cd5
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi!<div><br></div><div>Do we meet at the entrance?=C2=A0</div><div><br></di=
v><div>See you<br><br><div class=3D"gmail_quote"><div dir=3D"ltr">El mar., =
5 de jun. de 2018 3:07 PM, Nick Pentreath &lt;<a href=3D"mailto:nick.pentre=
ath@gmail.com">nick.pentreath@gmail.com</a>&gt; escribi=C3=B3:<br></div><bl=
ockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #=
ccc solid;padding-left:1ex"><div dir=3D"ltr">I will aim to join up at 4pm t=
omorrow (Wed) too. Look forward to it.</div><br><div class=3D"gmail_quote">=
<div dir=3D"ltr">On Sun, 3 Jun 2018 at 00:24 Holden Karau &lt;<a href=3D"ma=
ilto:holden@pigscanfly.ca" target=3D"_blank">holden@pigscanfly.ca</a>&gt; w=
rote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex=
;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=
=3D"gmail_extra"><div class=3D"gmail_quote">On Sat, Jun 2, 2018 at 8:39 PM,=
 Maximiliano Felice <span dir=3D"ltr">&lt;<a href=3D"mailto:maximilianofeli=
ce@gmail.com" target=3D"_blank">maximilianofelice@gmail.com</a>&gt;</span> =
wrote:<br></div></div></div><div dir=3D"ltr"><div class=3D"gmail_extra"><di=
v class=3D"gmail_quote"><blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span>Hi!</span><div=
><br></div></blockquote></div></div></div><div dir=3D"ltr"><div class=3D"gm=
ail_extra"><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div>W=
e&#39;re already in San Francisco waiting for the summit. We even think tha=
t we spotted @holdenk this afternoon.</div></blockquote></div></div></div><=
div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex"></blockquote><div>Unless you happened to be walki=
ng by my garage probably not super likely, spent the day working on scooter=
s/motorcycles (my style is a little less unique in SF :)). Also if you see =
me feel free to say hi unless I look like I haven&#39;t had my first coffee=
 of the day, love chatting with folks IRL :)</div></div></div></div><div di=
r=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote=
 class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc soli=
d;padding-left:1ex"><div><br></div><div>@chris, we&#39;re really interested=
 in the Meetup you&#39;re hosting. My team will probably join it since the =
beginning of you have room for us, and I&#39;ll join it later after discuss=
ing the topics on this thread. I&#39;ll send you an email regarding this re=
quest.</div><div><br></div><div>Thanks</div><div class=3D"m_726090808847091=
1906m_1652737974694539012m_-7190227123542900316HOEnZb"><div class=3D"m_7260=
908088470911906m_1652737974694539012m_-7190227123542900316h5"><div><br><div=
 class=3D"gmail_quote"><div dir=3D"ltr">El vie., 1 de jun. de 2018 7:26 AM,=
 Saikat Kanjilal &lt;<a href=3D"mailto:sxk1969@hotmail.com" target=3D"_blan=
k">sxk1969@hotmail.com</a>&gt; escribi=C3=B3:<br></div><blockquote class=3D=
"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding=
-left:1ex">


<div dir=3D"auto">
@Chris This sounds fantastic, please send summary notes for Seattle folks
<div><br>
</div>
<div>@Felix I work in downtown Seattle, am wondering if we should a tech me=
etup around model serving in spark at my work or elsewhere close, thoughts?=
=C2=A0 I=E2=80=99m actually in the midst of building microservices to manag=
e models and when I say models I mean much more
 than machine learning models (think OR, process models as well)</div>
<div><br>
</div>
<div></div></div><div dir=3D"auto"><div>Regards=C2=A0<br>
<br>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387AppleMailSignature">Sent from m=
y iPhone</div>
</div></div><div dir=3D"auto"><div><div><br>
On May 31, 2018, at 10:32 PM, Chris Fregly &lt;<a href=3D"mailto:chris@freg=
ly.com" target=3D"_blank">chris@fregly.com</a>&gt; wrote:<br>
<br>
</div>
<blockquote type=3D"cite">
<div>Hey everyone!
<div><br>
</div>
<div>@Felix: =C2=A0thanks for putting this together. =C2=A0i sent some of y=
ou a quick calendar event - mostly for me, so i don=E2=80=99t forget! =C2=
=A0:)</div>
<div><br>
</div>
<div>Coincidentally, this is the focus of June 6th&#39;s=C2=A0<a href=3D"ht=
tps://www.meetup.com/Advanced-Spark-and-TensorFlow-Meetup/events/250924195/=
" target=3D"_blank"><b>Advanced Spark and TensorFlow Meetup</b></a>=C2=A0@5=
:30pm on June 6th (same night) here in
 SF!</div>
<div><br>
</div>
<div>Everybody is welcome to come.=C2=A0 Here=E2=80=99s the link to the mee=
tup that includes the signup link: =C2=A0<a href=3D"https://www.meetup.com/=
Advanced-Spark-and-TensorFlow-Meetup/events/250924195/" target=3D"_blank"><=
b>https://www.meetup.com/Advanced-Spark-and-TensorFlow-Meetup/events/250924=
195/</b></a></div>
<div><br>
</div>
<div>We have an awesome lineup of speakers covered a lot of deep, technical=
 ground.</div>
<div><br>
</div>
<div>For those who can=E2=80=99t attend in person, we=E2=80=99ll be broadca=
sting live - and posting the recording afterward. =C2=A0</div>
<div><br>
</div>
<div>All details are in the meetup link above=E2=80=A6</div>
<div><br>
</div>
<div>
<div>@holden/felix/nick/joseph/maximiliano/saikat/leif: =C2=A0you=E2=80=99r=
e more than welcome to give a talk. I can move things around to make room.<=
/div>
</div>
<div><br>
</div>
<div>@joseph: =C2=A0I=E2=80=99d personally like an update on the direction =
of the Databricks proprietary ML Serving export format which is similar to =
PMML but not a standard in any way.</div>
<div><br>
</div>
<div>Also, the Databricks ML Serving Runtime is only available to Databrick=
s customers.=C2=A0 This seems in conflict with the community efforts descri=
bed here.=C2=A0 Can you comment on behalf of Databricks?</div>
<div><br>
</div>
<div>Look forward to your response, joseph.</div>
<div><br>
</div>
<div>See you all soon!</div>
<div>
<div><b><br>
</b></div>
<div>
<div>=E2=80=94</div>
<div><br>
<b>Chris Fregly<br>
</b>Founder @=C2=A0<a href=3D"https://pipeline.ai/" target=3D"_blank"><b>Pi=
pelineAI</b></a>=C2=A0(100,000 Users)<br>
Organizer @=C2=A0<a href=3D"https://www.meetup.com/Advanced-Spark-and-Tenso=
rFlow-Meetup/" target=3D"_blank"><b>Advanced Spark and=C2=A0TensorFlow Meet=
up</b></a>=C2=A0(85,000 Global Members)<br>
<i><b><br>
</b></i></div>
<div><i><b>San Francisco</b>=C2=A0-=C2=A0Chicago=C2=A0-=C2=A0Austin=C2=A0-=
=C2=A0<br>
Washington=C2=A0DC=C2=A0-=C2=A0London=C2=A0-=C2=A0Dusseldorf<br>
</i><br>
<b><a href=3D"http://community.pipeline.ai/" target=3D"_blank">Try our Pipe=
lineAI Community Edition with GPUs and TPUs!!</a></b></div>
</div>
</div>
<div><br>
</div>
<div><br>
<div>
<blockquote type=3D"cite">
<div>On May 30, 2018, at 9:32 AM, Felix Cheung &lt;<a href=3D"mailto:felixc=
heung_m@hotmail.com" target=3D"_blank">felixcheung_m@hotmail.com</a>&gt; wr=
ote:</div>
<br class=3D"m_7260908088470911906m_1652737974694539012m_-71902271235429003=
16m_-9129053291799417350m_-8572547956642599387Apple-interchange-newline">
<div>
<div>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387compose-container" style=3D"dir=
ection:ltr">
<span><span content=3D"Outlook Mobile for iOS"></span></span>
<div>
<div style=3D"direction:ltr">Hi!</div>
<div style=3D"direction:ltr"><br>
</div>
<div style=3D"direction:ltr">Thank you! Let=E2=80=99s meet then</div>
<div style=3D"direction:ltr"><br>
</div>
<div style=3D"direction:ltr">June 6 4pm</div>
<div style=3D"direction:ltr"><br>
</div>
<div style=3D"direction:ltr">Moscone West Convention Center</div>
<div style=3D"direction:ltr"><a href=3D"https://maps.google.com/?q=3D800+Ho=
ward+Street,+San+Francisco,+CA+94103&amp;entry=3Dgmail&amp;source=3Dg" targ=
et=3D"_blank">800 Howard Street, San Francisco, CA 94103</a></div>
<div><br>
</div>
<div style=3D"direction:ltr">Ground floor (outside of conference area - sho=
uld be available for all) - we will meet and decide where to go</div>
<div style=3D"direction:ltr"><br>
</div>
<div style=3D"direction:ltr">(Would not send invite because that would be t=
oo much noise for dev@)</div>
<div style=3D"direction:ltr"><br>
</div>
<div style=3D"direction:ltr">To paraphrase Joseph, we will use this to kick=
 off the discusssion and post notes after and follow up online. As for Seat=
tle, I would be very interested to meet in person lateen and discuss ;)=C2=
=A0</div>
<div style=3D"direction:ltr"><br>
</div>
<div style=3D"direction:ltr"><br>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387acompli_signature"></div>
<div class=3D"gmail_quote">_____________________________<br>
From: Saikat Kanjilal &lt;<a href=3D"mailto:sxk1969@hotmail.com" target=3D"=
_blank">sxk1969@hotmail.com</a>&gt;<br>
Sent: Tuesday, May 29, 2018 11:46 AM<br>
Subject: Re: Revisiting Online serving of Spark models?<br>
To: Maximiliano Felice &lt;<a href=3D"mailto:maximilianofelice@gmail.com" t=
arget=3D"_blank">maximilianofelice@gmail.com</a>&gt;<br>
Cc: Felix Cheung &lt;<a href=3D"mailto:felixcheung_m@hotmail.com" target=3D=
"_blank">felixcheung_m@hotmail.com</a>&gt;, Holden Karau &lt;<a href=3D"mai=
lto:holden@pigscanfly.ca" target=3D"_blank">holden@pigscanfly.ca</a>&gt;, J=
oseph Bradley &lt;<a href=3D"mailto:joseph@databricks.com" target=3D"_blank=
">joseph@databricks.com</a>&gt;,
 Leif Walsh &lt;<a href=3D"mailto:leif.walsh@gmail.com" target=3D"_blank">l=
eif.walsh@gmail.com</a>&gt;, dev &lt;<a href=3D"mailto:dev@spark.apache.org=
" target=3D"_blank">dev@spark.apache.org</a>&gt;<br>
<br>
<br>

Would love to join but am in Seattle, thoughts on how to make this work?
<div style=3D"direction:ltr"><br>
</div>
<div>Regards<br>
<br>
<div>Sent from my iPhone</div>
<div><br>
On May 29, 2018, at 10:35 AM, Maximiliano Felice &lt;<a href=3D"mailto:maxi=
milianofelice@gmail.com" target=3D"_blank">maximilianofelice@gmail.com</a>&=
gt; wrote:<br>
<br>
</div>
<blockquote type=3D"cite">
<div>
<div dir=3D"ltr">Big +1 to a meeting with fresh air.
<div><br>
</div>
<div>Could anyone send the invites? I don&#39;t really know which is the pl=
ace Holden is talking about.</div>
</div>
<div class=3D"gmail_extra"><br>
<div class=3D"gmail_quote">2018-05-29 14:27 GMT-03:00 Felix Cheung <span di=
r=3D"ltr">
&lt;<a href=3D"mailto:felixcheung_m@hotmail.com" target=3D"_blank">felixche=
ung_m@hotmail.com</a>&gt;</span>:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216compose-co=
ntainer" style=3D"direction:ltr"><span><span></span></span>
<div>
<div style=3D"direction:ltr">You had me at blue bottle!</div>
<div><br>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216acompli=
_signature"></div>
<div class=3D"gmail_quote"><span>_____________________________<br>
From: Holden Karau &lt;<a href=3D"mailto:holden@pigscanfly.ca" target=3D"_b=
lank">holden@pigscanfly.ca</a>&gt;<br>
</span>Sent: Tuesday, May 29, 2018 9:47 AM<span><br>
Subject: Re: Revisiting Online serving of Spark models?<br>
</span>To: Felix Cheung &lt;<a href=3D"mailto:felixcheung_m@hotmail.com" ta=
rget=3D"_blank">felixcheung_m@hotmail.com</a>&gt;<br>
Cc: Saikat Kanjilal &lt;<a href=3D"mailto:sxk1969@hotmail.com" target=3D"_b=
lank">sxk1969@hotmail.com</a>&gt;, Maximiliano Felice &lt;<a href=3D"mailto=
:maximilianofelice@gmail.com" target=3D"_blank">maximilianofelice@gmail.com=
</a>&gt;, Joseph Bradley &lt;<a href=3D"mailto:joseph@databricks.com" targe=
t=3D"_blank">joseph@databricks.com</a>&gt;,
 Leif Walsh &lt;<a href=3D"mailto:leif.walsh@gmail.com" target=3D"_blank">l=
eif.walsh@gmail.com</a>&gt;, dev &lt;<a href=3D"mailto:dev@spark.apache.org=
" target=3D"_blank">dev@spark.apache.org</a>&gt;
<div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387h5"><br>
<br>
<br>
<div dir=3D"ltr">I&#39;m down for that, we could all go for a walk maybe to=
 the mint plazaa blue bottle and grab coffee (if the weather holds have our=
 design meeting outside :p)?</div>
<div class=3D"gmail_extra"><br>
<div class=3D"gmail_quote">On Tue, May 29, 2018 at 9:37 AM, Felix Cheung <s=
pan dir=3D"ltr">
&lt;<a href=3D"mailto:felixcheung_m@hotmail.com" target=3D"_blank">felixche=
ung_m@hotmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div dir=3D"auto">
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518compose-container" style=3D"direction:ltr">
<span><span></span></span>
<div>
<div>
<div style=3D"direction:ltr">Bump.</div>
</div>
<div><br>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216hm m_72=
60908088470911906m_1652737974694539012m_-7190227123542900316m_-912905329179=
9417350m_-8572547956642599387m_6626408342298426216HOEnZb"></div>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216hm m_72=
60908088470911906m_1652737974694539012m_-7190227123542900316m_-912905329179=
9417350m_-8572547956642599387m_6626408342298426216HOEnZb">
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518acompli_signature"></div>
</div>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216hm m_72=
60908088470911906m_1652737974694539012m_-7190227123542900316m_-912905329179=
9417350m_-8572547956642599387m_6626408342298426216HOEnZb"></div>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216hm m_72=
60908088470911906m_1652737974694539012m_-7190227123542900316m_-912905329179=
9417350m_-8572547956642599387m_6626408342298426216HOEnZb">
<hr style=3D"display:inline-block;width:98%">
</div>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518divRplyFwdMsg" dir=3D"ltr">
<font face=3D"Calibri, sans-serif" style=3D"font-size:11pt">
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216hm m_72=
60908088470911906m_1652737974694539012m_-7190227123542900316m_-912905329179=
9417350m_-8572547956642599387m_6626408342298426216HOEnZb"><b>From:</b> Feli=
x Cheung &lt;<a href=3D"mailto:felixcheung_m@hotmail.com" target=3D"_blank"=
>felixcheung_m@hotmail.com</a>&gt;<br>
<b>Sent:</b> Saturday, May 26, 2018 1:05:29 PM<br>
<b>To:</b> Saikat Kanjilal; Maximiliano Felice; Joseph Bradley<br>
<b>Cc:</b> Leif Walsh; Holden Karau; dev</div>
<div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216h5"><br=
>
<b>Subject:</b> Re: Revisiting Online serving of Spark models?</div>
</div>
</font>
<div>=C2=A0</div>
</div>
<div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216h5">
<div>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518compose-container" style=3D"direction:ltr">
<span><span></span></span>
<div>
<div>
<div style=3D"direction:ltr">Hi! How about we meet the community and discus=
s on June 6 4pm at (near) the Summit?</div>
<div><br>
</div>
<div style=3D"direction:ltr">(I propose we meet at the venue entrance so we=
 could accommodate people might not be in the conference)</div>
<div style=3D"direction:ltr"></div>
</div>
<div style=3D"direction:ltr"><br>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518acompli_signature"></div>
</div>
</div>
<hr style=3D"display:inline-block;width:98%">
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518divRplyFwdMsg" dir=3D"ltr">
<font face=3D"Calibri, sans-serif" style=3D"font-size:11pt"><b>From:</b> Sa=
ikat Kanjilal &lt;<a href=3D"mailto:sxk1969@hotmail.com" target=3D"_blank">=
sxk1969@hotmail.com</a>&gt;<br>
<b>Sent:</b> Tuesday, May 22, 2018 7:47:07 AM<br>
<b>To:</b> Maximiliano Felice<br>
<b>Cc:</b> Leif Walsh; Felix Cheung; Holden Karau; Joseph Bradley; dev<br>
<b>Subject:</b> Re: Revisiting Online serving of Spark models?</font>
<div>=C2=A0</div>
</div>
<div>I=E2=80=99m in the same exact boat as Maximiliano and have use cases a=
s well for model serving and would love to join this discussion.<br>
<br>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518AppleMailSignature">Sent from my iPhone</div>
<div><br>
On May 22, 2018, at 6:39 AM, Maximiliano Felice &lt;<a href=3D"mailto:maxim=
ilianofelice@gmail.com" target=3D"_blank">maximilianofelice@gmail.com</a>&g=
t; wrote:<br>
<br>
</div>
<blockquote type=3D"cite">
<div>
<div dir=3D"ltr">Hi!
<div><br>
</div>
<div>I&#39;m don&#39;t usually write a lot on this list but I keep up to da=
te with the discussions and I&#39;m a heavy user of Spark. This topic caugh=
t my attention, as we&#39;re currently facing this issue at work. I&#39;m a=
ttending to the summit and was wondering if
 it would it be possible for me to join that meeting. I might be able to sh=
are some helpful usecases=C2=A0and ideas.</div>
<div><br>
</div>
<div>Thanks,</div>
<div>Maximiliano Felice</div>
<div><br>
<div class=3D"gmail_quote">
<div dir=3D"ltr">El mar., 22 de may. de 2018 9:14 AM, Leif Walsh &lt;<a hre=
f=3D"mailto:leif.walsh@gmail.com" target=3D"_blank">leif.walsh@gmail.com</a=
>&gt; escribi=C3=B3:<br>
</div>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div>I=E2=80=99m with you on json being more readable than parquet, but we=
=E2=80=99ve had success using pyarrow=E2=80=99s parquet reader and have bee=
n quite happy with it so far. If your target is python (and probably if not=
 now, then soon, R), you should look in to it.=C2=A0</div>
</div>
<div><br>
<div class=3D"gmail_quote">
<div>On Mon, May 21, 2018 at 16:52 Joseph Bradley &lt;<a href=3D"mailto:jos=
eph@databricks.com" target=3D"_blank">joseph@databricks.com</a>&gt; wrote:<=
br>
</div>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>Regarding model reading and writing, I&#39;ll give quick thoughts here=
:
<div>* Our approach was to use the same format but write JSON instead of Pa=
rquet.=C2=A0 It&#39;s easier to parse JSON without Spark, and using the sam=
e format simplifies architecture.=C2=A0 Plus, some people want to check fil=
es into version control, and JSON is
 nice for that.</div>
<div>* The reader/writer APIs could be extended to take format parameters (=
just like DataFrame reader/writers) to handle JSON (and maybe, eventually, =
handle Parquet in the online serving setting).</div>
<div><br>
</div>
<div>This would be a big project, so proposing a SPIP might be best.=C2=A0 =
If people are around at the Spark Summit, that could be a good time to meet=
 up &amp; then post notes back to the dev list.</div>
</div>
<div class=3D"gmail_extra"><br>
<div class=3D"gmail_quote">On Sun, May 20, 2018 at 8:11 PM, Felix Cheung <s=
pan>
&lt;<a href=3D"mailto:felixcheung_m@hotmail.com" target=3D"_blank">felixche=
ung_m@hotmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507066=
4m_8484192439184722668compose-container" style=3D"direction:ltr">
<span><span></span></span>
<div>
<div>
<div style=3D"direction:ltr">Specifically I=E2=80=99d like bring part of th=
e discussion to Model and PipelineModel, and various ModelReader and Shared=
ReadWrite implementations that rely on SparkContext. This is a big blocker =
on reusing =C2=A0trained models outside
 of Spark for online serving.</div>
<div><br>
</div>
<div style=3D"direction:ltr">What=E2=80=99s the next step? Would folks be i=
nterested in getting together to discuss/get some feedback?</div>
</div>
<div><br>
</div>
<div><br>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664m_8484192439184722668acompli_signature">
</div>
<div class=3D"gmail_quote">_____________________________<br>
From: Felix Cheung &lt;<a href=3D"mailto:felixcheung_m@hotmail.com" target=
=3D"_blank">felixcheung_m@hotmail.com</a>&gt;<br>
Sent: Thursday, May 10, 2018 10:10 AM<span><br>
Subject: Re: Revisiting Online serving of Spark models?<br>
</span>To: Holden Karau &lt;<a href=3D"mailto:holden@pigscanfly.ca" target=
=3D"_blank">holden@pigscanfly.ca</a>&gt;, Joseph Bradley &lt;<a href=3D"mai=
lto:joseph@databricks.com" target=3D"_blank">joseph@databricks.com</a>&gt;<=
br>
Cc: dev &lt;<a href=3D"mailto:dev@spark.apache.org" target=3D"_blank">dev@s=
park.apache.org</a>&gt;
<div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664h5">
<br>
<br>
<br>
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507066=
4m_8484192439184722668compose-container" style=3D"direction:ltr">
<span><span></span></span>
<div>
<div style=3D"direction:ltr">Huge +1 on this!</div>
<div><br>
</div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664m_8484192439184722668acompli_signature">
</div>
</div>
</div>
<hr style=3D"display:inline-block;width:98%">
<div id=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900316=
m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885706=
59733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507066=
4m_8484192439184722668divRplyFwdMsg">
<font face=3D"Calibri, sans-serif" style=3D"font-size:11pt"><b>From:</b><a =
href=3D"mailto:holden.karau@gmail.com" target=3D"_blank">holden.karau@gmail=
.com</a> &lt;<a href=3D"mailto:holden.karau@gmail.com" target=3D"_blank">ho=
lden.karau@gmail.com</a>&gt;
 on behalf of Holden Karau &lt;<a href=3D"mailto:holden@pigscanfly.ca" targ=
et=3D"_blank">holden@pigscanfly.ca</a>&gt;<br>
<b>Sent:</b> Thursday, May 10, 2018 9:39:26 AM<br>
<b>To:</b> Joseph Bradley<br>
<b>Cc:</b> dev<br>
<b>Subject:</b> Re: Revisiting Online serving of Spark models?</font>
<div>=C2=A0</div>
</div>
<div>
<div><br>
<div class=3D"gmail_extra"><br>
<div class=3D"gmail_quote">On Thu, May 10, 2018 at 9:25 AM, Joseph Bradley =
<span>
&lt;<a href=3D"mailto:joseph@databricks.com" target=3D"_blank">joseph@datab=
ricks.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
Thanks for bringing this up Holden!=C2=A0 I&#39;m a strong supporter of thi=
s.</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
<br>
</div>
</div>
</blockquote>
<div>Awesome! I&#39;m glad other folks think something like this belongs in=
 Spark.</div>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
This was one of the original goals for mllib-local: to have local versions =
of MLlib models which could be deployed without the big Spark JARs and with=
out a SparkContext or SparkSession.=C2=A0 There are related commercial offe=
rings like this : ) but the overhead
 of maintaining those offerings is pretty high.=C2=A0 Building good APIs wi=
thin MLlib to avoid copying logic across libraries will be well worth it.</=
div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
<br>
</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
We&#39;ve talked about this need at Databricks and have also been syncing w=
ith the creators of MLeap.=C2=A0 It&#39;d be great to get this functionalit=
y into Spark itself.=C2=A0 Some thoughts:</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
* It&#39;d be valuable to have this go beyond adding transform() methods ta=
king a Row to the current Models.=C2=A0 Instead, it would be ideal to have =
local, lightweight versions of models in mllib-local, outside of the main m=
llib package (for easier deployment with
 smaller &amp; fewer dependencies).</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
* Supporting Pipelines is important.=C2=A0 For this, it would be ideal to u=
tilize elements of Spark SQL, particularly Rows and Types, which could be m=
oved into a local sql package.<br>
</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
* This architecture may require some awkward APIs currently to have model p=
rediction logic in mllib-local, local model classes in mllib-local, and reg=
ular (DataFrame-friendly) model classes in mllib.=C2=A0 We might find it he=
lpful to break some DeveloperApis in
 Spark 3.0 to facilitate this architecture while making it feasible for 3rd=
 party developers to extend MLlib APIs (especially in Java).</div>
</div>
</blockquote>
<div>I agree this could be interesting, and feed into the other discussion =
around when (or if) we should be considering Spark 3.0</div>
<div>I _think_ we could probably do it with optional traits people could mi=
x in to avoid breaking the current APIs but I could be wrong on that point.=
</div>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
* It could also be worth discussing local DataFrames.=C2=A0 They might not =
be as important as per-Row transformations, but they would be helpful for b=
atching for higher throughput.</div>
</div>
</blockquote>
<div>That could be interesting as well.=C2=A0</div>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
<br>
</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
I&#39;ll be interested to hear others&#39; thoughts too!</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
<br>
</div>
<div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:12=
.8px;font-style:normal;font-weight:400;letter-spacing:normal;text-align:sta=
rt;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;=
background-color:rgb(255,255,255)">
Joseph</div>
</div>
<div class=3D"gmail_extra">
<div>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664m_8484192439184722668h5">
<br>
<div class=3D"gmail_quote">On Wed, May 9, 2018 at 7:18 AM, Holden Karau <sp=
an>
&lt;<a href=3D"mailto:holden@pigscanfly.ca" target=3D"_blank">holden@pigsca=
nfly.ca</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div>Hi y&#39;all,
<div><br>
</div>
<div>With the renewed interest in ML in Apache Spark now seems like a good =
a time as any to revisit the online serving situation in Spark ML. DB &amp;=
 other&#39;s have done some excellent working moving a lot of the necessary=
 tools into a local linear algebra
 package that doesn&#39;t depend on having a SparkContext.</div>
<div><br>
</div>
<div>There are a few different commercial and non-commercial solutions roun=
d this, but currently our individual transform/predict methods are private =
so they either need to copy or re-implement (or put them selves in org.apac=
he.spark) to access them.
 How would folks feel about adding a new trait for ML pipeline stages to ex=
pose to do transformation of single element inputs (or local collections) t=
hat could be optionally implemented by stages which support this? That way =
we can have less copy and paste
 code possibly getting out of sync with our model training.</div>
<div><br>
</div>
<div>I think continuing to have on-line serving grow in different projects =
is probably the right path, forward (folks have different needs), but I&#39=
;d love to see us make it simpler for other projects to build reliable serv=
ing tools.</div>
<div><br>
</div>
<div>I realize this maybe puts some of the folks in an awkward position wit=
h their own commercial offerings, but hopefully if we make it easier for ev=
eryone the commercial vendors can benefit as well.</div>
<div><br>
</div>
<div>Cheers,</div>
<div><br>
</div>
<div>Holden :)<span class=3D"m_7260908088470911906m_1652737974694539012m_-7=
190227123542900316m_-9129053291799417350m_-8572547956642599387m_66264083422=
98426216m_7688570659733634518m_-4358562789506581722m_-3356694128944526080m_=
5497272554885070664m_8484192439184722668m_-7251639539927760327HOEnZb"><font=
 color=3D"#888888"><br clear=3D"all">
<div><br>
</div>
-- <br>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664m_8484192439184722668m_-7251639539927760327m_2933878428057687671gmail_s=
ignature">
<div>
<div>
<div>
<div>
<div><span style=3D"font-size:12.8px">Twitter:=C2=A0</span><a href=3D"https=
://twitter.com/holdenkarau" style=3D"font-size:12.8px" target=3D"_blank">ht=
tps://twitter.com/holdenkarau</a><br>
</div>
</div>
</div>
</div>
</div>
</div>
</font></span></div>
</div>
</blockquote>
</div>
<br>
<br clear=3D"all">
<div><br>
</div>
</div>
</div>
<span class=3D"m_7260908088470911906m_1652737974694539012m_-719022712354290=
0316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_7688=
570659733634518m_-4358562789506581722m_-3356694128944526080m_54972725548850=
70664m_8484192439184722668HOEnZb"><font color=3D"#888888">--<br>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664m_8484192439184722668m_-7251639539927760327gmail_signature">
<div>
<div style=3D"font-size:small;margin-top:0pt;margin-bottom:0pt"><font face=
=3D"Arial"><span style=3D"font-size:12.6667px;line-height:15.2px;white-spac=
e:pre-wrap">Joseph Bradley</span></font></div>
<div style=3D"font-size:small;margin-top:0pt;margin-bottom:0pt"><font face=
=3D"Arial"><span style=3D"font-size:12.6667px;line-height:15.2px;white-spac=
e:pre-wrap">Software Engineer - Machine Learning</span></font></div>
<div style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin-bottom=
:0pt">
<span style=3D"font-size:12.6667px;font-family:arial;vertical-align:baselin=
e;white-space:pre-wrap">Databricks, Inc.</span></div>
<div style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin-bottom=
:0pt">
<a href=3D"http://databricks.com/" style=3D"font-size:12.8px;color:rgb(17,8=
5,204)" target=3D"_blank"><img width=3D"16" height=3D"16" alt=3D"http://dat=
abricks.com" src=3D"https://databricks.com/wp-content/uploads/2016/11/db-bu=
g-email-sig-16px.png"></a><br>
</div>
</div>
</div>
</font></span></div>
</blockquote>
</div>
<br>
<br clear=3D"all">
<div><br>
</div>
-- <br>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664m_8484192439184722668gmail_signature">
<div>
<div>
<div>
<div>
<div><span style=3D"font-size:12.8px">Twitter:=C2=A0</span><a href=3D"https=
://twitter.com/holdenkarau" style=3D"font-size:12.8px" target=3D"_blank">ht=
tps://twitter.com/holdenkarau</a><br>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
<br>
</div>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
<br clear=3D"all">
<div><br>
</div>
-- <br>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216m_76885=
70659733634518m_-4358562789506581722m_-3356694128944526080m_549727255488507=
0664gmail_signature">
<div>
<div style=3D"font-size:small;margin-top:0pt;margin-bottom:0pt"><font face=
=3D"Arial"><span style=3D"font-size:12.6667px;line-height:15.2px;white-spac=
e:pre-wrap">Joseph Bradley</span></font></div>
<div style=3D"font-size:small;margin-top:0pt;margin-bottom:0pt"><font face=
=3D"Arial"><span style=3D"font-size:12.6667px;line-height:15.2px;white-spac=
e:pre-wrap">Software Engineer - Machine Learning</span></font></div>
<div style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin-bottom=
:0pt">
<span style=3D"font-size:12.6667px;font-family:arial;vertical-align:baselin=
e;white-space:pre-wrap">Databricks, Inc.</span></div>
<div style=3D"font-size:12.8px;line-height:1.2;margin-top:0pt;margin-bottom=
:0pt">
<a href=3D"http://databricks.com/" style=3D"font-size:12.8px;color:rgb(17,8=
5,204)" target=3D"_blank"><img width=3D"16" height=3D"16" alt=3D"http://dat=
abricks.com" src=3D"https://databricks.com/wp-content/uploads/2016/11/db-bu=
g-email-sig-16px.png"></a><br>
</div>
</div>
</div>
</div>
</blockquote>
</div>
</div>
-- <br>
<div dir=3D"ltr" class=3D"m_7260908088470911906m_1652737974694539012m_-7190=
227123542900316m_-9129053291799417350m_-8572547956642599387m_66264083422984=
26216m_7688570659733634518m_-4358562789506581722m_-3356694128944526080gmail=
_signature">
-- <br>
Cheers,<br>
Leif</div>
</blockquote>
</div>
</div>
</div>
</div>
</blockquote>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
<br clear=3D"all">
<div><br>
</div>
-- <br>
<div class=3D"m_7260908088470911906m_1652737974694539012m_-7190227123542900=
316m_-9129053291799417350m_-8572547956642599387m_6626408342298426216gmail_s=
ignature">
<div dir=3D"ltr">
<div>
<div dir=3D"ltr">
<div dir=3D"ltr">
<div><span style=3D"font-size:12.8px">Twitter:=C2=A0</span><a href=3D"https=
://twitter.com/holdenkarau" style=3D"font-size:12.8px" target=3D"_blank">ht=
tps://twitter.com/holdenkarau</a><br>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
<br>
</div>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
</blockquote>
</div>
<br>
<br>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
</blockquote>
</div></div></blockquote></div></div>
</div></div></blockquote></div></div></div><div dir=3D"ltr"><div class=3D"g=
mail_extra"><br><br clear=3D"all"><div><br></div>-- <br><div class=3D"m_726=
0908088470911906m_1652737974694539012m_-7190227123542900316gmail_signature"=
 data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><div><div dir=3D"ltr">=
<div dir=3D"ltr"><div><span style=3D"font-size:12.8px">Twitter:=C2=A0</span=
><a href=3D"https://twitter.com/holdenkarau" style=3D"font-size:12.8px" tar=
get=3D"_blank">https://twitter.com/holdenkarau</a><br></div></div></div></d=
iv></div></div>
</div></div></blockquote></div>
</blockquote></div></div>

--000000000000fa537b056e000cd5--