systemml-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From dusenberr...@gmail.com
Subject Re: GSoc 2017
Date Thu, 12 Jan 2017 23:08:26 GMT
Yeah helping to build out our Python DSL into a full-out replacement for the current "DML"
language would be great, and we'd be quite supportive!

-Mike

--

Mike Dusenberry
GitHub: github.com/dusenberrymw
LinkedIn: linkedin.com/in/mikedusenberry

Sent from my iPhone.


> On Jan 12, 2017, at 2:58 PM, fschueler@posteo.de wrote:
> 
> Hi Krishna,
> 
> cool to see that you're interested in SystemML!
> 
> From your list I personally think that a) and d) would be well suited for projects, especially
a good python DSL is a high priority.
> 
> We will apply as an organization to GSoC once organization applications are open (Jan.
19th) and I think we will find mentors for at least a) and d). If you already want to take
a look at what is currently there, I suggest to look at our python APIs and documentation.
If you want to take on the DSL project it might also be a good idea to look into the DML documentation
and related papers to see what we need to support.
> 
> The proposals will probably circulate on the mailinglist, too, so keep an eye on that
:)
> 
> -Felix
> 
> Am 12.01.2017 23:13 schrieb Krishna Kalyan:
>> Hello All,
>> Thank you for your wonderful replies.
>> Tasks that I am interested in:
>> a) Support for Python DSLs
>> b) Python wrappers for all existing algorithms
>> c) GPU support
>> d) Perftest : automated performance tests of algorithms
>> I am also willing to work on the tasks that SystemML community think are
>> important.
>> Regards,
>> Krishna
>> On Fri, Jan 6, 2017 at 10:14 PM, Mike Dusenberry <dusenberrymw@gmail.com>
>> wrote:
>>> Hi Krishna!  Welcome, and thanks for your interest!
>>> We would definitely be excited to collaborate with you on a GSOC project.
>>> We've started another thread to discuss possible new proposals, and we
>>> would also be quite interested in any particular proposal that you might
>>> like to generate tailored towards your interests.  Copied from the other
>>> thread, some possible ideas could include: building out a full ML demo to
>>> solve a real, large-scale problem that would benefit from a distributed
>>> approach; overall performance improvements that address a full class, or
>>> wider area, of ML algorithms, rather than a single, specific script;
>>> infrastructure for [performance] testing, and identification of wide areas
>>> of improvement; helping with building out fully-featured, clean,
>>> well-tested DSLs in Python & Scala (we've started, but it would be good to
>>> continue stressing them -- we could even aim to replace DML with the DSLs);
>>> etc.  Overall, we want to improve the ability of the user to work on a wide
>>> range of large-scale, distributed ML problems in a simple and easy manner
>>> on top of Spark.
>>> In the meantime, you could explore our recent open issues [1] and even
>>> begin discussions or contributions on any of the items.  You could also
>>> view our recent roadmap discussion thread on the mailing list, starting
>>> with the first email [2]:
>>> [1]:
>>> https://issues.apache.org/jira/issues/?jql=project%20%3D%20SYSTEMML%20AND%
>>> 20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DESC%2C%
>>> 20priority%20DESC
>>> [2]:
>>> http://mail-archives.apache.org/mod_mbox/incubator-
>>> systemml-dev/201701.mbox/%3C9eb780f0-ff28-c702-117c-
>>> bad74059930d@gmail.com%3E
>>> - Mike
>>> --
>>> Michael W. Dusenberry
>>> GitHub: github.com/dusenberrymw
>>> LinkedIn: linkedin.com/in/mikedusenberry
>>> On Fri, Jan 6, 2017 at 12:34 PM, Luciano Resende <luckbr1975@gmail.com>
>>> wrote:
>>> > As some folks have described on this thread, it would be great to get you
>>> > familiarized with SystemML.
>>> >
>>> > In parallel, I would look for a mentor from the active committer list and
>>> > start working on a project proposal which could be based on the recent
>>> > Roadmap discussion [1].
>>> >
>>> > If you are looking for some guidance on how Apache participate on GSOC,
>>> > take a look at the following resources [2] and [3], and don't hesitate to
>>> > ask questions here.
>>> >
>>> >
>>> > [1]
>>> > https://www.mail-archive.com/dev@systemml.incubator.apache.o
>>> > rg/msg01199.html
>>> > [2] http://community.apache.org/gsoc.html
>>> > [3]
>>> > http://www.slideshare.net/luckbr1975/how-mentoring-can-help-
>>> > you-start-contributing-to-open-source
>>> >
>>> > On Thu, Jan 5, 2017 at 3:15 PM, Krishna Kalyan <krishnakalyan3@gmail.com
>>> >
>>> > wrote:
>>> >
>>> > > Hello Developers,
>>> > > I am Krishna, currently a 2nd year Masters student in (MSc. in Data
>>> > Mining)
>>> > > currently in Barcelona studying at Université Polytechnique de
>>> Catalogne.
>>> > > I was interested in contributing to SystemML this year under GSoc
>>> > program.
>>> > > Could anyone please guide on how to go about it?. (I understand the
I
>>> > need
>>> > > to write a proposal)
>>> > >
>>> > > Related Experience:
>>> > > My masters is mostly focussed on data mining techniques. Before my
>>> > masters,
>>> > > I was a  data engineer with IBM (India). I was responsible for managing
>>> > 50
>>> > > node Hadoop Cluster for more than a year. Most of my time was spent
>>> > > optimising and writing ETL (Apache Pig) jobs.
>>> > >
>>> > > I am the most comfortable with Python followed by R and Scala.
>>> > >
>>> > > My Webpage
>>> > > kkalyan.in
>>> > >
>>> > > My Spark Pull Requests
>>> > > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=
>>> is%3Apr%20author%
>>> > > 3Akrishnakalyan3%20
>>> > >
>>> > > Thank you so much,
>>> > > Krishna
>>> > >
>>> >
>>> >
>>> >
>>> > --
>>> > Luciano Resende
>>> > http://twitter.com/lresende1975
>>> > http://lresende.blogspot.com/
>>> >

Mime
  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message