systemml-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Dusenberry <dusenberr...@gmail.com>
Subject Re: GSoc 2017
Date Fri, 06 Jan 2017 21:14:38 GMT
Hi Krishna!  Welcome, and thanks for your interest!

We would definitely be excited to collaborate with you on a GSOC project.
We've started another thread to discuss possible new proposals, and we
would also be quite interested in any particular proposal that you might
like to generate tailored towards your interests.  Copied from the other
thread, some possible ideas could include: building out a full ML demo to
solve a real, large-scale problem that would benefit from a distributed
approach; overall performance improvements that address a full class, or
wider area, of ML algorithms, rather than a single, specific script;
infrastructure for [performance] testing, and identification of wide areas
of improvement; helping with building out fully-featured, clean,
well-tested DSLs in Python & Scala (we've started, but it would be good to
continue stressing them -- we could even aim to replace DML with the DSLs);
etc.  Overall, we want to improve the ability of the user to work on a wide
range of large-scale, distributed ML problems in a simple and easy manner
on top of Spark.

In the meantime, you could explore our recent open issues [1] and even
begin discussions or contributions on any of the items.  You could also
view our recent roadmap discussion thread on the mailing list, starting
with the first email [2]:

[1]:
https://issues.apache.org/jira/issues/?jql=project%20%3D%20SYSTEMML%20AND%20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DESC%2C%20priority%20DESC
[2]:
http://mail-archives.apache.org/mod_mbox/incubator-systemml-dev/201701.mbox/%3C9eb780f0-ff28-c702-117c-bad74059930d@gmail.com%3E


- Mike

--

Michael W. Dusenberry
GitHub: github.com/dusenberrymw
LinkedIn: linkedin.com/in/mikedusenberry

On Fri, Jan 6, 2017 at 12:34 PM, Luciano Resende <luckbr1975@gmail.com>
wrote:

> As some folks have described on this thread, it would be great to get you
> familiarized with SystemML.
>
> In parallel, I would look for a mentor from the active committer list and
> start working on a project proposal which could be based on the recent
> Roadmap discussion [1].
>
> If you are looking for some guidance on how Apache participate on GSOC,
> take a look at the following resources [2] and [3], and don't hesitate to
> ask questions here.
>
>
> [1]
> https://www.mail-archive.com/dev@systemml.incubator.apache.o
> rg/msg01199.html
> [2] http://community.apache.org/gsoc.html
> [3]
> http://www.slideshare.net/luckbr1975/how-mentoring-can-help-
> you-start-contributing-to-open-source
>
> On Thu, Jan 5, 2017 at 3:15 PM, Krishna Kalyan <krishnakalyan3@gmail.com>
> wrote:
>
> > Hello Developers,
> > I am Krishna, currently a 2nd year Masters student in (MSc. in Data
> Mining)
> > currently in Barcelona studying at Université Polytechnique de Catalogne.
> > I was interested in contributing to SystemML this year under GSoc
> program.
> > Could anyone please guide on how to go about it?. (I understand the I
> need
> > to write a proposal)
> >
> > Related Experience:
> > My masters is mostly focussed on data mining techniques. Before my
> masters,
> > I was a  data engineer with IBM (India). I was responsible for managing
> 50
> > node Hadoop Cluster for more than a year. Most of my time was spent
> > optimising and writing ETL (Apache Pig) jobs.
> >
> > I am the most comfortable with Python followed by R and Scala.
> >
> > My Webpage
> > kkalyan.in
> >
> > My Spark Pull Requests
> > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=is%3Apr%20author%
> > 3Akrishnakalyan3%20
> >
> > Thank you so much,
> > Krishna
> >
>
>
>
> --
> Luciano Resende
> http://twitter.com/lresende1975
> http://lresende.blogspot.com/
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message