spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kai Jiang (JIRA)" <>
Subject [jira] [Commented] (SPARK-13489) GSoC 2016 project ideas for MLlib
Date Thu, 25 Feb 2016 12:46:18 GMT


Kai Jiang commented on SPARK-13489:

Here is a [post|]
I published on dev mailing list. (paste it here)
Hi All Spark Devs,

I am Kai Jiang, a master student majoring in Computer Science. Machine Learning and Distributed

System are my interests. Due to that, I've been contributing to Spark codebase since last
year. My
Pull Requests are related to MLlib, PySpark and SQL.(

Last time, I was impressed by the MechCoder's project mentored by mengxr. This year, I look
to having a chance to do something interesting and want to extend my future contribution with
into a GSoC project. Thus, I was wondering if there are some specific ideas, issues or suggestions
regarding MLlib (mainly), SQL or others could be gathered into a project. After looking into
the MLlib 2.0
Roadmap, I found there are many issues I am interested in (i.e Python/SparkR API for ML, PMML
etc.). If community has other ideas, I am very willing to work on some issues before GSoC.

I will put here a link of my very rough draft proposal later.

> GSoC 2016 project ideas for MLlib
> ---------------------------------
>                 Key: SPARK-13489
>                 URL:
>             Project: Spark
>          Issue Type: Brainstorming
>          Components: ML
>            Reporter: Xiangrui Meng
>            Assignee: Xiangrui Meng
>            Priority: Minor
> I want to use this JIRA to collect some GSoC project ideas for MLlib. Ideally, the student
should have contributed to Spark. And the content of the project could be divided into small
functional pieces so that it won't get stalled if the mentor is temporarily unavailable.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message