spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From enzo <e...@smartinsightsfromdata.com>
Subject Fwd: Question on Spark's graph libraries roadmap
Date Mon, 13 Mar 2017 20:47:02 GMT
Please see this email  trail:  no answer so far on the user@spark board.  Trying the developer
board for better luck

The question:

I am a bit confused by the current roadmap for graph and graph analytics in Apache Spark.

I understand that we have had for some time two libraries (the following is my understanding
- please amend as appropriate!):

. GraphX, part of Spark project.  This library is based on RDD and it is only accessible via
Scala.  It doesn’t look that this library has been enhanced recently.
. GraphFrames, independent (at the moment?) library for Spark.  This library is based on Spark
DataFrames and accessible by Scala & Python. Last commit on GitHub was 2 months ago.

GraphFrames cam about with the promise at some point to be integrated in Apache Spark.

I can see other projects coming up with interesting libraries and ideas (e.g. Graphulo on
Accumulo, a new project with the goal of implementing the GraphBlas building blocks for graph
algorithms on top of Accumulo).

Where is Apache Spark going?

Where are graph libraries in the roadmap?



Thanks for any clarity brought to this matter.

Thanks Enzo

> Begin forwarded message:
> 
> From: "Md. Rezaul Karim" <rezaul.karim@insight-centre.org>
> Subject: Re: Question on Spark's graph libraries
> Date: 10 March 2017 at 13:13:15 CET
> To: Robin East <robin.east@xense.co.uk>
> Cc: enzo <enzo@smartinsightsfromdata.com>, spark users <user@spark.apache.org>
> 
> +1
> 
> Regards,
> _________________________________
> Md. Rezaul Karim, BSc, MSc
> PhD Researcher, INSIGHT Centre for Data Analytics 
> National University of Ireland, Galway
> IDA Business Park, Dangan, Galway, Ireland
> Web: http://www.reza-analytics.eu/index.html <http://139.59.184.114/index.html>
> 
> On 10 March 2017 at 12:10, Robin East <robin.east@xense.co.uk <mailto:robin.east@xense.co.uk>>
wrote:
> I would love to know the answer to that too.
> -------------------------------------------------------------------------------
> Robin East
> Spark GraphX in Action Michael Malak and Robin East
> Manning Publications Co.
> http://www.manning.com/books/spark-graphx-in-action <http://www.manning.com/books/spark-graphx-in-action>
> 
> 
> 
> 
> 
>> On 9 Mar 2017, at 17:42, enzo <enzo@smartinsightsfromdata.com <mailto:enzo@smartinsightsfromdata.com>>
wrote:
>> 
>> I am a bit confused by the current roadmap for graph and graph analytics in Apache
Spark.
>> 
>> I understand that we have had for some time two libraries (the following is my understanding
- please amend as appropriate!):
>> 
>> . GraphX, part of Spark project.  This library is based on RDD and it is only accessible
via Scala.  It doesn’t look that this library has been enhanced recently.
>> . GraphFrames, independent (at the moment?) library for Spark.  This library is based
on Spark DataFrames and accessible by Scala & Python. Last commit on GitHub was 2 months
ago.
>> 
>> GraphFrames cam about with the promise at some point to be integrated in Apache Spark.
>> 
>> I can see other projects coming up with interesting libraries and ideas (e.g. Graphulo
on Accumulo, a new project with the goal of implementing the GraphBlas building blocks for
graph algorithms on top of Accumulo).
>> 
>> Where is Apache Spark going?
>> 
>> Where are graph libraries in the roadmap?
>> 
>> 
>> 
>> Thanks for any clarity brought to this matter.
>> 
>> Enzo
> 
> 


Mime
View raw message