ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Manish Mishra (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (IGNITE-4526) Add Spark Shared RDD examples
Date Fri, 06 Jan 2017 11:41:58 GMT

    [ https://issues.apache.org/jira/browse/IGNITE-4526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15804364#comment-15804364
] 

Manish Mishra commented on IGNITE-4526:
---------------------------------------


Hey [~dmagda] Is it ok to start with  a Scala example first. If yes, here is the package structure
that will follow for the first example: 
ignite/examples/src/main/org/apache/ignite/spark/examples
Does it look ok ? 

> Add Spark Shared RDD examples
> -----------------------------
>
>                 Key: IGNITE-4526
>                 URL: https://issues.apache.org/jira/browse/IGNITE-4526
>             Project: Ignite
>          Issue Type: Task
>            Reporter: Denis Magda
>            Assignee: Manish Mishra
>             Fix For: 2.0
>
>
> Spark Shared RDD functionality doesn't have its own examples. We need to add an example
that will do the following:
> - First Spark Worker: creation of a shared RDD and filling it in with data.
> - First Spark Worker: performing some native spark transformation with the RDD.
> - Second Spark Worker: connecting to the same shared RDD.
> - Second Spark Worker: execution of SQL query using Spark API and Ignite API. Show that
Ignite's query executes faster.
> The reason why the example should consist of two workers is to showcase one of the main
benefits of Ignite's RDDs - ability to share the state (RDD) amid different Spark workers
and processes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message