Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A58DA200D29 for ; Thu, 12 Oct 2017 05:10:36 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A405C160BE3; Thu, 12 Oct 2017 03:10:36 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C07D01609E5 for ; Thu, 12 Oct 2017 05:10:35 +0200 (CEST) Received: (qmail 21090 invoked by uid 500); 12 Oct 2017 03:10:34 -0000 Mailing-List: contact dev-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list dev@ignite.apache.org Received: (qmail 21073 invoked by uid 99); 12 Oct 2017 03:10:34 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 12 Oct 2017 03:10:34 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 6761A1A173A for ; Thu, 12 Oct 2017 03:03:47 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.912 X-Spam-Level: * X-Spam-Status: No, score=1.912 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, URI_HEX=1.313] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id tVXF9mgMq6hC for ; Thu, 12 Oct 2017 03:03:45 +0000 (UTC) Received: from mail-io0-f180.google.com (mail-io0-f180.google.com [209.85.223.180]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 053545FD01 for ; Thu, 12 Oct 2017 03:03:45 +0000 (UTC) Received: by mail-io0-f180.google.com with SMTP id j17so3982966iod.5 for ; Wed, 11 Oct 2017 20:03:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=l7W3DXmo+lcGgL8mQ9wGOnSCv5PbCiJ5Sm1GFPWMkog=; b=VqmzZ+pbsoZkNpa9AANz3ZSkBK5Q6QKSEh8kJuK2BoJ7CZzUl2o/iOFAgFXFBBCPMc bSCiEJf7g7jIRl3Xl/wrsiWdXmjTea4Ldga09T+yMqGHjRhzYPc8K2Nr+6OpzWOW3dED NocVBKnNdSj4tScblUvPm/sDLrFv82QlLKVTZs7F/bCsAPgHBjPIcTRwvTGZRqqHBTax pvH7t+XpSjxQrY6BHm0tGK6+JZ3eWBjvR8QhmYFCTDCwm3zrBnQmLzPG0IAwiM35pcLJ gy87p/+fuozRVouKmXr047xdKhVh8x20mjxrp7ECARCuL5pPRwec3WNDt+LtTLioRZ1R hxRQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=l7W3DXmo+lcGgL8mQ9wGOnSCv5PbCiJ5Sm1GFPWMkog=; b=hiyvq5oBoN/rpR6IPEz4HiALcufGlwi8ViuDnC6tJo2P5a0hzw6Q31Y4iezLTz4sGR n2YUzDI9TbGXbSkjqhWFnkTaiu17/kAdgLXt5HPC/q2OfzmFms+Y2l3HRA0uX4/clk4N LWjFT+BEpur3zf5s/x1XTzXz9P4COrnLNGttave6BRElm9RtWxPYnzpHC03jJaJYkqeF J4AiRJUKQhVPADE30RfNlbuuLis+gD/FOUt87Njk7Q36u1cS5KNYOcVACJ4onxW6aM45 KkiKJlWFD+VeslKu1gJAVbxzTZ2uQ1jQrMYokT8u9ugM9P/FVF/FRB5LX629gTrD7KT1 DYUA== X-Gm-Message-State: AMCzsaUF3PV4JZLOiMqz/Pg/nVj9z4X48KNiNB/af/05CZ+IDproqPu2 IlBhT/vvFaMbm40psUDad9rx4xVRMziGmFQhxMI= X-Google-Smtp-Source: ABhQp+Q1K07czs9R9JdwBeGvvdyTKUtWWzMbWLVv1gSc0dYpymMVaKkLTVvvmkZkCXnvOBf84FWuRLPGJPEVvx1jvKY= X-Received: by 10.107.40.148 with SMTP id o142mr1336821ioo.251.1507777424416; Wed, 11 Oct 2017 20:03:44 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.138.86 with HTTP; Wed, 11 Oct 2017 20:03:13 -0700 (PDT) In-Reply-To: References: From: Valentin Kulichenko Date: Wed, 11 Oct 2017 20:03:13 -0700 Message-ID: Subject: Re: Integration of Spark and Ignite. Prototype. To: =?UTF-8?B?0J3QuNC60L7Qu9Cw0Lkg0JjQttC40LrQvtCy?= Cc: dev@ignite.apache.org Content-Type: multipart/alternative; boundary="001a1141f2a4c862c1055b50c9ee" archived-at: Thu, 12 Oct 2017 03:10:36 -0000 --001a1141f2a4c862c1055b50c9ee Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi Nikolay, Sorry for delay on this, got a little swamped lately. I will do my best to review the code this week. -Val On Mon, Oct 9, 2017 at 11:48 AM, =D0=9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0=D0=B9= =D0=98=D0=B6=D0=B8=D0=BA=D0=BE=D0=B2 wrote: > Hello, Valentin. > > Did you have a chance to look at my changes? > > Now I think I have done almost all required features. > I want to make some performance test to ensure my implementation work > properly with a significant amount of data. > And I definitely need some feedback for my changes. > > > 2017-10-09 18:45 GMT+03:00 =D0=9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0=D0=B9 =D0= =98=D0=B6=D0=B8=D0=BA=D0=BE=D0=B2 : > >> Hello, guys. >> >> Which version of Spark do we want to use? >> >> 1. Currently, Ignite depends on Spark 2.1.0. >> >> * Can be run on JDK 7. >> * Still supported: 2.1.2 will be released soon. >> >> 2. Latest Spark version is 2.2.0. >> >> * Can be run only on JDK 8+ >> * Released Jul 11, 2017. >> * Already supported by huge vendors(Amazon for example). >> >> Note that in IGNITE-3084 I implement some internal Spark API. >> So It will take some effort to switch between Spark 2.1 and 2.2 >> >> >> 2017-09-27 2:20 GMT+03:00 Valentin Kulichenko < >> valentin.kulichenko@gmail.com>: >> >>> I will review in the next few days. >>> >>> -Val >>> >>> On Tue, Sep 26, 2017 at 2:23 PM, Denis Magda wrote: >>> >>> > Hello Nikolay, >>> > >>> > This is good news. Finally this capability is coming to Ignite. >>> > >>> > Val, Vladimir, could you do a preliminary review? >>> > >>> > Answering on your questions. >>> > >>> > 1. Yardstick should be enough for performance measurements. As a Spar= k >>> > user, I will be curious to know what=E2=80=99s the point of this inte= gration. >>> > Probably we need to compare Spark + Ignite and Spark + Hive or Spark = + >>> > RDBMS cases. >>> > >>> > 2. If Spark community is reluctant let=E2=80=99s include the module i= n >>> > ignite-spark integration. >>> > >>> > =E2=80=94 >>> > Denis >>> > >>> > > On Sep 25, 2017, at 11:14 AM, =D0=9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0= =D0=B9 =D0=98=D0=B6=D0=B8=D0=BA=D0=BE=D0=B2 >> > >>> > wrote: >>> > > >>> > > Hello, guys. >>> > > >>> > > Currently, I=E2=80=99m working on integration between Spark and Ign= ite [1]. >>> > > >>> > > For now, I implement following: >>> > > * Ignite DataSource implementation(IgniteRelationProvider) >>> > > * DataFrame support for Ignite SQL table. >>> > > * IgniteCatalog implementation for a transparent resolving of >>> ignites >>> > > SQL tables. >>> > > >>> > > Implementation of it can be found in PR [2] >>> > > It would be great if someone provides feedback for a prototype. >>> > > >>> > > I made some examples in PR so you can see how API suppose to be use= d >>> [3]. >>> > > [4]. >>> > > >>> > > I need some advice. Can you help me? >>> > > >>> > > 1. How should this PR be tested? >>> > > >>> > > Of course, I need to provide some unit tests. But what about >>> scalability >>> > > tests, etc. >>> > > Maybe we need some Yardstick benchmark or similar? >>> > > What are your thoughts? >>> > > Which scenarios should I consider in the first place? >>> > > >>> > > 2. Should we provide Spark Catalog implementation inside Ignite >>> codebase? >>> > > >>> > > A current implementation of Spark Catalog based on *internal Spark >>> API*. >>> > > Spark community seems not interested in making Catalog API public o= r >>> > > including Ignite Catalog in Spark code base [5], [6]. >>> > > >>> > > *Should we include Spark internal API implementation inside Ignite >>> code >>> > > base?* >>> > > >>> > > Or should we consider to include Catalog implementation in some >>> external >>> > > module? >>> > > That will be created and released outside Ignite?(we still can >>> support >>> > and >>> > > develop it inside Ignite community). >>> > > >>> > > [1] https://issues.apache.org/jira/browse/IGNITE-3084 >>> > > [2] https://github.com/apache/ignite/pull/2742 >>> > > [3] https://github.com/apache/ignite/pull/2742/files#diff- >>> > > f4ff509cef3018e221394474775e0905 >>> > > [4] https://github.com/apache/ignite/pull/2742/files#diff- >>> > > f2b670497d81e780dfd5098c5dd8a89c >>> > > [5] http://apache-spark-developers-list.1001551.n3. >>> > > nabble.com/Spark-Core-Custom-Catalog-Integration-between- >>> > > Apache-Ignite-and-Apache-Spark-td22452.html >>> > > [6] https://issues.apache.org/jira/browse/SPARK-17767 >>> > > >>> > > -- >>> > > Nikolay Izhikov >>> > > NIzhikov.dev@gmail.com >>> > >>> > >>> >> >> >> >> -- >> Nikolay Izhikov >> NIzhikov.dev@gmail.com >> > > > > -- > Nikolay Izhikov > NIzhikov.dev@gmail.com > --001a1141f2a4c862c1055b50c9ee--