From issues-return-70538-archive-asf-public=cust-asf.ponee.io@ignite.apache.org Fri Jul 27 15:31:03 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id C91C6180657 for ; Fri, 27 Jul 2018 15:31:02 +0200 (CEST) Received: (qmail 6503 invoked by uid 500); 27 Jul 2018 13:31:01 -0000 Mailing-List: contact issues-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list issues@ignite.apache.org Received: (qmail 6494 invoked by uid 99); 27 Jul 2018 13:31:01 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Jul 2018 13:31:01 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 8AC3418071F for ; Fri, 27 Jul 2018 13:31:01 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -110.301 X-Spam-Level: X-Spam-Status: No, score=-110.301 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id MnvOQj5m9KzD for ; Fri, 27 Jul 2018 13:31:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id BB0515F1F7 for ; Fri, 27 Jul 2018 13:31:00 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 4DC59E25C8 for ; Fri, 27 Jul 2018 13:31:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 0F4D92775E for ; Fri, 27 Jul 2018 13:31:00 +0000 (UTC) Date: Fri, 27 Jul 2018 13:31:00 +0000 (UTC) From: "Stuart Macdonald (JIRA)" To: issues@ignite.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (IGNITE-9108) Spark DataFrames With Cache Key and Value Objects MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Stuart Macdonald created IGNITE-9108: ---------------------------------------- Summary: Spark DataFrames With Cache Key and Value Objects Key: IGNITE-9108 URL: https://issues.apache.org/jira/browse/IGNITE-9108 Project: Ignite Issue Type: New Feature Components: spark Reporter: Stuart Macdonald Add support for _key and _val columns within Ignite-provided Spark DataFram= es, which represent the cache key and value objects similar to the current = _key/_val column semantics in Ignite SQL. =C2=A0 If the cache key or value objects are standard SQL types (eg. String, Int, = etc) they will be represented as such in the DataFrame schema, otherwise th= ey are represented as Binary types encoded as either: 1. Ignite BinaryObjec= ts, in which case we'd need to supply a Spark Encoder implementation for Bi= naryObjects, eg: =C2=A0 {code:java} IgniteSparkSession session =3D ... Dataset dataFrame =3D ... Dataset valDataSet =3D dataFrame.select("_val_).as(session.bina= ryObjectEncoder(MyValClass.class)) {code} Or 2. Kryo-serialised versions of the objects, eg: =C2=A0 {code:java} Dataset dataFrame =3D ... DataSet dataSet =3D dataFrame.select("_val_).as(Encoders.kryo(M= yValClass.class)) {code} Option 1 would probably be more efficient but option 2 would be more idioma= tic Spark. =C2=A0 The rationale behind this is the same as the Ignite SQL _key and _val colum= ns: to allow access to the full cache objects from a SQL context. -- This message was sent by Atlassian JIRA (v7.6.3#76005)