Return-Path: X-Original-To: apmail-ignite-user-archive@minotaur.apache.org Delivered-To: apmail-ignite-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 00B1F18BFF for ; Fri, 4 Mar 2016 04:00:16 +0000 (UTC) Received: (qmail 84300 invoked by uid 500); 4 Mar 2016 04:00:15 -0000 Delivered-To: apmail-ignite-user-archive@ignite.apache.org Received: (qmail 84255 invoked by uid 500); 4 Mar 2016 04:00:15 -0000 Mailing-List: contact user-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ignite.apache.org Delivered-To: mailing list user@ignite.apache.org Received: (qmail 84245 invoked by uid 99); 4 Mar 2016 04:00:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Mar 2016 04:00:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 619291A028E for ; Fri, 4 Mar 2016 04:00:15 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.972 X-Spam-Level: X-Spam-Status: No, score=0.972 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_NONE=-0.0001, SPF_SOFTFAIL=0.972] autolearn=disabled Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id Emi2qM9YD2d2 for ; Fri, 4 Mar 2016 04:00:14 +0000 (UTC) Received: from mbob.nabble.com (mbob.nabble.com [162.253.133.15]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTP id A60C85FAC9 for ; Fri, 4 Mar 2016 04:00:12 +0000 (UTC) Received: from malf.nabble.com (unknown [162.253.133.59]) by mbob.nabble.com (Postfix) with ESMTP id 94AEF2223043 for ; Thu, 3 Mar 2016 19:51:44 -0800 (PST) Date: Thu, 3 Mar 2016 19:47:38 -0800 (PST) From: DmitryB To: user@ignite.apache.org Message-ID: <1457063258992-3363.post@n6.nabble.com> In-Reply-To: References: <1456936219107-3343.post@n6.nabble.com> Subject: Re: index and query org.apache.ignite.spark.IgniteRDD[String,org.apache.spark.sql.Row] MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Hi Andrey, Thanks a lots for your help. Unfortunately, i can not use case classes, because a schema information is only available at runtime; to make it more clear let me add more details. suppose that i have a very big data set (~500 Tb) which is stored in AWS s3 in a parquet format; Using spark, i can process (filter + join) it and reduce size down to ~200 -500 Gb; resulted dataset i would like to save in ignite cache using IgniteRdd and create indexes for a particular set of fields which will be used later for running queries (filter, join, aggregations); My assumption is that having this result dataset in ignite + indexes would help to improve the performance comparing to using spark DataFrame (persisted); Unfortunately, the resulted dataset schema can vary with great number of variations; So, it seems impossible to describe all of them with case classes; This is why an approach to store spark.sql.row + describe query fields and indexes using QueryEntity would be preferable; Thanks to your explanation, i see that this approach doesn't works; Another solutions that is spinning in my head is to generate case classes dynamically (at runtime) based on spark data frame schema, then map sql.rows to RDD[generated_case_class], describe ignite query and index fields using QueryEntity, create IgniteContext for generated case class; Im not sure that this approach is even possible, so i would like to ask for your opinion before i go deeper; Will be very grateful for advice Best regards, Dmitry -- View this message in context: http://apache-ignite-users.70518.x6.nabble.com/index-and-query-org-apache-ignite-spark-IgniteRDD-String-org-apache-spark-sql-Row-tp3343p3363.html Sent from the Apache Ignite Users mailing list archive at Nabble.com.