carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CARBONDATA-308) Use CarbonInputFormat in CarbonScanRDD compute
Date Thu, 27 Oct 2016 04:07:59 GMT

    [ https://issues.apache.org/jira/browse/CARBONDATA-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15610586#comment-15610586
] 

ASF GitHub Bot commented on CARBONDATA-308:
-------------------------------------------

GitHub user jackylk opened a pull request:

    https://github.com/apache/incubator-carbondata/pull/262

    [CARBONDATA-308] [WIP] Use CarbonInputFormat in CarbonScanRDD compute

    Use CarbonInputFormat in CarbonScanRDD compute function
    
    1. In driver side, only getSplit is required, so only filter condition is required, no
need to create full QueryModel object, so creation of QueryModel is moved from driver side
to executor side.
    2. use CarbonInputFormat.createRecordReader in CarbonScanRDD.compute instead of use 
    QueryExecutor directly

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jackylk/incubator-carbondata scanrdd

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/262.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #262
    
----
commit ef4a889db9b86653c273794c9a810a9cd9683437
Author: jackylk <jacky.likun@huawei.com>
Date:   2016-10-22T18:43:53Z

    use CarbonInputFormat in executor

commit a5c17f523c7127b538cc2d384cbff4fa454a007a
Author: jackylk <jacky.likun@huawei.com>
Date:   2016-10-27T04:01:36Z

    modify getPartition

----


> Use CarbonInputFormat in CarbonScanRDD compute
> ----------------------------------------------
>
>                 Key: CARBONDATA-308
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-308
>             Project: CarbonData
>          Issue Type: Sub-task
>          Components: spark-integration
>            Reporter: Jacky Li
>             Fix For: 0.2.0-incubating
>
>
> Take CarbonScanRDD as the target RDD, modify as following:
> 1. In driver side, only getSplit is required, so only filter condition is required, no
need to create full QueryModel object, so we can move creation of QueryModel from driver side
to executor side.
> 2. use CarbonInputFormat.createRecordReader in CarbonScanRDD.compute instead of use QueryExecutor
directly



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message