carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CARBONDATA-253) Duplicate block loading when distribution is based on blocklet
Date Sat, 17 Sep 2016 21:25:20 GMT

    [ https://issues.apache.org/jira/browse/CARBONDATA-253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15499715#comment-15499715
] 

ASF GitHub Bot commented on CARBONDATA-253:
-------------------------------------------

GitHub user kumarvishal09 opened a pull request:

    https://github.com/apache/incubator-carbondata/pull/170

    [CARBONDATA-253]OOM issue if distribution is based on blocklet duing query execution

    Problem:In case of query execution when distribution is based on blocklet same blocks
are getting loaded multiple times this is because hash code and equals method contract is
not same, this is can cause OOM issue if distribution is based on blocklet
    Solution: As same class will be used to identify unique blocks while distribution and
while loading so creating a wrapper class and implementing hash code and equals method based
on filepath, offset and length, this will remove duplicate blocks and only one block's metadata
will be loaded in memory 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/kumarvishal09/incubator-carbondata equalsAndHashCodeIssue

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/170.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #170
    
----
commit 49a76db0ff8e63fc95a309984d215086d888fc7b
Author: kumarvishal <kumarvishal.1802@gmail.com>
Date:   2016-09-17T13:25:36Z

    equalsAndHashCodeIssue

----


> Duplicate block loading when distribution is based on blocklet
> --------------------------------------------------------------
>
>                 Key: CARBONDATA-253
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-253
>             Project: CarbonData
>          Issue Type: Bug
>            Reporter: kumar vishal
>            Assignee: kumar vishal
>
> In case of query execution when distribution is based on blocklet same blocks are getting
loaded multiple times this is because hash code and equals method contract is not same 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message