drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-6164) Heap memory leak during parquet scan and OOM
Date Fri, 16 Feb 2018 04:14:00 GMT

    [ https://issues.apache.org/jira/browse/DRILL-6164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16366593#comment-16366593

ASF GitHub Bot commented on DRILL-6164:

GitHub user vrozov opened a pull request:


    DRILL-6164: Heap memory leak during parquet scan and OOM

    @parthchandra Please take a look

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/vrozov/drill DRILL-6164

Alternatively you can review and apply these changes as the patch at:


To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1122
commit ec6b6ae017fd22ab6cfe539430053c7caf5174c0
Author: Vlad Rozov <vrozov@...>
Date:   2018-02-16T03:25:21Z

    DRILL-6164: Heap memory leak during parquet scan and OOM


> Heap memory leak during parquet scan and OOM
> --------------------------------------------
>                 Key: DRILL-6164
>                 URL: https://issues.apache.org/jira/browse/DRILL-6164
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Vlad Rozov
>            Assignee: Vlad Rozov
>            Priority: Major
> During a scan of a large set of parquet files, Drill iterates over the set initializing
parquet readers. Such initialization may require a significant memory usage (both heap and
direct). When scan moves to the next parquet file in the set, it does not remove reference
to the reader from the set it iterates over and does not remove references created during

This message was sent by Atlassian JIRA

View raw message