drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Victoria Markman (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-2865) Drillbit runs out of memory on multiple consecutive CTAS
Date Thu, 23 Apr 2015 21:35:38 GMT
Victoria Markman created DRILL-2865:
---------------------------------------

             Summary: Drillbit runs out of memory on multiple consecutive CTAS
                 Key: DRILL-2865
                 URL: https://issues.apache.org/jira/browse/DRILL-2865
             Project: Apache Drill
          Issue Type: Bug
    Affects Versions: 0.9.0
            Reporter: Victoria Markman


Hardware configuration:
        - single node
        - 64GB RAM
Drill configuration
        DRILL_MAX_DIRECT_MEMORY="8G"
        DRILL_MAX_HEAP="4G"
        `planner.enable_multiphase_agg` = false;
        `store.parquet.block-size` = 134217728;
        `planner.enable_mux_exchange` = false;
        `exec.min_hash_table_size` = 67108864;
        `planner.enable_hashagg` = true; 
        `planner.width.max_per_node` = 23;


Aggregation query on TPCDS scale factor 1: 
        select 
                ss_sold_date_sk , 
                ss_sold_time_sk , 
                ss_item_sk , 
                ss_customer_sk , 
                ss_cdemo_sk, 
                count(*) from store_sales
        group by 
                ss_sold_date_sk , 
                ss_sold_time_sk , 
                ss_item_sk , 
                ss_customer_sk , 
                ss_cdemo_sk
;

1. Executing CTAS with this query and store.format = 'parquet' fails on iteration #9 with
this configuration consistently
2. Ran query by itself: 47 iterations successfully
3. Ran CTAS with this query and store.format = 'csv': - 30 iterations did not reproduce the
problem

Attached:
      - drillbit.log
      - scripts.tar (contains script that reproduces OOM)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message