drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Khurram Faraaz <kfar...@mapr.com>
Subject TPC-DS query 72 takes for ever, it appears to be hung!
Date Mon, 09 Jan 2017 19:46:01 GMT
Hi All,


TPC-DS query 72 appears to be in running state for ever (it appears to be hung). I am on Drill
1.10.0 on a 4 node CentOS cluster, can someone please take a look. This is seen over SF1 data.


Query plan for TPC-DS query 72

{noformat}
00-00    Screen : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name,
INTEGER d_week_seq, BIGINT no_promo, BIGINT promo, BIGINT total_cnt): rowcount = 100.0, cumulative
cost = {1.2742944455E8 rows, 1.4918508997879562E9 cpu, 0.0 io, 1.98207277056E11 network, 1.5168530696E8
memory}, id = 5366578
00-01      Project(i_item_desc=[$0], w_warehouse_name=[$1], d_week_seq=[$2], no_promo=[$3],
promo=[$4], total_cnt=[$5]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200)
w_warehouse_name, INTEGER d_week_seq, BIGINT no_promo, BIGINT promo, BIGINT total_cnt): rowcount
= 100.0, cumulative cost = {1.2742943455E8 rows, 1.4918508897879562E9 cpu, 0.0 io, 1.98207277056E11
network, 1.5168530696E8 memory}, id = 5366577
00-02        SelectionVectorRemover : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200)
w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount
= 100.0, cumulative cost = {1.2742943455E8 rows, 1.4918508897879562E9 cpu, 0.0 io, 1.98207277056E11
network, 1.5168530696E8 memory}, id = 5366576
00-03          Limit(fetch=[100]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200)
w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount
= 100.0, cumulative cost = {1.2742933455E8 rows, 1.4918507897879562E9 cpu, 0.0 io, 1.98207277056E11
network, 1.5168530696E8 memory}, id = 5366575
00-04            SelectionVectorRemover : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200)
w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount
= 29362.5, cumulative cost = {1.2742923455E8 rows, 1.4918503897879562E9 cpu, 0.0 io, 1.98207277056E11
network, 1.5168530696E8 memory}, id = 5366574
00-05              TopN(limit=[100]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200)
w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount
= 29362.5, cumulative cost = {1.2739987205E8 rows, 1.4918210272879562E9 cpu, 0.0 io, 1.98207277056E11
network, 1.5168530696E8 memory}, id = 5366573
00-06                Project(i_item_desc=[$0], w_warehouse_name=[$1], d_week_seq=[$2], $f3=[$3],
$f4=[$4], total_cnt=[$5]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name,
INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount = 29362.5, cumulative
cost = {1.2737050955E8 rows, 1.48869974365E9 cpu, 0.0 io, 1.98207277056E11 network, 1.5168530696E8
memory}, id = 5366572
00-07                  HashToRandomExchange(dist0=[[$5]], dist1=[[$0]], dist2=[[$1]], dist3=[[$2]])
: rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name, INTEGER d_week_seq,
BIGINT $f3, BIGINT $f4, BIGINT total_cnt, ANY E_X_P_R_H_A_S_H_F_I_E_L_D): rowcount = 29362.5,
cumulative cost = {1.2737050955E8 rows, 1.48869974365E9 cpu, 0.0 io, 1.98207277056E11 network,
1.5168530696E8 memory}, id = 5366571
01-01                    UnorderedMuxExchange : rowType = RecordType(VARCHAR(200) i_item_desc,
VARCHAR(200) w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt,
ANY E_X_P_R_H_A_S_H_F_I_E_L_D): rowcount = 29362.5, cumulative cost = {1.2734114705E8 rows,
1.48840611865E9 cpu, 0.0 io, 1.97365395456E11 network, 1.5168530696E8 memory}, id = 5366570
02-01                      Project(i_item_desc=[$0], w_warehouse_name=[$1], d_week_seq=[$2],
$f3=[$3], $f4=[$4], total_cnt=[$5], E_X_P_R_H_A_S_H_F_I_E_L_D=[hash32AsDouble($2, hash32AsDouble($1,
hash32AsDouble($0, hash32AsDouble($5))))]) : rowType = RecordType(VARCHAR(200) i_item_desc,
VARCHAR(200) w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt,
ANY E_X_P_R_H_A_S_H_F_I_E_L_D): rowcount = 29362.5, cumulative cost = {1.2731178455E8 rows,
1.48837675615E9 cpu, 0.0 io, 1.97365395456E11 network, 1.5168530696E8 memory}, id = 5366569
02-02                        HashAgg(group=[{0, 1, 2}], agg#0=[$SUM0($3)], agg#1=[$SUM0($4)],
total_cnt=[$SUM0($5)]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name,
INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount = 29362.5, cumulative
cost = {1.2728242205E8 rows, 1.48825930615E9 cpu, 0.0 io, 1.97365395456E11 network, 1.5168530696E8
memory}, id = 5366568
02-03                          Project(i_item_desc=[$0], w_warehouse_name=[$1], d_week_seq=[$2],
$f3=[$3], $f4=[$4], total_cnt=[$5]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200)
w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount
= 293625.0, cumulative cost = {1.2698879705E8 rows, 1.47064180615E9 cpu, 0.0 io, 1.97365395456E11
network, 1.4134970696E8 memory}, id = 5366567
02-04                            HashToRandomExchange(dist0=[[$0]], dist1=[[$1]], dist2=[[$2]])
: rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name, INTEGER d_week_seq,
BIGINT $f3, BIGINT $f4, BIGINT total_cnt, ANY E_X_P_R_H_A_S_H_F_I_E_L_D): rowcount = 293625.0,
cumulative cost = {1.2698879705E8 rows, 1.47064180615E9 cpu, 0.0 io, 1.97365395456E11 network,
1.4134970696E8 memory}, id = 5366566
03-01                              UnorderedMuxExchange : rowType = RecordType(VARCHAR(200)
i_item_desc, VARCHAR(200) w_warehouse_name, INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT
total_cnt, ANY E_X_P_R_H_A_S_H_F_I_E_L_D): rowcount = 293625.0, cumulative cost = {1.2669517205E8
rows, 1.46750980615E9 cpu, 0.0 io, 1.88946579456E11 network, 1.4134970696E8 memory}, id =
5366565
04-01                                Project(i_item_desc=[$0], w_warehouse_name=[$1], d_week_seq=[$2],
$f3=[$3], $f4=[$4], total_cnt=[$5], E_X_P_R_H_A_S_H_F_I_E_L_D=[hash32AsDouble($2, hash32AsDouble($1,
hash32AsDouble($0)))]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name,
INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt, ANY E_X_P_R_H_A_S_H_F_I_E_L_D):
rowcount = 293625.0, cumulative cost = {1.2640154705E8 rows, 1.46721618115E9 cpu, 0.0 io,
1.88946579456E11 network, 1.4134970696E8 memory}, id = 5366564
04-02                                  HashAgg(group=[{0, 1, 2}], agg#0=[$SUM0($3)], agg#1=[$SUM0($4)],
total_cnt=[COUNT()]) : rowType = RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name,
INTEGER d_week_seq, BIGINT $f3, BIGINT $f4, BIGINT total_cnt): rowcount = 293625.0, cumulative
cost = {1.2610792205E8 rows, 1.46604168115E9 cpu, 0.0 io, 1.88946579456E11 network, 1.4134970696E8
memory}, id = 5366563
04-03                                    Project(i_item_desc=[$5], w_warehouse_name=[$4],
d_week_seq=[$9], $f3=[CASE(IS NULL($13), 1, 0)], $f4=[CASE(IS NOT NULL($13), 1, 0)]) : rowType
= RecordType(VARCHAR(200) i_item_desc, VARCHAR(200) w_warehouse_name, INTEGER d_week_seq,
INTEGER $f3, INTEGER $f4): rowcount = 2936250.0, cumulative cost = {1.2317167205E8 rows, 1.28986668115E9
cpu, 0.0 io, 1.88946579456E11 network, 3.799370696E7 memory}, id = 5366562
04-04                                      HashJoin(condition=[AND(=($0, $14), =($1, $15))],
joinType=[left]) : rowType = RecordType(INTEGER cs_item_sk, INTEGER cs_order_number, INTEGER
cs_quantity, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc,
VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, DATE d_date, INTEGER d_week_seq,
INTEGER d_year, INTEGER d_week_seq0, DATE d_date1, INTEGER p_promo_sk, INTEGER cr_item_sk,
INTEGER cr_order_number): rowcount = 2936250.0, cumulative cost = {1.2023542205E8 rows, 1.26637668115E9
cpu, 0.0 io, 1.88946579456E11 network, 3.799370696E7 memory}, id = 5366561
04-06                                        Project(cs_item_sk=[$0], cs_order_number=[$2],
cs_quantity=[$3], inv_quantity_on_hand=[$4], w_warehouse_name=[$5], i_item_desc=[$6], cd_marital_status=[$7],
hd_buy_potential=[$8], d_date=[$9], d_week_seq=[$10], d_year=[$11], d_week_seq0=[$12], d_date1=[$13],
p_promo_sk=[$14]) : rowType = RecordType(INTEGER cs_item_sk, INTEGER cs_order_number, INTEGER
cs_quantity, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc,
VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, DATE d_date, INTEGER d_week_seq,
INTEGER d_year, INTEGER d_week_seq0, DATE d_date1, INTEGER p_promo_sk): rowcount = 2936250.0,
cumulative cost = {1.1686697105E8 rows, 1.19216093915E9 cpu, 0.0 io, 1.88946579456E11 network,
3.4190338160000004E7 memory}, id = 5366558
04-08                                          HashJoin(condition=[=($1, $14)], joinType=[left])
: rowType = RecordType(INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER
cs_quantity, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc,
VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, DATE d_date, INTEGER d_week_seq,
INTEGER d_year, INTEGER d_week_seq0, DATE d_date1, INTEGER p_promo_sk): rowcount = 2936250.0,
cumulative cost = {1.1686697105E8 rows, 1.19216093915E9 cpu, 0.0 io, 1.88946579456E11 network,
3.4190338160000004E7 memory}, id = 5366557
04-10                                            SelectionVectorRemover : rowType = RecordType(INTEGER
cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_quantity_on_hand,
VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status, VARCHAR(200)
hd_buy_potential, DATE d_date, INTEGER d_week_seq, INTEGER d_year, INTEGER d_week_seq0, DATE
d_date1): rowcount = 2936250.0, cumulative cost = {1.1392982105E8 rows, 1.15692203915E9 cpu,
0.0 io, 1.88946579456E11 network, 3.4185058160000004E7 memory}, id = 5366554
04-12                                              Filter(condition=[>($13, DATETIME_PLUS($9,
432000000))]) : rowType = RecordType(INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200)
i_item_desc, VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, DATE d_date, INTEGER
d_week_seq, INTEGER d_year, INTEGER d_week_seq0, DATE d_date1): rowcount = 2936250.0, cumulative
cost = {1.1099357105E8 rows, 1.15398578915E9 cpu, 0.0 io, 1.88946579456E11 network, 3.4185058160000004E7
memory}, id = 5366553
04-13                                                Project(cs_item_sk=[$1], cs_promo_sk=[$2],
cs_order_number=[$3], cs_quantity=[$4], inv_quantity_on_hand=[$5], w_warehouse_name=[$6],
i_item_desc=[$7], cd_marital_status=[$8], hd_buy_potential=[$9], d_date=[$10], d_week_seq=[$11],
d_year=[$12], d_week_seq0=[$13], d_date1=[$15]) : rowType = RecordType(INTEGER cs_item_sk,
INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_quantity_on_hand,
VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status, VARCHAR(200)
hd_buy_potential, DATE d_date, INTEGER d_week_seq, INTEGER d_year, INTEGER d_week_seq0, DATE
d_date1): rowcount = 5872500.0, cumulative cost = {1.0512107105E8 rows, 1.11875078915E9 cpu,
0.0 io, 1.88946579456E11 network, 3.4185058160000004E7 memory}, id = 5366552
04-14                                                  HashJoin(condition=[=($0, $14)], joinType=[inner])
: rowType = RecordType(INTEGER cs_ship_date_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER
cs_order_number, INTEGER cs_quantity, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name,
VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, DATE
d_date, INTEGER d_week_seq, INTEGER d_year, INTEGER d_week_seq0, INTEGER d_date_sk, DATE d_date0):
rowcount = 5872500.0, cumulative cost = {1.0512107105E8 rows, 1.11875078915E9 cpu, 0.0 io,
1.88946579456E11 network, 3.4185058160000004E7 memory}, id = 5366551
04-16                                                    Project(cs_ship_date_sk=[$0], cs_item_sk=[$1],
cs_promo_sk=[$2], cs_order_number=[$3], cs_quantity=[$4], inv_quantity_on_hand=[$6], w_warehouse_name=[$7],
i_item_desc=[$8], cd_marital_status=[$9], hd_buy_potential=[$10], d_date=[$11], d_week_seq=[$12],
d_year=[$13], d_week_seq0=[$15]) : rowType = RecordType(INTEGER cs_ship_date_sk, INTEGER cs_item_sk,
INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_quantity_on_hand,
VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status, VARCHAR(200)
hd_buy_potential, DATE d_date, INTEGER d_week_seq, INTEGER d_year, INTEGER d_week_seq0): rowcount
= 5872500.0, cumulative cost = {9.902942405E7 rows, 1.04696590715E9 cpu, 0.0 io, 1.88946579456E11
network, 3.289939576E7 memory}, id = 5366547
04-18                                                      HashJoin(condition=[AND(=($5, $14),
=($12, $15))], joinType=[inner]) : rowType = RecordType(INTEGER cs_ship_date_sk, INTEGER cs_item_sk,
INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER
inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc, VARCHAR(200)
cd_marital_status, VARCHAR(200) hd_buy_potential, DATE d_date, INTEGER d_week_seq, INTEGER
d_year, INTEGER d_date_sk, INTEGER d_week_seq0): rowcount = 5872500.0, cumulative cost = {9.902942405E7
rows, 1.04696590715E9 cpu, 0.0 io, 1.88946579456E11 network, 3.289939576E7 memory}, id = 5366546
04-21                                                        Project(cs_ship_date_sk=[$1],
cs_item_sk=[$2], cs_promo_sk=[$3], cs_order_number=[$4], cs_quantity=[$5], inv_date_sk=[$6],
inv_quantity_on_hand=[$7], w_warehouse_name=[$8], i_item_desc=[$9], cd_marital_status=[$10],
hd_buy_potential=[$11], d_date=[$13], d_week_seq=[$14], d_year=[$15]) : rowType = RecordType(INTEGER
cs_ship_date_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER
cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name,
VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, DATE
d_date, INTEGER d_week_seq, INTEGER d_year): rowcount = 5872500.0, cumulative cost = {9.293777705E7
rows, 9.0412663315E8 cpu, 0.0 io, 1.88946579456E11 network, 3.097090216E7 memory}, id = 5366542
04-23                                                          HashJoin(condition=[=($0, $12)],
joinType=[inner]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity, INTEGER
inv_date_sk, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc,
VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential, INTEGER d_date_sk, DATE d_date,
INTEGER d_week_seq, INTEGER d_year): rowcount = 5872500.0, cumulative cost = {9.293777705E7
rows, 9.0412663315E8 cpu, 0.0 io, 1.88946579456E11 network, 3.097090216E7 memory}, id = 5366541
04-26                                                            Project(cs_sold_date_sk=[$0],
cs_ship_date_sk=[$1], cs_item_sk=[$3], cs_promo_sk=[$4], cs_order_number=[$5], cs_quantity=[$6],
inv_date_sk=[$7], inv_quantity_on_hand=[$8], w_warehouse_name=[$9], i_item_desc=[$10], cd_marital_status=[$11],
hd_buy_potential=[$13]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity, INTEGER
inv_date_sk, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc,
VARCHAR(200) cd_marital_status, VARCHAR(200) hd_buy_potential): rowcount = 5872500.0, cumulative
cost = {8.6886307E7 rows, 8.32754478E8 cpu, 0.0 io, 1.88946579456E11 network, 3.07780528E7
memory}, id = 5366536
04-28                                                              HashJoin(condition=[=($2,
$12)], joinType=[inner]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name,
VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status, INTEGER hd_demo_sk, VARCHAR(200)
hd_buy_potential): rowcount = 5872500.0, cumulative cost = {8.6886307E7 rows, 8.32754478E8
cpu, 0.0 io, 1.88946579456E11 network, 3.07780528E7 memory}, id = 5366535
04-31                                                                Project(cs_sold_date_sk=[$0],
cs_ship_date_sk=[$1], cs_bill_hdemo_sk=[$3], cs_item_sk=[$4], cs_promo_sk=[$5], cs_order_number=[$6],
cs_quantity=[$7], inv_date_sk=[$8], inv_quantity_on_hand=[$9], w_warehouse_name=[$10], i_item_desc=[$11],
cd_marital_status=[$13]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name,
VARCHAR(200) i_item_desc, VARCHAR(200) cd_marital_status): rowcount = 5872500.0, cumulative
cost = {8.0996167E7 rows, 7.62218598E8 cpu, 0.0 io, 1.88946579456E11 network, 3.07590448E7
memory}, id = 5366530
04-34                                                                  HashJoin(condition=[=($2,
$12)], joinType=[inner]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand,
VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc, INTEGER cd_demo_sk, VARCHAR(200)
cd_marital_status): rowcount = 5872500.0, cumulative cost = {8.0996167E7 rows, 7.62218598E8
cpu, 0.0 io, 1.88946579456E11 network, 3.07590448E7 memory}, id = 5366529
04-37                                                                    Project(cs_sold_date_sk=[$0],
cs_ship_date_sk=[$1], cs_bill_cdemo_sk=[$2], cs_bill_hdemo_sk=[$3], cs_item_sk=[$4], cs_promo_sk=[$5],
cs_order_number=[$6], cs_quantity=[$7], inv_date_sk=[$8], inv_quantity_on_hand=[$9], w_warehouse_name=[$10],
i_item_desc=[$12]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand,
VARCHAR(200) w_warehouse_name, VARCHAR(200) i_item_desc): rowcount = 5872500.0, cumulative
cost = {7.0417707E7 rows, 6.74173278E8 cpu, 0.0 io, 1.88946579456E11 network, 2.56881328E7
memory}, id = 5366524
04-40                                                                      HashJoin(condition=[=($4,
$11)], joinType=[inner]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand,
VARCHAR(200) w_warehouse_name, INTEGER i_item_sk, VARCHAR(200) i_item_desc): rowcount = 5872500.0,
cumulative cost = {7.0417707E7 rows, 6.74173278E8 cpu, 0.0 io, 1.88946579456E11 network, 2.56881328E7
memory}, id = 5366523
04-43                                                                        Project(cs_sold_date_sk=[$0],
cs_ship_date_sk=[$1], cs_bill_cdemo_sk=[$2], cs_bill_hdemo_sk=[$3], cs_item_sk=[$4], cs_promo_sk=[$5],
cs_order_number=[$6], cs_quantity=[$7], inv_date_sk=[$8], inv_quantity_on_hand=[$10], w_warehouse_name=[$12])
: rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk, INTEGER cs_bill_cdemo_sk,
INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_quantity_on_hand, VARCHAR(200) w_warehouse_name):
rowcount = 5872500.0, cumulative cost = {6.4491207E7 rows, 6.03379278E8 cpu, 0.0 io, 1.88946579456E11
network, 2.53713328E7 memory}, id = 5366520
04-46                                                                          HashJoin(condition=[=($9,
$11)], joinType=[inner]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_warehouse_sk,
INTEGER inv_quantity_on_hand, INTEGER w_warehouse_sk, VARCHAR(200) w_warehouse_name): rowcount
= 5872500.0, cumulative cost = {6.4491207E7 rows, 6.03379278E8 cpu, 0.0 io, 1.88946579456E11
network, 2.53713328E7 memory}, id = 5366519
04-48                                                                            SelectionVectorRemover
: rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk, INTEGER cs_bill_cdemo_sk,
INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_warehouse_sk, INTEGER inv_quantity_on_hand):
rowcount = 5872500.0, cumulative cost = {5.8618692E7 rows, 5.32909188E8 cpu, 0.0 io, 1.88946579456E11
network, 2.53712448E7 memory}, id = 5366516
04-50                                                                              Filter(condition=[<($10,
$7)]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk, INTEGER cs_bill_cdemo_sk,
INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_warehouse_sk, INTEGER inv_quantity_on_hand):
rowcount = 5872500.0, cumulative cost = {5.2746192E7 rows, 5.27036688E8 cpu, 0.0 io, 1.88946579456E11
network, 2.53712448E7 memory}, id = 5366515
04-51                                                                                Project(cs_sold_date_sk=[$0],
cs_ship_date_sk=[$1], cs_bill_cdemo_sk=[$2], cs_bill_hdemo_sk=[$3], cs_item_sk=[$4], cs_promo_sk=[$5],
cs_order_number=[$6], cs_quantity=[$7], inv_date_sk=[$8], inv_warehouse_sk=[$10], inv_quantity_on_hand=[$11])
: rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk, INTEGER cs_bill_cdemo_sk,
INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number,
INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_warehouse_sk, INTEGER inv_quantity_on_hand):
rowcount = 1.1745E7, cumulative cost = {4.1001192E7 rows, 4.56566688E8 cpu, 0.0 io, 1.88946579456E11
network, 2.53712448E7 memory}, id = 5366514
04-52                                                                                  Project(cs_sold_date_sk=[$4],
cs_ship_date_sk=[$5], cs_bill_cdemo_sk=[$6], cs_bill_hdemo_sk=[$7], cs_item_sk=[$8], cs_promo_sk=[$9],
cs_order_number=[$10], cs_quantity=[$11], inv_date_sk=[$0], inv_item_sk=[$1], inv_warehouse_sk=[$2],
inv_quantity_on_hand=[$3]) : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity, INTEGER inv_date_sk, INTEGER inv_item_sk, INTEGER
inv_warehouse_sk, INTEGER inv_quantity_on_hand): rowcount = 1.1745E7, cumulative cost = {4.1001192E7
rows, 4.56566688E8 cpu, 0.0 io, 1.88946579456E11 network, 2.53712448E7 memory}, id = 5366513
04-53                                                                                    HashJoin(condition=[=($8,
$1)], joinType=[inner]) : rowType = RecordType(INTEGER inv_date_sk, INTEGER inv_item_sk, INTEGER
inv_warehouse_sk, INTEGER inv_quantity_on_hand, INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity): rowcount = 1.1745E7, cumulative cost = {4.1001192E7
rows, 4.56566688E8 cpu, 0.0 io, 1.88946579456E11 network, 2.53712448E7 memory}, id = 5366512
04-55                                                                                    
 Project(inv_date_sk=[CAST($0):INTEGER], inv_item_sk=[CAST($1):INTEGER], inv_warehouse_sk=[CAST($2):INTEGER],
inv_quantity_on_hand=[CAST($3):INTEGER]) : rowType = RecordType(INTEGER inv_date_sk, INTEGER
inv_item_sk, INTEGER inv_warehouse_sk, INTEGER inv_quantity_on_hand): rowcount = 1.1745E7,
cumulative cost = {2.349E7 rows, 2.349E8 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366508
04-56                                                                                    
   Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/inventory]],
selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/inventory, numFiles=1, usedMetadataFile=false,
columns=[`inv_date_sk`, `inv_item_sk`, `inv_warehouse_sk`, `inv_quantity_on_hand`]]]) : rowType
= RecordType(ANY inv_date_sk, ANY inv_item_sk, ANY inv_warehouse_sk, ANY inv_quantity_on_hand):
rowcount = 1.1745E7, cumulative cost = {1.1745E7 rows, 4.698E7 cpu, 0.0 io, 0.0 network, 0.0
memory}, id = 5366507
04-54                                                                                    
 BroadcastExchange : rowType = RecordType(INTEGER cs_sold_date_sk, INTEGER cs_ship_date_sk,
INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk, INTEGER cs_item_sk, INTEGER cs_promo_sk,
INTEGER cs_order_number, INTEGER cs_quantity): rowcount = 1441548.0, cumulative cost = {4324644.0
rows, 6.9194304E7 cpu, 0.0 io, 1.88946579456E11 network, 0.0 memory}, id = 5366511
05-01                                                                                    
   Project(cs_sold_date_sk=[CAST($0):INTEGER], cs_ship_date_sk=[CAST($1):INTEGER], cs_bill_cdemo_sk=[CAST($2):INTEGER],
cs_bill_hdemo_sk=[CAST($3):INTEGER], cs_item_sk=[CAST($4):INTEGER], cs_promo_sk=[CAST($5):INTEGER],
cs_order_number=[CAST($6):INTEGER], cs_quantity=[CAST($7):INTEGER]) : rowType = RecordType(INTEGER
cs_sold_date_sk, INTEGER cs_ship_date_sk, INTEGER cs_bill_cdemo_sk, INTEGER cs_bill_hdemo_sk,
INTEGER cs_item_sk, INTEGER cs_promo_sk, INTEGER cs_order_number, INTEGER cs_quantity): rowcount
= 1441548.0, cumulative cost = {2883096.0 rows, 5.766192E7 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366510
05-02                                                                                    
     Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/catalog_sales]],
selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/catalog_sales, numFiles=1, usedMetadataFile=false,
columns=[`cs_sold_date_sk`, `cs_ship_date_sk`, `cs_bill_cdemo_sk`, `cs_bill_hdemo_sk`, `cs_item_sk`,
`cs_promo_sk`, `cs_order_number`, `cs_quantity`]]]) : rowType = RecordType(ANY cs_sold_date_sk,
ANY cs_ship_date_sk, ANY cs_bill_cdemo_sk, ANY cs_bill_hdemo_sk, ANY cs_item_sk, ANY cs_promo_sk,
ANY cs_order_number, ANY cs_quantity): rowcount = 1441548.0, cumulative cost = {1441548.0
rows, 1.1532384E7 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366509
04-47                                                                            Project(w_warehouse_sk=[CAST($0):INTEGER],
w_warehouse_name=[CAST($1):VARCHAR(200) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary"])
: rowType = RecordType(INTEGER w_warehouse_sk, VARCHAR(200) w_warehouse_name): rowcount =
5.0, cumulative cost = {10.0 rows, 50.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366518
04-49                                                                              Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/warehouse]],
selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/warehouse, numFiles=1, usedMetadataFile=false,
columns=[`w_warehouse_sk`, `w_warehouse_name`]]]) : rowType = RecordType(ANY w_warehouse_sk,
ANY w_warehouse_name): rowcount = 5.0, cumulative cost = {5.0 rows, 10.0 cpu, 0.0 io, 0.0
network, 0.0 memory}, id = 5366517
04-42                                                                        Project(i_item_sk=[CAST($0):INTEGER],
i_item_desc=[CAST($1):VARCHAR(200) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary"])
: rowType = RecordType(INTEGER i_item_sk, VARCHAR(200) i_item_desc): rowcount = 18000.0, cumulative
cost = {36000.0 rows, 180000.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366522
04-45                                                                          Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/item]], selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/item,
numFiles=1, usedMetadataFile=false, columns=[`i_item_sk`, `i_item_desc`]]]) : rowType = RecordType(ANY
i_item_sk, ANY i_item_desc): rowcount = 18000.0, cumulative cost = {18000.0 rows, 36000.0
cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366521
04-36                                                                    Project(cd_demo_sk=[CAST($0):INTEGER],
cd_marital_status=[CAST($1):VARCHAR(200) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary"])
: rowType = RecordType(INTEGER cd_demo_sk, VARCHAR(200) cd_marital_status): rowcount = 288120.0,
cumulative cost = {4417840.0 rows, 1.527036E7 cpu, 0.0 io, 0.0 network, 0.0 memory}, id =
5366528
04-39                                                                      SelectionVectorRemover
: rowType = RecordType(ANY cd_demo_sk, ANY cd_marital_status): rowcount = 288120.0, cumulative
cost = {4129720.0 rows, 1.29654E7 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366527
04-41                                                                        Filter(condition=[=(CAST($1):VARCHAR(200)
CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary", 'M')]) : rowType = RecordType(ANY
cd_demo_sk, ANY cd_marital_status): rowcount = 288120.0, cumulative cost = {3841600.0 rows,
1.267728E7 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366526
04-44                                                                          Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/customer_demographics]],
selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/customer_demographics, numFiles=1,
usedMetadataFile=false, columns=[`cd_demo_sk`, `cd_marital_status`]]]) : rowType = RecordType(ANY
cd_demo_sk, ANY cd_marital_status): rowcount = 1920800.0, cumulative cost = {1920800.0 rows,
3841600.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366525
04-30                                                                Project(hd_demo_sk=[CAST($0):INTEGER],
hd_buy_potential=[CAST($1):VARCHAR(200) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary"])
: rowType = RecordType(INTEGER hd_demo_sk, VARCHAR(200) hd_buy_potential): rowcount = 1080.0,
cumulative cost = {16560.0 rows, 57240.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366534
04-33                                                                  SelectionVectorRemover
: rowType = RecordType(ANY hd_demo_sk, ANY hd_buy_potential): rowcount = 1080.0, cumulative
cost = {15480.0 rows, 48600.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366533
04-35                                                                    Filter(condition=[=(CAST($1):VARCHAR(200)
CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary", '501-1000')]) : rowType = RecordType(ANY
hd_demo_sk, ANY hd_buy_potential): rowcount = 1080.0, cumulative cost = {14400.0 rows, 47520.0
cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366532
04-38                                                                      Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/household_demographics]],
selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/household_demographics, numFiles=1,
usedMetadataFile=false, columns=[`hd_demo_sk`, `hd_buy_potential`]]]) : rowType = RecordType(ANY
hd_demo_sk, ANY hd_buy_potential): rowcount = 7200.0, cumulative cost = {7200.0 rows, 14400.0
cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366531
04-25                                                            Project(d_date_sk=[CAST($0):INTEGER],
d_date=[CAST($1):DATE], d_week_seq=[CAST($2):INTEGER], d_year=[CAST($3):INTEGER]) : rowType
= RecordType(INTEGER d_date_sk, DATE d_date, INTEGER d_week_seq, INTEGER d_year): rowcount
= 10957.35, cumulative cost = {168012.7 rows, 814496.35 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366540
04-27                                                              SelectionVectorRemover
: rowType = RecordType(ANY d_date_sk, ANY d_date, ANY d_week_seq, ANY d_year): rowcount =
10957.35, cumulative cost = {157055.35 rows, 639178.75 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366539
04-29                                                                Filter(condition=[=(CAST($3):INTEGER,
2002)]) : rowType = RecordType(ANY d_date_sk, ANY d_date, ANY d_week_seq, ANY d_year): rowcount
= 10957.35, cumulative cost = {146098.0 rows, 628221.4 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366538
04-32                                                                  Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/date_dim]], selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/date_dim,
numFiles=1, usedMetadataFile=false, columns=[`d_date_sk`, `d_date`, `d_week_seq`, `d_year`]]])
: rowType = RecordType(ANY d_date_sk, ANY d_date, ANY d_week_seq, ANY d_year): rowcount =
73049.0, cumulative cost = {73049.0 rows, 292196.0 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366537
04-20                                                        Project(d_date_sk=[$0], d_week_seq0=[$1])
: rowType = RecordType(INTEGER d_date_sk, INTEGER d_week_seq0): rowcount = 73049.0, cumulative
cost = {146098.0 rows, 730490.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366545
04-22                                                          Project(d_date_sk=[CAST($0):INTEGER],
d_week_seq=[CAST($1):INTEGER]) : rowType = RecordType(INTEGER d_date_sk, INTEGER d_week_seq):
rowcount = 73049.0, cumulative cost = {146098.0 rows, 730490.0 cpu, 0.0 io, 0.0 network, 0.0
memory}, id = 5366544
04-24                                                            Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/date_dim]], selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/date_dim,
numFiles=1, usedMetadataFile=false, columns=[`d_date_sk`, `d_week_seq`]]]) : rowType = RecordType(ANY
d_date_sk, ANY d_week_seq): rowcount = 73049.0, cumulative cost = {73049.0 rows, 146098.0
cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366543
04-15                                                    Project(d_date_sk=[$0], d_date0=[$1])
: rowType = RecordType(INTEGER d_date_sk, DATE d_date0): rowcount = 73049.0, cumulative cost
= {146098.0 rows, 730490.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366550
04-17                                                      Project(d_date_sk=[CAST($0):INTEGER],
d_date=[CAST($1):DATE]) : rowType = RecordType(INTEGER d_date_sk, DATE d_date): rowcount =
73049.0, cumulative cost = {146098.0 rows, 730490.0 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366549
04-19                                                        Scan(groupscan=[ParquetGroupScan
[entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/tpcds_sf1/parquet/date_dim]], selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/date_dim,
numFiles=1, usedMetadataFile=false, columns=[`d_date_sk`, `d_date`]]]) : rowType = RecordType(ANY
d_date_sk, ANY d_date): rowcount = 73049.0, cumulative cost = {73049.0 rows, 146098.0 cpu,
0.0 io, 0.0 network, 0.0 memory}, id = 5366548
04-09                                            Project(p_promo_sk=[CAST($0):INTEGER]) :
rowType = RecordType(INTEGER p_promo_sk): rowcount = 300.0, cumulative cost = {600.0 rows,
1500.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366556
04-11                                              Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath
[path=maprfs:///drill/testdata/tpcds_sf1/parquet/promotion]], selectionRoot=maprfs:/drill/testdata/tpcds_sf1/parquet/promotion,
numFiles=1, usedMetadataFile=false, columns=[`p_promo_sk`]]]) : rowType = RecordType(ANY p_promo_sk):
rowcount = 300.0, cumulative cost = {300.0 rows, 300.0 cpu, 0.0 io, 0.0 network, 0.0 memory},
id = 5366555
04-05                                        Project(cr_item_sk=[CAST($0):INTEGER], cr_order_number=[CAST($1):INTEGER])
: rowType = RecordType(INTEGER cr_item_sk, INTEGER cr_order_number): rowcount = 144067.0,
cumulative cost = {288134.0 rows, 1440670.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 5366560
{noformat}


Thanks

Khurram

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message