Return-Path: X-Original-To: apmail-hive-issues-archive@minotaur.apache.org Delivered-To: apmail-hive-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3037617B46 for ; Mon, 11 May 2015 03:53:01 +0000 (UTC) Received: (qmail 73179 invoked by uid 500); 11 May 2015 03:53:01 -0000 Delivered-To: apmail-hive-issues-archive@hive.apache.org Received: (qmail 73160 invoked by uid 500); 11 May 2015 03:53:01 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 73149 invoked by uid 99); 11 May 2015 03:53:01 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 May 2015 03:53:01 +0000 Date: Mon, 11 May 2015 03:53:00 +0000 (UTC) From: "Matt McCline (JIRA)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-10609) Vectorization : Q64 fails with ClassCastException MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-10609?page=3Dcom.atlassian= .jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D1453= 7532#comment-14537532 ]=20 Matt McCline commented on HIVE-10609: ------------------------------------- Test failures are not related to the changes here. > Vectorization : Q64 fails with ClassCastException > ------------------------------------------------- > > Key: HIVE-10609 > URL: https://issues.apache.org/jira/browse/HIVE-10609 > Project: Hive > Issue Type: Bug > Components: Vectorization > Affects Versions: 1.2.0 > Reporter: Mostafa Mokhtar > Assignee: Matt McCline > Attachments: HIVE-10609.01.patch, HIVE-10609.02.patch > > > TPC-DS Q64 fails with ClassCastException. > Query > {code} > select cs1.product_name ,cs1.store_name ,cs1.store_zip ,cs1.b_street_numb= er ,cs1.b_streen_name ,cs1.b_city > ,cs1.b_zip ,cs1.c_street_number ,cs1.c_street_name ,cs1.c_city ,cs1.= c_zip ,cs1.syear ,cs1.cnt > ,cs1.s1 ,cs1.s2 ,cs1.s3 > ,cs2.s1 ,cs2.s2 ,cs2.s3 ,cs2.syear ,cs2.cnt > from > (select i_product_name as product_name ,i_item_sk as item_sk ,s_store_nam= e as store_name > ,s_zip as store_zip ,ad1.ca_street_number as b_street_number ,ad1.ca= _street_name as b_streen_name > ,ad1.ca_city as b_city ,ad1.ca_zip as b_zip ,ad2.ca_street_number as= c_street_number > ,ad2.ca_street_name as c_street_name ,ad2.ca_city as c_city ,ad2.ca_= zip as c_zip > ,d1.d_year as syear ,d2.d_year as fsyear ,d3.d_year as s2year ,count= (*) as cnt > ,sum(ss_wholesale_cost) as s1 ,sum(ss_list_price) as s2 ,sum(ss_coup= on_amt) as s3 > FROM store_sales > JOIN store_returns ON store_sales.ss_item_sk =3D store_returns.sr= _item_sk and store_sales.ss_ticket_number =3D store_returns.sr_ticket_numbe= r > JOIN customer ON store_sales.ss_customer_sk =3D customer.c_custom= er_sk > JOIN date_dim d1 ON store_sales.ss_sold_date_sk =3D d1.d_date_sk > JOIN date_dim d2 ON customer.c_first_sales_date_sk =3D d2.d_date_= sk=20 > JOIN date_dim d3 ON customer.c_first_shipto_date_sk =3D d3.d_date= _sk > JOIN store ON store_sales.ss_store_sk =3D store.s_store_sk > JOIN customer_demographics cd1 ON store_sales.ss_cdemo_sk=3D cd1.= cd_demo_sk > JOIN customer_demographics cd2 ON customer.c_current_cdemo_sk =3D= cd2.cd_demo_sk > JOIN promotion ON store_sales.ss_promo_sk =3D promotion.p_promo_s= k > JOIN household_demographics hd1 ON store_sales.ss_hdemo_sk =3D hd= 1.hd_demo_sk > JOIN household_demographics hd2 ON customer.c_current_hdemo_sk = =3D hd2.hd_demo_sk > JOIN customer_address ad1 ON store_sales.ss_addr_sk =3D ad1.ca_ad= dress_sk > JOIN customer_address ad2 ON customer.c_current_addr_sk =3D ad2.c= a_address_sk > JOIN income_band ib1 ON hd1.hd_income_band_sk =3D ib1.ib_income_b= and_sk > JOIN income_band ib2 ON hd2.hd_income_band_sk =3D ib2.ib_income_b= and_sk > JOIN item ON store_sales.ss_item_sk =3D item.i_item_sk > JOIN > (select cs_item_sk > ,sum(cs_ext_list_price) as sale,sum(cr_refunded_cash+cr_reversed_= charge+cr_store_credit) as refund > from catalog_sales JOIN catalog_returns > ON catalog_sales.cs_item_sk =3D catalog_returns.cr_item_sk > and catalog_sales.cs_order_number =3D catalog_returns.cr_order_number > group by cs_item_sk > having sum(cs_ext_list_price)>2*sum(cr_refunded_cash+cr_reversed_charge= +cr_store_credit)) cs_ui > ON store_sales.ss_item_sk =3D cs_ui.cs_item_sk > WHERE =20 > cd1.cd_marital_status <> cd2.cd_marital_status and > i_color in ('maroon','burnished','dim','steel','navajo','chocola= te') and > i_current_price between 35 and 35 + 10 and > i_current_price between 35 + 1 and 35 + 15 > group by i_product_name ,i_item_sk ,s_store_name ,s_zip ,ad1.ca_street_nu= mber > ,ad1.ca_street_name ,ad1.ca_city ,ad1.ca_zip ,ad2.ca_street_number > ,ad2.ca_street_name ,ad2.ca_city ,ad2.ca_zip ,d1.d_year ,d2.d_year= ,d3.d_year > ) cs1 > JOIN > (select i_product_name as product_name ,i_item_sk as item_sk ,s_store_nam= e as store_name > ,s_zip as store_zip ,ad1.ca_street_number as b_street_number ,ad1.ca= _street_name as b_streen_name > ,ad1.ca_city as b_city ,ad1.ca_zip as b_zip ,ad2.ca_street_number as= c_street_number > ,ad2.ca_street_name as c_street_name ,ad2.ca_city as c_city ,ad2.ca_= zip as c_zip > ,d1.d_year as syear ,d2.d_year as fsyear ,d3.d_year as s2year ,count= (*) as cnt > ,sum(ss_wholesale_cost) as s1 ,sum(ss_list_price) as s2 ,sum(ss_coup= on_amt) as s3 > FROM store_sales > JOIN store_returns ON store_sales.ss_item_sk =3D store_returns.sr= _item_sk and store_sales.ss_ticket_number =3D store_returns.sr_ticket_numbe= r > JOIN customer ON store_sales.ss_customer_sk =3D customer.c_custom= er_sk > JOIN date_dim d1 ON store_sales.ss_sold_date_sk =3D d1.d_date_sk > JOIN date_dim d2 ON customer.c_first_sales_date_sk =3D d2.d_date_= sk=20 > JOIN date_dim d3 ON customer.c_first_shipto_date_sk =3D d3.d_date= _sk > JOIN store ON store_sales.ss_store_sk =3D store.s_store_sk > JOIN customer_demographics cd1 ON store_sales.ss_cdemo_sk=3D cd1.= cd_demo_sk > JOIN customer_demographics cd2 ON customer.c_current_cdemo_sk =3D= cd2.cd_demo_sk > JOIN promotion ON store_sales.ss_promo_sk =3D promotion.p_promo_s= k > JOIN household_demographics hd1 ON store_sales.ss_hdemo_sk =3D hd= 1.hd_demo_sk > JOIN household_demographics hd2 ON customer.c_current_hdemo_sk = =3D hd2.hd_demo_sk > JOIN customer_address ad1 ON store_sales.ss_addr_sk =3D ad1.ca_ad= dress_sk > JOIN customer_address ad2 ON customer.c_current_addr_sk =3D ad2.c= a_address_sk > JOIN income_band ib1 ON hd1.hd_income_band_sk =3D ib1.ib_income_b= and_sk > JOIN income_band ib2 ON hd2.hd_income_band_sk =3D ib2.ib_income_b= and_sk > JOIN item ON store_sales.ss_item_sk =3D item.i_item_sk > JOIN > (select cs_item_sk > ,sum(cs_ext_list_price) as sale,sum(cr_refunded_cash+cr_reversed_= charge+cr_store_credit) as refund > from catalog_sales JOIN catalog_returns > ON catalog_sales.cs_item_sk =3D catalog_returns.cr_item_sk > and catalog_sales.cs_order_number =3D catalog_returns.cr_order_number > group by cs_item_sk > having sum(cs_ext_list_price)>2*sum(cr_refunded_cash+cr_reversed_charge= +cr_store_credit)) cs_ui > ON store_sales.ss_item_sk =3D cs_ui.cs_item_sk > WHERE =20 > cd1.cd_marital_status <> cd2.cd_marital_status and > i_color in ('maroon','burnished','dim','steel','navajo','chocola= te') and > i_current_price between 35 and 35 + 10 and > i_current_price between 35 + 1 and 35 + 15 > group by i_product_name ,i_item_sk ,s_store_name ,s_zip ,ad1.ca_street_nu= mber > ,ad1.ca_street_name ,ad1.ca_city ,ad1.ca_zip ,ad2.ca_street_number > ,ad2.ca_street_name ,ad2.ca_city ,ad2.ca_zip ,d1.d_year ,d2.d_year= ,d3.d_year > ) cs2 > ON cs1.item_sk=3Dcs2.item_sk > where=20 > cs1.syear =3D 2000 and > cs2.syear =3D 2000 + 1 and > cs2.cnt <=3D cs1.cnt and > cs1.store_name =3D cs2.store_name and > cs1.store_zip =3D cs2.store_zip > order by cs1.product_name ,cs1.store_name ,cs2.cnt > {code} > Exception=20 > {code} > ], TaskAttempt 3 failed, info=3D[Error: Failure while running task:java.l= ang.RuntimeException: java.lang.RuntimeException: org.apache.hadoop.hive.ql= .metadata.HiveException: Hive Runtime Error while processing row=20 > =09at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunPro= cessor(TezProcessor.java:171) > =09at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.ja= va:137) > =09at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOP= rocessorRuntimeTask.java:337) > =09at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(= TezTaskRunner.java:179) > =09at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(= TezTaskRunner.java:171) > =09at java.security.AccessController.doPrivileged(Native Method) > =09at javax.security.auth.Subject.doAs(Subject.java:415) > =09at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInfor= mation.java:1628) > =09at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callIn= ternal(TezTaskRunner.java:171) > =09at org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callIn= ternal(TezTaskRunner.java:167) > =09at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > =09at java.util.concurrent.FutureTask.run(FutureTask.java:262) > =09at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecuto= r.java:1145) > =09at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecut= or.java:615) > =09at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata= .HiveException: Hive Runtime Error while processing row=20 > =09at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRe= cordSource.java:91) > =09at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRe= cordSource.java:68) > =09at org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecord= Processor.java:290) > =09at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunPro= cessor(TezProcessor.java:148) > =09... 14 more > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime= Error while processing row=20 > =09at org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(Vec= torMapOperator.java:52) > =09at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRe= cordSource.java:83) > =09... 17 more > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unexpected e= xception: org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector cannot be= cast to org.apache.hadoop.hive.ql.exec.vector.DoubleColumnVector > =09at org.apache.hadoop.hive.ql.exec.MapJoinOperator.process(MapJoinOpera= tor.java:426) > =09at org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.process= (VectorMapJoinOperator.java:290) > =09at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:837) > =09at org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.process(= VectorSelectOperator.java:138) > =09at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:837) > =09at org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(= VectorFilterOperator.java:114) > =09at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:837) > =09at org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanO= perator.java:97) > =09at org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOper= ator.java:162) > =09at org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(Vec= torMapOperator.java:45) > =09... 18 more > Caused by: java.lang.ClassCastException: org.apache.hadoop.hive.ql.exec.v= ector.BytesColumnVector cannot be cast to org.apache.hadoop.hive.ql.exec.ve= ctor.DoubleColumnVector > =09at org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionW= riterFactory$VectorExpressionWriterDouble.writeValue(VectorExpressionWriter= Factory.java:225) > =09at org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.getRowO= bject(VectorMapJoinOperator.java:352) > =09at org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.spillBi= gTableRow(VectorMapJoinOperator.java:309) > =09at org.apache.hadoop.hive.ql.exec.MapJoinOperator.process(MapJoinOpera= tor.java:404) > =09... 27 more > ]], Vertex failed as one or more tasks failed. failedTasks:1, Vertex vert= ex_1426707664723_4575_1_34 [Map 40] killed/failed due to:null]Vertex killed= , vertexName=3DReducer 6, vertexId=3Dvertex_1426707664723_4575_1_42, diagno= stics=3D[Vertex received Kill while in RUNNING state., Vertex killed as oth= er vertex failed. failedTasks:0, Vertex vertex_1426707664723_4575_1_42 [Red= ucer 6] killed/failed due to:null]Vertex killed, vertexName=3DReducer 27, v= ertexId=3Dvertex_1426707664723_4575_1_36, diagnostics=3D[Vertex received Ki= ll while in RUNNING state., Vertex killed as other vertex failed. failedTas= ks:0, Vertex vertex_1426707664723_4575_1_36 [Reducer 27] killed/failed due = to:null]Vertex killed, vertexName=3DReducer 5, vertexId=3Dvertex_1426707664= 723_4575_1_41, diagnostics=3D[Vertex received Kill while in RUNNING state.,= Vertex killed as other vertex failed. failedTasks:0, Vertex vertex_1426707= 664723_4575_1_41 [Reducer 5] killed/failed due to:null]Vertex killed, verte= xName=3DMap 26, vertexId=3Dvertex_1426707664723_4575_1_35, diagnostics=3D[V= ertex received Kill while in RUNNING state., Vertex killed as other vertex = failed. failedTasks:0, Vertex vertex_1426707664723_4575_1_35 [Map 26] kille= d/failed due to:null]Vertex killed, vertexName=3DMap 19, vertexId=3Dvertex_= 1426707664723_4575_1_39, diagnostics=3D[Vertex received Kill while in RUNNI= NG state., Vertex killed as other vertex failed. failedTasks:0, Vertex vert= ex_1426707664723_4575_1_39 [Map 19] killed/failed due to:null]Vertex killed= , vertexName=3DMap 4, vertexId=3Dvertex_1426707664723_4575_1_40, diagnostic= s=3D[Vertex received Kill while in RUNNING state., Vertex killed as other v= ertex failed. failedTasks:0, Vertex vertex_1426707664723_4575_1_40 [Map 4] = killed/failed due to:null]DAG failed due to vertex failure. failedVertices:= 1 killedVertices:6 > {code} > Plan=20 > {code} > STAGE PLANS: > Stage: Stage-1 > Tez > Edges: > Map 1 <- Map 2 (BROADCAST_EDGE) > Map 10 <- Map 11 (BROADCAST_EDGE), Map 12 (BROADCAST_EDGE), Map 1= 3 (BROADCAST_EDGE), Map 18 (BROADCAST_EDGE) > Map 14 <- Map 15 (BROADCAST_EDGE) > Map 15 <- Map 16 (BROADCAST_EDGE), Map 17 (BROADCAST_EDGE) > Map 18 <- Map 14 (BROADCAST_EDGE) > Map 19 <- Map 10 (BROADCAST_EDGE), Map 7 (BROADCAST_EDGE), Map 9 = (BROADCAST_EDGE), Reducer 21 (BROADCAST_EDGE) > Map 20 <- Map 22 (BROADCAST_EDGE) > Map 23 <- Map 24 (BROADCAST_EDGE) > Map 26 <- Map 23 (BROADCAST_EDGE), Map 25 (BROADCAST_EDGE), Map 4= 0 (BROADCAST_EDGE) > Map 28 <- Map 29 (BROADCAST_EDGE) > Map 31 <- Map 32 (BROADCAST_EDGE), Map 33 (BROADCAST_EDGE), Map 3= 4 (BROADCAST_EDGE), Map 39 (BROADCAST_EDGE) > Map 35 <- Map 36 (BROADCAST_EDGE) > Map 36 <- Map 37 (BROADCAST_EDGE), Map 38 (BROADCAST_EDGE) > Map 39 <- Map 35 (BROADCAST_EDGE) > Map 4 <- Map 1 (BROADCAST_EDGE), Map 19 (BROADCAST_EDGE), Map 3 (= BROADCAST_EDGE) > Map 40 <- Map 28 (BROADCAST_EDGE), Map 30 (BROADCAST_EDGE), Map 3= 1 (BROADCAST_EDGE), Reducer 42 (BROADCAST_EDGE) > Map 41 <- Map 43 (BROADCAST_EDGE) > Map 7 <- Map 8 (BROADCAST_EDGE) > Reducer 21 <- Map 20 (SIMPLE_EDGE) > Reducer 27 <- Map 26 (SIMPLE_EDGE) > Reducer 42 <- Map 41 (SIMPLE_EDGE) > Reducer 5 <- Map 4 (SIMPLE_EDGE), Reducer 27 (BROADCAST_EDGE) > Reducer 6 <- Reducer 5 (SIMPLE_EDGE) > DagName: jenkins_20150503192312_dad6086a-e3ca-4519-aab9-d1179786ba1= d:1 > Vertices: > Map 1=20 > Map Operator Tree: > TableScan > alias: hd1 > filterExpr: (hd_income_band_sk is not null and hd_demo_= sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 770400 Basic stat= s: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (hd_income_band_sk is not null and hd_demo= _sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 57600 Basic sta= ts: COMPLETE Column stats: COMPLETE > Select Operator > expressions: hd_demo_sk (type: int), hd_income_band= _sk (type: int) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 7200 Data size: 57600 Basic s= tats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0 > input vertices: > 1 Map 2 > Statistics: Num rows: 7200 Data size: 28800 Basic= stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 7200 Data size: 28800 Bas= ic stats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 10=20 > Map Operator Tree: > TableScan > alias: cd1 > filterExpr: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 718379200 Basi= c stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 170951200 Ba= sic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cd_demo_sk (type: int), cd_marital_sta= tus (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 1920800 Data size: 170951200 = Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col1 (type: int) > outputColumnNames: _col1, _col2, _col4, _col5, _c= ol6, _col7 > input vertices: > 1 Map 11 > Statistics: Num rows: 1387730 Data size: 14571165= 0 Basic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col7 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col2, _col4, _col5, = _col6, _col9 > input vertices: > 1 Map 12 > Statistics: Num rows: 1387730 Data size: 145711= 650 Basic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col6 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col2, _col4, _col5= , _col9, _col11 > input vertices: > 1 Map 13 > Statistics: Num rows: 1387730 Data size: 1457= 11650 Basic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col2 (type: int) > 1 _col4 (type: int) > outputColumnNames: _col1, _col4, _col5, _co= l9, _col11, _col13, _col15, _col18, _col19, _col21, _col23, _col24, _col25,= _col29, _col31, _col32 > input vertices: > 1 Map 18 > Statistics: Num rows: 68 Data size: 26860 B= asic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Filter Operator > predicate: (_col13 <> _col1) (type: boole= an) > Statistics: Num rows: 68 Data size: 26860= Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col15 (type: int), _col24= (type: float), _col25 (type: float), _col4 (type: int), _col5 (type: int),= _col29 (type: int), _col9 (type: int), _col11 (type: int), _col31 (type: s= tring), _col32 (type: string), _col18 (type: int), _col19 (type: int), _col= 21 (type: int), _col23 (type: float) > outputColumnNames: _col1, _col10, _col1= 1, _col16, _col17, _col21, _col23, _col25, _col27, _col28, _col4, _col5, _c= ol7, _col9 > Statistics: Num rows: 68 Data size: 153= 00 Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col1 (type: int) > sort order: + > Map-reduce partition columns: _col1 (= type: int) > Statistics: Num rows: 68 Data size: 1= 5300 Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col4 (type: int),= _col5 (type: int), _col7 (type: int), _col9 (type: float), _col10 (type: f= loat), _col11 (type: float), _col16 (type: int), _col17 (type: int), _col21= (type: int), _col23 (type: int), _col25 (type: int), _col27 (type: string)= , _col28 (type: string) > Execution mode: vectorized > Map 11=20 > Map Operator Tree: > TableScan > alias: customer > filterExpr: (((((c_current_cdemo_sk is not null and c_f= irst_sales_date_sk is not null) and c_first_shipto_date_sk is not null) and= c_customer_sk is not null) and c_current_hdemo_sk is not null) and c_curre= nt_addr_sk is not null) (type: boolean) > Statistics: Num rows: 1600000 Data size: 1241633212 Bas= ic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((((c_current_cdemo_sk is not null and c_= first_sales_date_sk is not null) and c_first_shipto_date_sk is not null) an= d c_customer_sk is not null) and c_current_hdemo_sk is not null) and c_curr= ent_addr_sk is not null) (type: boolean) > Statistics: Num rows: 1387730 Data size: 32529324 Bas= ic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: c_customer_sk (type: int), c_current_c= demo_sk (type: int), c_current_hdemo_sk (type: int), c_current_addr_sk (typ= e: int), c_first_shipto_date_sk (type: int), c_first_sales_date_sk (type: i= nt) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4, _col5 > Statistics: Num rows: 1387730 Data size: 32529324 B= asic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col1 (type: int) > sort order: + > Map-reduce partition columns: _col1 (type: int) > Statistics: Num rows: 1387730 Data size: 32529324= Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col0 (type: int), _col2 (type= : int), _col3 (type: int), _col4 (type: int), _col5 (type: int) > Execution mode: vectorized > Map 12=20 > Map Operator Tree: > TableScan > alias: d1 > filterExpr: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 584392 Basic s= tats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int), d_year (type: i= nt) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73049 Data size: 584392 Basic= stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 73049 Data size: 584392 Bas= ic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int) > Execution mode: vectorized > Map 13=20 > Map Operator Tree: > TableScan > alias: d1 > filterExpr: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 584392 Basic s= tats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int), d_year (type: i= nt) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73049 Data size: 584392 Basic= stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 73049 Data size: 584392 Bas= ic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int) > Execution mode: vectorized > Map 14=20 > Map Operator Tree: > TableScan > alias: cd1 > filterExpr: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 718379200 Basi= c stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 170951200 Ba= sic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cd_demo_sk (type: int), cd_marital_sta= tus (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 1920800 Data size: 170951200 = Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col3 (type: int) > outputColumnNames: _col1, _col3, _col4, _col6, _c= ol7, _col8, _col9, _col11, _col12, _col13, _col17 > input vertices: > 1 Map 15 > Statistics: Num rows: 78 Data size: 9750 Basic st= ats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Reduce Output Operator > key expressions: _col8 (type: int) > sort order: + > Map-reduce partition columns: _col8 (type: int) > Statistics: Num rows: 78 Data size: 9750 Basic = stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string), _col3 = (type: int), _col4 (type: int), _col6 (type: int), _col7 (type: int), _col9= (type: int), _col11 (type: float), _col12 (type: float), _col13 (type: flo= at), _col17 (type: int) > Execution mode: vectorized > Map 15=20 > Map Operator Tree: > TableScan > alias: store_sales > filterExpr: ((((((((ss_item_sk is not null and ss_ticke= t_number is not null) and ss_sold_date_sk is not null) and ss_cdemo_sk is n= ot null) and ss_store_sk is not null) and ss_customer_sk is not null) and s= s_promo_sk is not null) and ss_addr_sk is not null) and ss_hdemo_sk is not = null) (type: boolean) > Statistics: Num rows: 550076554 Data size: 101277521188= Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((((((((ss_item_sk is not null and ss_tick= et_number is not null) and ss_sold_date_sk is not null) and ss_cdemo_sk is = not null) and ss_store_sk is not null) and ss_customer_sk is not null) and = ss_promo_sk is not null) and ss_addr_sk is not null) and ss_hdemo_sk is not= null) (type: boolean) > Statistics: Num rows: 476766968 Data size: 2248047657= 2 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ss_sold_date_sk (type: int), ss_item_s= k (type: int), ss_customer_sk (type: int), ss_cdemo_sk (type: int), ss_hdem= o_sk (type: int), ss_addr_sk (type: int), ss_store_sk (type: int), ss_promo= _sk (type: int), ss_ticket_number (type: int), ss_wholesale_cost (type: flo= at), ss_list_price (type: float), ss_coupon_amt (type: float) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4, _col5, _col6, _col7, _col8, _col9, _col10, _col11 > Statistics: Num rows: 476766968 Data size: 22480476= 572 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int), _col8 (type: int) > 1 _col0 (type: int), _col1 (type: int) > outputColumnNames: _col0, _col1, _col2, _col3, _c= ol4, _col5, _col6, _col7, _col9, _col10, _col11 > input vertices: > 1 Map 16 > Statistics: Num rows: 8686 Data size: 382184 Basi= c stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col2, _col3, _col4, = _col5, _col6, _col7, _col9, _col10, _col11 > input vertices: > 1 Map 17 > Statistics: Num rows: 78 Data size: 3120 Basic = stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: int), _col10 (type:= float), _col11 (type: float), 2000 (type: int), _col2 (type: int), _col3 (= type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col7 = (type: int), _col9 (type: float) > outputColumnNames: _col1, _col10, _col11, _co= l15, _col2, _col3, _col4, _col5, _col6, _col7, _col9 > Statistics: Num rows: 78 Data size: 3432 Basi= c stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col3 (type: int) > sort order: + > Map-reduce partition columns: _col3 (type: = int) > Statistics: Num rows: 78 Data size: 3432 Ba= sic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int), _col2= (type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col= 7 (type: int), _col9 (type: float), _col10 (type: float), _col11 (type: flo= at), _col15 (type: int) > Execution mode: vectorized > Map 16=20 > Map Operator Tree: > TableScan > alias: store_returns > filterExpr: (sr_item_sk is not null and sr_ticket_numbe= r is not null) (type: boolean) > Statistics: Num rows: 55578005 Data size: 9601960106 Ba= sic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (sr_item_sk is not null and sr_ticket_numb= er is not null) (type: boolean) > Statistics: Num rows: 55578005 Data size: 444624040 B= asic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: sr_item_sk (type: int), sr_ticket_numb= er (type: int) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 55578005 Data size: 444624040= Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int), _col1 (type: = int) > sort order: ++ > Map-reduce partition columns: _col0 (type: int), = _col1 (type: int) > Statistics: Num rows: 55578005 Data size: 4446240= 40 Basic stats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 17=20 > Map Operator Tree: > TableScan > alias: d1 > filterExpr: ((d_year =3D 2000) and d_date_sk is not nul= l) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((d_year =3D 2000) and d_date_sk is not nu= ll) (type: boolean) > Statistics: Num rows: 652 Data size: 5216 Basic stats= : COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 652 Data size: 2608 Basic sta= ts: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 652 Data size: 2608 Basic s= tats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 18=20 > Map Operator Tree: > TableScan > alias: store > filterExpr: ((s_store_sk is not null and s_store_name i= s not null) and s_zip is not null) (type: boolean) > Statistics: Num rows: 212 Data size: 405680 Basic stats= : COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((s_store_sk is not null and s_store_name = is not null) and s_zip is not null) (type: boolean) > Statistics: Num rows: 212 Data size: 38372 Basic stat= s: COMPLETE Column stats: COMPLETE > Select Operator > expressions: s_store_sk (type: int), s_store_name (= type: string), s_zip (type: string) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 212 Data size: 38372 Basic st= ats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col8 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col3, _col4, _col6, _c= ol7, _col9, _col11, _col12, _col13, _col17, _col19, _col20 > input vertices: > 0 Map 14 > Statistics: Num rows: 78 Data size: 23244 Basic s= tats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: string), _col11 (type= : float), _col12 (type: float), _col13 (type: float), _col17 (type: int), _= col19 (type: string), _col20 (type: string), _col3 (type: int), _col4 (type= : int), _col6 (type: int), _col7 (type: int), _col9 (type: int) > outputColumnNames: _col1, _col11, _col12, _col1= 3, _col17, _col19, _col20, _col3, _col4, _col6, _col7, _col9 > Statistics: Num rows: 78 Data size: 23244 Basic= stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col4 (type: int) > sort order: + > Map-reduce partition columns: _col4 (type: in= t) > Statistics: Num rows: 78 Data size: 23244 Bas= ic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string), _col= 3 (type: int), _col6 (type: int), _col7 (type: int), _col9 (type: int), _co= l11 (type: float), _col12 (type: float), _col13 (type: float), _col17 (type= : int), _col19 (type: string), _col20 (type: string) > Execution mode: vectorized > Map 19=20 > Map Operator Tree: > TableScan > alias: item > filterExpr: ((((i_color) IN ('maroon', 'burnished', 'di= m', 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45) = and i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: bo= olean) > Statistics: Num rows: 48000 Data size: 68732712 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((((i_color) IN ('maroon', 'burnished', 'd= im', 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45)= and i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: b= oolean) > Statistics: Num rows: 6000 Data size: 1199936 Basic s= tats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: i_item_sk (type: int), i_product_name = (type: string) > outputColumnNames: _col0, _col3 > Statistics: Num rows: 6000 Data size: 642000 Basic = stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > Inner Join 0 to 2 > keys: > 0 _col1 (type: int) > 1 _col0 (type: int) > 2 _col0 (type: int) > outputColumnNames: _col4, _col5, _col7, _col9, _c= ol10, _col11, _col16, _col17, _col21, _col23, _col25, _col27, _col28, _col3= 3, _col36 > input vertices: > 0 Map 10 > 2 Reducer 21 > Statistics: Num rows: 13200 Data size: 1412400 Ba= sic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col10 (type: float), _col11 (type= : float), _col16 (type: int), _col17 (type: int), _col21 (type: int), _col2= 3 (type: int), _col25 (type: int), _col27 (type: string), _col28 (type: str= ing), _col33 (type: int), _col36 (type: string), _col4 (type: int), _col5 (= type: int), _col7 (type: int), _col9 (type: float) > outputColumnNames: _col10, _col11, _col16, _col= 17, _col21, _col23, _col25, _col27, _col28, _col33, _col36, _col4, _col5, _= col7, _col9 > Statistics: Num rows: 13200 Data size: 1412400 = Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col7 (type: int) > outputColumnNames: _col5, _col6, _col10, _col= 11, _col12, _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34,= _col37 > input vertices: > 0 Map 9 > Statistics: Num rows: 14520 Data size: 155364= 0 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col10 (type: float), _col11 (= type: float), _col12 (type: float), _col17 (type: int), _col18 (type: int),= _col22 (type: int), _col24 (type: int), _col26 (type: int), _col28 (type: = string), _col29 (type: string), _col34 (type: int), _col37 (type: string), = _col5 (type: int), _col6 (type: int) > outputColumnNames: _col10, _col11, _col12, = _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34, _col37, _co= l5, _col6 > Statistics: Num rows: 14520 Data size: 1553= 640 Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col17 (type: int) > outputColumnNames: _col8, _col9, _col13, = _col14, _col15, _col21, _col25, _col27, _col29, _col31, _col32, _col37, _co= l40 > input vertices: > 0 Map 7 > Statistics: Num rows: 15972 Data size: 17= 09004 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col13 (type: float), _col= 14 (type: float), _col15 (type: float), _col21 (type: int), _col25 (type: i= nt), _col27 (type: int), _col29 (type: int), _col31 (type: string), _col32 = (type: string), _col37 (type: int), _col40 (type: string), _col8 (type: int= ), _col9 (type: int) > outputColumnNames: _col13, _col14, _col= 15, _col21, _col25, _col27, _col29, _col31, _col32, _col37, _col40, _col8, = _col9 > Statistics: Num rows: 15972 Data size: = 1709004 Basic stats: COMPLETE Column stats: NONE > Reduce Output Operator > key expressions: _col9 (type: int) > sort order: + > Map-reduce partition columns: _col9 (= type: int) > Statistics: Num rows: 15972 Data size= : 1709004 Basic stats: COMPLETE Column stats: NONE > value expressions: _col8 (type: int),= _col13 (type: float), _col14 (type: float), _col15 (type: float), _col21 (= type: int), _col25 (type: int), _col27 (type: int), _col29 (type: int), _co= l31 (type: string), _col32 (type: string), _col37 (type: int), _col40 (type= : string) > Execution mode: vectorized > Map 2=20 > Map Operator Tree: > TableScan > alias: ib1 > filterExpr: ib_income_band_sk is not null (type: boolea= n) > Statistics: Num rows: 20 Data size: 240 Basic stats: CO= MPLETE Column stats: COMPLETE > Filter Operator > predicate: ib_income_band_sk is not null (type: boole= an) > Statistics: Num rows: 20 Data size: 80 Basic stats: C= OMPLETE Column stats: COMPLETE > Select Operator > expressions: ib_income_band_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 20 Data size: 80 Basic stats:= COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 20 Data size: 80 Basic stat= s: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 20=20 > Map Operator Tree: > TableScan > alias: catalog_sales > filterExpr: (cs_item_sk is not null and cs_order_number= is not null) (type: boolean) > Statistics: Num rows: 286549727 Data size: 65825832570 = Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (cs_item_sk is not null and cs_order_numbe= r is not null) (type: boolean) > Statistics: Num rows: 286549727 Data size: 3435718732= Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cs_item_sk (type: int), cs_order_numbe= r (type: int), cs_ext_list_price (type: float) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 286549727 Data size: 34357187= 32 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int), _col1 (type: int) > 1 _col0 (type: int), _col1 (type: int) > outputColumnNames: _col0, _col2, _col5, _col6, _c= ol7 > input vertices: > 1 Map 22 > Statistics: Num rows: 2911 Data size: 58220 Basic= stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Select Operator > expressions: _col0 (type: int), _col2 (type: fl= oat), ((_col5 + _col6) + _col7) (type: float) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 2911 Data size: 58220 Bas= ic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col1), sum(_col2) > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 1 Data size: 20 Basic s= tats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: = int) > Statistics: Num rows: 1 Data size: 20 Basic= stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: double), _c= ol2 (type: double) > Execution mode: vectorized > Map 22=20 > Map Operator Tree: > TableScan > alias: catalog_returns > filterExpr: (cr_item_sk is not null and cr_order_number= is not null) (type: boolean) > Statistics: Num rows: 28798881 Data size: 5764329494 Ba= sic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (cr_item_sk is not null and cr_order_numbe= r is not null) (type: boolean) > Statistics: Num rows: 28798881 Data size: 569059536 B= asic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cr_item_sk (type: int), cr_order_numbe= r (type: int), cr_refunded_cash (type: float), cr_reversed_charge (type: fl= oat), cr_store_credit (type: float) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4 > Statistics: Num rows: 28798881 Data size: 569059536= Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int), _col1 (type: = int) > sort order: ++ > Map-reduce partition columns: _col0 (type: int), = _col1 (type: int) > Statistics: Num rows: 28798881 Data size: 5690595= 36 Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col2 (type: float), _col3 (ty= pe: float), _col4 (type: float) > Execution mode: vectorized > Map 23=20 > Map Operator Tree: > TableScan > alias: hd1 > filterExpr: (hd_income_band_sk is not null and hd_demo_= sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 770400 Basic stat= s: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (hd_income_band_sk is not null and hd_demo= _sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 57600 Basic sta= ts: COMPLETE Column stats: COMPLETE > Select Operator > expressions: hd_demo_sk (type: int), hd_income_band= _sk (type: int) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 7200 Data size: 57600 Basic s= tats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0 > input vertices: > 1 Map 24 > Statistics: Num rows: 7200 Data size: 28800 Basic= stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 7200 Data size: 28800 Bas= ic stats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 24=20 > Map Operator Tree: > TableScan > alias: ib1 > filterExpr: ib_income_band_sk is not null (type: boolea= n) > Statistics: Num rows: 20 Data size: 240 Basic stats: CO= MPLETE Column stats: COMPLETE > Filter Operator > predicate: ib_income_band_sk is not null (type: boole= an) > Statistics: Num rows: 20 Data size: 80 Basic stats: C= OMPLETE Column stats: COMPLETE > Select Operator > expressions: ib_income_band_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 20 Data size: 80 Basic stats:= COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 20 Data size: 80 Basic stat= s: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 25=20 > Map Operator Tree: > TableScan > alias: ad1 > filterExpr: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic= stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 292000000 Bas= ic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_street_n= umber (type: string), ca_street_name (type: string), ca_city (type: string)= , ca_zip (type: string) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4 > Statistics: Num rows: 800000 Data size: 292000000 B= asic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 292000000= Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string), _col2 (t= ype: string), _col3 (type: string), _col4 (type: string) > Execution mode: vectorized > Map 26=20 > Map Operator Tree: > TableScan > alias: ad1 > filterExpr: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic= stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 292000000 Bas= ic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_street_n= umber (type: string), ca_street_name (type: string), ca_city (type: string)= , ca_zip (type: string) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4 > Statistics: Num rows: 800000 Data size: 292000000 B= asic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col9 (type: int) > outputColumnNames: _col1, _col2, _col3, _col4, _c= ol13, _col18, _col19, _col20, _col26, _col30, _col32, _col34, _col36, _col3= 7, _col42, _col45 > input vertices: > 1 Map 40 > Statistics: Num rows: 880000 Data size: 321200006= Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: string), _col13 (type= : int), _col18 (type: float), _col19 (type: float), _col2 (type: string), _= col20 (type: float), _col26 (type: int), _col3 (type: string), _col30 (type= : int), _col32 (type: int), _col34 (type: int), _col36 (type: string), _col= 37 (type: string), _col4 (type: string), _col42 (type: int), _col45 (type: = string) > outputColumnNames: _col1, _col13, _col18, _col1= 9, _col2, _col20, _col26, _col3, _col30, _col32, _col34, _col36, _col37, _c= ol4, _col42, _col45 > Statistics: Num rows: 880000 Data size: 3212000= 06 Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col26 (type: int) > outputColumnNames: _col1, _col2, _col3, _col4= , _col6, _col7, _col8, _col9, _col18, _col23, _col24, _col25, _col35, _col3= 7, _col39, _col41, _col42, _col47, _col50 > input vertices: > 0 Map 25 > Statistics: Num rows: 968000 Data size: 35332= 0014 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: string), _col18 (= type: int), _col2 (type: string), _col23 (type: float), _col24 (type: float= ), _col25 (type: float), _col3 (type: string), _col35 (type: int), _col37 (= type: int), _col39 (type: int), _col4 (type: string), _col41 (type: string)= , _col42 (type: string), _col47 (type: int), _col50 (type: string), _col6 (= type: string), _col7 (type: string), _col8 (type: string), _col9 (type: str= ing) > outputColumnNames: _col1, _col18, _col2, _c= ol23, _col24, _col25, _col3, _col35, _col37, _col39, _col4, _col41, _col42,= _col47, _col50, _col6, _col7, _col8, _col9 > Statistics: Num rows: 968000 Data size: 353= 320014 Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col18 (type: int) > outputColumnNames: _col4, _col5, _col6, _= col7, _col9, _col10, _col11, _col12, _col26, _col27, _col28, _col38, _col40= , _col42, _col44, _col45, _col50, _col53 > input vertices: > 0 Map 23 > Statistics: Num rows: 1064800 Data size: = 388652023 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col53 (type: string), _co= l50 (type: int), _col44 (type: string), _col45 (type: string), _col9 (type:= string), _col10 (type: string), _col11 (type: string), _col12 (type: strin= g), _col4 (type: string), _col5 (type: string), _col6 (type: string), _col7= (type: string), _col38 (type: int), _col40 (type: int), _col42 (type: int)= , _col26 (type: float), _col27 (type: float), _col28 (type: float) > outputColumnNames: _col0, _col1, _col2,= _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _= col13, _col14, _col15, _col16, _col17 > Statistics: Num rows: 1064800 Data size= : 388652023 Basic stats: COMPLETE Column stats: NONE > Group By Operator > aggregations: count(), sum(_col15), s= um(_col16), sum(_col17) > keys: _col0 (type: string), _col1 (ty= pe: int), _col2 (type: string), _col3 (type: string), _col4 (type: string),= _col5 (type: string), _col6 (type: string), _col7 (type: string), _col8 (t= ype: string), _col9 (type: string), _col10 (type: string), _col11 (type: st= ring), _col12 (type: int), _col13 (type: int), _col14 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col= 2, _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12,= _col13, _col14, _col15, _col16, _col17, _col18 > Statistics: Num rows: 1064800 Data si= ze: 388652023 Basic stats: COMPLETE Column stats: NONE > Reduce Output Operator > key expressions: _col0 (type: strin= g), _col1 (type: int), _col2 (type: string), _col3 (type: string), _col4 (t= ype: string), _col5 (type: string), _col6 (type: string), _col7 (type: stri= ng), _col8 (type: string), _col9 (type: string), _col10 (type: string), _co= l11 (type: string), _col12 (type: int), _col13 (type: int), _col14 (type: i= nt) > sort order: +++++++++++++++ > Map-reduce partition columns: _col0= (type: string), _col1 (type: int), _col2 (type: string), _col3 (type: stri= ng), _col4 (type: string), _col5 (type: string), _col6 (type: string), _col= 7 (type: string), _col8 (type: string), _col9 (type: string), _col10 (type:= string), _col11 (type: string), _col12 (type: int), _col13 (type: int), _c= ol14 (type: int) > Statistics: Num rows: 1064800 Data = size: 388652023 Basic stats: COMPLETE Column stats: NONE > value expressions: _col15 (type: bi= gint), _col16 (type: double), _col17 (type: double), _col18 (type: double) > Execution mode: vectorized > Map 28=20 > Map Operator Tree: > TableScan > alias: hd1 > filterExpr: (hd_income_band_sk is not null and hd_demo_= sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 770400 Basic stat= s: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (hd_income_band_sk is not null and hd_demo= _sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 57600 Basic sta= ts: COMPLETE Column stats: COMPLETE > Select Operator > expressions: hd_demo_sk (type: int), hd_income_band= _sk (type: int) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 7200 Data size: 57600 Basic s= tats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0 > input vertices: > 1 Map 29 > Statistics: Num rows: 7200 Data size: 28800 Basic= stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 7200 Data size: 28800 Bas= ic stats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 29=20 > Map Operator Tree: > TableScan > alias: ib1 > filterExpr: ib_income_band_sk is not null (type: boolea= n) > Statistics: Num rows: 20 Data size: 240 Basic stats: CO= MPLETE Column stats: COMPLETE > Filter Operator > predicate: ib_income_band_sk is not null (type: boole= an) > Statistics: Num rows: 20 Data size: 80 Basic stats: C= OMPLETE Column stats: COMPLETE > Select Operator > expressions: ib_income_band_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 20 Data size: 80 Basic stats:= COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 20 Data size: 80 Basic stat= s: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 3=20 > Map Operator Tree: > TableScan > alias: ad1 > filterExpr: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic= stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 292000000 Bas= ic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_street_n= umber (type: string), ca_street_name (type: string), ca_city (type: string)= , ca_zip (type: string) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4 > Statistics: Num rows: 800000 Data size: 292000000 B= asic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 292000000= Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string), _col2 (t= ype: string), _col3 (type: string), _col4 (type: string) > Execution mode: vectorized > Map 30=20 > Map Operator Tree: > TableScan > alias: promotion > filterExpr: p_promo_sk is not null (type: boolean) > Statistics: Num rows: 450 Data size: 530848 Basic stats= : COMPLETE Column stats: COMPLETE > Filter Operator > predicate: p_promo_sk is not null (type: boolean) > Statistics: Num rows: 450 Data size: 1800 Basic stats= : COMPLETE Column stats: COMPLETE > Select Operator > expressions: p_promo_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 450 Data size: 1800 Basic sta= ts: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 450 Data size: 1800 Basic s= tats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 31=20 > Map Operator Tree: > TableScan > alias: cd1 > filterExpr: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 718379200 Basi= c stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 170951200 Ba= sic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cd_demo_sk (type: int), cd_marital_sta= tus (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 1920800 Data size: 170951200 = Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col1 (type: int) > outputColumnNames: _col1, _col2, _col4, _col5, _c= ol6, _col7 > input vertices: > 1 Map 32 > Statistics: Num rows: 1387730 Data size: 14571165= 0 Basic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col7 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col2, _col4, _col5, = _col6, _col9 > input vertices: > 1 Map 33 > Statistics: Num rows: 1387730 Data size: 145711= 650 Basic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col6 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col2, _col4, _col5= , _col9, _col11 > input vertices: > 1 Map 34 > Statistics: Num rows: 1387730 Data size: 1457= 11650 Basic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col2 (type: int) > 1 _col4 (type: int) > outputColumnNames: _col1, _col4, _col5, _co= l9, _col11, _col13, _col15, _col18, _col19, _col21, _col23, _col24, _col25,= _col29, _col31, _col32 > input vertices: > 1 Map 39 > Statistics: Num rows: 68 Data size: 26860 B= asic stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Filter Operator > predicate: (_col13 <> _col1) (type: boole= an) > Statistics: Num rows: 68 Data size: 26860= Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col15 (type: int), _col24= (type: float), _col25 (type: float), _col4 (type: int), _col5 (type: int),= _col29 (type: int), _col9 (type: int), _col11 (type: int), _col31 (type: s= tring), _col32 (type: string), _col18 (type: int), _col19 (type: int), _col= 21 (type: int), _col23 (type: float) > outputColumnNames: _col1, _col10, _col1= 1, _col16, _col17, _col21, _col23, _col25, _col27, _col28, _col4, _col5, _c= ol7, _col9 > Statistics: Num rows: 68 Data size: 153= 00 Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col1 (type: int) > sort order: + > Map-reduce partition columns: _col1 (= type: int) > Statistics: Num rows: 68 Data size: 1= 5300 Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col4 (type: int),= _col5 (type: int), _col7 (type: int), _col9 (type: float), _col10 (type: f= loat), _col11 (type: float), _col16 (type: int), _col17 (type: int), _col21= (type: int), _col23 (type: int), _col25 (type: int), _col27 (type: string)= , _col28 (type: string) > Execution mode: vectorized > Map 32=20 > Map Operator Tree: > TableScan > alias: customer > filterExpr: (((((c_current_cdemo_sk is not null and c_f= irst_sales_date_sk is not null) and c_first_shipto_date_sk is not null) and= c_customer_sk is not null) and c_current_hdemo_sk is not null) and c_curre= nt_addr_sk is not null) (type: boolean) > Statistics: Num rows: 1600000 Data size: 1241633212 Bas= ic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((((c_current_cdemo_sk is not null and c_= first_sales_date_sk is not null) and c_first_shipto_date_sk is not null) an= d c_customer_sk is not null) and c_current_hdemo_sk is not null) and c_curr= ent_addr_sk is not null) (type: boolean) > Statistics: Num rows: 1387730 Data size: 32529324 Bas= ic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: c_customer_sk (type: int), c_current_c= demo_sk (type: int), c_current_hdemo_sk (type: int), c_current_addr_sk (typ= e: int), c_first_shipto_date_sk (type: int), c_first_sales_date_sk (type: i= nt) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4, _col5 > Statistics: Num rows: 1387730 Data size: 32529324 B= asic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col1 (type: int) > sort order: + > Map-reduce partition columns: _col1 (type: int) > Statistics: Num rows: 1387730 Data size: 32529324= Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col0 (type: int), _col2 (type= : int), _col3 (type: int), _col4 (type: int), _col5 (type: int) > Execution mode: vectorized > Map 33=20 > Map Operator Tree: > TableScan > alias: d1 > filterExpr: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 584392 Basic s= tats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int), d_year (type: i= nt) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73049 Data size: 584392 Basic= stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 73049 Data size: 584392 Bas= ic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int) > Execution mode: vectorized > Map 34=20 > Map Operator Tree: > TableScan > alias: d1 > filterExpr: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: d_date_sk is not null (type: boolean) > Statistics: Num rows: 73049 Data size: 584392 Basic s= tats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int), d_year (type: i= nt) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73049 Data size: 584392 Basic= stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 73049 Data size: 584392 Bas= ic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int) > Execution mode: vectorized > Map 35=20 > Map Operator Tree: > TableScan > alias: cd1 > filterExpr: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 718379200 Basi= c stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: cd_demo_sk is not null (type: boolean) > Statistics: Num rows: 1920800 Data size: 170951200 Ba= sic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cd_demo_sk (type: int), cd_marital_sta= tus (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 1920800 Data size: 170951200 = Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col3 (type: int) > outputColumnNames: _col1, _col3, _col4, _col6, _c= ol7, _col8, _col9, _col11, _col12, _col13, _col17 > input vertices: > 1 Map 36 > Statistics: Num rows: 78 Data size: 9750 Basic st= ats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Reduce Output Operator > key expressions: _col8 (type: int) > sort order: + > Map-reduce partition columns: _col8 (type: int) > Statistics: Num rows: 78 Data size: 9750 Basic = stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string), _col3 = (type: int), _col4 (type: int), _col6 (type: int), _col7 (type: int), _col9= (type: int), _col11 (type: float), _col12 (type: float), _col13 (type: flo= at), _col17 (type: int) > Execution mode: vectorized > Map 36=20 > Map Operator Tree: > TableScan > alias: store_sales > filterExpr: ((((((((ss_item_sk is not null and ss_ticke= t_number is not null) and ss_sold_date_sk is not null) and ss_cdemo_sk is n= ot null) and ss_store_sk is not null) and ss_customer_sk is not null) and s= s_promo_sk is not null) and ss_addr_sk is not null) and ss_hdemo_sk is not = null) (type: boolean) > Statistics: Num rows: 550076554 Data size: 101277521188= Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((((((((ss_item_sk is not null and ss_tick= et_number is not null) and ss_sold_date_sk is not null) and ss_cdemo_sk is = not null) and ss_store_sk is not null) and ss_customer_sk is not null) and = ss_promo_sk is not null) and ss_addr_sk is not null) and ss_hdemo_sk is not= null) (type: boolean) > Statistics: Num rows: 476766968 Data size: 2248047657= 2 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ss_sold_date_sk (type: int), ss_item_s= k (type: int), ss_customer_sk (type: int), ss_cdemo_sk (type: int), ss_hdem= o_sk (type: int), ss_addr_sk (type: int), ss_store_sk (type: int), ss_promo= _sk (type: int), ss_ticket_number (type: int), ss_wholesale_cost (type: flo= at), ss_list_price (type: float), ss_coupon_amt (type: float) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4, _col5, _col6, _col7, _col8, _col9, _col10, _col11 > Statistics: Num rows: 476766968 Data size: 22480476= 572 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int), _col8 (type: int) > 1 _col0 (type: int), _col1 (type: int) > outputColumnNames: _col0, _col1, _col2, _col3, _c= ol4, _col5, _col6, _col7, _col9, _col10, _col11 > input vertices: > 1 Map 37 > Statistics: Num rows: 8686 Data size: 382184 Basi= c stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col2, _col3, _col4, = _col5, _col6, _col7, _col9, _col10, _col11 > input vertices: > 1 Map 38 > Statistics: Num rows: 78 Data size: 3120 Basic = stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: int), _col10 (type:= float), _col11 (type: float), 2001 (type: int), _col2 (type: int), _col3 (= type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col7 = (type: int), _col9 (type: float) > outputColumnNames: _col1, _col10, _col11, _co= l15, _col2, _col3, _col4, _col5, _col6, _col7, _col9 > Statistics: Num rows: 78 Data size: 3432 Basi= c stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col3 (type: int) > sort order: + > Map-reduce partition columns: _col3 (type: = int) > Statistics: Num rows: 78 Data size: 3432 Ba= sic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int), _col2= (type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col= 7 (type: int), _col9 (type: float), _col10 (type: float), _col11 (type: flo= at), _col15 (type: int) > Execution mode: vectorized > Map 37=20 > Map Operator Tree: > TableScan > alias: store_returns > filterExpr: (sr_item_sk is not null and sr_ticket_numbe= r is not null) (type: boolean) > Statistics: Num rows: 55578005 Data size: 9601960106 Ba= sic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (sr_item_sk is not null and sr_ticket_numb= er is not null) (type: boolean) > Statistics: Num rows: 55578005 Data size: 444624040 B= asic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: sr_item_sk (type: int), sr_ticket_numb= er (type: int) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 55578005 Data size: 444624040= Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int), _col1 (type: = int) > sort order: ++ > Map-reduce partition columns: _col0 (type: int), = _col1 (type: int) > Statistics: Num rows: 55578005 Data size: 4446240= 40 Basic stats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 38=20 > Map Operator Tree: > TableScan > alias: d1 > filterExpr: ((d_year =3D 2001) and d_date_sk is not nul= l) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((d_year =3D 2001) and d_date_sk is not nu= ll) (type: boolean) > Statistics: Num rows: 652 Data size: 5216 Basic stats= : COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 652 Data size: 2608 Basic sta= ts: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 652 Data size: 2608 Basic s= tats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 39=20 > Map Operator Tree: > TableScan > alias: store > filterExpr: ((s_store_sk is not null and s_store_name i= s not null) and s_zip is not null) (type: boolean) > Statistics: Num rows: 212 Data size: 405680 Basic stats= : COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((s_store_sk is not null and s_store_name = is not null) and s_zip is not null) (type: boolean) > Statistics: Num rows: 212 Data size: 38372 Basic stat= s: COMPLETE Column stats: COMPLETE > Select Operator > expressions: s_store_sk (type: int), s_store_name (= type: string), s_zip (type: string) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 212 Data size: 38372 Basic st= ats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col8 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col3, _col4, _col6, _c= ol7, _col9, _col11, _col12, _col13, _col17, _col19, _col20 > input vertices: > 0 Map 35 > Statistics: Num rows: 78 Data size: 23244 Basic s= tats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: string), _col11 (type= : float), _col12 (type: float), _col13 (type: float), _col17 (type: int), _= col19 (type: string), _col20 (type: string), _col3 (type: int), _col4 (type= : int), _col6 (type: int), _col7 (type: int), _col9 (type: int) > outputColumnNames: _col1, _col11, _col12, _col1= 3, _col17, _col19, _col20, _col3, _col4, _col6, _col7, _col9 > Statistics: Num rows: 78 Data size: 23244 Basic= stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col4 (type: int) > sort order: + > Map-reduce partition columns: _col4 (type: in= t) > Statistics: Num rows: 78 Data size: 23244 Bas= ic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string), _col= 3 (type: int), _col6 (type: int), _col7 (type: int), _col9 (type: int), _co= l11 (type: float), _col12 (type: float), _col13 (type: float), _col17 (type= : int), _col19 (type: string), _col20 (type: string) > Execution mode: vectorized > Map 4=20 > Map Operator Tree: > TableScan > alias: ad1 > filterExpr: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic= stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ca_address_sk is not null (type: boolean) > Statistics: Num rows: 800000 Data size: 292000000 Bas= ic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_street_n= umber (type: string), ca_street_name (type: string), ca_city (type: string)= , ca_zip (type: string) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4 > Statistics: Num rows: 800000 Data size: 292000000 B= asic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col9 (type: int) > outputColumnNames: _col1, _col2, _col3, _col4, _c= ol13, _col18, _col19, _col20, _col26, _col30, _col32, _col34, _col36, _col3= 7, _col42, _col45 > input vertices: > 1 Map 19 > Statistics: Num rows: 880000 Data size: 321200006= Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: string), _col13 (type= : int), _col18 (type: float), _col19 (type: float), _col2 (type: string), _= col20 (type: float), _col26 (type: int), _col3 (type: string), _col30 (type= : int), _col32 (type: int), _col34 (type: int), _col36 (type: string), _col= 37 (type: string), _col4 (type: string), _col42 (type: int), _col45 (type: = string) > outputColumnNames: _col1, _col13, _col18, _col1= 9, _col2, _col20, _col26, _col3, _col30, _col32, _col34, _col36, _col37, _c= ol4, _col42, _col45 > Statistics: Num rows: 880000 Data size: 3212000= 06 Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col26 (type: int) > outputColumnNames: _col1, _col2, _col3, _col4= , _col6, _col7, _col8, _col9, _col18, _col23, _col24, _col25, _col35, _col3= 7, _col39, _col41, _col42, _col47, _col50 > input vertices: > 0 Map 3 > Statistics: Num rows: 968000 Data size: 35332= 0014 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col1 (type: string), _col18 (= type: int), _col2 (type: string), _col23 (type: float), _col24 (type: float= ), _col25 (type: float), _col3 (type: string), _col35 (type: int), _col37 (= type: int), _col39 (type: int), _col4 (type: string), _col41 (type: string)= , _col42 (type: string), _col47 (type: int), _col50 (type: string), _col6 (= type: string), _col7 (type: string), _col8 (type: string), _col9 (type: str= ing) > outputColumnNames: _col1, _col18, _col2, _c= ol23, _col24, _col25, _col3, _col35, _col37, _col39, _col4, _col41, _col42,= _col47, _col50, _col6, _col7, _col8, _col9 > Statistics: Num rows: 968000 Data size: 353= 320014 Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col18 (type: int) > outputColumnNames: _col4, _col5, _col6, _= col7, _col9, _col10, _col11, _col12, _col26, _col27, _col28, _col38, _col40= , _col42, _col44, _col45, _col50, _col53 > input vertices: > 0 Map 1 > Statistics: Num rows: 1064800 Data size: = 388652023 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col53 (type: string), _co= l50 (type: int), _col44 (type: string), _col45 (type: string), _col9 (type:= string), _col10 (type: string), _col11 (type: string), _col12 (type: strin= g), _col4 (type: string), _col5 (type: string), _col6 (type: string), _col7= (type: string), _col38 (type: int), _col40 (type: int), _col42 (type: int)= , _col26 (type: float), _col27 (type: float), _col28 (type: float) > outputColumnNames: _col0, _col1, _col2,= _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _= col13, _col14, _col15, _col16, _col17 > Statistics: Num rows: 1064800 Data size= : 388652023 Basic stats: COMPLETE Column stats: NONE > Group By Operator > aggregations: count(), sum(_col15), s= um(_col16), sum(_col17) > keys: _col0 (type: string), _col1 (ty= pe: int), _col2 (type: string), _col3 (type: string), _col4 (type: string),= _col5 (type: string), _col6 (type: string), _col7 (type: string), _col8 (t= ype: string), _col9 (type: string), _col10 (type: string), _col11 (type: st= ring), _col12 (type: int), _col13 (type: int), _col14 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col= 2, _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12,= _col13, _col14, _col15, _col16, _col17, _col18 > Statistics: Num rows: 1064800 Data si= ze: 388652023 Basic stats: COMPLETE Column stats: NONE > Reduce Output Operator > key expressions: _col0 (type: strin= g), _col1 (type: int), _col2 (type: string), _col3 (type: string), _col4 (t= ype: string), _col5 (type: string), _col6 (type: string), _col7 (type: stri= ng), _col8 (type: string), _col9 (type: string), _col10 (type: string), _co= l11 (type: string), _col12 (type: int), _col13 (type: int), _col14 (type: i= nt) > sort order: +++++++++++++++ > Map-reduce partition columns: _col0= (type: string), _col1 (type: int), _col2 (type: string), _col3 (type: stri= ng), _col4 (type: string), _col5 (type: string), _col6 (type: string), _col= 7 (type: string), _col8 (type: string), _col9 (type: string), _col10 (type:= string), _col11 (type: string), _col12 (type: int), _col13 (type: int), _c= ol14 (type: int) > Statistics: Num rows: 1064800 Data = size: 388652023 Basic stats: COMPLETE Column stats: NONE > value expressions: _col15 (type: bi= gint), _col16 (type: double), _col17 (type: double), _col18 (type: double) > Execution mode: vectorized > Map 40=20 > Map Operator Tree: > TableScan > alias: item > filterExpr: ((((i_color) IN ('maroon', 'burnished', 'di= m', 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45) = and i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: bo= olean) > Statistics: Num rows: 48000 Data size: 68732712 Basic s= tats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ((((i_color) IN ('maroon', 'burnished', 'd= im', 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45)= and i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: b= oolean) > Statistics: Num rows: 6000 Data size: 1199936 Basic s= tats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: i_item_sk (type: int), i_product_name = (type: string) > outputColumnNames: _col0, _col3 > Statistics: Num rows: 6000 Data size: 642000 Basic = stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > Inner Join 0 to 2 > keys: > 0 _col1 (type: int) > 1 _col0 (type: int) > 2 _col0 (type: int) > outputColumnNames: _col4, _col5, _col7, _col9, _c= ol10, _col11, _col16, _col17, _col21, _col23, _col25, _col27, _col28, _col3= 3, _col36 > input vertices: > 0 Map 31 > 2 Reducer 42 > Statistics: Num rows: 13200 Data size: 1412400 Ba= sic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col10 (type: float), _col11 (type= : float), _col16 (type: int), _col17 (type: int), _col21 (type: int), _col2= 3 (type: int), _col25 (type: int), _col27 (type: string), _col28 (type: str= ing), _col33 (type: int), _col36 (type: string), _col4 (type: int), _col5 (= type: int), _col7 (type: int), _col9 (type: float) > outputColumnNames: _col10, _col11, _col16, _col= 17, _col21, _col23, _col25, _col27, _col28, _col33, _col36, _col4, _col5, _= col7, _col9 > Statistics: Num rows: 13200 Data size: 1412400 = Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col7 (type: int) > outputColumnNames: _col5, _col6, _col10, _col= 11, _col12, _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34,= _col37 > input vertices: > 0 Map 30 > Statistics: Num rows: 14520 Data size: 155364= 0 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col10 (type: float), _col11 (= type: float), _col12 (type: float), _col17 (type: int), _col18 (type: int),= _col22 (type: int), _col24 (type: int), _col26 (type: int), _col28 (type: = string), _col29 (type: string), _col34 (type: int), _col37 (type: string), = _col5 (type: int), _col6 (type: int) > outputColumnNames: _col10, _col11, _col12, = _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34, _col37, _co= l5, _col6 > Statistics: Num rows: 14520 Data size: 1553= 640 Basic stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int) > 1 _col17 (type: int) > outputColumnNames: _col8, _col9, _col13, = _col14, _col15, _col21, _col25, _col27, _col29, _col31, _col32, _col37, _co= l40 > input vertices: > 0 Map 28 > Statistics: Num rows: 15972 Data size: 17= 09004 Basic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Select Operator > expressions: _col13 (type: float), _col= 14 (type: float), _col15 (type: float), _col21 (type: int), _col25 (type: i= nt), _col27 (type: int), _col29 (type: int), _col31 (type: string), _col32 = (type: string), _col37 (type: int), _col40 (type: string), _col8 (type: int= ), _col9 (type: int) > outputColumnNames: _col13, _col14, _col= 15, _col21, _col25, _col27, _col29, _col31, _col32, _col37, _col40, _col8, = _col9 > Statistics: Num rows: 15972 Data size: = 1709004 Basic stats: COMPLETE Column stats: NONE > Reduce Output Operator > key expressions: _col9 (type: int) > sort order: + > Map-reduce partition columns: _col9 (= type: int) > Statistics: Num rows: 15972 Data size= : 1709004 Basic stats: COMPLETE Column stats: NONE > value expressions: _col8 (type: int),= _col13 (type: float), _col14 (type: float), _col15 (type: float), _col21 (= type: int), _col25 (type: int), _col27 (type: int), _col29 (type: int), _co= l31 (type: string), _col32 (type: string), _col37 (type: int), _col40 (type= : string) > Execution mode: vectorized > Map 41=20 > Map Operator Tree: > TableScan > alias: catalog_sales > filterExpr: (cs_item_sk is not null and cs_order_number= is not null) (type: boolean) > Statistics: Num rows: 286549727 Data size: 65825832570 = Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (cs_item_sk is not null and cs_order_numbe= r is not null) (type: boolean) > Statistics: Num rows: 286549727 Data size: 3435718732= Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cs_item_sk (type: int), cs_order_numbe= r (type: int), cs_ext_list_price (type: float) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 286549727 Data size: 34357187= 32 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col0 (type: int), _col1 (type: int) > 1 _col0 (type: int), _col1 (type: int) > outputColumnNames: _col0, _col2, _col5, _col6, _c= ol7 > input vertices: > 1 Map 43 > Statistics: Num rows: 2911 Data size: 58220 Basic= stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Select Operator > expressions: _col0 (type: int), _col2 (type: fl= oat), ((_col5 + _col6) + _col7) (type: float) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 2911 Data size: 58220 Bas= ic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col1), sum(_col2) > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 1 Data size: 20 Basic s= tats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: = int) > Statistics: Num rows: 1 Data size: 20 Basic= stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: double), _c= ol2 (type: double) > Execution mode: vectorized > Map 43=20 > Map Operator Tree: > TableScan > alias: catalog_returns > filterExpr: (cr_item_sk is not null and cr_order_number= is not null) (type: boolean) > Statistics: Num rows: 28798881 Data size: 5764329494 Ba= sic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (cr_item_sk is not null and cr_order_numbe= r is not null) (type: boolean) > Statistics: Num rows: 28798881 Data size: 569059536 B= asic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: cr_item_sk (type: int), cr_order_numbe= r (type: int), cr_refunded_cash (type: float), cr_reversed_charge (type: fl= oat), cr_store_credit (type: float) > outputColumnNames: _col0, _col1, _col2, _col3, _col= 4 > Statistics: Num rows: 28798881 Data size: 569059536= Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int), _col1 (type: = int) > sort order: ++ > Map-reduce partition columns: _col0 (type: int), = _col1 (type: int) > Statistics: Num rows: 28798881 Data size: 5690595= 36 Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col2 (type: float), _col3 (ty= pe: float), _col4 (type: float) > Execution mode: vectorized > Map 7=20 > Map Operator Tree: > TableScan > alias: hd1 > filterExpr: (hd_income_band_sk is not null and hd_demo_= sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 770400 Basic stat= s: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (hd_income_band_sk is not null and hd_demo= _sk is not null) (type: boolean) > Statistics: Num rows: 7200 Data size: 57600 Basic sta= ts: COMPLETE Column stats: COMPLETE > Select Operator > expressions: hd_demo_sk (type: int), hd_income_band= _sk (type: int) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 7200 Data size: 57600 Basic s= tats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0 > input vertices: > 1 Map 8 > Statistics: Num rows: 7200 Data size: 28800 Basic= stats: COMPLETE Column stats: COMPLETE > HybridGraceHashJoin: true > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 7200 Data size: 28800 Bas= ic stats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 8=20 > Map Operator Tree: > TableScan > alias: ib1 > filterExpr: ib_income_band_sk is not null (type: boolea= n) > Statistics: Num rows: 20 Data size: 240 Basic stats: CO= MPLETE Column stats: COMPLETE > Filter Operator > predicate: ib_income_band_sk is not null (type: boole= an) > Statistics: Num rows: 20 Data size: 80 Basic stats: C= OMPLETE Column stats: COMPLETE > Select Operator > expressions: ib_income_band_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 20 Data size: 80 Basic stats:= COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 20 Data size: 80 Basic stat= s: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Map 9=20 > Map Operator Tree: > TableScan > alias: promotion > filterExpr: p_promo_sk is not null (type: boolean) > Statistics: Num rows: 450 Data size: 530848 Basic stats= : COMPLETE Column stats: COMPLETE > Filter Operator > predicate: p_promo_sk is not null (type: boolean) > Statistics: Num rows: 450 Data size: 1800 Basic stats= : COMPLETE Column stats: COMPLETE > Select Operator > expressions: p_promo_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 450 Data size: 1800 Basic sta= ts: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 450 Data size: 1800 Basic s= tats: COMPLETE Column stats: COMPLETE > Execution mode: vectorized > Reducer 21=20 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0), sum(VALUE._col1) > keys: KEY._col0 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLE= TE Column stats: COMPLETE > Filter Operator > predicate: (_col1 > (2.0 * _col2)) (type: boolean) > Statistics: Num rows: 0 Data size: 0 Basic stats: NONE = Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 0 Data size: 0 Basic stats: NON= E Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 0 Data size: 0 Basic stats: N= ONE Column stats: COMPLETE > Reducer 27=20 > Reduce Operator Tree: > Group By Operator > aggregations: count(VALUE._col0), sum(VALUE._col1), sum(V= ALUE._col2), sum(VALUE._col3) > keys: KEY._col0 (type: string), KEY._col1 (type: int), KE= Y._col2 (type: string), KEY._col3 (type: string), KEY._col4 (type: string),= KEY._col5 (type: string), KEY._col6 (type: string), KEY._col7 (type: strin= g), KEY._col8 (type: string), KEY._col9 (type: string), KEY._col10 (type: s= tring), KEY._col11 (type: string), KEY._col12 (type: int), KEY._col13 (type= : int), KEY._col14 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3, _col4, _co= l5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _co= l15, _col16, _col17, _col18 > Statistics: Num rows: 532400 Data size: 194326011 Basic s= tats: COMPLETE Column stats: NONE > Select Operator > expressions: _col1 (type: int), _col2 (type: string), _= col3 (type: string), _col12 (type: int), _col15 (type: bigint), _col16 (typ= e: double), _col17 (type: double), _col18 (type: double) > outputColumnNames: _col0, _col1, _col2, _col3, _col4, _= col5, _col6, _col7 > Statistics: Num rows: 532400 Data size: 194326011 Basic= stats: COMPLETE Column stats: NONE > Reduce Output Operator > key expressions: _col0 (type: int), _col1 (type: stri= ng), _col2 (type: string) > sort order: +++ > Map-reduce partition columns: _col0 (type: int), _col= 1 (type: string), _col2 (type: string) > Statistics: Num rows: 532400 Data size: 194326011 Bas= ic stats: COMPLETE Column stats: NONE > value expressions: _col3 (type: int), _col4 (type: bi= gint), _col5 (type: double), _col6 (type: double), _col7 (type: double) > Reducer 42=20 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0), sum(VALUE._col1) > keys: KEY._col0 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLE= TE Column stats: COMPLETE > Filter Operator > predicate: (_col1 > (2.0 * _col2)) (type: boolean) > Statistics: Num rows: 0 Data size: 0 Basic stats: NONE = Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 0 Data size: 0 Basic stats: NON= E Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 0 Data size: 0 Basic stats: N= ONE Column stats: COMPLETE > Reducer 5=20 > Reduce Operator Tree: > Group By Operator > aggregations: count(VALUE._col0), sum(VALUE._col1), sum(V= ALUE._col2), sum(VALUE._col3) > keys: KEY._col0 (type: string), KEY._col1 (type: int), KE= Y._col2 (type: string), KEY._col3 (type: string), KEY._col4 (type: string),= KEY._col5 (type: string), KEY._col6 (type: string), KEY._col7 (type: strin= g), KEY._col8 (type: string), KEY._col9 (type: string), KEY._col10 (type: s= tring), KEY._col11 (type: string), KEY._col12 (type: int), KEY._col13 (type= : int), KEY._col14 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3, _col4, _co= l5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _co= l15, _col16, _col17, _col18 > Statistics: Num rows: 532400 Data size: 194326011 Basic s= tats: COMPLETE Column stats: NONE > Select Operator > expressions: _col0 (type: string), _col1 (type: int), _= col2 (type: string), _col3 (type: string), _col4 (type: string), _col5 (typ= e: string), _col6 (type: string), _col7 (type: string), _col8 (type: string= ), _col9 (type: string), _col10 (type: string), _col11 (type: string), _col= 12 (type: int), _col15 (type: bigint), _col16 (type: double), _col17 (type:= double), _col18 (type: double) > outputColumnNames: _col0, _col1, _col2, _col3, _col4, _= col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _= col15, _col16 > Statistics: Num rows: 532400 Data size: 194326011 Basic= stats: COMPLETE Column stats: NONE > Map Join Operator > condition map: > Inner Join 0 to 1 > keys: > 0 _col1 (type: int), _col2 (type: string), _col3 (t= ype: string) > 1 _col0 (type: int), _col1 (type: string), _col2 (t= ype: string) > outputColumnNames: _col0, _col2, _col3, _col4, _col5,= _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _col15= , _col16, _col20, _col21, _col22, _col23, _col24 > input vertices: > 1 Reducer 27 > Statistics: Num rows: 585640 Data size: 213758616 Bas= ic stats: COMPLETE Column stats: NONE > HybridGraceHashJoin: true > Filter Operator > predicate: (_col21 <=3D _col13) (type: boolean) > Statistics: Num rows: 195213 Data size: 71252750 Ba= sic stats: COMPLETE Column stats: NONE > Select Operator > expressions: _col0 (type: string), _col2 (type: s= tring), _col3 (type: string), _col4 (type: string), _col5 (type: string), _= col6 (type: string), _col7 (type: string), _col8 (type: string), _col9 (typ= e: string), _col10 (type: string), _col11 (type: string), _col12 (type: int= ), _col13 (type: bigint), _col14 (type: double), _col15 (type: double), _co= l16 (type: double), _col22 (type: double), _col23 (type: double), _col24 (t= ype: double), _col20 (type: int), _col21 (type: bigint) > outputColumnNames: _col0, _col1, _col2, _col3, _c= ol4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _co= l14, _col15, _col16, _col17, _col18, _col19, _col20 > Statistics: Num rows: 195213 Data size: 71252750 = Basic stats: COMPLETE Column stats: NONE > Reduce Output Operator > key expressions: _col0 (type: string), _col1 (t= ype: string), _col20 (type: bigint) > sort order: +++ > Statistics: Num rows: 195213 Data size: 7125275= 0 Basic stats: COMPLETE Column stats: NONE > value expressions: _col2 (type: string), _col3 = (type: string), _col4 (type: string), _col5 (type: string), _col6 (type: st= ring), _col7 (type: string), _col8 (type: string), _col9 (type: string), _c= ol10 (type: string), _col11 (type: int), _col12 (type: bigint), _col13 (typ= e: double), _col14 (type: double), _col15 (type: double), _col16 (type: dou= ble), _col17 (type: double), _col18 (type: double), _col19 (type: int) > Reducer 6=20 > Reduce Operator Tree: > Select Operator > expressions: KEY.reducesinkkey0 (type: string), KEY.reduc= esinkkey1 (type: string), VALUE._col0 (type: string), VALUE._col1 (type: st= ring), VALUE._col2 (type: string), VALUE._col3 (type: string), VALUE._col4 = (type: string), VALUE._col5 (type: string), VALUE._col6 (type: string), VAL= UE._col7 (type: string), VALUE._col8 (type: string), VALUE._col9 (type: int= ), VALUE._col10 (type: bigint), VALUE._col11 (type: double), VALUE._col12 (= type: double), VALUE._col13 (type: double), VALUE._col14 (type: double), VA= LUE._col15 (type: double), VALUE._col16 (type: double), VALUE._col17 (type:= int), KEY.reducesinkkey2 (type: bigint) > outputColumnNames: _col0, _col1, _col2, _col3, _col4, _co= l5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _co= l15, _col16, _col17, _col18, _col19, _col20 > Statistics: Num rows: 195213 Data size: 71252750 Basic st= ats: COMPLETE Column stats: NONE > File Output Operator > compressed: false > Statistics: Num rows: 195213 Data size: 71252750 Basic = stats: COMPLETE Column stats: NONE > table: > input format: org.apache.hadoop.mapred.TextInputFor= mat > output format: org.apache.hadoop.hive.ql.io.HiveIgn= oreKeyTextOutputFormat > serde: org.apache.hadoop.hive.serde2.lazy.LazySimpl= eSerDe > Stage: Stage-0 > Fetch Operator > limit: -1 > Processor Tree: > ListSink > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)