drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aman Sinha (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-2900) Joins are not getting parallelized in TestTpchDistributed unit tests
Date Wed, 29 Apr 2015 05:20:06 GMT
Aman Sinha created DRILL-2900:
---------------------------------

             Summary: Joins are not getting parallelized in TestTpchDistributed unit tests
                 Key: DRILL-2900
                 URL: https://issues.apache.org/jira/browse/DRILL-2900
             Project: Apache Drill
          Issue Type: Bug
          Components: Query Planning & Optimization
    Affects Versions: 0.8.0
            Reporter: Aman Sinha
            Assignee: Jinfeng Ni


The suite of unit tests in org.apache.drill.TestTpchDistributed are supposed to have parallel
joins, parallel aggregations and sort.  In order to force parallelism, the slice_target value
is set to 10 (default is 100K).  However, it looks like the plans of at least a few queries
that I tried (query 03, 05, 14) did not have parallelism for the hash joins.  The aggregations
and sort do have Exchanges, so they are getting parallelized.  

There is a likelihood that many  plans have BroadcastExchange on the right input of the join
and since the left input of the join is typically a single file we may be dropping the exchange
on the right input.  This should be fixed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message