drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jinfeng Ni (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-2900) Joins are not getting parallelized in TestTpchDistributed unit tests
Date Thu, 02 Jul 2015 20:35:04 GMT

     [ https://issues.apache.org/jira/browse/DRILL-2900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jinfeng Ni updated DRILL-2900:
    Fix Version/s:     (was: 1.2.0)

> Joins are not getting parallelized in TestTpchDistributed unit tests
> --------------------------------------------------------------------
>                 Key: DRILL-2900
>                 URL: https://issues.apache.org/jira/browse/DRILL-2900
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 0.8.0
>            Reporter: Aman Sinha
>            Assignee: Jinfeng Ni
>             Fix For: 1.3.0
> The suite of unit tests in org.apache.drill.TestTpchDistributed are supposed to have
parallel joins, parallel aggregations and sort.  In order to force parallelism, the slice_target
value is set to 10 (default is 100K).  However, it looks like the plans of at least a few
queries that I tried (query 03, 05, 14) did not have parallelism for the hash joins.  The
aggregations and sort do have Exchanges, so they are getting parallelized.  
> There is a likelihood that many  plans have BroadcastExchange on the right input of the
join and since the left input of the join is typically a single file we may be dropping the
exchange on the right input.  This should be fixed. 

This message was sent by Atlassian JIRA

View raw message