drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jinfeng Ni (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-1500) Partition filtering might lead to an unnecessary column in the result set.
Date Tue, 07 Oct 2014 00:16:33 GMT
Jinfeng Ni created DRILL-1500:
---------------------------------

             Summary: Partition filtering might lead to an unnecessary column in the result
set. 
                 Key: DRILL-1500
                 URL: https://issues.apache.org/jira/browse/DRILL-1500
             Project: Apache Drill
          Issue Type: Bug
          Components: Query Planning & Optimization
            Reporter: Jinfeng Ni
            Priority: Minor


When partition filtering is used together with select * query, Drill might return the partitioning
column duplicately. 

Q1 : 

{code}
select * from dfs.`/Users/jni/work/incubator-drill/exec/java-exec/src/test/resources/multilevel/parquet`
where dir0=1994 and dir1='Q1' order by dir0 limit 1;

+------------+------------+------------+------------+------------+------------+-------------+------------+-----------------+---------------+----------------+--------------+
|   dir00    |    dir0    |    dir1    |  o_clerk   | o_comment  | o_custkey  | o_orderdate
| o_orderkey | o_orderpriority | o_orderstatus | o_shippriority | o_totalprice |
+------------+------------+------------+------------+------------+------------+-------------+------------+-----------------+---------------+----------------+--------------+
| 1994       | 1994       | Q1         | Clerk#000000743 | y pending requests integrate |
1292       | 1994-01-20  | 66         | 5-LOW           | F             | 0              |
104190.66    |
+------------+------------+------------+------------+------------+------------+-------------+------------+-----------------+---------------+----------------+--------------+
1 row selected (2.097 seconds)
{code}

We can see that column "dir0" appeared twice in the result set.  In comparison, here is the
query without partition filtering and the query result:

Q2:

{code}
select * from dfs.`/Users/jni/work/incubator-drill/exec/java-exec/src/test/resources/multilevel/parquet`
order by dir0 limit 1;
+------------+------------+------------+------------+------------+-------------+------------+-----------------+---------------+----------------+--------------+
|    dir0    |    dir1    |  o_clerk   | o_comment  | o_custkey  | o_orderdate | o_orderkey
| o_orderpriority | o_orderstatus | o_shippriority | o_totalprice |
+------------+------------+------------+------------+------------+-------------+------------+-----------------+---------------+----------------+--------------+
| 1994       | Q1         | Clerk#000000743 | y pending requests integrate | 1292       |
1994-01-20  | 66         | 5-LOW           | F             | 0              | 104190.66  
 |
+------------+------------+------------+------------+------------+-------------+------------+-----------------+---------------+----------------+--------------+
1 row selected (0.761 seconds)
{code}




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message