drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Deneche A. Hakim (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-2262) selecting columns of certain datatypes from a dictionary encoded parquet file created by drill fails
Date Mon, 23 Feb 2015 17:34:11 GMT

     [ https://issues.apache.org/jira/browse/DRILL-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Deneche A. Hakim updated DRILL-2262:
------------------------------------
    Attachment: DRILL-2262.1.patch.txt

This is a work in progress, the issue is fixed but the code needs improvement as other issues
may be related to this.

unit tests are passing along with functional tests.

> selecting columns of certain datatypes from a dictionary encoded parquet file created
by drill fails 
> -----------------------------------------------------------------------------------------------------
>
>                 Key: DRILL-2262
>                 URL: https://issues.apache.org/jira/browse/DRILL-2262
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>            Reporter: Ramana Inukonda Nagaraj
>            Assignee: Deneche A. Hakim
>            Priority: Critical
>         Attachments: 0_0_0.parquet, DRILL-2262.1.patch.txt
>
>
> After creating a parquet file containing all datatypes using CTAS from drill certain
columns are not readable by doing a select in drill. 
> These datatypes are :
> DECIMAL18_col:       OPTIONAL INT64 O:DECIMAL R:0 D:1
> TIME_col:            OPTIONAL INT32 O:TIME R:0 D:1
> TIMESTAMP_col:       OPTIONAL INT64 O:TIMESTAMP R:0 D:1
> The select from these columns fails with a similar error
> {code}
> 0: jdbc:drill:> select DECIMAL18_col from parquet_all_default limit 1;
> Query failed: RemoteRpcException: Failure while running fragment., org.apache.drill.exec.vector.NullableDecimal18Vector
cannot be cast to org.apache.drill.exec.vector.NullableBigIntVector [ 26094858-356a-4128-ba2a-aa1473f74c93
on 10.10.30.167:31010 ]
> [ 26094858-356a-4128-ba2a-aa1473f74c93 on 10.10.30.167:31010 ]
> Error: exception while executing query: Failure while executing query. (state=,code=0)
> 0: jdbc:drill:> select TIME_col from parquet_all_default limit 1;
> Query failed: RemoteRpcException: Failure while running fragment., org.apache.drill.exec.vector.NullableTimeVector
cannot be cast to org.apache.drill.exec.vector.NullableIntVector [ 8938245b-a3a8-4bf2-8b6a-7e3860c90e8e
on 10.10.30.167:31010 ]
> [ 8938245b-a3a8-4bf2-8b6a-7e3860c90e8e on 10.10.30.167:31010 ]
> Error: exception while executing query: Failure while executing query. (state=,code=0)
> 0: jdbc:drill:> select TIMESTAMP_col from parquet_all_default limit 1;
> Query failed: RemoteRpcException: Failure while running fragment., org.apache.drill.exec.vector.NullableTimeStampVector
cannot be cast to org.apache.drill.exec.vector.NullableBigIntVector [ 5dff5b38-ece7-4159-acfe-78dacb563e20
on 10.10.30.167:31010 ]
> [ 5dff5b38-ece7-4159-acfe-78dacb563e20 on 10.10.30.167:31010 ]
> Error: exception while executing query: Failure while executing query. (state=,code=0)
> {code}
> Note: Dictionary encoding is turned on by default as of commit : https://github.com/apache/drill/commit/00c08eff2a5de6e4334fb57e952aac9a852c3d37
> After setting session options to turn off dictionary encoding the same queries succeed
successfully. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message