drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hao Zhu (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3578) UnsupportedOperationException: Unable to get value vector class for minor type [FIXEDBINARY] and mode [OPTIONAL]
Date Wed, 29 Jul 2015 18:50:04 GMT
Hao Zhu created DRILL-3578:
------------------------------

             Summary: UnsupportedOperationException: Unable to get value vector class for
minor type [FIXEDBINARY] and mode [OPTIONAL]
                 Key: DRILL-3578
                 URL: https://issues.apache.org/jira/browse/DRILL-3578
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Data Types
    Affects Versions: 1.1.0
            Reporter: Hao Zhu
            Assignee: Hanifi Gunes


The issue is Drill fails to read "timestamp" type in parquet file generated by Hive.

How to reproduce:
1. Create a external Hive CSV table in hive 1.0:
{code}
create external table type_test_csv
(
  id1 int,
  id2 string,
  id3 timestamp,
  id4 double
)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ','
STORED AS TEXTFILE
LOCATION '/xxx/testcsv';
{code}
2. Put sample data for above external table:
{code}
1,One,2015-01-01 00:01:00,1.0
2,Two,2015-01-02 00:02:00,2.0
{code}

3. Create a parquet hive table:
{code}
create external table type_test
(
  id1 int,
  id2 string,
  id3 timestamp,
  id4 double
)
STORED AS PARQUET
LOCATION '/xxx/type_test';

INSERT OVERWRITE TABLE type_test
  SELECT * FROM type_test_csv;
{code}
4. Then querying the parquet file directly through filesystem storage plugin:
{code}
> select * from dfs.`xxx/type_test`;
Error: SYSTEM ERROR: UnsupportedOperationException: Unable to get value vector class for minor
type [FIXEDBINARY] and mode [OPTIONAL]

Fragment 0:0

[Error Id: fccfe8b2-6427-46e5-8bfd-cac639e526e8 on h3.poc.com:31010] (state=,code=0)
{code}
5. If the sample data is only 1 row:
{code}
1,One,2015-01-01 00:01:00,1.0
{code}
Then the error message would become:
{code}
> select * from dfs.`xxx/type_test`;
Error: SYSTEM ERROR: UnsupportedOperationException: Unsupported type:INT96


[Error Id: b52b5d46-63a8-4be6-a11d-999a1b46c7c2 on h3.poc.com:31010] (state=,code=0)
{code}

Using Hive storage plugin works fine. This issue only applies to filesystem storage plugin.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message