hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gopal Vijayaraghavan <>
Subject Re: same query works with TEXTFILE and fails with ORC
Date Mon, 13 Apr 2015 21:46:18 GMT
> I¹m getting an error in Hive when executing a query on a table in ORC

This is not an ORC bug, this looks like a vectorization issue.

Can you try comparing both query plans (³explain <query>²) for the
Execution mode: vectorized markers?

TextFile queries are not vectorized today, since you cannot find if any
column is marked as isRepeating=true in a row-major format.

> SELECT CONCAT(TO_DATE(datetime), '-'),   SUM(gpa)  FROM students_orc
>GROUP BY CONCAT(TO_DATE(datetime), '-Œ);

> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unsuported
>vector output type: StringGroup
>        at 
>        at 
>        at 

The correct fix would be to handle this query pattern for vectorization
(or automatically disable vectorization, like it has to do for Unions).

Can you log a bug on Apache JIRA against the correct version of hive which
threw this error up?


View raw message