hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Verhaeghe Philippe <Philippe.Verhae...@worldline.com>
Subject RE: same query works with TEXTFILE and fails with ORC
Date Mon, 13 Apr 2015 14:52:07 GMT
Bug created in JIRA as HIVE-10316

-----Message d'origine-----
De : Gopal Vijayaraghavan [mailto:gopal@hortonworks.com] De la part de Gopal Vijayaraghavan
Envoyé : Monday, April 13, 2015 11:46 PM
À : user@hive.apache.org
Objet : Re: same query works with TEXTFILE and fails with ORC

> I¹m getting an error in Hive when executing a query on a table in ORC
>format.

This is not an ORC bug, this looks like a vectorization issue.

Can you try comparing both query plans (³explain <query>²) for the Execution mode:
vectorized markers?

TextFile queries are not vectorized today, since you cannot find if any column is marked as
isRepeating=true in a row-major format.

> SELECT CONCAT(TO_DATE(datetime), '-'),   SUM(gpa)  FROM students_orc
>GROUP BY CONCAT(TO_DATE(datetime), '-Œ);

...
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException:
>Unsuported vector output type: StringGroup
>        at
>org.apache.hadoop.hive.ql.exec.vector.VectorColumnSetInfo.addKey(Vector
>Col
>umnSetInfo.java:139)
>        at
>org.apache.hadoop.hive.ql.exec.vector.VectorHashKeyWrapperBatch.compile
>Key
>WrapperBatch(VectorHashKeyWrapperBatch.java:521)
>        at
>org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator.initializeO
>p(V
>ectorGroupByOperator.java:786)

The correct fix would be to handle this query pattern for vectorization (or automatically
disable vectorization, like it has to do for Unions).

Can you log a bug on Apache JIRA against the correct version of hive which threw this error
up?

Cheers,
Gopal



Ce message et les pièces jointes sont confidentiels et réservés à l'usage exclusif de
ses destinataires. Il peut également être protégé par le secret professionnel. Si vous
recevez ce message par erreur, merci d'en avertir immédiatement l'expéditeur et de le détruire.
L'intégrité du message ne pouvant être assurée sur Internet, la responsabilité de Worldline
ne pourra être recherchée quant au contenu de ce message. Bien que les meilleurs efforts
soient faits pour maintenir cette transmission exempte de tout virus, l'expéditeur ne donne
aucune garantie à cet égard et sa responsabilité ne saurait être recherchée pour tout
dommage résultant d'un virus transmis.

This e-mail and the documents attached are confidential and intended solely for the addressee;
it may also be privileged. If you receive this e-mail in error, please notify the sender immediately
and destroy it. As its integrity cannot be secured on the Internet, the Worldline liability
cannot be triggered for the message content. Although the sender endeavours to maintain a
computer virus-free network, the sender does not warrant that this transmission is virus-free
and will not be liable for any damages resulting from any virus transmitted.

Mime
View raw message