hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nicholas Hakobian <nicholas.hakob...@rallyhealth.com>
Subject Re: Hive parquet on EMR
Date Fri, 01 Apr 2016 19:34:34 GMT
Make sure your column names in the struct exactly matches the case in the
table create statement. We just decided to make everything lowercase, but
occasionally someone forgets and makes one of the characters upper case and
Hive fails.

There was a fix for this in Hive, but it only fixed querying with mixed
case in top level column names, not columns nested in structs.

Hope this helps,

Nick

Nicholas Szandor Hakobian
Data Scientist
Rally Health
nicholas.hakobian@rallyhealth.com

On Fri, Apr 1, 2016 at 12:09 PM, Jose Rozanec <jose.rozanec@mercadolibre.com
> wrote:

> Hello,
>
> We have a Hive (v 1.0.0) cluster at EMR and data stored in parquet files.
> When querying data, it fails to return result, causing a NPE. We think the
> error may be related with Hive deserialization, since can query data
> without problems when using other technologies (ex.: Presto).
>
> Here we provide the stacktrace of the exception we get
> <http://pastebin.com/6SfNCrzj>. We checked Hive Parquet support
> <https://cwiki.apache.org/confluence/display/Hive/Parquet> and release
> notes <https://github.com/apache/hive/blob/master/RELEASE_NOTES.txt>, and
> seems map and struct types are supported for this version (1.0.0) Our
> Parquet schema involves the following types: string, boolean, map, struct.
>
> Did anyone had a similar experience? Is there a workaround for this?
>
> Thank you in advance,
>
>
>

Mime
View raw message