hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Frédéric TERRAZZONI (JIRA) <j...@apache.org>
Subject [jira] [Created] (HIVE-8359) Map containing null values are not correctly written in Parquet files
Date Mon, 06 Oct 2014 17:17:33 GMT
Frédéric TERRAZZONI created HIVE-8359:
-----------------------------------------

             Summary: Map containing null values are not correctly written in Parquet files
                 Key: HIVE-8359
                 URL: https://issues.apache.org/jira/browse/HIVE-8359
             Project: Hive
          Issue Type: Bug
    Affects Versions: 0.13.1
            Reporter: Frédéric TERRAZZONI


Tried write a map<string,string> column in a Parquet file. The table should contain
:
{code}
{"key3":"val3","key4":null}
{"key3":"val3","key4":null}
{"key1":null,"key2":"val2"}
{"key3":"val3","key4":null}
{"key3":"val3","key4":null}
{code}
... and when you do a query like {code}SELECT * from mytable{code}
We can see that the table is corrupted :
{code}
{"key3":"val3"}
{"key4":"val3"}
{"key3":"val2"}
{"key4":"val3"}
{"key1":"val3"}
{code}

I've not been able to read the Parquet file in our software afterwards, and consequently I
suspect it to be corrupted. 

For those who are interested, I generated this Parquet table from an Avro file. Don't know
how to attach it here though ... :)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message