Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 06F8FC842 for ; Tue, 18 Nov 2014 15:54:35 +0000 (UTC) Received: (qmail 23558 invoked by uid 500); 18 Nov 2014 15:54:34 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 23486 invoked by uid 500); 18 Nov 2014 15:54:34 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 23474 invoked by uid 500); 18 Nov 2014 15:54:34 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 23471 invoked by uid 99); 18 Nov 2014 15:54:34 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Nov 2014 15:54:34 +0000 Date: Tue, 18 Nov 2014 15:54:34 +0000 (UTC) From: =?utf-8?Q?Sergio_Pe=C3=B1a_=28JIRA=29?= To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-8359) Map containing null values are not correctly written in Parquet files MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-8359?page=3Dcom.atlassian.= jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D14216= 336#comment-14216336 ]=20 Sergio Pe=C3=B1a commented on HIVE-8359: ----------------------------------- Thanks [~mickaellcr]. Sorry for the confusion. I did not see you uploaded another patch here.=20 I just added two extra lines to the patch you uploaded. I will integrate yo= ur fixes there, and upload the patch again. > Map containing null values are not correctly written in Parquet files > --------------------------------------------------------------------- > > Key: HIVE-8359 > URL: https://issues.apache.org/jira/browse/HIVE-8359 > Project: Hive > Issue Type: Bug > Components: File Formats > Affects Versions: 0.13.1 > Reporter: Fr=C3=A9d=C3=A9ric TERRAZZONI > Assignee: Sergio Pe=C3=B1a > Attachments: HIVE-8359.1.patch, HIVE-8359.2.patch, HIVE-8359.4.pa= tch, map_null_val.avro > > > Tried write a map column in a Parquet file. The table shou= ld contain : > {code} > {"key3":"val3","key4":null} > {"key3":"val3","key4":null} > {"key1":null,"key2":"val2"} > {"key3":"val3","key4":null} > {"key3":"val3","key4":null} > {code} > ... and when you do a query like {code}SELECT * from mytable{code} > We can see that the table is corrupted : > {code} > {"key3":"val3"} > {"key4":"val3"} > {"key3":"val2"} > {"key4":"val3"} > {"key1":"val3"} > {code} > I've not been able to read the Parquet file in our software afterwards, a= nd consequently I suspect it to be corrupted.=20 > For those who are interested, I generated this Parquet table from an Avro= file.=20 -- This message was sent by Atlassian JIRA (v6.3.4#6332)