hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sergio Peña (JIRA) <j...@apache.org>
Subject [jira] [Commented] (HIVE-6914) parquet-hive cannot write nested map (map value is map)
Date Wed, 19 Nov 2014 18:08:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-6914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14218250#comment-14218250
] 

Sergio Peña commented on HIVE-6914:
-----------------------------------

Hi [~mickaellcr],

It sounds good if you use the patch from HIVE-8359 for this bug. Regarding adding the qtests
to HIVE-8909, I think that ticket is meant to fix the reading part of different nested types
formats generated by Thrift and Avro tools (it does not touch the writing part); so I think
it should be good to have these writing tests separated from the reading tests.



> parquet-hive cannot write nested map (map value is map)
> -------------------------------------------------------
>
>                 Key: HIVE-6914
>                 URL: https://issues.apache.org/jira/browse/HIVE-6914
>             Project: Hive
>          Issue Type: Bug
>          Components: File Formats
>    Affects Versions: 0.12.0, 0.13.0
>            Reporter: Tongjie Chen
>              Labels: parquet, serialization
>         Attachments: HIVE-6914.1.patch, HIVE-6914.2.patch
>
>
> // table schema (identical for both plain text version and parquet version)
> desc hive> desc text_mmap;
> m map>
> // sample nested map entry
> {"level1":{"level2_key1":"value1","level2_key2":"value2"}}
> The following query will fail, 
> insert overwrite table parquet_mmap select * from text_mmap;
> Caused by: parquet.io.ParquetEncodingException: This should be an ArrayWritable or MapWritable:
org.apache.hadoop.hive.ql.io.parquet.writable.BinaryWritable@f2f8106
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter.writeData(DataWritableWriter.java:85)
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter.writeArray(DataWritableWriter.java:118)
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter.writeData(DataWritableWriter.java:80)
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter.writeData(DataWritableWriter.java:82)
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter.write(DataWritableWriter.java:55)
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriteSupport.write(DataWritableWriteSupport.java:59)
> at org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriteSupport.write(DataWritableWriteSupport.java:31)
> at parquet.hadoop.InternalParquetRecordWriter.write(InternalParquetRecordWriter.java:115)
> at parquet.hadoop.ParquetRecordWriter.write(ParquetRecordWriter.java:81)
> at parquet.hadoop.ParquetRecordWriter.write(ParquetRecordWriter.java:37)
> at org.apache.hadoop.hive.ql.io.parquet.write.ParquetRecordWriterWrapper.write(ParquetRecordWriterWrapper.java:77)
> at org.apache.hadoop.hive.ql.io.parquet.write.ParquetRecordWriterWrapper.write(ParquetRecordWriterWrapper.java:90)
> at org.apache.hadoop.hive.ql.exec.FileSinkOperator.processOp(FileSinkOperator.java:622)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:793)
> at org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(SelectOperator.java:87)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:793)
> at org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:92)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:793)
> at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:540)
> ... 9 more



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message