drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Deneche A. Hakim (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-2253) Vectorized Parquet reader fails to read correctly against RLE Dictionary encoded DATE column
Date Tue, 17 Feb 2015 22:58:11 GMT

    [ https://issues.apache.org/jira/browse/DRILL-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14325068#comment-14325068
] 

Deneche A. Hakim commented on DRILL-2253:
-----------------------------------------

running {{parquet-tools meta}} on the generated parquet file gives the following informations:
{noformat}
creator:     parquet-mr 

file schema: root 
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
date:        OPTIONAL INT32 O:DATE R:0 D:1

row group 1: RC:600 TS:1885 
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
date:         INT32 SNAPPY DO:0 FPO:4 SZ:1895/1885/0.99 VC:600 ENC:PLAIN_DICTIONARY,BIT_PACKED,RLE
{noformat}

One important note is: the exception doesn't happen if the column's encoder is PLAIN instead
of PLAIN_DICTIONARY

> Vectorized Parquet reader fails to read correctly against RLE Dictionary encoded DATE
column
> --------------------------------------------------------------------------------------------
>
>                 Key: DRILL-2253
>                 URL: https://issues.apache.org/jira/browse/DRILL-2253
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>            Reporter: Jacques Nadeau
>            Assignee: Deneche A. Hakim
>         Attachments: 2253.csv, 2253.parquet
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message