drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jacques Nadeau (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-1556) Querying JSON-converted-Parquet file throws parquet.io.ParquetDecodingException (Intermittent)
Date Sun, 04 Jan 2015 23:58:08 GMT

     [ https://issues.apache.org/jira/browse/DRILL-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jacques Nadeau updated DRILL-1556:
----------------------------------
    Fix Version/s: 0.8.0
         Priority: Critical  (was: Major)

> Querying JSON-converted-Parquet file throws parquet.io.ParquetDecodingException (Intermittent)
> ----------------------------------------------------------------------------------------------
>
>                 Key: DRILL-1556
>                 URL: https://issues.apache.org/jira/browse/DRILL-1556
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>            Reporter: Abhishek Girish
>            Assignee: Jason Altekruse
>            Priority: Critical
>             Fix For: 0.8.0
>
>         Attachments: drillbit.log
>
>
> Querying JSON data works at higher values for limit:
> > select * from `yelp_academic_dataset_review.json` limit 1125458;
> Querying Parquet data (converted from JSON) fails at higher values for limit:
> > create table yelp_academic_dataset_review as select * from `yelp_academic_dataset_review.json`;
> [success]
> >select * from yelp_academic_dataset_review limit 40000;
> [data]
> java.lang.RuntimeException: java.sql.SQLException: Failure while trying to get next result
batch.
> Logs indicate an error in decoding the Parquet file. Drillbit.log is attached. 
> 2014-10-20 15:21:22,739 [bf4a3f58-781b-4c89-b718-e1ef6eab6da4:frag:1:0] ERROR o.a.drill.exec.ops.FragmentContext
- Fragment Context received 
> failure.
> parquet.io.ParquetDecodingException: Can't read value in column [votes, funny] INT64
at value 61063 out of 61063, 61063 out of 61063 in currentPage. repetition level: 0, definition
level: 2
> This is at times consistent and some other times intermittent, for varied values provided
to the limit clause. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message