drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Altekruse (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-945) Create table in parquet fails on repeated data
Date Thu, 03 Jul 2014 16:58:36 GMT

    [ https://issues.apache.org/jira/browse/DRILL-945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14051681#comment-14051681
] 

Jason Altekruse commented on DRILL-945:
---------------------------------------

This ended up opening up an issue on the parquet library side. The problem is that the current
implementation allows for a record to be broken up between two pages, which necessitates us
doing some extra work to cache the partial record before we move onto the next. For cases
that do not exhibit this behavior the repeated implementation works in a partial patch that
will be posted shortly.

> Create table in parquet fails on repeated data
> ----------------------------------------------
>
>                 Key: DRILL-945
>                 URL: https://issues.apache.org/jira/browse/DRILL-945
>             Project: Apache Drill
>          Issue Type: Sub-task
>          Components: Storage - Parquet
>            Reporter: Jason Altekruse
>            Assignee: Jason Altekruse
>             Fix For: 1.0.0-BETA1
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message