drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Le Dem (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3972) Vectorize Parquet Writer
Date Fri, 23 Oct 2015 18:47:27 GMT
Julien Le Dem created DRILL-3972:
------------------------------------

             Summary: Vectorize Parquet Writer
                 Key: DRILL-3972
                 URL: https://issues.apache.org/jira/browse/DRILL-3972
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Parquet
            Reporter: Julien Le Dem


Currently the [ParquetRecordWriter|https://github.com/apache/drill/blob/a98da39dd5a8fa368afd8765f4e981826bbfcc0f/exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/ParquetRecordWriter.java]
receives one record at a time and then turns that into columns.
Which means we convert from Drill columns to rows and then to Parquet columns.
Instead we could directly convert the Drill columns into Parquet columns in a vectorized manner.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message