drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nate Putnam (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-5292) Better Parquet handling of sparse columns
Date Wed, 22 Feb 2017 22:22:44 GMT
Nate Putnam created DRILL-5292:

             Summary: Better Parquet handling of sparse columns
                 Key: DRILL-5292
                 URL: https://issues.apache.org/jira/browse/DRILL-5292
             Project: Apache Drill
          Issue Type: Improvement
          Components: Storage - Parquet
            Reporter: Nate Putnam

It appears the current implantation of ParquetRecordReader will fill in missing columns between
files as a NullableIntVector. It would be better if the code could determine if that column
was defined in a different file (and didn't conflict) and use the defined data type. 

This message was sent by Atlassian JIRA

View raw message