drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Rogers (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-5291) Parquet Reader produces low density batches - variable width fields
Date Wed, 22 Feb 2017 19:53:44 GMT
Paul Rogers created DRILL-5291:
----------------------------------

             Summary: Parquet Reader produces low density batches - variable width fields
                 Key: DRILL-5291
                 URL: https://issues.apache.org/jira/browse/DRILL-5291
             Project: Apache Drill
          Issue Type: Bug
            Reporter: Paul Rogers
             Fix For: 1.8.0


See DRILL-5266 for background. That JIRA analyzed the problem with Parquet producing "low
density" record batches. That JIRA focused on the issue with fixed-width fields: due to a
bug, we overestimated the space taken.

Once that bug is fixed, Parquet continues to produce low density batches for variable-width
fields. DRILL-5266 explains why.

This ticket covers the variable-width case so that we don't lose sight of it once the fixed-width
case is fixed.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message