drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Miroslav Holubec (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-4849) Refresh table metadata performance: read only new/updated parquet files
Date Wed, 17 Aug 2016 08:08:21 GMT
Miroslav Holubec created DRILL-4849:
---------------------------------------

             Summary: Refresh table metadata performance: read only new/updated parquet files
                 Key: DRILL-4849
                 URL: https://issues.apache.org/jira/browse/DRILL-4849
             Project: Apache Drill
          Issue Type: Improvement
          Components: Query Planning & Optimization
    Affects Versions: 1.7.0
            Reporter: Miroslav Holubec


Currently REFRESH TABLE METADATA takes serious amount of time for many small parquet files.
We can actually only read these parquets, which are new or changed. This will require to add
modificationTime into file metadata.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message