drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-4980) Upgrading of the approach of parquet date correctness status detection
Date Wed, 09 Nov 2016 16:58:59 GMT

    [ https://issues.apache.org/jira/browse/DRILL-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651437#comment-15651437

ASF GitHub Bot commented on DRILL-4980:

Github user paul-rogers commented on a diff in the pull request:

    --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
    @@ -185,7 +185,8 @@ private Metadata(FileSystem fs, ParquetFormatConfig formatConfig)
    -    ParquetTableMetadata_v3 parquetTableMetadata = new ParquetTableMetadata_v3(true);
    +    ParquetTableMetadata_v3 parquetTableMetadata = new ParquetTableMetadata_v3(DrillVersionInfo.getVersion(),
    +        ParquetWriter.WRITER_VERSION);
    --- End diff --
    I'm a bit confused. The writer version applies to the Parquet files which Drill writes.
(Or, at least, that was the intention.)
    Here, we're talking about metadata. There may well be a metadata writer, but that should
be a different writer, with a different version.
    Not sure we want to initialize the metadata object with the current writer version: there
seems to be no correlation between the metadata object and the writer version.
    On the other hand, the metadata can certainly hold the writer version, but it must be
the value read from the Parquet file itself; not a value set by the code. Else, we have the
difficult problem of making sure that the code-set version number agrees with the actual version
number in the file.

> Upgrading of the approach of parquet date correctness status detection
> ----------------------------------------------------------------------
>                 Key: DRILL-4980
>                 URL: https://issues.apache.org/jira/browse/DRILL-4980
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Parquet
>    Affects Versions: 1.8.0
>            Reporter: Vitalii Diravka
>            Assignee: Vitalii Diravka
>             Fix For: 1.9.0
> This jira is an addition for the [DRILL-4203|https://issues.apache.org/jira/browse/DRILL-4203].
> The date correctness label for the new generated parquet files should be upgraded. 

This message was sent by Atlassian JIRA

View raw message