hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Szehon Ho (JIRA)" <>
Subject [jira] [Updated] (HIVE-9482) Hive parquet timestamp compatibility
Date Fri, 30 Jan 2015 07:50:34 GMT


Szehon Ho updated HIVE-9482:
       Resolution: Fixed
    Fix Version/s:     (was: 0.15.0)
           Status: Resolved  (was: Patch Available)

Committed to trunk.  Thanks Brock for review.

> Hive parquet timestamp compatibility
> ------------------------------------
>                 Key: HIVE-9482
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>          Components: File Formats
>    Affects Versions: 0.15.0
>            Reporter: Szehon Ho
>            Assignee: Szehon Ho
>              Labels: TODOC1.2
>             Fix For: 1.2.0
>         Attachments: HIVE-9482.2.patch, HIVE-9482.patch, HIVE-9482.patch, parquet_external_time.parq
> In current Hive implementation, timestamps are stored in UTC (converted from current
timezone), based on original parquet timestamp spec.
> However, we find this is not compatibility with other tools, and after some investigation
it is not the way of the other file formats, or even some databases (Hive Timestamp is more
equivalent of 'timestamp without timezone' datatype).
> This is the first part of the fix, which will restore compatibility with parquet-timestamp
files generated by external tools by skipping conversion on reading.
> Later fix will change the write path to not convert, and stop the read-conversion even
for files written by Hive itself.

This message was sent by Atlassian JIRA

View raw message