hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dong Chen (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-10016) Remove duplicated Hive table schema parsing in DataWritableReadSupport
Date Wed, 08 Apr 2015 05:02:12 GMT

     [ https://issues.apache.org/jira/browse/HIVE-10016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Dong Chen updated HIVE-10016:
-----------------------------
    Attachment: HIVE-10016.patch

Rebased to trunk. There are a little changes for the patch to resolve conflict.

> Remove duplicated Hive table schema parsing in DataWritableReadSupport
> ----------------------------------------------------------------------
>
>                 Key: HIVE-10016
>                 URL: https://issues.apache.org/jira/browse/HIVE-10016
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Dong Chen
>            Assignee: Dong Chen
>         Attachments: HIVE-10016-parquet.patch, HIVE-10016.1-parquet.patch, HIVE-10016.patch
>
>
> In {{DataWritableReadSupport.init()}}, the table schema is created and its string format
is set in conf. When construct the {{ParquetRecordReaderWrapper}} , the schema is fetched
from conf and parsed several times.
> We could remove these schema parsing, and improve the speed of getRecordReader  a bit.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message