hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sergio Peña (JIRA) <j...@apache.org>
Subject [jira] [Updated] (HIVE-9670) Avoid reading file footers in ParquetRecordReaderWrapper
Date Thu, 12 Feb 2015 16:52:14 GMT

     [ https://issues.apache.org/jira/browse/HIVE-9670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sergio Peña updated HIVE-9670:
------------------------------
    Assignee:     (was: Sergio Peña)

> Avoid reading file footers in ParquetRecordReaderWrapper
> --------------------------------------------------------
>
>                 Key: HIVE-9670
>                 URL: https://issues.apache.org/jira/browse/HIVE-9670
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Sergio Peña
>
> ParquetRecordReaderWrapper is reading the file footer to create the splits, but then
when calling the realReader.initialize(), the file footer is read again by parquet.
> The issue PARQUET-139 did work to avoid reading the footers in parquet-avro. We should
implement the same idea in Hive, and update the parquet library to the latest stable version
from upstream.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message