hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rajesh Balamohan (Jira)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-22609) Reduce number of FS getFileStatus calls in AcidUtils::getHdfsDirSnapshots
Date Mon, 16 Dec 2019 02:40:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-22609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Rajesh Balamohan updated HIVE-22609:
------------------------------------
    Attachment: HIVE-22609.3.patch

> Reduce number of FS getFileStatus calls in AcidUtils::getHdfsDirSnapshots
> -------------------------------------------------------------------------
>
>                 Key: HIVE-22609
>                 URL: https://issues.apache.org/jira/browse/HIVE-22609
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>            Priority: Major
>         Attachments: HIVE-22609.1.patch, HIVE-22609.2.patch, HIVE-22609.3.patch
>
>
> [https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/AcidUtils.java#L1380]
> ACID delta folder contains {{_orc_acid_version}} and {{bucket_00000}} files. For both
these files, parent dir is the same. Number of getFileStatus in such cases should be reduced
by 1/2.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message