hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vineet Garg (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-21279) Avoid moving/rename operation in FileSink op for SELECT queries
Date Tue, 26 Feb 2019 21:51:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-21279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Vineet Garg updated HIVE-21279:
-------------------------------
    Status: Open  (was: Patch Available)

> Avoid moving/rename operation in FileSink op for SELECT queries
> ---------------------------------------------------------------
>
>                 Key: HIVE-21279
>                 URL: https://issues.apache.org/jira/browse/HIVE-21279
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Planning
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>            Priority: Major
>             Fix For: 4.0.0
>
>         Attachments: HIVE-21279.1.patch, HIVE-21279.2.patch, HIVE-21279.3.patch, HIVE-21279.4.patch,
HIVE-21279.5.patch, HIVE-21279.6.patch, HIVE-21279.7.patch, HIVE-21279.8.patch, HIVE-21279.9.patch
>
>
> Currently at the end of a job FileSink operator moves/rename temp directory to another
directory from which FetchTask fetches result. This is done to avoid fetching potential partial/invalid
files by failed/runway tasks. This operation is expensive for cloud storage. It could be avoided
if FetchTask is passed on set of files to read from instead of whole directory.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message