hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Qiang.Kang (Jira)" <j...@apache.org>
Subject [jira] [Assigned] (HIVE-23685) Removing user's extra resources when executing File Merge Task
Date Sat, 13 Jun 2020 04:48:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Qiang.Kang reassigned HIVE-23685:
---------------------------------


> Removing user's extra resources when executing File Merge Task
> --------------------------------------------------------------
>
>                 Key: HIVE-23685
>                 URL: https://issues.apache.org/jira/browse/HIVE-23685
>             Project: Hive
>          Issue Type: Bug
>          Components: Physical Optimizer, Query Planning
>            Reporter: Qiang.Kang
>            Assignee: Qiang.Kang
>            Priority: Critical
>
> Hi, we find that MapReduce's file merge map containers will download user's extra resources(such
as: added jars, files, archives) before launching task. When these resources are large or
the network is busy, file merge jobs will be timeout, causing the query be failed. As we all
know, file merge task will run correctly just with hive-exec.jar and MapReduce framework.
Therefore, there is no need to download user's resources. The patch below prevents setting
`tmpjars` for FileMerge Task.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message