hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-11363) Prewarm Hive on Spark containers [Spark Branch]
Date Tue, 28 Jul 2015 22:12:04 GMT

     [ https://issues.apache.org/jira/browse/HIVE-11363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Xuefu Zhang updated HIVE-11363:
-------------------------------
    Attachment: HIVE-11363.4-spark.patch

> Prewarm Hive on Spark containers [Spark Branch]
> -----------------------------------------------
>
>                 Key: HIVE-11363
>                 URL: https://issues.apache.org/jira/browse/HIVE-11363
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>    Affects Versions: 1.1.0
>            Reporter: Xuefu Zhang
>            Assignee: Xuefu Zhang
>         Attachments: HIVE-11363.1-spark.patch, HIVE-11363.2-spark.patch, HIVE-11363.3-spark.patch,
HIVE-11363.4-spark.patch
>
>
> When Hive job is launched by Oozie, a Hive session is created and job script is executed.
Session is closed when Hive job is completed. Thus, Hive session is not shared among Hive
jobs either in an Oozie workflow or across workflows. Since the parallelism of a Hive job
executed on Spark is impacted by the available executors, such Hive jobs will suffer the executor
ramp-up overhead. The idea here is to wait a bit so that enough executors can come up before
a job can be executed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message