hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jimmy Xiang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-8843) Release RDD cache when Hive query is done [Spark Branch]
Date Tue, 16 Dec 2014 17:31:14 GMT

    [ https://issues.apache.org/jira/browse/HIVE-8843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14248536#comment-14248536
] 

Jimmy Xiang commented on HIVE-8843:
-----------------------------------

Thought about it again. The current solution seems to be the simplest one.  Did I miss anything?

> Release RDD cache when Hive query is done [Spark Branch]
> --------------------------------------------------------
>
>                 Key: HIVE-8843
>                 URL: https://issues.apache.org/jira/browse/HIVE-8843
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Xuefu Zhang
>            Assignee: Jimmy Xiang
>         Attachments: HIVE-8843.1-spark.patch
>
>
> In some multi-inser cases, RDD.cache() is called to improve performance. RDD is SparkContext
specific, but the caching is useful only for the query. Thus, once the query is executed,
we need to release the cache used by calling RDD.uncache().



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message