Mailing-List: contact dev-help@hive.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@hive.apache.org
Date: Thu, 16 Oct 2014 23:57:33 +0000 (UTC)
From: "Chao (JIRA)" <jira@apache.org>
To: hive-dev@hadoop.apache.org
Message-ID: <JIRA.12747551.1413081771000.282819.1413503853954@Atlassian.JIRA>
In-Reply-To: <JIRA.12747551.1413081771000@Atlassian.JIRA>
References: <JIRA.12747551.1413081771000@Atlassian.JIRA>
 <JIRA.12747551.1413081771583@arcas>
Subject: [jira] [Commented] (HIVE-8436) Modify SparkWork to split works with
 multiple child works [Spark Branch]
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit


    [ https://issues.apache.org/jira/browse/HIVE-8436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14174494#comment-14174494 ] 

Chao commented on HIVE-8436:
----------------------------

Yeah, let me add such a test with this patch. Originally I was thinking to do it as a followup, but now I think it's better to do it together, to ensure correctness.
Also, my latest patch doesn't trigger the tests. Why is that?

> Modify SparkWork to split works with multiple child works [Spark Branch]
> ------------------------------------------------------------------------
>
>                 Key: HIVE-8436
>                 URL: https://issues.apache.org/jira/browse/HIVE-8436
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Xuefu Zhang
>            Assignee: Chao
>         Attachments: HIVE-8436.1-spark.patch, HIVE-8436.2-spark.patch, HIVE-8436.3-spark.patch
>
>
> Based on the design doc, we need to split the operator tree of a work in SparkWork if the work is connected to multiple child works. The way splitting the operator tree is performed by cloning the original work and removing unwanted branches in the operator tree. Please refer to the design doc for details.
> This process should be done right before we generate SparkPlan. We should have a utility method that takes the orignal SparkWork and return a modified SparkWork.
> This process should also keep the information about the original work and its clones. Such information will be needed during SparkPlan generation (HIVE-8437).


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)