hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rui Li (JIRA)" <>
Subject [jira] [Commented] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]
Date Wed, 12 Nov 2014 12:30:34 GMT


Rui Li commented on HIVE-8793:

Now the result seems better.
Another thing I want to clarify is that the split spark work is done in a BFS manner. So if
we have a work graph like:
Map1 -> Reducer2 -> Reducer3
                 -> Reducer4
It will be split into something like:
Map1 -> Reducer5 -> Reducer3
     -> Reducer6 -> Reducer4
And Map1 still has two children.
Is this OK?

> Make sure multi-insert works with map join [Spark Branch]
> ---------------------------------------------------------
>                 Key: HIVE-8793
>                 URL:
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>    Affects Versions: spark-branch
>            Reporter: Chao
>            Assignee: Rui Li
>         Attachments: HIVE-8793.1-spark.patch, HIVE-8793.2-spark.patch
> Currently, HIVE-8622 is implemented based on an assumption, that for a map join query,
a BaseWork would not have multiple children. By testing through subquery_multiinsert.q did
reveal that's the case. But, we need to investigate on this, and make sure this won't happen
in general.

This message was sent by Atlassian JIRA

View raw message