hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-7327) Refactoring: make Hive map side data processing reusable
Date Tue, 29 Jul 2014 13:13:39 GMT

    [ https://issues.apache.org/jira/browse/HIVE-7327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077693#comment-14077693
] 

Xuefu Zhang commented on HIVE-7327:
-----------------------------------

It seems it's easier to use ExecMapper directly than any refactoring. Postpone this item for
now for later consideration.

> Refactoring: make Hive map side data processing reusable
> --------------------------------------------------------
>
>                 Key: HIVE-7327
>                 URL: https://issues.apache.org/jira/browse/HIVE-7327
>             Project: Hive
>          Issue Type: Sub-task
>    Affects Versions: 0.13.0
>            Reporter: Xuefu Zhang
>            Assignee: Xuefu Zhang
>
> ExecMapper is Hive's mapper implementation for MapReduce. Table rows are read by MR framework
and processed by ExecMapper.map() method, which invokes Hive's map-side operator tree starting
from MapOperator. This task is to extract the map-side data processing offered by the operator
tree so that it can be used by other execution engine such as Spark. This is purely refactoring
the existing code.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message