hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <>
Subject [jira] [Commented] (HIVE-7327) Refactoring: make Hive map side data processing reusable
Date Tue, 29 Jul 2014 13:13:39 GMT


Xuefu Zhang commented on HIVE-7327:

It seems it's easier to use ExecMapper directly than any refactoring. Postpone this item for
now for later consideration.

> Refactoring: make Hive map side data processing reusable
> --------------------------------------------------------
>                 Key: HIVE-7327
>                 URL:
>             Project: Hive
>          Issue Type: Sub-task
>    Affects Versions: 0.13.0
>            Reporter: Xuefu Zhang
>            Assignee: Xuefu Zhang
> ExecMapper is Hive's mapper implementation for MapReduce. Table rows are read by MR framework
and processed by method, which invokes Hive's map-side operator tree starting
from MapOperator. This task is to extract the map-side data processing offered by the operator
tree so that it can be used by other execution engine such as Spark. This is purely refactoring
the existing code.

This message was sent by Atlassian JIRA

View raw message