hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <>
Subject [jira] [Updated] (HIVE-9153) Perf enhancement on CombineHiveInputFormat and HiveInputFormat
Date Mon, 29 Dec 2014 17:46:13 GMT


Xuefu Zhang updated HIVE-9153:
       Resolution: Fixed
    Fix Version/s: 0.15.0
           Status: Resolved  (was: Patch Available)

Committed to trunk and merged to spark branch. Thanks, Rui.

> Perf enhancement on CombineHiveInputFormat and HiveInputFormat
> --------------------------------------------------------------
>                 Key: HIVE-9153
>                 URL:
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Brock Noland
>            Assignee: Rui Li
>             Fix For: spark-branch, 0.15.0
>         Attachments: HIVE-9153.1-spark.patch, HIVE-9153.1-spark.patch, HIVE-9153.2.patch,
HIVE-9153.3.patch, screenshot.PNG
> The default InputFormat is {{CombineHiveInputFormat}} and thus HOS uses this. However,
Tez uses {{HiveInputFormat}}. Since tasks are relatively cheap in Spark, it might make sense
for us to use {{HiveInputFormat}} as well. We should evaluate this on a query which has many
input splits such as {{select count(\*) from store_sales where something is not null}}.

This message was sent by Atlassian JIRA

View raw message