hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yuntao Jia (JIRA)" <>
Subject [jira] Updated: (HIVE-396) Hive performance benchmarks
Date Tue, 21 Jul 2009 23:13:15 GMT


Yuntao Jia updated HIVE-396:

    Attachment: hive_benchmark_2009-07-21.tar.gz

Updated the benchmark script to make it more automatic. Now it outputs all the timings to
a csv file which looks like:

Timings, grep select, rankings select, uservisits aggregation, uservisits-rankings join
Trial 1
Trial 2
Trial 3

The first line shows the queries, followed by query timings from different trials. Within
each trial, there are three lines showing the query timings on Hive, PIG and Hadoop, respectively.
The numbers here are for illustration purpose only.
The file can be directly opened in excel. User can then easily generate a performance graph
on top of it

> Hive performance benchmarks
> ---------------------------
>                 Key: HIVE-396
>                 URL:
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Zheng Shao
>            Assignee: Yuntao Jia
>         Attachments: hive_benchmark_2009-06-18.pdf, hive_benchmark_2009-06-18.tar.gz,
hive_benchmark_2009-07-12.pdf, hive_benchmark_2009-07-21.tar.gz
> We need some performance benchmark to measure and track the performance improvements
of Hive.
> Some references:
> PIG performance benchmarks PIG-200
> PigMix:

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message