hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-396) Hive performance benchmarks
Date Mon, 10 Aug 2009 18:21:15 GMT

    [ https://issues.apache.org/jira/browse/HIVE-396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12741465#action_12741465

Alan Gates commented on HIVE-396:

> How many mapper slots and reducer slots are there in the cluster?
There are 36 mapper and 36 reducer slots on the cluster.

> How many mappers and reducers did hadoop, hive and pig take?
Hadoop and Hive took 35 maps, pig took 36.  I set all to use 4 reducers.

> Are you using hive trunk? What is the hive svn revision number?
SVN revision 796069

> I am also interested in learning how you write the efficient hadoop code for the aggregation
query. Can you attach your hadoop code?
Attached as AlansMRCode.tgz

I looked in hive-default.xml and didn't see any hive.merge.mapfiles.  Should I add it to hive-defult.xml
and set it to false?  Out of curiosity, why do you default to merging map files first?

> Hive performance benchmarks
> ---------------------------
>                 Key: HIVE-396
>                 URL: https://issues.apache.org/jira/browse/HIVE-396
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Zheng Shao
>            Assignee: Yuntao Jia
>         Attachments: hive_benchmark_2009-06-18.pdf, hive_benchmark_2009-06-18.tar.gz,
hive_benchmark_2009-07-12.pdf, hive_benchmark_2009-07-21.tar.gz
> We need some performance benchmark to measure and track the performance improvements
of Hive.
> Some references:
> PIG performance benchmarks PIG-200
> PigMix: http://wiki.apache.org/pig/PigMix

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message