Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 43613 invoked from network); 18 Feb 2011 22:36:33 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 18 Feb 2011 22:36:33 -0000 Received: (qmail 76664 invoked by uid 500); 18 Feb 2011 22:36:31 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 76626 invoked by uid 500); 18 Feb 2011 22:36:30 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 76618 invoked by uid 99); 18 Feb 2011 22:36:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Feb 2011 22:36:30 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.214.176] (HELO mail-iw0-f176.google.com) (209.85.214.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Feb 2011 22:36:22 +0000 Received: by iwn2 with SMTP id 2so4243604iwn.35 for ; Fri, 18 Feb 2011 14:36:01 -0800 (PST) Received: by 10.42.167.129 with SMTP id s1mr1562761icy.231.1298068561073; Fri, 18 Feb 2011 14:36:01 -0800 (PST) MIME-Version: 1.0 Received: by 10.42.213.129 with HTTP; Fri, 18 Feb 2011 14:35:41 -0800 (PST) X-Originating-IP: [64.105.168.204] In-Reply-To: <12CB02B10103F9488CBF215549B77EA80CD5BC@PVSWMAIL2010.pervasive.com> References: <12CB02B10103F9488CBF215549B77EA80CD5BC@PVSWMAIL2010.pervasive.com> From: Ted Dunning Date: Fri, 18 Feb 2011 14:35:41 -0800 Message-ID: Subject: Re: benchmark choices To: common-user@hadoop.apache.org Cc: Jim Falgout Content-Type: multipart/alternative; boundary=90e6ba6e899e7a15d7049c9623e1 X-Virus-Checked: Checked by ClamAV on apache.org --90e6ba6e899e7a15d7049c9623e1 Content-Type: text/plain; charset=ISO-8859-1 I just read the malstone report. They report times for a Java version that is many (5x) times slower than for a streaming implementation. That single fact indicates that the Java code is so appallingly bad that this is a very bad benchmark. On Fri, Feb 18, 2011 at 2:27 PM, Jim Falgout wrote: > We use MalStone and TeraSort. For Hive, you can use TPC-H, at least the > data and the queries, if not the query generator. There is a Jira issue in > Hive that discusses the TPC-H "benchmark" if you're interested. Sorry, I > don't remember the issue number offhand. > > -----Original Message----- > From: Shrinivas Joshi [mailto:jshrinivas@gmail.com] > Sent: Friday, February 18, 2011 3:32 PM > To: common-user@hadoop.apache.org > Subject: benchmark choices > > Which workloads are used for serious benchmarking of Hadoop clusters? Do > you care about any of the following workloads : > TeraSort, GridMix v1, v2, or v3, MalStone, CloudBurst, MRBench, NNBench, > sample apps shipped with Hadoop distro like PiEstimator, dbcount etc. > > Thanks, > -Shrinivas > > --90e6ba6e899e7a15d7049c9623e1--