Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 57944 invoked from network); 13 Jan 2011 16:51:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 13 Jan 2011 16:51:58 -0000 Received: (qmail 79894 invoked by uid 500); 13 Jan 2011 16:51:55 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 79502 invoked by uid 500); 13 Jan 2011 16:51:52 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 79493 invoked by uid 99); 13 Jan 2011 16:51:51 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Jan 2011 16:51:51 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [68.142.206.157] (HELO web33508.mail.mud.yahoo.com) (68.142.206.157) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 13 Jan 2011 16:51:44 +0000 Received: (qmail 94471 invoked by uid 60001); 13 Jan 2011 16:51:23 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1294937482; bh=0mH1lJ5xZdQiFgFHoauqcSecNKaEMUbUsfTwzD80Wf8=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type; b=UbabPs94GbmggUBM3YWW8y1xUsE/nTvtcw6czXnBrC8AR/s69bM1VBqyIgUJfoQeZ8ekCoilPKOgOXYS5Kts+/yFNvOpVSuEzg/6GP5a02VWGxc1OAfJLAp3Nl3ldL+pU0Dh/Zmn8pYzcCMaG1bcwjEYw6k4dXL6489SgVkeYTQ= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type; b=SJlJmQfb+U9XP3kJi3f4vUN5JnMYdB9URaCvfX2ZcmarqsGhPbs2I3T/98uaRfXs67NRNdh1NdzxSrFn9XfsKKtp29805vPPbQUd+c+TdnKDo5oDA7fuBRo+uV002PjwS51+ZqBfl6z2GiYMUJ7yV15Ah3JgU9Vgh6A/Z9e1K30=; Message-ID: <881015.94450.qm@web33508.mail.mud.yahoo.com> X-YMail-OSG: 3b9WgkMVM1mtiWv1hInvkqBkpo77drJydUhJ7Ig2Qo2HYB5 QIoOcK66PbZFc3F3_wHfbMinkoT2gWnBnws1Fhk_hCw.oK6L8X6Eq2xW.xh0 974fGwyea79bIna0IyDP8qewry.c9itZTJx3DpjskQsvCAsETx3pCzu1UzWI Cc1PHB5fzd4DlTD9STihqEbcUvKdCcAzJC8qSSJ4096flgcpNHrpTf8oYS56 bqaJcXq53C_vSMi__vayYq.ehRfP8P7841hlebB2j97BRPry7roLpOnMYl5D s1tJFdreS5rb3crHS9KU_8GniasfSUP7rYxnYRJHKgb8HOaRoeHEXYkA03x6 m7rqSbCUsO7dRwyhS1qcegt6yFaD7Or78_3311eJrMSH7Ge0anaVeN67p1h8 - Received: from [71.135.166.173] by web33508.mail.mud.yahoo.com via HTTP; Thu, 13 Jan 2011 08:51:22 PST X-Mailer: YahooMailWebService/0.8.107.285259 Date: Thu, 13 Jan 2011 08:51:22 -0800 (PST) From: Raj V Reply-To: Raj V Subject: Re: TeraSort question. To: "common-user@hadoop.apache.org" MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="0-756794436-1294937482=:94450" --0-756794436-1294937482=:94450 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Steve=0A=0ALet me plot the graphs for all the nodes. I picked up 6 random n= odes out oif 480 and 2 of these were really busy and the otehr 4 were idle.= Either that makes me very lucky or the cluster was underutilized.=0A=0AI w= ould have found it acceptable if different nodes were utilized in different= ways, but in my case , 2 nodes had serious CPU , Network and Disk activity= and others=A0 were completely idle.=0A=0A=0A=0A=0A=0A=0A=0A=0AFrom: Steve = Loughran =0ATo: common-user@hadoop.apache.org=0ACc: =0AS= ent: Thursday, January 13, 2011 3:05 AM=0ASubject: Re: TeraSort question.= =0A=0AOn 11/01/11 16:40, Raj V wrote:=0A> Ted=0A> =0A> =0A> Thanks. I have = all the graphs I need that include, map reduce timeline, system activity fo= r all the nodes when the sort was running. I will publish them once I have = them in some presentable format.,=0A> =0A> For legal reasons, I really don'= t want to send the complete job histiory files.=0A> =0A> My question is sti= ll this. When running terasort, would the CPU, disk and network utilization= of all the nodes be more or less similar or completely different.=0A=0AThe= y can be different. The JT pushes out work to machines when they report in,= some may get more work than others, so generate more local data. This will= have follow-on consequences. In a live system things are different as the = work tends to follow the data, so machines with (or near) the data you need= get the work.=0A=0AIt's a really hard thing to say "is the cluster working= right", when bringing it up, everyone is really guessing about expected pe= rformance.=0A=0A-Steve --0-756794436-1294937482=:94450--