Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of russell.jurney@gmail.com
 designates 209.85.216.41 as permitted sender)
References: 
 <CAMWNowE3vj3=Omc5MUm1S1qtF5x6OUst6bx1=9Su1zXKkeDDUQ@mail.gmail.com>
 <CAAu13zE+HFTRjH=6N=hB-jgPxSTanPA3RTVWk7EsCcV2yumCzg@mail.gmail.com>
 <4215577957656723663@unknownmsgid>
 <CAND0qzuPBjAi+5RfWoiNKdKF1wUX5mySOs6Lt7jgBJfyumtZ9w@mail.gmail.com>
From: Russell Jurney <russell.jurney@gmail.com>
In-Reply-To: 
 <CAND0qzuPBjAi+5RfWoiNKdKF1wUX5mySOs6Lt7jgBJfyumtZ9w@mail.gmail.com>
Mime-Version: 1.0 (1.0)
Date: Thu, 11 Oct 2012 12:03:40 -0700
Message-ID: <5658151512625454836@unknownmsgid>
Subject: Re: Why they recommend this (CPU) ?
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=20cf3074d876b7cb7a04cbcd3ac5

--20cf3074d876b7cb7a04cbcd3ac5
Content-Type: text/plain; charset=ISO-8859-1

My own clusters are too temporary and virtual for me to notice. I haven't
thought of clock speed as having mattered in a long time, so I'm curious
what kind of use cases might benefit from faster cores. Is there a category
in some way where this sweet spot for faster cores occurs?

Russell Jurney http://datasyndrome.com

On Oct 11, 2012, at 11:39 AM, Ted Dunning <tdunning@maprtech.com> wrote:

You should measure your workload.  Your experience will vary dramatically
with different computations.

On Thu, Oct 11, 2012 at 10:56 AM, Russell Jurney
<russell.jurney@gmail.com>wrote:

> Anyone got data on this? This is interesting, and somewhat
> counter-intuitive.
>
> Russell Jurney http://datasyndrome.com
>
> On Oct 11, 2012, at 10:47 AM, Jay Vyas <jayunit100@gmail.com> wrote:
>
> > Presumably, if you have a reasonable number of cores - speeding the
> cores up will be better than forking a task into smaller and smaller chunks
> - because at some point the overhead of multiple processes would be a
> bottleneck - maybe due to streaming reads and writes?  I'm sure each and
> every problem has a different sweet spot.
>

--20cf3074d876b7cb7a04cbcd3ac5
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<html><head></head><body bgcolor=3D"#FFFFFF"><div>My own clusters are too t=
emporary and virtual for me to notice. I haven&#39;t thought of clock speed=
 as having mattered in a long time, so I&#39;m curious what kind of use cas=
es might benefit from faster cores. Is there a category in some way where t=
his sweet spot for faster cores occurs?<br>
<br>Russell Jurney <a href=3D"http://datasyndrome.com">http://datasyndrome.=
com</a></div><div><br>On Oct 11, 2012, at 11:39 AM, Ted Dunning &lt;<a href=
=3D"mailto:tdunning@maprtech.com">tdunning@maprtech.com</a>&gt; wrote:<br><=
br>
</div><div></div><blockquote type=3D"cite"><div>You should measure your wor=
kload. =A0Your experience will vary dramatically with different computation=
s.<br><br><div class=3D"gmail_quote">On Thu, Oct 11, 2012 at 10:56 AM, Russ=
ell Jurney <span dir=3D"ltr">&lt;<a href=3D"mailto:russell.jurney@gmail.com=
" target=3D"_blank">russell.jurney@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Anyone got data on this? This is interesting=
, and somewhat counter-intuitive.<br>
<br>
Russell Jurney <a href=3D"http://datasyndrome.com" target=3D"_blank">http:/=
/datasyndrome.com</a><br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
On Oct 11, 2012, at 10:47 AM, Jay Vyas &lt;<a href=3D"mailto:jayunit100@gma=
il.com">jayunit100@gmail.com</a>&gt; wrote:<br>
<br>
&gt; Presumably, if you have a reasonable number of cores - speeding the co=
res up will be better than forking a task into smaller and smaller chunks -=
 because at some point the overhead of multiple processes would be a bottle=
neck - maybe due to streaming reads and writes? =A0I&#39;m sure each and ev=
ery problem has a different sweet spot.<br>


</div></div></blockquote></div><br>
</div></blockquote></body></html>

--20cf3074d876b7cb7a04cbcd3ac5--