Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <CC9C6FC3.16A03%goldstone1@llnl.gov>
References: 
 <CANS822icmNqzXzBu3sWSGX34vKz0WSP58zW21EcwJ9YycMTo_A@mail.gmail.com>
 <CC9C6FC3.16A03%goldstone1@llnl.gov>
From: Ted Dunning <tdunning@maprtech.com>
Date: Thu, 11 Oct 2012 12:56:56 -0700
Message-ID: 
 <CAND0qzsQvU9R77yVcjpx1xrkhGBPe=0vqxTnW=hfFYAqLE+ZDg@mail.gmail.com>
Subject: Re: Why they recommend this (CPU) ?
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=f46d043be10e6bee8704cbcdfac6

--f46d043be10e6bee8704cbcdfac6
Content-Type: text/plain; charset=ISO-8859-1

Like I said, measure twice, cut once.

On Thu, Oct 11, 2012 at 12:47 PM, Goldstone, Robin J.
<goldstone1@llnl.gov>wrote:

>  Be sure you are comparing apples to apples.  The E5-2650 has a larger
> cache than the E5-2640, faster system bus and can support faster (1600Ghz
> vs 1333Ghz) DRAM resulting in greater potential memory bandwidth.
>
>  http://ark.intel.com/compare/64590,64591
>
>
>   From: Patrick Angeles <patrick@cloudera.com>
> Reply-To: "user@hadoop.apache.org" <user@hadoop.apache.org>
> Date: Thursday, October 11, 2012 12:36 PM
> To: "user@hadoop.apache.org" <user@hadoop.apache.org>
> Subject: Re: Why they recommend this (CPU) ?
>
>   If you look at comparable Intel parts:
>
>  Intel E5-2640
> 6 cores @ 2.5 Ghz
> 95W - $885
>
>  Intel E5-2650
> 8 cores @ 2.0 Ghz
> 95W - $1107
>
>  So, for $400 more on a dual proc system -- which really isn't much --
> you get 2 more cores for a 20% drop in speed. I can believe that for some
> scenarios, the faster cores would fare better. Gzip compression is one that
> comes to mind, where you are aggressively trading CPU for lower storage
> volume and IO. An HBase cluster is another example.
>
> On Thu, Oct 11, 2012 at 3:03 PM, Russell Jurney <russell.jurney@gmail.com>wrote:
>
>>  My own clusters are too temporary and virtual for me to notice. I
>> haven't thought of clock speed as having mattered in a long time, so I'm
>> curious what kind of use cases might benefit from faster cores. Is there a
>> category in some way where this sweet spot for faster cores occurs?
>>
>> Russell Jurney http://datasyndrome.com
>>
>> On Oct 11, 2012, at 11:39 AM, Ted Dunning <tdunning@maprtech.com> wrote:
>>
>>   You should measure your workload.  Your experience will vary
>> dramatically with different computations.
>>
>> On Thu, Oct 11, 2012 at 10:56 AM, Russell Jurney <
>> russell.jurney@gmail.com> wrote:
>>
>>> Anyone got data on this? This is interesting, and somewhat
>>> counter-intuitive.
>>>
>>> Russell Jurney http://datasyndrome.com
>>>
>>> On Oct 11, 2012, at 10:47 AM, Jay Vyas <jayunit100@gmail.com> wrote:
>>>
>>> > Presumably, if you have a reasonable number of cores - speeding the
>>> cores up will be better than forking a task into smaller and smaller chunks
>>> - because at some point the overhead of multiple processes would be a
>>> bottleneck - maybe due to streaming reads and writes?  I'm sure each and
>>> every problem has a different sweet spot.
>>>
>>
>>
>

--f46d043be10e6bee8704cbcdfac6
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Like I said, measure twice, cut once.<br><br><div class=3D"gmail_quote">On =
Thu, Oct 11, 2012 at 12:47 PM, Goldstone, Robin J. <span dir=3D"ltr">&lt;<a=
 href=3D"mailto:goldstone1@llnl.gov" target=3D"_blank">goldstone1@llnl.gov<=
/a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">


<div style=3D"font-size:14px;font-family:Calibri,sans-serif;word-wrap:break=
-word">
<div>Be sure you are comparing apples to apples. =A0The E5-2650 has a large=
r cache than the E5-2640, faster system bus and can support faster (1600Ghz=
 vs 1333Ghz) DRAM resulting in greater potential memory bandwidth.</div>


<div><br>
</div>
<div><a href=3D"http://ark.intel.com/compare/64590,64591" target=3D"_blank"=
>http://ark.intel.com/compare/64590,64591</a></div>
<div><br>
</div>
<div><br>
</div>
<span>
<div style=3D"border-right:medium none;padding-right:0in;padding-left:0in;p=
adding-top:3pt;text-align:left;font-size:11pt;border-bottom:medium none;fon=
t-family:Calibri;border-top:#b5c4df 1pt solid;padding-bottom:0in;border-lef=
t:medium none">


<span style=3D"font-weight:bold">From: </span>Patrick Angeles &lt;<a href=
=3D"mailto:patrick@cloudera.com" target=3D"_blank">patrick@cloudera.com</a>=
&gt;<br>
<span style=3D"font-weight:bold">Reply-To: </span>&quot;<a href=3D"mailto:u=
ser@hadoop.apache.org" target=3D"_blank">user@hadoop.apache.org</a>&quot; &=
lt;<a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user@hadoop.=
apache.org</a>&gt;<br>


<span style=3D"font-weight:bold">Date: </span>Thursday, October 11, 2012 12=
:36 PM<br>
<span style=3D"font-weight:bold">To: </span>&quot;<a href=3D"mailto:user@ha=
doop.apache.org" target=3D"_blank">user@hadoop.apache.org</a>&quot; &lt;<a =
href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user@hadoop.apache=
.org</a>&gt;<br>


<span style=3D"font-weight:bold">Subject: </span>Re: Why they recommend thi=
s (CPU) ?<br>
</div><div><div class=3D"h5">
<div><br>
</div>
<div>
<div>
<div>If you look at comparable Intel parts:</div>
<div><br>
</div>
<div>Intel E5-2640</div>
<div>6 cores @ 2.5 Ghz</div>
<div>95W - $885</div>
<div><br>
</div>
<div>Intel E5-2650</div>
<div>8 cores @ 2.0 Ghz</div>
<div>95W -=A0$1107</div>
<div><br>
</div>
<div>So, for $400 more on a dual proc system -- which really isn&#39;t much=
 -- you get 2 more cores for a 20% drop in speed. I can believe that for so=
me scenarios, the faster cores would fare better. Gzip compression is one t=
hat comes to mind, where you are aggressively
 trading CPU for lower storage volume and IO. An HBase cluster is another e=
xample.</div>
<div><br>
<div class=3D"gmail_quote">On Thu, Oct 11, 2012 at 3:03 PM, Russell Jurney =
<span dir=3D"ltr">
&lt;<a href=3D"mailto:russell.jurney@gmail.com" target=3D"_blank">russell.j=
urney@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div bgcolor=3D"#FFFFFF">
<div>My own clusters are too temporary and virtual for me to notice. I have=
n&#39;t thought of clock speed as having mattered in a long time, so I&#39;=
m curious what kind of use cases might benefit from faster cores. Is there =
a category in some way where this sweet
 spot for faster cores occurs?<br>
<br>
Russell Jurney <a href=3D"http://datasyndrome.com" target=3D"_blank">http:/=
/datasyndrome.com</a></div>
<div>
<div>
<div><br>
On Oct 11, 2012, at 11:39 AM, Ted Dunning &lt;<a href=3D"mailto:tdunning@ma=
prtech.com" target=3D"_blank">tdunning@maprtech.com</a>&gt; wrote:<br>
<br>
</div>
<div></div>
<blockquote type=3D"cite">
<div>You should measure your workload. =A0Your experience will vary dramati=
cally with different computations.<br>
<br>
<div class=3D"gmail_quote">On Thu, Oct 11, 2012 at 10:56 AM, Russell Jurney=
 <span dir=3D"ltr">
&lt;<a href=3D"mailto:russell.jurney@gmail.com" target=3D"_blank">russell.j=
urney@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
Anyone got data on this? This is interesting, and somewhat counter-intuitiv=
e.<br>
<br>
Russell Jurney <a href=3D"http://datasyndrome.com" target=3D"_blank">http:/=
/datasyndrome.com</a><br>
<div>
<div><br>
On Oct 11, 2012, at 10:47 AM, Jay Vyas &lt;<a href=3D"mailto:jayunit100@gma=
il.com" target=3D"_blank">jayunit100@gmail.com</a>&gt; wrote:<br>
<br>
&gt; Presumably, if you have a reasonable number of cores - speeding the co=
res up will be better than forking a task into smaller and smaller chunks -=
 because at some point the overhead of multiple processes would be a bottle=
neck - maybe due to streaming reads
 and writes? =A0I&#39;m sure each and every problem has a different sweet s=
pot.<br>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</blockquote>
</div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
</div>
</div></div></span>
</div>

</blockquote></div><br>

--f46d043be10e6bee8704cbcdfac6--