Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of michael_segel@hotmail.com
 designates 65.55.111.109 as permitted sender)
Message-ID: <BLU0-SMTP196119A7AC721E2FA287AA98FAC0@phx.gbl>
Content-Type: multipart/alternative;
	boundary="Apple-Mail=_05CF2969-29FD-405B-8879-80BF0E2B0631"
MIME-Version: 1.0 (Mac OS X Mail 6.0 \(1486\))
Subject: Re: One petabyte of data loading into HDFS with in 10 min.
From: Michael Segel <michael_segel@hotmail.com>
In-Reply-To: 
 <CAL3=Dw2P9DFXLdUjGVSHvDs4ZpDyiwBXdO0AoSJ1DFR7uShVLw@mail.gmail.com>
Date: Mon, 10 Sep 2012 06:50:17 -0500
CC: Michael Segel <mike.segel@thinkbiganalytics.com>
References: 
 <CAL3=Dw2yokDnza43H6Dd6v3c1wOaZdoVZpSZ1+YgsVLatDjYvA@mail.gmail.com>
 <CC6D39E1.CF9B%clehene@adobe.com>
 <CAF-umFNiybKjBy4SXiKL6WELd6Av126eDCo9ARsJYbRqUHaqoA@mail.gmail.com>
 <BLU0-SMTP25244738DF94E04B36BD788FAF0@phx.gbl>
 <CAL3=Dw2P9DFXLdUjGVSHvDs4ZpDyiwBXdO0AoSJ1DFR7uShVLw@mail.gmail.com>
To: user@hadoop.apache.org

--Apple-Mail=_05CF2969-29FD-405B-8879-80BF0E2B0631
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain; charset="windows-1252"


On Sep 10, 2012, at 2:40 AM, prabhu K <prabhu.hadoop@gmail.com> wrote:

> Hi Users,
> =20
> Thanks for the response.
> =20
> We have loaded 100GB data loaded into HDFS, time taken 1hr.with below =
configuration.
>=20
> Each Node (1 machine master, 2 machines  are slave)
>=20
> 1.    500 GB hard disk.
> 2.    4Gb RAM
> 3.    3 quad code CPUs.
> 4.    Speed 1333 MHz
> =20
> Now, we are planning to load 1 petabyte of data (single file)  into =
Hadoop HDFS and Hive table within 10-20 minutes. For this we need a =
clarification below.
>=20
>=20
Ok...

Some say that I am sometimes too harsh in my criticisms so take what I =
say with a grain of salt...

You loaded 100GB in an hour using woefully underperforming hardware and =
are now saying you want to load 1PB in 10 mins.

I would strongly suggest that you first learn more about Hadoop.  No =
really. Looking at your first machine, its obvious that you don't really =
grok hadoop and what it requires to achieve optimum performance.  You =
couldn't even extrapolate any meaningful data from your current =
environment.

Secondly, I think you need to actually think about the problem. Did you =
mean PB or TB? Because your math seems to be off by a couple orders of =
magnitude.=20

A single file measured in PBs? That is currently impossible using today =
(2012) technology. In fact a single file that is measured in PBs =
wouldn't exist within the next 5 years and most likely the next decade. =
[Moore's law is all about CPU power, not disk density.]

Also take a look at networking.=20
ToR switch design differs, however current technology, the fabric tends =
to max out at 40GBs.  What's the widest fabric on a backplane?=20
That's your first bottleneck because even if you had a 1PB of data, you =
couldn't feed it to the cluster fast enough.=20

Forget disk. look at PCIe based memory. (Money no object, right? )=20
You still couldn't populate it fast enough.

I guess Steve hit this nail on the head when he talked about this being =
a homework assignment.=20

High school maybe?=20


> 1. what are the system configuration setup required for all the 3 =
machine=92s ?.
>=20
> 2. Hard disk size.
>=20
> 3. RAM size.
>=20
> 4. Mother board
>=20
> 5. Network cable
>=20
> 6. How much Gbps  Infiniband required.
>=20
>  For the same setup we need cloud computing environment too?
>=20
> Please suggest and help me on this.
>=20
>  Thanks,
>=20
> Prabhu.
>=20
> On Fri, Sep 7, 2012 at 7:30 PM, Michael Segel =
<michael_segel@hotmail.com> wrote:
> Sorry, but you didn't account for the network saturation.
>=20
> And why 1GBe and not 10GBe? Also which version of hadoop?
>=20
> Here MapR works well with bonding two 10GBe ports and with the right =
switch, you could do ok.
> Also 2 ToR switches... per rack. etc...
>=20
> How many machines? 150? 300? more?
>=20
> Then you don't talk about how much memory, CPUs, what type of =
storage...
>=20
> Lots of factors.
>=20
> I'm sorry to interrupt this mental masturbation about how to load 1PB =
in 10min.
> There is a lot more questions that should be asked that weren't.
>=20
> Hey but look. Its a Friday, so I suggest some pizza, beer and then =
take it to a white board.
>=20
> But what do I know? In a different thread, I'm talking about how to =
tame HR and Accounting so they let me play with my team Ninja!
> :-P
>=20
> On Sep 5, 2012, at 9:56 AM, zGreenfelder <zgreenfelder@gmail.com> =
wrote:
>=20
> > On Wed, Sep 5, 2012 at 10:43 AM, Cosmin Lehene <clehene@adobe.com> =
wrote:
> >> Here's an extremely na=EFve ballpark estimation: at theoretical =
hardware
> >> speed, for 3PB representing 1PB with 3x replication
> >>
> >> Over a single 1Gbps connection (and I'm not sure, you can actually =
reach
> >> 1Gbps)
> >> (3 petabytes) / (1 Gbps) =3D 291.271111 days
> >>
> >> So you'd need at least 40,000 1Gbps network cards to get that in 10 =
minutes
> >> :) - (3PB/1Gbps)/40000
> >>
> >> The actual number of nodes would depend a lot on the actual network
> >> architecture, the type of storage you use (SSD,  HDD), etc.
> >>
> >> Cosmin
> >
> > ah, I went te other direction with the math, and assumed no
> > replication (completely unsafe and never reasonable for a real,
> > production environment, but since we're all theory and just looking
> > for starting point numbers)
> >
> >
> > 1PB in 10 min =3D=3D
> > 1,000,000gB in 10 min =3D=3D
> > 8,000,000gb in 600 seconds =3D=3D
> >
> > 80,000/6  ~=3D 14k machines running at gigabit or about 1.5k =
machines if you
> > get 10Gb connected machines.
> >
> > all assuming there's no network or cluster sync overhead
> > (of course there would be)
> >
> >
> > that seems like some pretty deep pockets to get to < 10 minute load
> > time for that much data.
> >
> > I could also be off, I just threw some stuff together somewhat
> > quickly.between conf calls.
> >
> > --
> > Even the Magic 8 ball has an opinion on email clients: Outlook not =
so good.
> >
>=20
>=20


--Apple-Mail=_05CF2969-29FD-405B-8879-80BF0E2B0631
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html; charset="windows-1252"

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dwindows-1252"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><br><div><div>On Sep 10, 2012, at 2:40 AM, prabhu K &lt;<a =
href=3D"mailto:prabhu.hadoop@gmail.com">prabhu.hadoop@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div>Hi Users,</div>
<div>&nbsp;</div>
<div>Thanks for the response.</div>
<div>&nbsp;</div>
<div><p style=3D"MARGIN:0in 0in 10pt" class=3D"MsoNormal"><font =
size=3D"3"><font face=3D"Calibri">We have loaded 100GB data loaded into =
HDFS, time taken 1hr.with below configuration.</font></font></p><p =
style=3D"TEXT-INDENT:0.5in;MARGIN:0in 0in 10pt" class=3D"MsoNormal"><span =
style=3D""><font size=3D"3"><font face=3D"Calibri">Each Node (1 machine =
master, 2 machines <span style=3D"">&nbsp;</span>are =
slave)</font></font></span></p><div style=3D"line-height: normal; =
margin: 0in 0in 0pt 0.75in; "><span style=3D""><span style=3D""><font =
face=3D"Trebuchet MS">1.</font><span style=3D"FONT:7pt 'Times New =
Roman'">&nbsp;&nbsp;&nbsp; </span></span></span><span style=3D""><font =
face=3D"Trebuchet MS">500 GB hard disk.</font></span></div><div =
style=3D"line-height: normal; margin: 0in 0in 0pt 0.75in; "><span =
style=3D""><span style=3D""><font face=3D"Trebuchet MS">2.</font><span =
style=3D"FONT:7pt 'Times New Roman'">&nbsp;&nbsp;&nbsp; =
</span></span></span><span style=3D""><font face=3D"Trebuchet MS">4Gb =
RAM</font></span></div><div style=3D"line-height: normal; margin: 0in =
0in 0pt 0.75in; "><span style=3D""><span style=3D""><font =
face=3D"Trebuchet MS">3.</font><span style=3D"FONT:7pt 'Times New =
Roman'">&nbsp;&nbsp;&nbsp; </span></span></span><span style=3D""><font =
face=3D"Trebuchet MS">3 quad code CPUs.</font></span></div><div =
style=3D"line-height: normal; margin: 0in 0in 0pt 0.75in; "><span =
style=3D""><span style=3D""><font face=3D"Trebuchet MS">4.</font><span =
style=3D"FONT:7pt 'Times New Roman'">&nbsp;&nbsp;&nbsp; =
</span></span></span><span style=3D""><font face=3D"Trebuchet MS">Speed =
1333 MHz</font></span></div><div style=3D"margin: 0in 0in 10pt; "><font =
size=3D"3" face=3D"Calibri">&nbsp;</font><br =
class=3D"webkit-block-placeholder"></div><p style=3D"MARGIN:0in 0in =
10pt" class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">Now, =
we are planning to load 1 petabyte of data (single file) <span =
style=3D"">&nbsp;</span>into Hadoop HDFS and Hive table within 10-20 =
minutes. For this we need a clarification below.</font></font></p><p =
style=3D"MARGIN:0in 0in 10pt" class=3D"MsoNormal"><font =
size=3D"3"></font></p></div></blockquote>Ok...</div><div><br></div><div>So=
me say that I am sometimes too harsh in my criticisms so take what I say =
with a grain of salt...</div><div><br></div><div>You loaded 100GB in an =
hour using woefully underperforming hardware and are now saying you want =
to load 1PB in 10 mins.</div><div><br></div><div>I would strongly =
suggest that you first learn more about Hadoop. &nbsp;No really. Looking =
at your first machine, its obvious that you don't really grok hadoop and =
what it requires to achieve optimum performance. &nbsp;You couldn't even =
extrapolate any meaningful data from your current =
environment.</div><div><br></div><div>Secondly, I think you need to =
actually think about the problem. Did you mean PB or TB? Because your =
math seems to be off by a couple orders of =
magnitude.&nbsp;</div><div><br></div><div>A single file measured in PBs? =
That is currently impossible using today (2012) technology. In fact a =
single file that is measured in PBs wouldn't exist within the next 5 =
years and most likely the next decade. [Moore's law is all about CPU =
power, not disk density.]</div><div><br></div><div>Also take a look at =
networking.&nbsp;</div><div>ToR switch design differs, however current =
technology, the fabric tends to max out at 40GBs. &nbsp;What's the =
widest fabric on a backplane?&nbsp;</div><div>That's your first =
bottleneck because even if you had a 1PB of data, you couldn't feed it =
to the cluster fast enough.&nbsp;</div><div><br></div><div>Forget disk. =
look at PCIe based memory. (Money no object, right? =
)&nbsp;</div><div>You still couldn't populate it fast =
enough.</div><div><br></div><div>I guess Steve hit this nail on the head =
when he talked about this being a homework =
assignment.&nbsp;</div><div><br></div><div>High school =
maybe?&nbsp;</div><div><br></div><div><br><blockquote =
type=3D"cite"><div><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">1. what are =
the system configuration setup required for all the 3 machine=92s =
?.</font></font></p><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">2. Hard disk =
size.</font></font></p><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">3. RAM =
size.</font></font></p><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">4. Mother =
board</font></font></p><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">5. Network =
cable</font></font></p><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">6. How much =
Gbps <span style=3D"">&nbsp;</span>Infiniband =
required.</font></font></p><p style=3D"MARGIN:0in 0in 10pt" =
class=3D"MsoNormal"><font size=3D"3" face=3D"Calibri">&nbsp;</font><font =
size=3D"3"><font face=3D"Calibri">For the same setup we need cloud =
computing environment too?</font></font></p><p style=3D"MARGIN:0in 0in =
10pt" class=3D"MsoNormal"><font size=3D"3"><font face=3D"Calibri">Please =
suggest and help me on this.</font></font></p><p style=3D"MARGIN:0in 0in =
10pt" class=3D"MsoNormal"><font size=3D"3" =
face=3D"Calibri">&nbsp;Thanks,</font></p><p style=3D"MARGIN:0in 0in =
10pt" class=3D"MsoNormal"><font size=3D"3" =
face=3D"Calibri">Prabhu.</font><br></p></div>
<div class=3D"gmail_quote">On Fri, Sep 7, 2012 at 7:30 PM, Michael Segel =
<span dir=3D"ltr">&lt;<a href=3D"mailto:michael_segel@hotmail.com" =
target=3D"_blank">michael_segel@hotmail.com</a>&gt;</span> wrote:<br>
<blockquote style=3D"BORDER-LEFT:#ccc 1px solid;MARGIN:0px 0px 0px =
0.8ex;PADDING-LEFT:1ex" class=3D"gmail_quote">Sorry, but you didn't =
account for the network saturation.<br><br>And why 1GBe and not 10GBe? =
Also which version of hadoop?<br>
<br>Here MapR works well with bonding two 10GBe ports and with the right =
switch, you could do ok.<br>Also 2 ToR switches... per rack. =
etc...<br><br>How many machines? 150? 300? more?<br><br>Then you don't =
talk about how much memory, CPUs, what type of storage...<br>
<br>Lots of factors.<br><br>I'm sorry to interrupt this mental =
masturbation about how to load 1PB in 10min.<br>There is a lot more =
questions that should be asked that weren't.<br><br>Hey but look. Its a =
Friday, so I suggest some pizza, beer and then take it to a white =
board.<br>
<br>But what do I know? In a different thread, I'm talking about how to =
tame HR and Accounting so they let me play with my team =
Ninja!<br>:-P<br>
<div class=3D"HOEnZb">
<div class=3D"h5"><br>On Sep 5, 2012, at 9:56 AM, zGreenfelder &lt;<a =
href=3D"mailto:zgreenfelder@gmail.com">zgreenfelder@gmail.com</a>&gt; =
wrote:<br><br>&gt; On Wed, Sep 5, 2012 at 10:43 AM, Cosmin Lehene &lt;<a =
href=3D"mailto:clehene@adobe.com">clehene@adobe.com</a>&gt; wrote:<br>
&gt;&gt; Here's an extremely na=EFve ballpark estimation: at theoretical =
hardware<br>&gt;&gt; speed, for 3PB representing 1PB with 3x =
replication<br>&gt;&gt;<br>&gt;&gt; Over a single 1Gbps connection (and =
I'm not sure, you can actually reach<br>
&gt;&gt; 1Gbps)<br>&gt;&gt; (3 petabytes) / (1 Gbps) =3D 291.271111 =
days<br>&gt;&gt;<br>&gt;&gt; So you'd need at least 40,000 1Gbps network =
cards to get that in 10 minutes<br>&gt;&gt; :) - =
(3PB/1Gbps)/40000<br>&gt;&gt;<br>
&gt;&gt; The actual number of nodes would depend a lot on the actual =
network<br>&gt;&gt; architecture, the type of storage you use (SSD, =
&nbsp;HDD), etc.<br>&gt;&gt;<br>&gt;&gt; Cosmin<br>&gt;<br>&gt; ah, I =
went te other direction with the math, and assumed no<br>
&gt; replication (completely unsafe and never reasonable for a =
real,<br>&gt; production environment, but since we're all theory and =
just looking<br>&gt; for starting point numbers)<br>&gt;<br>&gt;<br>&gt; =
1PB in 10 min =3D=3D<br>
&gt; 1,000,000gB in 10 min =3D=3D<br>&gt; 8,000,000gb in 600 seconds =
=3D=3D<br>&gt;<br>&gt; 80,000/6 &nbsp;~=3D 14k machines running at =
gigabit or about 1.5k machines if you<br>&gt; get 10Gb connected =
machines.<br>&gt;<br>&gt; all assuming there's no network or cluster =
sync overhead<br>
&gt; (of course there would be)<br>&gt;<br>&gt;<br>&gt; that seems like =
some pretty deep pockets to get to &lt; 10 minute load<br>&gt; time for =
that much data.<br>&gt;<br>&gt; I could also be off, I just threw some =
stuff together somewhat<br>
&gt; quickly.between conf calls.<br>&gt;<br>&gt; --<br>&gt; Even the =
Magic 8 ball has an opinion on email clients: Outlook not so =
good.<br>&gt;<br><br></div></div></blockquote></div><br>
</blockquote></div><br></body></html>=

--Apple-Mail=_05CF2969-29FD-405B-8879-80BF0E2B0631--