Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of daemeonr@gmail.com designates
 209.85.213.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <AF470DA13D70B740ACE9277A448409F3100044DD@SF1-EXMBX-2.ad.savagebeast.com>
References: 
 <CAJwfZdqtQ8qm+L57DWWnobusVcWD_XJebPwLeaeUv0_h_h_i_w@mail.gmail.com>
 <AF470DA13D70B740ACE9277A448409F3100044DD@SF1-EXMBX-2.ad.savagebeast.com>
From: daemeon reiydelle <daemeonr@gmail.com>
Date: Sun, 14 Dec 2014 14:28:14 -0800
Message-ID: 
 <CAOUOv0EORKJtLDwm0meSjtm3FLTENBcfZ2fu0xQX3kbGhY2qKQ@mail.gmail.com>
Subject: Re: What happens to data nodes when name node has failed for long
 time?
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=bcaec51d227a1abaa3050a34a688

--bcaec51d227a1abaa3050a34a688
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

I found the terminology of primary and secondary to be a bit confusing in
describing operation after a failure scenario. Perhaps it is helpful to
think that the Hadoop instance is guided to select a node as primary for
normal operation. If that node fails, then the backup becomes the new
primary. In analyzing traffic it appears that the restored node does not
become primary again until the whole instance restarts. I myself would
welcome clarification on this observed behavior.


*.......*


*=E2=80=9CLife should not be a journey to the grave with the intention of a=
rriving
safely in apretty and well preserved body, but rather to skid in broadside
in a cloud of smoke,thoroughly used up, totally worn out, and loudly
proclaiming =E2=80=9CWow! What a Ride!=E2=80=9D - Hunter ThompsonDaemeon C.=
M. ReiydelleUSA
(+1) 415.501.0198London (+44) (0) 20 8144 9872*

On Fri, Dec 12, 2014 at 7:56 AM, Rich Haase <rhaase@pandora.com> wrote:

>   The remaining cluster services will continue to run.  That way when the
> namenode (or other failed processes) is restored the cluster will resume
> healthy operation.  This is part of hadoop=E2=80=99s ability to handle ne=
twork
> partition events.
>
>  *Rich Haase* | Sr. Software Engineer | Pandora
> m 303.887.1146 | rhaase@pandora.com
>
>   From: Chandrashekhar Kotekar <shekhar.kotekar@gmail.com>
> Reply-To: "user@hadoop.apache.org" <user@hadoop.apache.org>
> Date: Friday, December 12, 2014 at 3:57 AM
> To: "user@hadoop.apache.org" <user@hadoop.apache.org>
> Subject: What happens to data nodes when name node has failed for long
> time?
>
>   Hi,
>
>  What happens if name node has crashed for more than one hour but
> secondary name node, all the data nodes, job tracker, task trackers are
> running fine? Do those daemon services also automatically shutdown after
> some time? Or those services keep running hoping for namenode to come bac=
k?
>
> Regards,
> Chandrash3khar Kotekar
> Mobile - +91 8600011455
>

--bcaec51d227a1abaa3050a34a688
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:comic sa=
ns ms,sans-serif;color:rgb(7,55,99)">I found the terminology of primary and=
 secondary to be a bit confusing in describing operation after a failure sc=
enario. Perhaps it is helpful to think that the Hadoop instance is guided t=
o select a node as primary for normal operation. If that node fails, then t=
he backup becomes the new primary. In analyzing traffic it appears that the=
 restored node does not become primary again until the whole instance resta=
rts. I myself would welcome clarification on this observed behavior.<br></d=
iv></div><div class=3D"gmail_extra"><br clear=3D"all"><div><div class=3D"gm=
ail_signature"><div dir=3D"ltr"><div><div dir=3D"ltr"><span style=3D"color:=
rgb(56,118,29)"><span style=3D"background-color:rgb(255,255,255)"><b><span =
style=3D"font-family:comic sans ms,sans-serif"></span></b></span></span><sp=
an style=3D"color:rgb(56,118,29)"><span style=3D"background-color:rgb(255,2=
55,255)"><b><span style=3D"font-family:comic sans ms,sans-serif"><br>......=
.<br></span></b></span></span><span style=3D"color:rgb(56,118,29)"><span st=
yle=3D"background-color:rgb(255,255,255)"><b><span style=3D"font-family:com=
ic sans ms,sans-serif"><strong>=E2=80=9CLife should not be a journey to the=
 grave with the intention of
 arriving safely in a<br>pretty and well preserved body, but rather to skid
 in broadside in a cloud of smoke,<br>thoroughly used up, totally worn out,
 and loudly proclaiming =E2=80=9CWow! What a Ride!=E2=80=9D</strong> <br>- =
Hunter Thompson<br><br>Daemeon C.M. Reiydelle<br>USA (+1) 415.501.0198<br>L=
ondon (+44) (0) 20 8144 9872</span></b></span></span><font size=3D"1"><i><b=
r></i></font></div></div></div></div></div>
<br><div class=3D"gmail_quote">On Fri, Dec 12, 2014 at 7:56 AM, Rich Haase =
<span dir=3D"ltr">&lt;<a href=3D"mailto:rhaase@pandora.com" target=3D"_blan=
k">rhaase@pandora.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_q=
uote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1e=
x">


<div style=3D"word-wrap:break-word;color:rgb(0,0,0);font-size:14px;font-fam=
ily:Calibri,sans-serif">
<div>
<div>
<div>The remaining cluster services will continue to run.=C2=A0 That way wh=
en the namenode (or other failed processes) is restored the cluster will re=
sume healthy operation.=C2=A0 This is part of hadoop=E2=80=99s ability to h=
andle network partition events.</div>
<div>=C2=A0</div>
<div>
<div style=3D"font-family:&#39;Microsoft Sans Serif&#39;,sans-serif"><font =
face=3D"Arial"><span style=3D"font-size:12px"><b>Rich Haase</b>=C2=A0| Sr. =
Software Engineer | Pandora</span></font></div>
<div style=3D"font-family:&#39;Microsoft Sans Serif&#39;,sans-serif"><span =
style=3D"font-size:12px">m <a href=3D"tel:303.887.1146" value=3D"+130388711=
46" target=3D"_blank">303.887.1146</a> | <a href=3D"mailto:rhaase@pandora.c=
om" target=3D"_blank">rhaase@pandora.com</a></span></div>
</div>
</div>
</div>
<div><br>
</div>
<span>
<div style=3D"font-family:Calibri;font-size:11pt;text-align:left;color:blac=
k;BORDER-BOTTOM:medium none;BORDER-LEFT:medium none;PADDING-BOTTOM:0in;PADD=
ING-LEFT:0in;PADDING-RIGHT:0in;BORDER-TOP:#b5c4df 1pt solid;BORDER-RIGHT:me=
dium none;PADDING-TOP:3pt">
<span style=3D"font-weight:bold">From: </span>Chandrashekhar Kotekar &lt;<a=
 href=3D"mailto:shekhar.kotekar@gmail.com" target=3D"_blank">shekhar.koteka=
r@gmail.com</a>&gt;<br>
<span style=3D"font-weight:bold">Reply-To: </span>&quot;<a href=3D"mailto:u=
ser@hadoop.apache.org" target=3D"_blank">user@hadoop.apache.org</a>&quot; &=
lt;<a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user@hadoop.=
apache.org</a>&gt;<br>
<span style=3D"font-weight:bold">Date: </span>Friday, December 12, 2014 at =
3:57 AM<br>
<span style=3D"font-weight:bold">To: </span>&quot;<a href=3D"mailto:user@ha=
doop.apache.org" target=3D"_blank">user@hadoop.apache.org</a>&quot; &lt;<a =
href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user@hadoop.apache=
.org</a>&gt;<br>
<span style=3D"font-weight:bold">Subject: </span>What happens to data nodes=
 when name node has failed for long time?<br>
</div><div><div class=3D"h5">
<div><br>
</div>
<div>
<div>
<div dir=3D"ltr">Hi,
<div><br>
</div>
<div><span style=3D"font-size:13px">What happens if name node has crashed f=
or more than one hour but secondary name node, all the data nodes, job trac=
ker, task trackers are running fine? Do those daemon services also automati=
cally shutdown after some time? Or
 those services keep running hoping for namenode to come back?</span><br st=
yle=3D"font-size:13px" clear=3D"all">
<div style=3D"font-size:13px">
<div></div>
</div>
</div>
<div>
<div>
<div dir=3D"ltr"><br>
<span style=3D"color:rgb(102,0,0)">Regards,</span><br>
Chandrash3khar Kotekar<br>
Mobile - <a href=3D"tel:%2B91%208600011455" value=3D"+918600011455" target=
=3D"_blank">+91 8600011455</a><br>
</div>
</div>
</div>
</div>
</div>
</div>
</div></div></span>
</div>

</blockquote></div><br></div>

--bcaec51d227a1abaa3050a34a688--