Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of dwivedishashwat@gmail.com
 designates 209.85.217.174 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOcnVr3_81FF55hP8-d43c7k0EVp094XvpAZH-7K3s3vz+OVEA@mail.gmail.com>
References: 
 <CAO7hTbN__aN9pH2Tv6xiiDGxtGJHUT5hq4PSMX8jrOG61Z_6Yw@mail.gmail.com>
 <CAOcnVr3_81FF55hP8-d43c7k0EVp094XvpAZH-7K3s3vz+OVEA@mail.gmail.com>
From: shashwat shriparv <dwivedishashwat@gmail.com>
Date: Thu, 4 Apr 2013 00:19:23 +0530
Message-ID: 
 <CAAXmExUQe+FpHJNFnQW4Y3B3R+_GkW3r_Pi2VnJ7JasSS8qDeg@mail.gmail.com>
Subject: Re: NameNode failure and recovery!
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=f46d040169972efa4d04d97951bf

--f46d040169972efa4d04d97951bf
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

If you are not in position to go for HA just keep your checkpoint period
shorter to have recent data recoverable from SNN.

and you always have a option
hadoop namenode -recover
try this on testing cluster and get versed to it.

and take backup of image at some solid state storage.


=E2=88=9E
Shashwat Shriparv


On Wed, Apr 3, 2013 at 9:56 PM, Harsh J <harsh@cloudera.com> wrote:

> There is a 3rd, most excellent way: Use HDFS's own HA, see
>
> http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/HDFSHi=
ghAvailabilityWithQJM.html
> :)
>
> On Wed, Apr 3, 2013 at 8:10 PM, Rahul Bhattacharjee
> <rahul.rec.dgp@gmail.com> wrote:
> > Hi all,
> >
> > I was reading about Hadoop and got to know that there are two ways to
> > protect against the name node failures.
> >
> > 1) To write to a nfs mount along with the usual local disk.
> >  -or-
> > 2) Use secondary name node. In case of failure of NN , the SNN can take
> in
> > charge.
> >
> > My questions :-
> >
> > 1) SNN is always lagging , so when SNN becomes primary in event of a NN
> > failure ,  then the edits which have not been merged into the image fil=
e
> > would be lost , so the system of SNN would not be consistent with the N=
N
> > before its failure.
> >
> > 2) Also I have read that other purpose of SNN is to periodically merge
> the
> > edit logs with the image file. In case a setup goes with option #1
> (writing
> > to NFS, no SNN) , then who does this merging.
> >
> > Thanks,
> > Rahul
> >
> >
>
>
>
> --
> Harsh J
>

--f46d040169972efa4d04d97951bf
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div>If you are not in position to go for HA jus=
t keep your checkpoint period shorter to have recent data recoverable from =
SNN.<br><br></div>and you always have a option<br><code>hadoop namenode -re=
cover<br>

</code></div><code>try this on testing cluster and get versed to it.<br><br=
></code></div><code>and take backup of image at some solid state storage.<b=
r></code><div class=3D"gmail_extra"><br clear=3D"all"><div>=C2=A0=C2=A0=C2=
=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=20


=09
=09
=09
=09


<p style=3D"margin-bottom:0in"> <font size=3D"6">=E2=88=9E</font><font size=
=3D"6"></font><font size=3D"6"></font></p>

Shashwat Shriparv<div><br></div></div>
<br><br><div class=3D"gmail_quote">On Wed, Apr 3, 2013 at 9:56 PM, Harsh J =
<span dir=3D"ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" target=3D"_blan=
k">harsh@cloudera.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_q=
uote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1e=
x">

There is a 3rd, most excellent way: Use HDFS&#39;s own HA, see<br>
<a href=3D"http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-si=
te/HDFSHighAvailabilityWithQJM.html" target=3D"_blank">http://hadoop.apache=
.org/docs/current/hadoop-yarn/hadoop-yarn-site/HDFSHighAvailabilityWithQJM.=
html</a><br>


:)<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
On Wed, Apr 3, 2013 at 8:10 PM, Rahul Bhattacharjee<br>
&lt;<a href=3D"mailto:rahul.rec.dgp@gmail.com">rahul.rec.dgp@gmail.com</a>&=
gt; wrote:<br>
&gt; Hi all,<br>
&gt;<br>
&gt; I was reading about Hadoop and got to know that there are two ways to<=
br>
&gt; protect against the name node failures.<br>
&gt;<br>
&gt; 1) To write to a nfs mount along with the usual local disk.<br>
&gt; =C2=A0-or-<br>
&gt; 2) Use secondary name node. In case of failure of NN , the SNN can tak=
e in<br>
&gt; charge.<br>
&gt;<br>
&gt; My questions :-<br>
&gt;<br>
&gt; 1) SNN is always lagging , so when SNN becomes primary in event of a N=
N<br>
&gt; failure , =C2=A0then the edits which have not been merged into the ima=
ge file<br>
&gt; would be lost , so the system of SNN would not be consistent with the =
NN<br>
&gt; before its failure.<br>
&gt;<br>
&gt; 2) Also I have read that other purpose of SNN is to periodically merge=
 the<br>
&gt; edit logs with the image file. In case a setup goes with option #1 (wr=
iting<br>
&gt; to NFS, no SNN) , then who does this merging.<br>
&gt;<br>
&gt; Thanks,<br>
&gt; Rahul<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div><span class=3D"HOEnZb"><font color=3D"#888888">--<br>
Harsh J<br>
</font></span></blockquote></div><br></div></div>

--f46d040169972efa4d04d97951bf--