Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of craig.munro@gmail.com
 designates 209.85.214.45 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CANYAmxkzf4==HGt6RjK9KCbbvHPyUc74VHRsinwQHQ6n8kLJ4w@mail.gmail.com>
References: 
 <CANYAmxn83hKkkVC8ReGrB7U5anJG=mM969i9HMfF8t0NOs_Czw@mail.gmail.com>
	<CAOcnVr0tNt_4rzYeSNYXSbak14oqQYsDepCCgjsBxB6nsVHMCA@mail.gmail.com>
	<CANYAmxkzf4==HGt6RjK9KCbbvHPyUc74VHRsinwQHQ6n8kLJ4w@mail.gmail.com>
Date: Fri, 28 Dec 2012 10:08:46 +0000
Message-ID: 
 <CAJ9Pg5dyFBi2Yw11TLgbZHr+3E79anCFr8bbA0WmeCPcx4uU4w@mail.gmail.com>
Subject: Re: question about ZKFC daemon
From: Craig Munro <craig.munro@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=0015175907a46839e404d1e6d9a8

--0015175907a46839e404d1e6d9a8
Content-Type: text/plain; charset=ISO-8859-1

You need the following:

- active namenode + zkfc
- standby namenode + zkfc
- pool of journal nodes (odd number, 3 or more)
- pool of zookeeper nodes (odd number, 3 or more)

As the journal nodes hold the namesystem transactions they should not be
co-located with the namenodes in case of failure.  I distribute the journal
and zookeeper nodes across the hosts running datanodes or as Harsh says you
could co-locate them on dedicated hosts.

ZKFC does not monitor the JobTracker.

Regards,
Craig
On Dec 28, 2012 9:25 AM, "ESGLinux" <esggrupos@gmail.com> wrote:

> Hi,
>
> well, If I have understand you I can configure my NN HA cluster this way:
>
> - Active NameNode + 1 ZKFC daemon + Journal Node
> - Standby NameNode + 1 ZKFC daemon + Journal Node
> - JobTracker node + 1 ZKFC daemon + Journal Node,
>
> Is this right?
>
> Thanks in advance,
>
> ESGLinux,
>
> 2012/12/27 Harsh J <harsh@cloudera.com>
>
>> Hi,
>>
>> There are two different things here: Automatic Failover and Quorum
>> Journal Manager. The former, used via a ZooKeeper Failover Controller,
>> is to manage failovers automatically (based on health checks of NNs).
>> The latter, used via a set of Journal Nodes, is a medium of shared
>> storage for namesystem transactions that helps enable HA.
>>
>> In a typical deployment, you want 3 or more (odd) JournalNodes for
>> reliable HA, preferably on nodes of their own if possible (like you
>> would for typical ZooKeepers, and you may co-locate with those as
>> well) and one ZKFC for each NameNode (connected to the same ZK
>> quorum).
>>
>> On Thu, Dec 27, 2012 at 5:33 PM, ESGLinux <esggrupos@gmail.com> wrote:
>> > Hi all,
>> >
>> > I have a doubt about how to deploy the Zookeeper in a NN HA  cluster,
>> >
>> > As far as I know, I need at least three nodes to run three ZooKeeper
>> > FailOver Controller (ZKFC). I plan to put these 3 daemons this way:
>> >
>> > - Active NameNode + 1 ZKFC daemon
>> > - Standby NameNode + 1 ZKFC daemon
>> > - JobTracker node + 1 ZKFC daemon, (is this right?)
>> >
>> > so the quorum is formed with these three nodes. The nodes that runs a
>> > namenode are right because the ZKFC monitors it, but what does the third
>> > daemon?
>> >
>> > as I read from this url:
>> >
>> https://ccp.cloudera.com/display/CDH4DOC/Software+Configuration+for+Quorum-based+Storage#SoftwareConfigurationforQuorum-basedStorage-AutomaticFailoverConfiguration
>> >
>> > this daemons are only related with NameNodes, (Health monitoring - the
>> ZKFC
>> > pings its local NameNode on a periodic basis with a health-check
>> command.)
>> > so what does the third ZKFC? I used the jobtracker node but I could use
>> > another node without any daemon on it...
>> >
>> > Thanks in advance,
>> >
>> > ESGLInux,
>> >
>> >
>> >
>>
>>
>>
>> --
>> Harsh J
>>
>
>

--0015175907a46839e404d1e6d9a8
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<p>You need the following:</p>
<p>- active namenode + zkfc<br>
- standby namenode + zkfc<br>
- pool of journal nodes (odd number, 3 or more)<br>
- pool of zookeeper nodes (odd number, 3 or more)</p>
<p>As the journal nodes hold the namesystem transactions they should not be=
 co-located with the namenodes in case of failure.=A0 I distribute the jour=
nal and zookeeper nodes across the hosts running datanodes or as Harsh says=
 you could co-locate them on dedicated hosts.</p>

<p>ZKFC does not monitor the JobTracker.</p>
<p>Regards,<br>
Craig</p>
<div class=3D"gmail_quote">On Dec 28, 2012 9:25 AM, &quot;ESGLinux&quot; &l=
t;<a href=3D"mailto:esggrupos@gmail.com">esggrupos@gmail.com</a>&gt; wrote:=
<br type=3D"attribution"><blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi,=A0<div><br></div><div>well, If I have understand you I can configure my=
 NN HA cluster this way:</div><div><br></div><div><span style=3D"border-col=
lapse:collapse;color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px"><div>

- Active NameNode + 1 ZKFC daemon + Journal Node=A0</div><div>- Standby Nam=
eNode + 1 ZKFC daemon + Journal Node</div><div>- JobTracker node + 1 ZKFC d=
aemon + Journal Node,=A0</div><div><br></div><div>Is this right?</div><div>

<br></div><div>Thanks in advance,=A0</div><div><br></div><div>ESGLinux,=A0<=
/div></span><br><div class=3D"gmail_quote">2012/12/27 Harsh J <span dir=3D"=
ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" target=3D"_blank">harsh@clou=
dera.com</a>&gt;</span><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hi,<br>
<br>
There are two different things here: Automatic Failover and Quorum<br>
Journal Manager. The former, used via a ZooKeeper Failover Controller,<br>
is to manage failovers automatically (based on health checks of NNs).<br>
The latter, used via a set of Journal Nodes, is a medium of shared<br>
storage for namesystem transactions that helps enable HA.<br>
<br>
In a typical deployment, you want 3 or more (odd) JournalNodes for<br>
reliable HA, preferably on nodes of their own if possible (like you<br>
would for typical ZooKeepers, and you may co-locate with those as<br>
well) and one ZKFC for each NameNode (connected to the same ZK<br>
quorum).<br>
<div><div><br>
On Thu, Dec 27, 2012 at 5:33 PM, ESGLinux &lt;<a href=3D"mailto:esggrupos@g=
mail.com" target=3D"_blank">esggrupos@gmail.com</a>&gt; wrote:<br>
&gt; Hi all,<br>
&gt;<br>
&gt; I have a doubt about how to deploy the Zookeeper in a NN HA =A0cluster=
,<br>
&gt;<br>
&gt; As far as I know, I need at least three nodes to run three ZooKeeper<b=
r>
&gt; FailOver Controller (ZKFC). I plan to put these 3 daemons this way:<br=
>
&gt;<br>
&gt; - Active NameNode + 1 ZKFC daemon<br>
&gt; - Standby NameNode + 1 ZKFC daemon<br>
&gt; - JobTracker node + 1 ZKFC daemon, (is this right?)<br>
&gt;<br>
&gt; so the quorum is formed with these three nodes. The nodes that runs a<=
br>
&gt; namenode are right because the ZKFC monitors it, but what does the thi=
rd<br>
&gt; daemon?<br>
&gt;<br>
&gt; as I read from this url:<br>
&gt; <a href=3D"https://ccp.cloudera.com/display/CDH4DOC/Software+Configura=
tion+for+Quorum-based+Storage#SoftwareConfigurationforQuorum-basedStorage-A=
utomaticFailoverConfiguration" target=3D"_blank">https://ccp.cloudera.com/d=
isplay/CDH4DOC/Software+Configuration+for+Quorum-based+Storage#SoftwareConf=
igurationforQuorum-basedStorage-AutomaticFailoverConfiguration</a><br>


&gt;<br>
&gt; this daemons are only related with NameNodes, (Health monitoring - the=
 ZKFC<br>
&gt; pings its local NameNode on a periodic basis with a health-check comma=
nd.)<br>
&gt; so what does the third ZKFC? I used the jobtracker node but I could us=
e<br>
&gt; another node without any daemon on it...<br>
&gt;<br>
&gt; Thanks in advance,<br>
&gt;<br>
&gt; ESGLInux,<br>
&gt;<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div><span><font color=3D"#888888">--<br>
Harsh J<br>
</font></span></blockquote></div><br></div>
</blockquote></div>

--0015175907a46839e404d1e6d9a8--