Mailing-List: contact user-help@ambari.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@ambari.apache.org
Received-SPF: pass (nike.apache.org: domain of orenault@hortonworks.com
 designates 209.85.128.182 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOWF127LLPdwrJVCFaOnAiJwWh=mDS8eB0nKE_=FQCwPgRbWKQ@mail.gmail.com>
References: 
 <CAPwwW9GWXEJo+cXWzWs3=UoS-MO9vAVdndg4esZCRCRAg2A-Ng@mail.gmail.com>
	<CAOWF127LLPdwrJVCFaOnAiJwWh=mDS8eB0nKE_=FQCwPgRbWKQ@mail.gmail.com>
Date: Wed, 11 Dec 2013 22:30:31 +0100
Message-ID: 
 <CAJD6GeKFz-XWkY07Jg=9L4bu9uNcCjd=6nFx9hQ7jMuQTUCETQ@mail.gmail.com>
Subject: Re: Journal Nodes in a multi-site environment
From: Olivier Renault <orenault@hortonworks.com>
To: user@ambari.apache.org
Cc: Suresh Srinivas <suresh@hortonworks.com>,
 Rohit Bakhshi <rohit@hortonworks.com>
Content-Type: multipart/alternative; boundary=001a11c22e5846e7da04ed48f054

--001a11c22e5846e7da04ed48f054
Content-Type: text/plain; charset=US-ASCII

To get a quorum you need to have : number of QJM / 2 +1 so in case of 3 you
need 2 servers to be up. If you've got 3 QJM on each side, in case of
losing a DC, you will never get quorum as it will require 4 nodes. When you
lose your DC with the 2 QJMs, you could manually start Hadoop on the second
DC.

As a side note, Hadoop is not yet recommended as a multi DC solution.

Hope it helps.
Olivier


On 11 December 2013 21:49, Jeff Sposetti <jeff@hortonworks.com> wrote:

> Adding in some Hadoop folks to chime in here.
>
>
>
> On Wed, Dec 11, 2013 at 5:35 AM, Chadwick Banning <
> chadwickbanning@gmail.com> wrote:
>
>> Hi all,
>>
>> I have an Ambari 1.4/HDP 2.0.6 environment that is split between two data
>> centers -- a main site and a recovery site.  We have NameNode HA enabled
>> with automatic failover and the problem we are facing is how to divide the
>> journal nodes across both sites so that failover happens appropriately.
>>
>> It seems that only one site will always have a majority of journal nodes
>> and if that site NN were to go down, the other site NN would no longer be
>> able to start as it couldn't reach a majority of the journal nodes.
>>
>> Is there any way around this?  I know an odd number of journal nodes is
>> recommended but what would happen if we were to place an even number of
>> journal nodes at each site?
>>
>> Thanks for any input!
>>
>
>
> CONFIDENTIALITY NOTICE
> NOTICE: This message is intended for the use of the individual or entity
> to which it is addressed and may contain information that is confidential,
> privileged and exempt from disclosure under applicable law. If the reader
> of this message is not the intended recipient, you are hereby notified that
> any printing, copying, dissemination, distribution, disclosure or
> forwarding of this communication is strictly prohibited. If you have
> received this communication in error, please contact the sender immediately
> and delete it from your system. Thank You.

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

--001a11c22e5846e7da04ed48f054
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">To get a quorum you need to have : number of QJM / 2 +1 so=
 in case of 3 you need 2 servers to be up. If you&#39;ve got 3 QJM on each =
side, in case of losing a DC, you will never get quorum as it will require =
4 nodes. When you lose your DC with the 2 QJMs, you could manually start Ha=
doop on the second DC.=A0<div>
<br><div><div>As a side note, Hadoop is not yet recommended as a multi DC s=
olution.=A0</div><div><br></div><div>Hope it helps.=A0</div><div>Olivier=A0=
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On 11 D=
ecember 2013 21:49, Jeff Sposetti <span dir=3D"ltr">&lt;<a href=3D"mailto:j=
eff@hortonworks.com" target=3D"_blank">jeff@hortonworks.com</a>&gt;</span> =
wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Adding in some Hadoop folks=
 to chime in here.<div><div class=3D"h5"><br><div class=3D"gmail_extra"><br=
><br><div class=3D"gmail_quote">
On Wed, Dec 11, 2013 at 5:35 AM, Chadwick Banning <span dir=3D"ltr">&lt;<a =
href=3D"mailto:chadwickbanning@gmail.com" target=3D"_blank">chadwickbanning=
@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi all,<div><br></div><div>=
I have an Ambari 1.4/HDP 2.0.6 environment that is split between two data c=
enters -- a main site and a recovery site. =A0We have NameNode HA enabled w=
ith automatic failover and the problem we are facing is how to divide the j=
ournal nodes across both sites so that failover happens appropriately.</div=
>


<div><br></div><div>It seems that only one site will always have a majority=
 of journal nodes and if that site NN were to go down, the other site NN wo=
uld no longer be able to start as it couldn&#39;t reach a majority of the j=
ournal nodes.</div>


<div><br></div><div>Is there any way around this? =A0I know an odd number o=
f journal nodes is recommended but what would happen if we were to place an=
 even number of journal nodes at each site?</div><div><br></div><div>Thanks=
 for any input!</div>


</div>
</blockquote></div><br></div></div></div></div>

<br>
<span style=3D"color:rgb(128,128,128);font-family:Arial,sans-serif;font-siz=
e:10px">CONFIDENTIALITY NOTICE</span><br style=3D"color:rgb(128,128,128);fo=
nt-family:Arial,sans-serif;font-size:10px"><span style=3D"color:rgb(128,128=
,128);font-family:Arial,sans-serif;font-size:10px">NOTICE: This message is =
intended for the use of the individual or entity to which it is addressed a=
nd may contain information that is confidential, privileged and exempt from=
 disclosure under applicable law. If the reader of this message is not the =
intended recipient, you are hereby notified that any printing, copying, dis=
semination, distribution, disclosure or forwarding of this communication is=
 strictly prohibited. If you have received this communication in error, ple=
ase contact the sender immediately and delete it from your system. Thank Yo=
u.</span></blockquote>
</div><div style=3D"width:510px"><div style=3D"margin-top:5px">
</div></div>
</div></div></div></div>

<br>
<span style=3D"color:rgb(128,128,128);font-family:Arial,sans-serif;font-siz=
e:10px">CONFIDENTIALITY NOTICE</span><br style=3D"color:rgb(128,128,128);fo=
nt-family:Arial,sans-serif;font-size:10px"><span style=3D"color:rgb(128,128=
,128);font-family:Arial,sans-serif;font-size:10px">NOTICE: This message is =
intended for the use of the individual or entity to which it is addressed a=
nd may contain information that is confidential, privileged and exempt from=
 disclosure under applicable law. If the reader of this message is not the =
intended recipient, you are hereby notified that any printing, copying, dis=
semination, distribution, disclosure or forwarding of this communication is=
 strictly prohibited. If you have received this communication in error, ple=
ase contact the sender immediately and delete it from your system. Thank Yo=
u.</span>
--001a11c22e5846e7da04ed48f054--