Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of marcin.mejran@hooklogic.com
 designates 213.199.154.144 as permitted sender)
Received-SPF: pass (mail2-db3: domain of hooklogic.com designates 132.245.2.21
 as permitted sender) client-ip=132.245.2.21;
 envelope-from=marcin.mejran@hooklogic.com;
 helo=BN1PRD0512HT003.namprd05.prod.outlook.com ;.outlook.com ;
From: Marcin Mejran <marcin.mejran@hooklogic.com>
To: "<user@hadoop.apache.org>" <user@hadoop.apache.org>
CC: "user@hadoop.apache.org" <user@hadoop.apache.org>
Subject: Re: Hadoop efficient resource isolation
Thread-Topic: Hadoop efficient resource isolation
Thread-Index: AQHOEDXE3CkT2Z0wSkuN/45NNaiJdpiLTXcAgABFUSY=
Date: Tue, 26 Feb 2013 04:27:35 +0000
Message-ID: <4839F0AF-25D4-4141-87B7-195CE6BB89B6@hooklogic.com>
References: 
 <CAJzooYe-3POY6vwojg9=2yTnUYxO+gY6QY3=vvtVHuzr24n3kQ@mail.gmail.com>,<2AF7313F-73BC-44B2-9E3F-D9183F9D0BAB@hortonworks.com>
In-Reply-To: <2AF7313F-73BC-44B2-9E3F-D9183F9D0BAB@hortonworks.com>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_4839F0AF25D4414187B7195CE6BB89B6hooklogiccom_"
MIME-Version: 1.0

--_000_4839F0AF25D4414187B7195CE6BB89B6hooklogiccom_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

That won't stop a bad job (say a fork bomb or a massive memory leak in a st=
reaming script) from taking out a node which is what I believe Dhanasekaran=
 was asking about. He wants to physically isolate certain lobs to certain "=
non critical" nodes. I don't believe this is possible and data would be spr=
ead to those nodes, assuming they're data nodes, which would still cause cl=
uster wide issues (and if data is isolate why not have two separate cluster=
s?),

I've read references in the docs about some type of memory based contrains =
in Hadoop but I don't know of the details. Anyone know how they work?

Also, I believe there are tools in Linux that can kill processes in case of=
 memory issues and otherwise restrict what a certain user can do. These see=
m like a more flexible solution although they won't cover all potential iss=
ues.

-Marcin

On Feb 25, 2013, at 7:20 PM, "Arun C Murthy" <acm@hortonworks.com<mailto:ac=
m@hortonworks.com>> wrote:

CapacityScheduler is what you want...

On Feb 21, 2013, at 5:16 AM, Dhanasekaran Anbalagan wrote:

Hi Guys,

It's possible isolation job submission for hadoop cluster, we currently run=
ning 48 machine cluster. we  monitor Hadoop is not provides efficient resou=
rce isolation. In my case we ran for tech and research pool, When tech job =
some memory leak will haven, It's occupy the hole cluster.  Finally we figu=
re out  issue with tech job. It's  screwed up hole hadoop cluster. finally =
10 data node  are dead.

Any prevention of job submission efficient way resource allocation. When so=
mething wrong in   particular job, effect particular pool, Not effect other=
s job. Any way to archive this

Please guide me guys.

My idea is, When tech user submit job means only apply job in for my case s=
ubmit 24 machine. other machine only for research user.

It's will prevent the memory leak problem.


-Dhanasekaran.
Did I learn something today? If not, I wasted it.

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/


--_000_4839F0AF25D4414187B7195CE6BB89B6hooklogiccom_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html>
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
</head>
<body bgcolor=3D"#FFFFFF">
<div>That won't stop a bad job (say a fork bomb or a massive memory leak in=
 a streaming script) from taking out a node which is what I believe&nbsp;<s=
pan class=3D"Apple-style-span" style=3D"-webkit-tap-highlight-color: rgba(2=
6, 26, 26, 0.292969); -webkit-composition-fill-color: rgba(175, 192, 227, 0=
.230469); -webkit-composition-frame-color: rgba(77, 128, 180, 0.230469); ">=
Dhanasekaran
 was asking about. He wants to physically isolate certain lobs to certain &=
quot;non critical&quot; nodes. I don't believe this is possible and data wo=
uld be spread to those nodes, assuming they're data nodes, which would stil=
l cause cluster wide issues (and if data is
 isolate why not have two separate clusters?),</span></div>
<div><span class=3D"Apple-style-span" style=3D"-webkit-tap-highlight-color:=
 rgba(26, 26, 26, 0.292969); -webkit-composition-fill-color: rgba(175, 192,=
 227, 0.230469); -webkit-composition-frame-color: rgba(77, 128, 180, 0.2304=
69); "><br>
</span></div>
<div><span class=3D"Apple-style-span" style=3D"-webkit-tap-highlight-color:=
 rgba(26, 26, 26, 0.292969); -webkit-composition-fill-color: rgba(175, 192,=
 227, 0.230469); -webkit-composition-frame-color: rgba(77, 128, 180, 0.2304=
69); ">I've read references in the docs
 about some type of memory based contrains in Hadoop but I don't know of th=
e details. Anyone know how they work?</span></div>
<div><span class=3D"Apple-style-span" style=3D"-webkit-tap-highlight-color:=
 rgba(26, 26, 26, 0.292969); -webkit-composition-fill-color: rgba(175, 192,=
 227, 0.230469); -webkit-composition-frame-color: rgba(77, 128, 180, 0.2304=
69); "><br>
</span></div>
<div><span class=3D"Apple-style-span" style=3D"-webkit-tap-highlight-color:=
 rgba(26, 26, 26, 0.292969); -webkit-composition-fill-color: rgba(175, 192,=
 227, 0.230469); -webkit-composition-frame-color: rgba(77, 128, 180, 0.2304=
69); ">Also, I believe there are tools
 in Linux that can kill processes in case of memory issues and otherwise re=
strict what a certain user can do. These seem like a more flexible solution=
 although they won't cover all potential issues.</span></div>
<div><br>
</div>
<div>-Marcin</div>
<div><br>
On Feb 25, 2013, at 7:20 PM, &quot;Arun C Murthy&quot; &lt;<a href=3D"mailt=
o:acm@hortonworks.com">acm@hortonworks.com</a>&gt; wrote:<br>
<br>
</div>
<div></div>
<blockquote type=3D"cite">
<div>CapacityScheduler is what you want...
<div><br>
<div>
<div>On Feb 21, 2013, at 5:16 AM, Dhanasekaran Anbalagan wrote:</div>
<br class=3D"Apple-interchange-newline">
<blockquote type=3D"cite">
<div dir=3D"ltr">Hi Guys,
<div><br>
</div>
<div>It's possible isolation job&nbsp;submission for hadoop cluster, we cur=
rently running 48 machine cluster. we &nbsp;monitor Hadoop is not provides =
efficient resource isolation. In my case we ran for tech and research pool,=
 When tech job some memory&nbsp;leak will haven,
 It's occupy the hole&nbsp;cluster.&nbsp;&nbsp;Finally we&nbsp;figure out&n=
bsp; issue with tech job. It's &nbsp;screwed up hole hadoop cluster. finall=
y 10 data node&nbsp; are dead.</div>
<div><br>
</div>
<div>Any prevention of job submission efficient way resource allocation. Wh=
en something wrong in &nbsp;&nbsp;particular job, effect particular pool, N=
ot effect others job. Any way to archive this</div>
<div><br>
</div>
<div>Please guide me guys.</div>
<div><br>
</div>
<div style=3D"">My idea is, When tech user submit job means only apply job =
in for my case&nbsp;submit 24 machine. other machine only for research user=
.</div>
<div style=3D""><br>
</div>
<div style=3D"">It's will prevent the memory leak problem.&nbsp;</div>
<div style=3D"">&nbsp;<br>
</div>
<div><br>
</div>
<div>-Dhanasekaran.<br clear=3D"all">
<div>Did I learn something today? If not, I wasted it.</div>
</div>
</div>
</blockquote>
</div>
<br>
<div><span class=3D"Apple-style-span" style=3D"border-collapse: separate; c=
olor: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-varian=
t: normal; font-weight: normal; letter-spacing: normal; line-height: normal=
; orphans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: n=
one; white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-hori=
zontal-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-dec=
orations-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stro=
ke-width: 0px; font-size: medium; "><span class=3D"Apple-style-span" style=
=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: Helvetica;=
 font-style: normal; font-variant: normal; font-weight: normal; letter-spac=
ing: normal; line-height: normal; orphans: 2; text-align: -webkit-auto; tex=
t-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-s=
pacing: 0px; -webkit-border-horizontal-spacing: 0px; -webkit-border-vertica=
l-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-size=
-adjust: auto; -webkit-text-stroke-width: 0px; font-size: medium; ">
<div style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line=
-break: after-white-space; ">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; color:=
 rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: no=
rmal; font-weight: normal; letter-spacing: normal; line-height: normal; orp=
hans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: none; =
white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-horizonta=
l-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-decorati=
ons-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stroke-wi=
dth: 0px; font-size: medium; ">
<div style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line=
-break: after-white-space; ">
--</div>
<div style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line=
-break: after-white-space; ">
Arun C. Murthy</div>
<div style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line=
-break: after-white-space; ">
Hortonworks Inc.<br>
<a href=3D"http://hortonworks.com/">http://hortonworks.com/</a><br>
<br>
</div>
</span></div>
</span></span></div>
<br>
</div>
</div>
</blockquote>
</body>
</html>

--_000_4839F0AF25D4414187B7195CE6BB89B6hooklogiccom_--