Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of linlma@gmail.com designates
 209.85.128.171 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <01756F0A-EEF7-46DA-8FFA-821443FEFCF5@gmail.com>
References: 
 <CAK_MoSv1-Yg2U6sQYF+O-egqCeLyYvgmafnX2c2GDtHrTUV3Og@mail.gmail.com>
	<13FCE8DF-8DE3-41F6-85BF-D4F8B6D5DD6E@gmail.com>
	<CAK_MoSum6rq0KHfQ+qAOfBb1g1iXWBba3k1pTg2WK4UmJeQ48g@mail.gmail.com>
	<01756F0A-EEF7-46DA-8FFA-821443FEFCF5@gmail.com>
Date: Sat, 26 Jan 2013 23:38:19 +0800
Message-ID: 
 <CAK_MoSss0qhcLgbEDHxPr8KZgt6Q_+Ls-B-DRqOcp_GKxArOJw@mail.gmail.com>
Subject: Re: Fair Scheduler of Hadoop
From: Lin Ma <linlma@gmail.com>
To: Joep Rottinghuis <jrottinghuis@gmail.com>
Cc: "user@hadoop.apache.org" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=20cf307d03ea57149404d432d5fd

--20cf307d03ea57149404d432d5fd
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

Thanks Joep, smart answer! All of my confusions are gone. Have a good
weekend.

regards,
Lin

On Tue, Jan 22, 2013 at 2:00 AM, Joep Rottinghuis <jrottinghuis@gmail.com>w=
rote:

> You could configure it like that if you wanted. Keep in mind that would
> waste some resources. Imagine a 10 minute task that has been running for =
9
> minutes. If you have that task killed immediately then it would have to b=
e
> the-scheduled and re-do all 10 minutes.
> Give it another minute and the task is complete and out if the way.
>
> So, consider how busy your cluster is overall and how much you are willin=
g
> to wait for fairness trading this off against a certain amount of waste.
>
> Cheers,
>
> Joep
>
> Sent from my iPhone
>
> On Jan 21, 2013, at 9:30 AM, Lin Ma <linlma@gmail.com> wrote:
>
> Hi Joep,
>
> Excellent answer! I think you have answered my confusions. And one
> remaining issue after reading this document again, even it is old. :-)
>
> It is mentioned, "which will allow you to set how long each pool will
> wait before preempting other jobs=92 tasks to reach its guaranteed capaci=
ty",
> my question is why each pool need wait here? If a pool cannot get its
> guaranteed capacity because of jobs in other pools over use the capacity,
> we should kill such jobs immediately? Appreciate if you could elaborate a
> bit more why we need wait to get even guaranteed capacity.
>
> regards,
> Lin
>
> On Mon, Jan 21, 2013 at 8:24 AM, Joep Rottinghuis <jrottinghuis@gmail.com=
>wrote:
>
>> Lin,
>>
>> The article you are reading us old.
>> Fair scheduler does have preemption.
>> Tasks get killed and rerun later, potentially on a different node.
>>
>> You can set a minimum / guaranteed capacity. The sum of those across
>> pools would typically equal the total capacity of your cluster or less.
>> Then you can configure each pool to go beyond that capacity. That would
>> happen if the cluster is temporary not used to the full capacity.
>> Then when the demand for capacity increases, and jobs are queued in othe=
r
>> pools that are not running at their minimum guaranteed capacity, some lo=
ng
>> running tasks from jobs in the pool that is using more than its minimum
>> capacity get killed (to be run later again).
>>
>> Does that make sense?
>>
>> Cheers,
>>
>> Joep
>>
>> Sent from my iPhone
>>
>> On Jan 20, 2013, at 6:25 AM, Lin Ma <linlma@gmail.com> wrote:
>>
>> Hi guys,
>>
>> I have a quick question regarding to fire scheduler of Hadoop, I am
>> reading this article =3D>
>> http://blog.cloudera.com/blog/2008/11/job-scheduling-in-hadoop/, my
>> question is from the following statements, "There is currently no
>> support for preemption of long tasks, but this is being added in
>> HADOOP-4665 <https://issues.apache.org/jira/browse/HADOOP-4665>, which
>> will allow you to set how long each pool will wait before preempting oth=
er
>> jobs=92 tasks to reach its guaranteed capacity.".
>>
>> My questions are,
>>
>> 1. What means "preemption of long tasks"? Kill long running tasks, or
>> pause long running tasks to give resources to other tasks, or it means
>> something else?
>> 2. I am also confused about "set how long each pool will wait before
>> preempting other jobs=92 tasks to reach its guaranteed capacity"., what
>> means "reach its guaranteed capacity"? I think when using fair
>> scheduler, each pool has predefined resources allocation settings (and t=
he
>> settings guarantees each pool has resources as configured), is that true=
?
>> In what situations each pool will not have its guaranteed (or configured=
)
>> capacity?
>>
>> regards,
>> Lin
>>
>>
>

--20cf307d03ea57149404d432d5fd
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

Thanks Joep, smart answer! All of my confusions are gone. Have a good weeke=
nd.<br><br>regards,<br>Lin<br><br><div class=3D"gmail_quote">On Tue, Jan 22=
, 2013 at 2:00 AM, Joep Rottinghuis <span dir=3D"ltr">&lt;<a href=3D"mailto=
:jrottinghuis@gmail.com" target=3D"_blank">jrottinghuis@gmail.com</a>&gt;</=
span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"auto"><div>You could configure i=
t like that if you wanted. Keep in mind that would waste some resources. Im=
agine a 10 minute task that has been running for 9 minutes. If you have tha=
t task killed immediately then it would have to be the-scheduled and re-do =
all 10 minutes.</div>
<div>Give it another minute and the task is complete and out if the way.</d=
iv><div><br></div><div>So, consider how busy your cluster is overall and ho=
w much you are willing to wait for fairness trading this off against a cert=
ain amount of waste.</div>
<div class=3D"im"><div><br></div><div>Cheers,</div><div><br></div><div>Joep=
</div><div><br>Sent from my iPhone</div></div><div><div class=3D"h5"><div><=
br>On Jan 21, 2013, at 9:30 AM, Lin Ma &lt;<a href=3D"mailto:linlma@gmail.c=
om" target=3D"_blank">linlma@gmail.com</a>&gt; wrote:<br>
<br></div><blockquote type=3D"cite"><div>Hi Joep,<br><br>Excellent answer! =
I think you have answered my confusions. And one remaining issue after read=
ing this document again, even it is old. :-)<br><br>It is mentioned, &quot;=
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;te=
xt-align:start;font-style:normal;display:inline!important;font-weight:norma=
l;float:none;line-height:19px;color:rgb(80,80,80);text-transform:none;font-=
size:13px;white-space:normal;font-family:&#39;Helvetica Neue&#39;,Helvetica=
,Arial,sans-serif;word-spacing:0px">which will allow you to set how long ea=
ch pool will wait before preempting other jobs=92 tasks to reach its guaran=
teed capacity</span>&quot;, my question is why each pool need wait here? If=
 a pool cannot get its guaranteed capacity because of jobs in other pools o=
ver use the capacity, we should kill such jobs immediately? Appreciate if y=
ou could elaborate a bit more why we need wait to get even guaranteed capac=
ity.<br>

<br>regards,<br>Lin<br><br><div class=3D"gmail_quote">On Mon, Jan 21, 2013 =
at 8:24 AM, Joep Rottinghuis <span dir=3D"ltr">&lt;<a href=3D"mailto:jrotti=
nghuis@gmail.com" target=3D"_blank">jrottinghuis@gmail.com</a>&gt;</span> w=
rote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div dir=3D"auto"><div>Lin,</div><div><br></div><div>The article you are re=
ading us old.</div><div>Fair scheduler does have preemption.</div><div>Task=
s get killed and rerun later, potentially on a different node.</div><div>


<br></div><div>You can set a minimum / guaranteed capacity. The sum of thos=
e across pools would typically equal the total capacity of your cluster or =
less.</div><div>Then you can configure each pool to go beyond that capacity=
. That would happen if the cluster is temporary not used to the full capaci=
ty.</div>


<div>Then when the demand for capacity increases, and jobs are queued in ot=
her pools that are not running at their minimum guaranteed capacity, some l=
ong running tasks from jobs in the pool that is using more than its minimum=
 capacity get killed (to be run later again).</div>


<div><br></div><div>Does that make sense?</div><div><br></div><div>Cheers,<=
/div><div><br></div><div>Joep<br><br>Sent from my iPhone</div><div><div><di=
v><br>On Jan 20, 2013, at 6:25 AM, Lin Ma &lt;<a href=3D"mailto:linlma@gmai=
l.com" target=3D"_blank">linlma@gmail.com</a>&gt; wrote:<br>


<br></div><blockquote type=3D"cite"><div>Hi guys,<br><br>I have a quick que=
stion regarding to fire scheduler of Hadoop, I am reading this article =3D&=
gt; <a href=3D"http://blog.cloudera.com/blog/2008/11/job-scheduling-in-hado=
op/" target=3D"_blank">http://blog.cloudera.com/blog/2008/11/job-scheduling=
-in-hadoop/</a>, my question is from the following statements, &quot;<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-ali=
gn:start;font-style:normal;display:inline!important;font-weight:normal;floa=
t:none;line-height:19px;color:rgb(80,80,80);text-transform:none;font-size:1=
3px;white-space:normal;font-family:&#39;Helvetica Neue&#39;,Helvetica,Arial=
,sans-serif;word-spacing:0px">There is currently no support for preemption =
of long tasks, but this is being added in<span>=A0</span></span><a href=3D"=
https://issues.apache.org/jira/browse/HADOOP-4665" style=3D"text-indent:0px=
;letter-spacing:normal;font-variant:normal;text-align:start;font-style:norm=
al;font-weight:bold;line-height:19px;color:rgb(3,146,178);text-transform:no=
ne;font-size:13px;white-space:normal;text-decoration:initial;font-family:&#=
39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;word-spacing:0px" target=
=3D"_blank">HADOOP-4665</a><span style=3D"text-indent:0px;letter-spacing:no=
rmal;font-variant:normal;text-align:start;font-style:normal;display:inline!=
important;font-weight:normal;float:none;line-height:19px;color:rgb(80,80,80=
);text-transform:none;font-size:13px;white-space:normal;font-family:&#39;He=
lvetica Neue&#39;,Helvetica,Arial,sans-serif;word-spacing:0px">, which will=
 allow you to set how long each pool will wait before preempting other jobs=
=92 tasks to reach its guaranteed capacity.</span>&quot;.<br>


<br>My questions are,<br><br>1. What means &quot;<span style=3D"text-indent=
:0px;letter-spacing:normal;font-variant:normal;text-align:start;font-style:=
normal;display:inline!important;font-weight:normal;float:none;line-height:1=
9px;color:rgb(80,80,80);text-transform:none;font-size:13px;white-space:norm=
al;font-family:&#39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;word-spa=
cing:0px">preemption of long tasks</span>&quot;? Kill long running tasks, o=
r pause long running tasks to give resources to other tasks, or it means so=
mething else?<br>


2. I am also confused about &quot;<span style=3D"text-indent:0px;letter-spa=
cing:normal;font-variant:normal;text-align:start;font-style:normal;display:=
inline!important;font-weight:normal;float:none;line-height:19px;color:rgb(8=
0,80,80);text-transform:none;font-size:13px;white-space:normal;font-family:=
&#39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;word-spacing:0px">set h=
ow long each pool will wait before preempting other jobs=92 tasks to reach =
its guaranteed capacity</span>&quot;., what means &quot;<span style=3D"text=
-indent:0px;letter-spacing:normal;font-variant:normal;text-align:start;font=
-style:normal;display:inline!important;font-weight:normal;float:none;line-h=
eight:19px;color:rgb(80,80,80);text-transform:none;font-size:13px;white-spa=
ce:normal;font-family:&#39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;w=
ord-spacing:0px">reach its guaranteed capacity</span>&quot;? I think when u=
sing fair scheduler, each pool has predefined resources allocation settings=
 (and the settings guarantees each pool has resources as configured), is th=
at true? In what situations each pool will not have its guaranteed (or conf=
igured) capacity?<br>


<br>regards,<br>Lin<br>
</div></blockquote></div></div></div></blockquote></div><br>
</div></blockquote></div></div></div></blockquote></div><br>

--20cf307d03ea57149404d432d5fd--