Mailing-List: contact user-help@spark.apache.org; run by ezmlm
Precedence: bulk
MIME-Version: 1.0
In-Reply-To: 
 <CALte62wmg8gZmLx=S-DCCeHjbGJ5SZS-mCURGGAvPyXLey-c_Q@mail.gmail.com>
References: 
 <CADjSX1xL8jTDKZL1Bssh_HnXmdCm24YD=9_Cdsj4CjjC2r6YHw@mail.gmail.com>
	<BN1PR07MB087014A4628C9381ED4CCF1F4230@BN1PR07MB087.namprd07.prod.outlook.com>
	<CALte62wmg8gZmLx=S-DCCeHjbGJ5SZS-mCURGGAvPyXLey-c_Q@mail.gmail.com>
Date: Tue, 27 Oct 2015 07:23:32 +0000
Message-ID: 
 <CADONuiSNr1pRmMaF2UqJVYzxzOPfMbHVCD53DvhEgPSQD-w60A@mail.gmail.com>
Subject: Re: Dynamic Resource Allocation with Spark Streaming (Standalone
 Cluster, Spark 1.5.1)
From: Deenar Toraskar <deenar.toraskar@gmail.com>
To: Ted Yu <yuzhihong@gmail.com>, user <user@spark.apache.org>
Content-Type: multipart/alternative; boundary=001a11c333f284129b052310f475

--001a11c333f284129b052310f475
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Till Spark Streaming supports dynamic allocation, you could use
StreamingListener to monitor batch execution times and based on it
sparkContext.requestExecutors() and sparkContext.killExecutors() to add and
remove executors explicitly and .


On 26 October 2015 at 21:37, Ted Yu <yuzhihong@gmail.com> wrote:

> This is related:
> SPARK-10955 Warn if dynamic allocation is enabled for Streaming jobs
>
> which went into 1.6.0 as well.
>
> FYI
>
> On Mon, Oct 26, 2015 at 2:26 PM, Silvio Fiorito <
> silvio.fiorito@granturing.com> wrote:
>
>> Hi Matthias,
>>
>> Unless there was a change in 1.5, I'm afraid dynamic resource allocation
>> is not yet supported in streaming apps.
>>
>> Thanks,
>> Silvio
>>
>> Sent from my Lumia 930
>> ------------------------------
>> From: Matthias Niehoff <matthias.niehoff@codecentric.de>
>> Sent: =E2=80=8E10/=E2=80=8E26/=E2=80=8E2015 4:00 PM
>> To: user@spark.apache.org
>> Subject: Dynamic Resource Allocation with Spark Streaming (Standalone
>> Cluster, Spark 1.5.1)
>>
>> Hello everybody,
>>
>> I have a few (~15) Spark Streaming jobs which have load peaks as well as
>> long times with a low load. So I thought the new Dynamic Resource
>> Allocation for Standalone Clusters might be helpful (SPARK-4751).
>>
>> I have a test "cluster" with 1 worker consisting of 4 executors with 2
>> cores each, so 8 cores in total.
>>
>> I started a simple streaming application without limiting the max cores
>> for this app. As expected the app occupied every core of the cluster. Th=
en
>> I started a second app, also without limiting the maximum cores. As the
>> first app did not get any input through the stream, my naive expectation
>> was that the second app would get at least 2 cores (1 receiver, 1
>> processing), but that's not what happened. The cores are still assigned =
to
>> the first app.
>> When I look at the application UI of the first app every executor is
>> still running. That explains why no executor is used for the second app.
>>
>> I end up with two questions:
>> - When does an executor getting idle in a Spark Streaming application?
>> (and so could be reassigned to another app)
>> - Is there another way to compete with uncertain load when using Spark
>> Streaming Applications? I already combined multiple jobs to a Spark
>> Application using different threads, but this approach comes to a limit =
for
>> me, because Spark Applications get to big to manage.
>>
>> Thank You!
>>
>>
>>
>

--001a11c333f284129b052310f475
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div style=3D"font-size:12.8px"><span style=3D"font-size:1=
2.8px">Till Spark Streaming supports dynamic allocation, you could use=C2=
=A0</span><span style=3D"font-size:12.8px">StreamingListener</span><span st=
yle=3D"font-size:12.8px">=C2=A0to monitor batch execution times and based o=
n it=C2=A0</span><span style=3D"font-size:12.8px">sparkContext.requestExecu=
tors(</span><span style=3D"font-size:12.8px">) and sparkContext.killExecuto=
rs() to add and remove executors explicitly and=C2=A0</span><span style=3D"=
font-size:12.8px">.</span></div><div style=3D"font-size:12.8px"><span style=
=3D"font-size:12.8px"><br></span></div><div style=3D"font-size:12.8px"><br>=
</div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On 26=
 October 2015 at 21:37, Ted Yu <span dir=3D"ltr">&lt;<a href=3D"mailto:yuzh=
ihong@gmail.com" target=3D"_blank">yuzhihong@gmail.com</a>&gt;</span> wrote=
:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-le=
ft:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">This is related:<div>S=
PARK-10955=C2=A0Warn if dynamic allocation is enabled for Streaming jobs<br=
></div><div><br></div><div>which went into 1.6.0 as well.</div><div><br></d=
iv><div>FYI</div></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=
=3D"gmail_extra"><br><div class=3D"gmail_quote">On Mon, Oct 26, 2015 at 2:2=
6 PM, Silvio Fiorito <span dir=3D"ltr">&lt;<a href=3D"mailto:silvio.fiorito=
@granturing.com" target=3D"_blank">silvio.fiorito@granturing.com</a>&gt;</s=
pan> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex=
;border-left:1px #ccc solid;padding-left:1ex">


<div>
<div>
<div style=3D"font-family:Calibri,sans-serif;font-size:11pt">Hi Matthias,<b=
r>
<br>
Unless there was a change in 1.5, I&#39;m afraid dynamic resource allocatio=
n is not yet supported in streaming apps.<br>
<br>
Thanks,<br>
Silvio<br>
<br>
Sent from my Lumia 930</div>
</div>
<div dir=3D"ltr">
<hr>
<span style=3D"font-family:Calibri,sans-serif;font-size:11pt;font-weight:bo=
ld">From:
</span><span style=3D"font-family:Calibri,sans-serif;font-size:11pt"><a hre=
f=3D"mailto:matthias.niehoff@codecentric.de" target=3D"_blank">Matthias Nie=
hoff</a></span><br>
<span style=3D"font-family:Calibri,sans-serif;font-size:11pt;font-weight:bo=
ld">Sent:
</span><span style=3D"font-family:Calibri,sans-serif;font-size:11pt">=E2=80=
=8E10/=E2=80=8E26/=E2=80=8E2015 4:00 PM</span><br>
<span style=3D"font-family:Calibri,sans-serif;font-size:11pt;font-weight:bo=
ld">To:
</span><span style=3D"font-family:Calibri,sans-serif;font-size:11pt"><a hre=
f=3D"mailto:user@spark.apache.org" target=3D"_blank">user@spark.apache.org<=
/a></span><br>
<span style=3D"font-family:Calibri,sans-serif;font-size:11pt;font-weight:bo=
ld">Subject:
</span><span style=3D"font-family:Calibri,sans-serif;font-size:11pt">Dynami=
c Resource Allocation with Spark Streaming (Standalone Cluster, Spark 1.5.1=
)</span><br>
<br>
</div><div><div>
<div>
<div dir=3D"ltr">Hello everybody,
<div><br>
</div>
<div>
<div>I have a few (~15) Spark Streaming jobs which have load peaks as well =
as long times with a low load. So I thought the new Dynamic Resource Alloca=
tion for Standalone Clusters might be helpful (SPARK-4751).</div>
<div><br>
</div>
<div>I have a test &quot;cluster&quot; with 1 worker consisting of 4 execut=
ors with 2 cores each, so 8 cores in total.</div>
<div><br>
</div>
<div>I started a simple streaming application without limiting the max core=
s for this app. As expected the app occupied every core of the cluster. The=
n I started a second app, also without limiting the maximum cores. As the f=
irst app did not get any input through
 the stream, my naive expectation was that the second app would get at leas=
t 2 cores (1 receiver, 1 processing), but that&#39;s not what happened. The=
 cores are still assigned to the first app.</div>
<div>When I look at the application UI of the first app every executor is s=
till running. That explains why no executor is used for the second app.=C2=
=A0</div>
<div><br>
</div>
<div>I end up with two questions:</div>
<div>- When does an executor getting idle in a Spark Streaming application?=
 (and so could be reassigned to another app)</div>
<div>- Is there another way to compete with uncertain load when using Spark=
 Streaming Applications? I already combined multiple jobs to a Spark Applic=
ation using different threads, but this approach comes to a limit for me, b=
ecause Spark Applications get to
 big to manage.</div>
<div><br>
</div>
<div>Thank You!</div>
<div><br>
</div>
<div>
<div dir=3D"ltr">
<div>
<div dir=3D"ltr"><br>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div></div></div>

</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--001a11c333f284129b052310f475--