Mailing-List: contact user-help@spark.apache.org; run by ezmlm
Precedence: bulk
MIME-Version: 1.0
In-Reply-To: 
 <CADVCeTGTU5OfYj_JkEVwR9howPvbxVzzZWYA12m+wynXS7Z0=Q@mail.gmail.com>
References: 
 <CADVCeTFKNjq7K_N_-Px1qw9_3G_J2GK3B=Q4fd8zu7M=6uADQQ@mail.gmail.com>
	<CAHUQ+_bR7K5jgeTJ4w1OKDp6gBenRO4OmHk=MkgPyON2zw-_qA@mail.gmail.com>
	<CADVCeTGnxo_=DVZ_PQo2kE0L-hjYLC0UMUbbq35g=smV7yA6Ww@mail.gmail.com>
	<CAHUQ+_YX1P8rJJb_DM2xF5e_0wHpZ8KATZ+DFFa0DbsoPS=pvg@mail.gmail.com>
	<CADVCeTGTU5OfYj_JkEVwR9howPvbxVzzZWYA12m+wynXS7Z0=Q@mail.gmail.com>
Date: Thu, 28 May 2015 12:37:45 +0530
Message-ID: 
 <CAHUQ+_aX5HrZ4Q+4zPXs35xCGviqKu=Ow32aMp2F_dYabj4eow@mail.gmail.com>
Subject: Re: Spark Streming yarn-cluster Mode Off-heap Memory Is Constantly
 Growing
From: Akhil Das <akhil@sigmoidanalytics.com>
To: Ji ZHANG <zhangji87@gmail.com>
Cc: User <user@spark.apache.org>
Content-Type: multipart/alternative; boundary=20cf307f33163cbcd305171f04e8

--20cf307f33163cbcd305171f04e8
Content-Type: text/plain; charset=UTF-8

Hi Zhang,

Could you paste your code in a gist? Not sure what you are doing inside the
code to fill up memory.

Thanks
Best Regards

On Thu, May 28, 2015 at 10:08 AM, Ji ZHANG <zhangji87@gmail.com> wrote:

> Hi,
>
> Yes, I'm using createStream, but the storageLevel param is by default
> MEMORY_AND_DISK_SER_2. Besides, the driver's memory is also growing. I
> don't think Kafka messages will be cached in driver.
>
>
> On Thu, May 28, 2015 at 12:24 AM, Akhil Das <akhil@sigmoidanalytics.com>
> wrote:
>
>> Are you using the createStream or createDirectStream api? If its the
>> former, you can try setting the StorageLevel to MEMORY_AND_DISK (it might
>> slow things down though). Another way would be to try the later one.
>>
>> Thanks
>> Best Regards
>>
>> On Wed, May 27, 2015 at 1:00 PM, Ji ZHANG <zhangji87@gmail.com> wrote:
>>
>>> Hi Akhil,
>>>
>>> Thanks for your reply. Accoding to the Streaming tab of Web UI, the
>>> Processing Time is around 400ms, and there's no Scheduling Delay, so I
>>> suppose it's not the Kafka messages that eat up the off-heap memory. Or
>>> maybe it is, but how to tell?
>>>
>>> I googled about how to check the off-heap memory usage, there's a tool
>>> called pmap, but I don't know how to interprete the results.
>>>
>>> On Wed, May 27, 2015 at 3:08 PM, Akhil Das <akhil@sigmoidanalytics.com>
>>> wrote:
>>>
>>>> After submitting the job, if you do a ps aux | grep spark-submit then
>>>> you can see all JVM params. Are you using the highlevel consumer (receiver
>>>> based) for receiving data from Kafka? In that case if your throughput is
>>>> high and the processing delay exceeds batch interval then you will hit this
>>>> memory issues as the data will keep on receiving and is dumped to memory.
>>>> You can set StorageLevel to MEMORY_AND_DISK (but it slows things down).
>>>> Another alternate will be to use the lowlevel kafka consumer
>>>> <https://github.com/dibbhatt/kafka-spark-consumer> or to use the
>>>> non-receiver based directStream
>>>> <https://spark.apache.org/docs/1.3.1/streaming-kafka-integration.html#approach-2-direct-approach-no-receivers>
>>>> that comes up with spark.
>>>>
>>>> Thanks
>>>> Best Regards
>>>>
>>>> On Wed, May 27, 2015 at 11:51 AM, Ji ZHANG <zhangji87@gmail.com> wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> I'm using Spark Streaming 1.3 on CDH5.1 with yarn-cluster mode. I find
>>>>> out that YARN is killing the driver and executor process because of
>>>>> excessive use of memory. Here's something I tried:
>>>>>
>>>>> 1. Xmx is set to 512M and the GC looks fine (one ygc per 10s), so the
>>>>> extra memory is not used by heap.
>>>>> 2. I set the two memoryOverhead params to 1024 (default is 384), but
>>>>> the memory just keeps growing and then hits the limit.
>>>>> 3. This problem is not shown in low-throughput jobs, neither in
>>>>> standalone mode.
>>>>> 4. The test job just receives messages from Kafka, with batch interval
>>>>> of 1, do some filtering and aggregation, and then print to executor logs.
>>>>> So it's not some 3rd party library that causes the 'leak'.
>>>>>
>>>>> Spark 1.3 is built by myself, with correct hadoop versions.
>>>>>
>>>>> Any ideas will be appreciated.
>>>>>
>>>>> Thanks.
>>>>>
>>>>> --
>>>>> Jerry
>>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> Jerry
>>>
>>
>>
>
>
> --
> Jerry
>

--20cf307f33163cbcd305171f04e8
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:courier =
new,monospace;color:#000000">Hi Zhang,</div><div class=3D"gmail_default" st=
yle=3D"font-family:courier new,monospace;color:#000000"><br></div><div clas=
s=3D"gmail_default" style=3D"font-family:courier new,monospace;color:#00000=
0">Could you paste your code in a gist? Not sure what you are doing inside =
the code to fill up memory.</div></div><div class=3D"gmail_extra"><br clear=
=3D"all"><div><div class=3D"gmail_signature"><div dir=3D"ltr">Thanks<div>Be=
st Regards</div></div></div></div>
<br><div class=3D"gmail_quote">On Thu, May 28, 2015 at 10:08 AM, Ji ZHANG <=
span dir=3D"ltr">&lt;<a href=3D"mailto:zhangji87@gmail.com" target=3D"_blan=
k">zhangji87@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1=
ex"><div dir=3D"ltr">Hi,<div><br></div><div>Yes, I&#39;m using createStream=
, but the storageLevel param is by default MEMORY_AND_DISK_SER_2. Besides, =
the driver&#39;s memory is also growing. I don&#39;t think Kafka messages w=
ill be cached in driver.</div><div><br></div></div><div class=3D"gmail_extr=
a"><div><div class=3D"h5"><br><div class=3D"gmail_quote">On Thu, May 28, 20=
15 at 12:24 AM, Akhil Das <span dir=3D"ltr">&lt;<a href=3D"mailto:akhil@sig=
moidanalytics.com" target=3D"_blank">akhil@sigmoidanalytics.com</a>&gt;</sp=
an> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D=
"gmail_default" style=3D"font-family:courier new,monospace;color:#000000">A=
re you using the createStream or createDirectStream api? If its the former,=
 you can try setting the StorageLevel to MEMORY_AND_DISK (it might slow thi=
ngs down though). Another way would be to try the later one.=C2=A0</div></d=
iv><div class=3D"gmail_extra"><br clear=3D"all"><div><div><div dir=3D"ltr">=
Thanks<div>Best Regards</div></div></div></div><div><div>
<br><div class=3D"gmail_quote">On Wed, May 27, 2015 at 1:00 PM, Ji ZHANG <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:zhangji87@gmail.com" target=3D"_blank=
">zhangji87@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_q=
uote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1e=
x"><div dir=3D"ltr">Hi Akhil,<div><br></div><div>Thanks for your reply. Acc=
oding to the Streaming tab of Web UI, the Processing Time is around 400ms, =
and there&#39;s no Scheduling Delay, so I suppose it&#39;s not the Kafka me=
ssages that eat up the off-heap memory. Or maybe it is, but how to tell?</d=
iv><div><br></div><div>I googled about how to check the off-heap memory usa=
ge, there&#39;s a tool called pmap, but I don&#39;t know how to interprete =
the results.</div></div><div class=3D"gmail_extra"><div><div><br><div class=
=3D"gmail_quote">On Wed, May 27, 2015 at 3:08 PM, Akhil Das <span dir=3D"lt=
r">&lt;<a href=3D"mailto:akhil@sigmoidanalytics.com" target=3D"_blank">akhi=
l@sigmoidanalytics.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1=
ex"><div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:cour=
ier new,monospace;color:#000000">After submitting the job, if you do a ps a=
ux | grep spark-submit then you can see all JVM params. Are you using the h=
ighlevel consumer (receiver based) for receiving data from Kafka? In that c=
ase if your throughput is high and the processing delay exceeds batch inter=
val then you will hit this memory issues as the data will keep on receiving=
 and is dumped to memory. You can set StorageLevel to MEMORY_AND_DISK (but =
it slows things down). Another alternate will be to use the <a href=3D"http=
s://github.com/dibbhatt/kafka-spark-consumer" target=3D"_blank">lowlevel ka=
fka consumer</a> or to use the non-receiver based <a href=3D"https://spark.=
apache.org/docs/1.3.1/streaming-kafka-integration.html#approach-2-direct-ap=
proach-no-receivers" target=3D"_blank">directStream</a> that comes up with =
spark.</div></div><div class=3D"gmail_extra"><br clear=3D"all"><div><div><d=
iv dir=3D"ltr">Thanks<div>Best Regards</div></div></div></div><div><div>
<br><div class=3D"gmail_quote">On Wed, May 27, 2015 at 11:51 AM, Ji ZHANG <=
span dir=3D"ltr">&lt;<a href=3D"mailto:zhangji87@gmail.com" target=3D"_blan=
k">zhangji87@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1=
ex"><div dir=3D"ltr">Hi,<div><br></div><div>I&#39;m using Spark Streaming 1=
.3 on CDH5.1 with yarn-cluster mode. I find out that YARN is killing the dr=
iver and executor process because of excessive use of memory. Here&#39;s so=
mething I tried:</div><div><br></div><div>1. Xmx is set to 512M and the GC =
looks fine (one ygc per 10s), so the extra memory is not used by heap.</div=
><div>2. I set the two memoryOverhead params to 1024 (default is 384), but =
the memory just keeps growing and then hits the limit.</div><div>3. This pr=
oblem is not shown in low-throughput jobs, neither in standalone mode.</div=
><div>4. The test job just receives messages from Kafka, with batch interva=
l of 1, do some filtering and aggregation, and then print to executor logs.=
 So it&#39;s not some 3rd party library that causes the &#39;leak&#39;.<br>=
</div><div><br></div><div>Spark 1.3 is built by myself, with correct hadoop=
 versions.</div><div><br></div><div>Any ideas will be appreciated.</div><di=
v><br></div><div>Thanks.</div><span><font color=3D"#888888"><div><div><br><=
/div>-- <br><div>Jerry</div>
</div></font></span></div>
</blockquote></div><br></div></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><span><=
font color=3D"#888888">-- <br><div>Jerry</div>
</font></span></div>
</blockquote></div><br></div></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><span c=
lass=3D"HOEnZb"><font color=3D"#888888">-- <br><div>Jerry</div>
</font></span></div>
</blockquote></div><br></div>

--20cf307f33163cbcd305171f04e8--