Mailing-List: contact users-help@trafficserver.apache.org; run by ezmlm
Precedence: bulk
Reply-To: users@trafficserver.apache.org
Received-SPF: pass (nike.apache.org: domain of portl4t.cn@gmail.com designates
 209.85.223.169 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAL9LgEOPiMrO4=ZfCsoMVmTNvDberHMpnQjJ8WDj3gvjhg6MkQ@mail.gmail.com>
References: 
 <CAL9LgEOPiMrO4=ZfCsoMVmTNvDberHMpnQjJ8WDj3gvjhg6MkQ@mail.gmail.com>
Date: Mon, 12 Jan 2015 11:49:59 +0800
Message-ID: 
 <CANBiXpwTBsWFbnG4k6Zqo8zxdxtA0tKiBPzmVYnZXjGEUx5YoA@mail.gmail.com>
Subject: Re: Interim cache - High CPU usage
From: gang li <portl4t.cn@gmail.com>
To: users@trafficserver.apache.org
Content-Type: multipart/alternative; boundary=001a11405f7c7e8cd7050c6c6664

--001a11405f7c7e8cd7050c6c6664
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

I don't think it is a good idea to use interim cache if the size of the
cache objects are very large. As the ssd disk is only 120GB,  the objects
on the HDD will be migrated to the ssd frequently, and the ssd storage is
meaningless as it will be overwritten quickly, and this will increase the
consumption of cpu and io.

Can you give more infos from the perf, such as call graph.


On Mon, Jan 12, 2015 at 4:52 AM, Daniel Biazus <daniel.biazus@azion.com>
wrote:

> Hi Guys,
>
> We' ve been using ATS as a reverse proxy, and a few week ago We started t=
o
> use the interim cache feature in more intense way, caching objects with t=
he
> average size of 200 MB and max size of 1 GB.
>
> *We have ~ 1 TB HDD as a default storage:*
>
> cat /etc/trafficserver/storage.config
>
> # ATS - Storage
> /dev/sda6 volume=3D1
>
> *And also, a 120 GB SDD storage as interim cache:*
>
> LOCAL proxy.config.cache.interim.storage STRING /dev/sdc1
>
> After 20 ~ 30 minutes in production with this configuration, We could
> notice a sudden CPU high usage, increasing up to 65 %, considering that o=
ur
> regular usage is 10 %. However the throughput still stable in 250 Mbps pe=
r
> box.
>
> We've found the following behavior using the perf top tool:
>
>    88.18%  traffic_server      [.]
> _Z15write_to_net_ioP10NetHandlerP18UnixNetVConnectionP7EThread
>      0.32%  traffic_server      [.] _ZN10NetHandler12mainNetEventEiP5Even=
t
>      0.30%  [kernel]             [k] update_sd_lb_stats
>      0.29%  [e1000e]           [k] e1000e_check_ltr_demote
>      0.25%  [kernel]             [k] __ticket_spin_lock
>      0.24%  traffic_server      [.] _ZN7EThread13process_eventEP5Eventi
>      0.21%  [kernel]             [k] timerqueue_add
>      0.17%  libc-2.12.so       [.] epoll_wait
>      0.17%  libpcre.so.0.0.   [.] 0x00000000000100dd
>      0.14%  [kernel]             [k] __schedule
>
> 1) This behavior is easily reproduced caching* large objects with interim
> cache active*.
> 2) With interim cache *disabled*, this behavior* is not reproduced.*
>
>  As you can see, at the perf top output, the write_to_net_*io *function
> is responsible for this heavy CPU usage. We would like to hear of you guy=
s,
> if anyone has faced a issue like that, or if you have any clues about thi=
s
> possible bug.
>
> Thanks & Regards,
>
> --
>
> Daniel Biazus
> infrastructure Engineering
> Azion Technologies
> Porto Alegre, Brasil +55 51 3012 3005 | +55 51 82279032
> Miami, USA +1 305 704 8816
>
> Quaisquer informa=C3=A7=C3=B5es contidas neste e-mail e anexos podem ser
> confidenciais e privilegiadas, protegidas por sigilo legal. Qualquer form=
a
> de utiliza=C3=A7=C3=A3o deste documento depende de autoriza=C3=A7=C3=A3o =
do emissor, sujeito as
> penalidades cab=C3=ADveis.
>
> Any information in this e-mail and attachments may be confidential and
> privileged, protected by legal confidentiality. The use of this document
> require authorization by the issuer, subject to penalties.
>
>

--001a11405f7c7e8cd7050c6c6664
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>I don&#39;t think it is a good idea to use interim ca=
che if the size of the cache objects are very large. As the ssd disk is onl=
y 120GB,=C2=A0 the objects on the HDD will be migrated to the ssd frequentl=
y, and the ssd storage is meaningless as it will be overwritten quickly, an=
d this will increase the consumption of cpu and io.<br><br></div>Can you gi=
ve more infos from the perf, such as call graph.<br><br></div><div class=3D=
"gmail_extra"><br><div class=3D"gmail_quote">On Mon, Jan 12, 2015 at 4:52 A=
M, Daniel Biazus <span dir=3D"ltr">&lt;<a href=3D"mailto:daniel.biazus@azio=
n.com" target=3D"_blank">daniel.biazus@azion.com</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Guys,<div><br></div><div>=
We&#39; ve been using ATS as a reverse proxy, and a few week ago We started=
 to use the interim cache feature in more intense way, caching objects with=
 the average size of 200 MB and max size of 1 GB.=C2=A0</div><div><br></div=
><div><b>We have ~ 1 TB HDD as a default storage:</b><br></div><div><br></d=
iv><div><div>cat /etc/trafficserver/storage.config</div><div><br></div><div=
># ATS - Storage<br></div><div>/dev/sda6 volume=3D1</div><div><br></div><di=
v><b>And also, a 120 GB SDD storage as interim cache:</b></div><div><br></d=
iv><div>LOCAL proxy.config.cache.interim.storage STRING /dev/sdc1<br></div>=
<div><br></div><div>After 20 ~ 30 minutes in production with this configura=
tion, We could notice a sudden CPU high usage, increasing up to 65 %, consi=
dering that our regular usage is 10 %. However the throughput still stable =
in 250 Mbps per box.=C2=A0</div><div><br></div><div>We&#39;ve found the fol=
lowing behavior using the perf top tool:</div><div><br></div><div style=3D"=
font-size:13px"><div><font color=3D"#ff0000">=C2=A0 =C2=A088.18% =C2=A0traf=
fic_server =C2=A0 =C2=A0 =C2=A0[.] _Z15write_to_net_ioP10NetHandlerP18UnixN=
etVConnectionP7EThread</font></div><div>=C2=A0 =C2=A0 =C2=A00.32% =C2=A0tra=
ffic_server =C2=A0 =C2=A0 =C2=A0[.] _ZN10NetHandler12mainNetEventEiP5Event<=
/div><div>=C2=A0 =C2=A0 =C2=A00.30% =C2=A0[kernel] =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 [k] update_sd_lb_stats</div><div>=C2=A0 =C2=A0 =C2=A00.29=
% =C2=A0[e1000e] =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 [k] e1000e_check_ltr_de=
mote</div><div>=C2=A0 =C2=A0 =C2=A00.25% =C2=A0[kernel] =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 [k] __ticket_spin_lock</div><div>=C2=A0 =C2=A0 =C2=
=A00.24% =C2=A0traffic_server =C2=A0 =C2=A0 =C2=A0[.] _ZN7EThread13process_=
eventEP5Eventi</div><div>=C2=A0 =C2=A0 =C2=A00.21% =C2=A0[kernel] =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 [k] timerqueue_add</div><div>=C2=A0 =C2=
=A0 =C2=A00.17% =C2=A0<a href=3D"http://libc-2.12.so/" target=3D"_blank">li=
bc-2.12.so</a>=C2=A0=C2=A0 =C2=A0 =C2=A0 [.] epoll_wait</div><div>=C2=A0 =
=C2=A0 =C2=A00.17% =C2=A0libpcre.so.0.0. =C2=A0 [.] 0x00000000000100dd</div=
><div>=C2=A0 =C2=A0 =C2=A00.14% =C2=A0[kernel] =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 [k] __schedule</div><div><br></div><div>1) This behavior is e=
asily reproduced caching<b> large objects with interim cache active</b>.</d=
iv><div>2) With interim cache <b>disabled</b>, this behavior<b> is not repr=
oduced.</b></div><div><br></div></div><div>=C2=A0As you can see, at the per=
f top output, the<font color=3D"#000000"><b>=C2=A0</b><span style=3D"font-w=
eight:bold;font-size:13px">write_to_net_</span><span style=3D"font-size:13p=
x"><b>io </b>function is responsible for this heavy CPU usage. We would lik=
e to hear of you guys, if anyone has faced a issue like that, or if you hav=
e any clues about this possible bug.</span></font></div><div><font color=3D=
"#000000"><span style=3D"font-size:13px"><br></span></font></div><div><font=
 color=3D"#000000"><span style=3D"font-size:13px">Thanks &amp; Regards,</sp=
an></font></div><span class=3D"HOEnZb"><font color=3D"#888888"><div><br></d=
iv>-- <br><div><div dir=3D"ltr"><h1 style=3D"font-family:Arial;font-size:14=
px;margin-bottom:10px;color:rgb(68,68,71)"><br></h1><div><div style=3D"colo=
r:rgb(0,0,0);font-family:Arial;font-size:medium;min-height:7px;width:0px;bo=
rder-left-width:15px;border-left-style:solid;border-left-color:rgb(239,101,=
47);border-right-width:7px;border-right-style:solid;border-right-color:rgb(=
247,183,160);margin-bottom:15px"></div><div style=3D"color:rgb(0,0,0);font-=
family:Arial,Helvetica,sans-serif;font-size:14px;font-weight:bold">Daniel B=
iazus</div><div style=3D"color:rgb(0,0,0);font-family:Arial,Helvetica,sans-=
serif;font-size:12px">infrastructure Engineering</div><div style=3D"margin:=
1em 0px 0px;font-family:Arial,Helvetica,sans-serif;color:rgb(239,101,47)">A=
zion Technologies</div><div style=3D"color:rgb(0,0,0);margin:0.25em 0px;fon=
t-family:Arial,Helvetica,sans-serif;font-size:11px">Porto Alegre, Brasil=C2=
=A0<a href=3D"tel:%2B55%2051%203012%203005" value=3D"+555130123005" target=
=3D"_blank">+55 51 3012 3005</a>=C2=A0|=C2=A0<a href=3D"tel:%2B55%2051%2082=
279032" value=3D"+555182279032" target=3D"_blank">+55 51 82279032</a></div>=
<div style=3D"color:rgb(0,0,0);margin:0.25em 0px;font-family:Arial,Helvetic=
a,sans-serif;font-size:11px">Miami, USA <a href=3D"tel:%2B1%20305%20704%208=
816" value=3D"+13057048816" target=3D"_blank">+1 305 704 8816</a></div><tab=
le style=3D"font-family:Arial;max-width:500px" border=3D"0" cellpadding=3D"=
0" cellspacing=3D"0"><tbody><tr><td><br><p style=3D"font-size:10px">Quaisqu=
er informa=C3=A7=C3=B5es contidas neste e-mail e anexos podem ser confidenc=
iais e privilegiadas, protegidas por sigilo legal. Qualquer forma de utiliz=
a=C3=A7=C3=A3o deste documento depende de autoriza=C3=A7=C3=A3o do emissor,=
 sujeito as penalidades cab=C3=ADveis.</p><p style=3D"font-size:10px;margin=
-top:10px">Any information in this e-mail and attachments may be confidenti=
al and privileged, protected by legal confidentiality. The use of this docu=
ment require authorization by the issuer, subject to penalties.</p></td></t=
r></tbody></table><br style=3D"color:rgb(0,0,0);font-family:Arial;font-size=
:medium"></div></div></div>
</font></span></div></div>
</blockquote></div><br></div>

--001a11405f7c7e8cd7050c6c6664--