Mailing-List: contact user-help@mesos.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@mesos.apache.org
MIME-Version: 1.0
In-Reply-To: <CAFp_NivJ7cCYtTVUY5bfduSC27A6HOSGhcmkwn1aOeBy-=6=sQ@mail.gmail.com>
References: <7A6B773BF2F721479D668347BF48E8A7575714C9@SZXEMI501-MBS.china.huawei.com>
 <CAFp_NiuouoXFKO+QjBX06rriZ0aqWzrGdrqWqEDxCCR79-0RHw@mail.gmail.com>
 <7A6B773BF2F721479D668347BF48E8A757571608@SZXEMI501-MBS.china.huawei.com> <CAFp_NivJ7cCYtTVUY5bfduSC27A6HOSGhcmkwn1aOeBy-=6=sQ@mail.gmail.com>
From: tommy xiao <xiaods@gmail.com>
Date: Tue, 10 Jan 2017 23:05:45 +0800
Message-ID: <CAJA-XLOhk5gvG9i5CDQ4yRAwWi4vkqux-dd6PYPQDNtCpc7_qQ@mail.gmail.com>
Subject: =?UTF-8?Q?Re=3A_=E7=AD=94=E5=A4=8D=3A_Optimize_libprocess_performance?=
To: dev <dev@mesos.apache.org>
Cc: "user@mesos.apache.org" <user@mesos.apache.org>
Content-Type: multipart/alternative; boundary=001a114950dc97a7050545bed185
archived-at: Tue, 10 Jan 2017 15:06:03 -0000

--001a114950dc97a7050545bed185
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Looking forward to seeing the benchmarks.

2017-01-06 5:52 GMT+08:00 Benjamin Mahler <bmahler@apache.org>:

> Ok great, thanks for sharing. Looking forward to seeing the benchmarks.
>
> On Wed, Jan 4, 2017 at 10:31 PM, pangbingqiang <pangbingqiang@huawei.com>
> wrote:
>
> > We write a light k-v database ,use for metadata store and nameservice
> like
> > etcd, but we test its TPS just 1200+(one client), the network is not th=
e
> > bottleneck, so the RPC layer is too heavy.
> >
> > -----=E9=82=AE=E4=BB=B6=E5=8E=9F=E4=BB=B6-----
> > =E5=8F=91=E4=BB=B6=E4=BA=BA: Benjamin Mahler [mailto:bmahler@apache.org=
]
> > =E5=8F=91=E9=80=81=E6=97=B6=E9=97=B4: 2017=E5=B9=B41=E6=9C=885=E6=97=A5=
 9:26
> > =E6=94=B6=E4=BB=B6=E4=BA=BA: dev
> > =E6=8A=84=E9=80=81: user@mesos.apache.org
> > =E4=B8=BB=E9=A2=98: Re: Optimize libprocess performance
> >
> > Which areas does the performance not meet your needs? There are a lot o=
f
> > aspects to libprocess that can be optimized, so it would be good to foc=
us
> > on each of your particular use cases via benchmarks, this allows us to
> have
> > a shared way to profile and measure improvements.
> >
> > Copy elimination is one area where a lot of improvement can be made
> across
> > libprocess, note that libprocess was implemented before we had C++11 mo=
ve
> > support available. We've recently made some improvements to update the
> HTTP
> > serving path towards zero-copies but it's not completely done. Can you
> > submit patches for the ProcessBase::send() path copy elimination? We ca=
n
> > have a move overload for ProcessBase::send and have
> ProtobufProcess::send()
> > and encode() perform moves instead of a copy.
> >
> > With respect to the MessageEncoder, since it's less trivial, you can
> > submit a benchmark that captures the use case you care about and we can
> > drive improvements using it. I have some suggestions here as well but w=
e
> > can discuss once we have the benchmarks committed.
> >
> > How does that sound to start?
> >
> > On Tue, Jan 3, 2017 at 7:31 PM, pangbingqiang <pangbingqiang@huawei.com=
>
> > wrote:
> >
> > > Hi All:
> > >
> > >   We use libprocess as our underlying communication library, but we
> > > find it=E2=80=99s performance don=E2=80=99t meet, we want to optimize=
 it, for example:
> > >
> > > *  =E2=80=98send=E2=80=99 function *implementation one metadata has f=
our times memory
> > > copy,
> > >
> > > *1. ProtobufMessage SerializeToString then processbase =E2=80=98encod=
e=E2=80=99
> > > construct string once;*
> > >
> > > *2. In =E2=80=98encode=E2=80=99 function Message body copy again;*
> > >
> > > *3. In MessageEncoder in order to construct HTTP Request, copy again;=
*
> > >
> > > *4.       **MessageEncoder return copy again;*
> > >
> > >   How to optimize this scenario may be useful.
> > >
> > >   Also , in libprocess it has so many lock:
> > >
> > > *1.       **SocketManager:   std::recursive_mutex mutex;*
> > >
> > > *2.       **ProcessManager:  std::recursive_mutex processes_mutex;*
> > *std::recursive_mutex
> > > runq_mutex; std::recursive_mutex firewall_mutex;*
> > >
> > > In particular, everytime event enqueue/dequeue both need to get lock,
> > > maybe use lookfree struct is better.
> > >
> > >
> > >
> > > If have any optimize suggestion or discussion, please let me know,
> > thanks.
> > >
> > >
> > >
> > > [image: cid:image001.png@01D0E8C5.8D08F440]
> > >
> > >
> > >
> > > Bingqiang Pang(=E5=BA=9E=E5=85=B5=E5=BC=BA)
> > >
> > >
> > >
> > > Distributed and Parallel Software Lab
> > >
> > > Huawei Technologies Co., Ltd.
> > >
> > > Email:pangbingqiang@huawei.com <suteng@huawei.com>
> > >
> > >
> > >
> > >
> > >
> >
>


--=20
Deshi Xiao
Twitter: xds2000
E-mail: xiaods(AT)gmail.com

--001a114950dc97a7050545bed185
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><span style=3D"font-size:14px">Looking forward to seeing t=
he benchmarks.</span><br></div><div class=3D"gmail_extra"><br><div class=3D=
"gmail_quote">2017-01-06 5:52 GMT+08:00 Benjamin Mahler <span dir=3D"ltr">&=
lt;<a href=3D"mailto:bmahler@apache.org" target=3D"_blank">bmahler@apache.o=
rg</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0=
 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Ok great, thanks for s=
haring. Looking forward to seeing the benchmarks.<br>
<br>
On Wed, Jan 4, 2017 at 10:31 PM, pangbingqiang &lt;<a href=3D"mailto:pangbi=
ngqiang@huawei.com">pangbingqiang@huawei.com</a>&gt;<br>
<div class=3D"HOEnZb"><div class=3D"h5">wrote:<br>
<br>
&gt; We write a light k-v database ,use for metadata store and nameservice =
like<br>
&gt; etcd, but we test its TPS just 1200+(one client), the network is not t=
he<br>
&gt; bottleneck, so the RPC layer is too heavy.<br>
&gt;<br>
&gt; -----=E9=82=AE=E4=BB=B6=E5=8E=9F=E4=BB=B6-----<br>
&gt; =E5=8F=91=E4=BB=B6=E4=BA=BA: Benjamin Mahler [mailto:<a href=3D"mailto=
:bmahler@apache.org">bmahler@apache.org</a>]<br>
&gt; =E5=8F=91=E9=80=81=E6=97=B6=E9=97=B4: 2017=E5=B9=B41=E6=9C=885=E6=97=
=A5 9:26<br>
&gt; =E6=94=B6=E4=BB=B6=E4=BA=BA: dev<br>
&gt; =E6=8A=84=E9=80=81: <a href=3D"mailto:user@mesos.apache.org">user@meso=
s.apache.org</a><br>
&gt; =E4=B8=BB=E9=A2=98: Re: Optimize libprocess performance<br>
&gt;<br>
&gt; Which areas does the performance not meet your needs? There are a lot =
of<br>
&gt; aspects to libprocess that can be optimized, so it would be good to fo=
cus<br>
&gt; on each of your particular use cases via benchmarks, this allows us to=
 have<br>
&gt; a shared way to profile and measure improvements.<br>
&gt;<br>
&gt; Copy elimination is one area where a lot of improvement can be made ac=
ross<br>
&gt; libprocess, note that libprocess was implemented before we had C++11 m=
ove<br>
&gt; support available. We&#39;ve recently made some improvements to update=
 the HTTP<br>
&gt; serving path towards zero-copies but it&#39;s not completely done. Can=
 you<br>
&gt; submit patches for the ProcessBase::send() path copy elimination? We c=
an<br>
&gt; have a move overload for ProcessBase::send and have ProtobufProcess::s=
end()<br>
&gt; and encode() perform moves instead of a copy.<br>
&gt;<br>
&gt; With respect to the MessageEncoder, since it&#39;s less trivial, you c=
an<br>
&gt; submit a benchmark that captures the use case you care about and we ca=
n<br>
&gt; drive improvements using it. I have some suggestions here as well but =
we<br>
&gt; can discuss once we have the benchmarks committed.<br>
&gt;<br>
&gt; How does that sound to start?<br>
&gt;<br>
&gt; On Tue, Jan 3, 2017 at 7:31 PM, pangbingqiang &lt;<a href=3D"mailto:pa=
ngbingqiang@huawei.com">pangbingqiang@huawei.com</a>&gt;<br>
&gt; wrote:<br>
&gt;<br>
&gt; &gt; Hi All:<br>
&gt; &gt;<br>
&gt; &gt;=C2=A0 =C2=A0We use libprocess as our underlying communication lib=
rary, but we<br>
&gt; &gt; find it=E2=80=99s performance don=E2=80=99t meet, we want to opti=
mize it, for example:<br>
&gt; &gt;<br>
&gt; &gt; *=C2=A0 =E2=80=98send=E2=80=99 function *implementation one metad=
ata has four times memory<br>
&gt; &gt; copy,<br>
&gt; &gt;<br>
&gt; &gt; *1. ProtobufMessage SerializeToString then processbase =E2=80=98e=
ncode=E2=80=99<br>
&gt; &gt; construct string once;*<br>
&gt; &gt;<br>
&gt; &gt; *2. In =E2=80=98encode=E2=80=99 function Message body copy again;=
*<br>
&gt; &gt;<br>
&gt; &gt; *3. In MessageEncoder in order to construct HTTP Request, copy ag=
ain;*<br>
&gt; &gt;<br>
&gt; &gt; *4.=C2=A0 =C2=A0 =C2=A0 =C2=A0**MessageEncoder return copy again;=
*<br>
&gt; &gt;<br>
&gt; &gt;=C2=A0 =C2=A0How to optimize this scenario may be useful.<br>
&gt; &gt;<br>
&gt; &gt;=C2=A0 =C2=A0Also , in libprocess it has so many lock:<br>
&gt; &gt;<br>
&gt; &gt; *1.=C2=A0 =C2=A0 =C2=A0 =C2=A0**SocketManager:=C2=A0 =C2=A0std::r=
ecursive_mutex mutex;*<br>
&gt; &gt;<br>
&gt; &gt; *2.=C2=A0 =C2=A0 =C2=A0 =C2=A0**ProcessManager:=C2=A0 std::recurs=
ive_mutex processes_mutex;*<br>
&gt; *std::recursive_mutex<br>
&gt; &gt; runq_mutex; std::recursive_mutex firewall_mutex;*<br>
&gt; &gt;<br>
&gt; &gt; In particular, everytime event enqueue/dequeue both need to get l=
ock,<br>
&gt; &gt; maybe use lookfree struct is better.<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; If have any optimize suggestion or discussion, please let me know=
,<br>
&gt; thanks.<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; [image: cid:image001.png@01D0E8C5.<wbr>8D08F440]<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; Bingqiang Pang(=E5=BA=9E=E5=85=B5=E5=BC=BA)<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; Distributed and Parallel Software Lab<br>
&gt; &gt;<br>
&gt; &gt; Huawei Technologies Co., Ltd.<br>
&gt; &gt;<br>
&gt; &gt; <a href=3D"mailto:Email%3Apangbingqiang@huawei.com">Email:pangbin=
gqiang@huawei.com</a> &lt;<a href=3D"mailto:suteng@huawei.com">suteng@huawe=
i.com</a>&gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt;<br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div class=3D"gmail_signature" data-smartmail=3D"gmail_signature">Deshi Xia=
o<br>Twitter: xds2000<br>E-mail: xiaods(AT)<a href=3D"http://gmail.com" tar=
get=3D"_blank">gmail.com</a></div>
</div>

--001a114950dc97a7050545bed185--