Mailing-List: contact user-help@kudu.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@kudu.apache.org
MIME-Version: 1.0
In-Reply-To: <CANbMB4yrWyYaFG8xP7kpK04m4tJsEfTU6gh1sgHBeXBU7hTKuw@mail.gmail.com>
References: <CAO3rhrQO-ha-h4z-bYw9E-cbEeAQ8g9pLHeBeFFfT13vc2f4WA@mail.gmail.com>
 <CANbMB4yrWyYaFG8xP7kpK04m4tJsEfTU6gh1sgHBeXBU7hTKuw@mail.gmail.com>
From: Jason Heo <jason.heo.sde@gmail.com>
Date: Wed, 15 Mar 2017 10:10:58 +0900
Message-ID: <CAO3rhrRA4Zb3D41YZo090hUHvw4zUAtUT1gsR6xKjRBba+9r2g@mail.gmail.com>
Subject: Re: What does RowSet Compaction Duration means?
To: user@kudu.apache.org
Content-Type: multipart/alternative; boundary=94eb2c0b7320078562054aba9eb5
archived-at: Wed, 15 Mar 2017 01:11:08 -0000

--94eb2c0b7320078562054aba9eb5
Content-Type: text/plain; charset=UTF-8

Hi Alexey.

Thank you for your reply.

With your help, now I can understand what 'compact_rs_duration` means. But
the `default_num_replicas` is just 3 not 5 :(

It seems compaction on tableB affects huge on bulk loading on tableA. Is
there a way to minimize compaction activities? (something like changing
configuration of Kudu)

The FAQ says that "Since compactions are so predictable, the only tuning
knob available is the number of threads dedicated to flushes and
compactions in the *maintenance manager*."

my `maintenance_manager_num_threads` is already 1.

Thanks.

2017-03-15 3:48 GMT+09:00 Alexey Serbin <aserbin@cloudera.com>:

> Hi Jason,
>
> As I understand, that 'milliseconds / second' cryptic unit means 'number
> of units / for sampling (or averaging) interval'.
>
> I.e., they capture that metric reading (expressed in milliseconds) every
> second, subtract previous value from the current value, and declare the
> result as the result measurement at current time.  If not capturing every
> second, then it's about measuring every X seconds, do the subtraction of
> the previous from the current measurement, and then divide by X.
>
> For a single tablet, the 'compact_rs_duration' metric stands for 'Time
> spent compacting RowSets'.  As I understand, that 'total_kudu_compact_rs_
> duration_sum_rate_across_kudu_replicas' is sum/accumulation of those
> measurements for all existing replicas of the specified tablet across Kudu
> cluster.
>
> I suspect you have the replication factor of 5 for that tablet, and at
> some point all replicas become busy with rowset compaction all the time.
>
> Compactions on tables are run in the background.  Compactions on different
> tables run independently.  So, if you have some other activity doing
> inserts/updates on tableB, then it's natural to see compaction happen on
> tabletB as well.
>
>
> Best regards,
>
> Alexey
>
> On Tue, Mar 14, 2017 at 12:50 AM, Jason Heo <jason.heo.sde@gmail.com>
> wrote:
>
>> Hi.
>>
>> I'm stuck with performance degradation on compaction happens.
>>
>> My Duration is "4956.71 milliseconds / second" What does this mean? I
>> can't figure it out.
>>
>> Here is the captured image: http://imgur.com/WU9sRRq
>>
>> When I'm doing bulk indexing on tableA, sometimes compaction happens over
>> tableB. Is this situation is natural?
>>
>> Thanks.
>>
>
>

--94eb2c0b7320078562054aba9eb5
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Alexey.<div><br></div><div>Thank you for your reply.</d=
iv><div><br></div><div>With your help, now I can understand what &#39;compa=
ct_rs_duration` means. But the `default_num_replicas` is just 3 not 5 :(</d=
iv><div><br></div><div>It seems compaction on tableB affects huge on bulk l=
oading on tableA. Is there a way to minimize compaction activities? (someth=
ing like changing configuration of Kudu)</div><div><br></div><div>The FAQ s=
ays that &quot;<span style=3D"color:rgb(51,51,51);font-family:&quot;helveti=
ca neue&quot;,helvetica,arial,sans-serif;font-size:14px">Since compactions =
are so predictable, the only tuning knob available is the number of threads=
 dedicated to flushes and compactions in the=C2=A0</span><em style=3D"box-s=
izing:border-box;color:rgb(51,51,51);font-family:&quot;helvetica neue&quot;=
,helvetica,arial,sans-serif;font-size:14px">maintenance manager</em><span s=
tyle=3D"color:rgb(51,51,51);font-family:&quot;helvetica neue&quot;,helvetic=
a,arial,sans-serif;font-size:14px">.&quot;</span></div><div><span style=3D"=
color:rgb(51,51,51);font-family:&quot;helvetica neue&quot;,helvetica,arial,=
sans-serif;font-size:14px"><br></span></div><div><span style=3D"color:rgb(5=
1,51,51);font-family:&quot;helvetica neue&quot;,helvetica,arial,sans-serif;=
font-size:14px">my `</span><font color=3D"#333333" face=3D"helvetica neue, =
helvetica, arial, sans-serif"><span style=3D"font-size:14px">maintenance_ma=
nager_num_threads` is already 1.</span></font></div><div><br></div><div><fo=
nt color=3D"#333333" face=3D"helvetica neue, helvetica, arial, sans-serif">=
<span style=3D"font-size:14px">Thanks.</span></font></div></div><div class=
=3D"gmail_extra"><br><div class=3D"gmail_quote">2017-03-15 3:48 GMT+09:00 A=
lexey Serbin <span dir=3D"ltr">&lt;<a href=3D"mailto:aserbin@cloudera.com" =
target=3D"_blank">aserbin@cloudera.com</a>&gt;</span>:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-f=
amily:arial,helvetica,sans-serif">Hi Jason,</div><div class=3D"gmail_defaul=
t" style=3D"font-family:arial,helvetica,sans-serif"><br></div><div class=3D=
"gmail_default" style=3D"font-family:arial,helvetica,sans-serif">As I under=
stand, that &#39;milliseconds / second&#39; cryptic unit means &#39;number =
of units / for sampling (or averaging) interval&#39;.</div><div class=3D"gm=
ail_default" style=3D"font-family:arial,helvetica,sans-serif"><br></div><di=
v class=3D"gmail_default" style=3D"font-family:arial,helvetica,sans-serif">=
I.e., they capture that metric reading (expressed in milliseconds) every se=
cond, subtract previous value from the current value, and declare the resul=
t as the result measurement at current time.=C2=A0 If not capturing every s=
econd, then it&#39;s about measuring every X seconds, do the subtraction of=
 the previous from the current measurement, and then divide by X.</div><div=
 class=3D"gmail_default" style=3D"font-family:arial,helvetica,sans-serif"><=
br></div><div class=3D"gmail_default" style=3D"font-family:arial,helvetica,=
sans-serif">For a single tablet, the &#39;compact_rs_duration&#39; metric s=
tands for &#39;Time spent compacting RowSets&#39;.=C2=A0 As I understand, t=
hat &#39;total_kudu_compact_rs_<wbr>duration_sum_rate_across_kudu_<wbr>repl=
icas&#39; is sum/accumulation of those measurements for all existing replic=
as of the specified tablet across Kudu cluster.</div><div class=3D"gmail_de=
fault" style=3D"font-family:arial,helvetica,sans-serif"><br></div><div clas=
s=3D"gmail_default" style=3D"font-family:arial,helvetica,sans-serif">I susp=
ect you have the replication factor of 5 for that tablet, and at some point=
 all replicas become busy with rowset compaction all the time.</div><div cl=
ass=3D"gmail_default" style=3D"font-family:arial,helvetica,sans-serif"><br>=
</div><div class=3D"gmail_default" style=3D"font-family:arial,helvetica,san=
s-serif">Compactions on tables are run in the background.=C2=A0 Compactions=
 on different tables run independently.=C2=A0 So, if you have some other ac=
tivity doing inserts/updates on tableB, then it&#39;s natural to see compac=
tion happen on tabletB as well.</div><div class=3D"gmail_default" style=3D"=
font-family:arial,helvetica,sans-serif"><br></div><div class=3D"gmail_defau=
lt" style=3D"font-family:arial,helvetica,sans-serif"><br></div><div class=
=3D"gmail_default" style=3D"font-family:arial,helvetica,sans-serif">Best re=
gards,</div><div class=3D"gmail_default" style=3D"font-family:arial,helveti=
ca,sans-serif"><br></div><div class=3D"gmail_default" style=3D"font-family:=
arial,helvetica,sans-serif">Alexey</div></div><div class=3D"HOEnZb"><div cl=
ass=3D"h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue=
, Mar 14, 2017 at 12:50 AM, Jason Heo <span dir=3D"ltr">&lt;<a href=3D"mail=
to:jason.heo.sde@gmail.com" target=3D"_blank">jason.heo.sde@gmail.com</a>&g=
t;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0=
 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi.<div=
><br></div><div>I&#39;m stuck with performance degradation on compaction ha=
ppens.</div><div><br></div><div>My Duration is &quot;4956.71 milliseconds /=
 second&quot; What does this mean? I can&#39;t figure it out.</div><div><br=
></div><div>Here is the captured image:=C2=A0<a href=3D"http://imgur.com/WU=
9sRRq" target=3D"_blank">http://imgur.com/WU9sRR<wbr>q</a></div><div><br></=
div><div>When I&#39;m doing bulk indexing on tableA, sometimes compaction h=
appens over tableB. Is this situation is natural?</div><div><br></div><div>=
Thanks.</div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--94eb2c0b7320078562054aba9eb5--