Mailing-List: contact user-help@kudu.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@kudu.apache.org
MIME-Version: 1.0
In-Reply-To: <CANRT7T2vpNtDS0xFtyHYW9UeMB=f9_xaPW1qWjDigAyQA+dpGQ@mail.gmail.com>
References: <CANRT7T2kZkoa-XaTDy24ycVfwRhmzz0-acn7_q5OM2XQSVcmqw@mail.gmail.com>
 <CAGpTDNeWM3GRsT-tZ9n6DmJk=GmOkeL-L7=jHcfoea5+U-bbEQ@mail.gmail.com>
 <CANRT7T2bp3KeHTGKogFGe629TP1tTHZ_5fpoMtrVLuGeiMJdZg@mail.gmail.com>
 <CAGpTDNfegj89jR9=4tQ_wNSGfkgsrmrLjYCe89-WeWEFXREjkA@mail.gmail.com>
 <CANRT7T2L5N0xBZjLO=vWpERi+PrKQLu1piurdABUOuFtpx3oWw@mail.gmail.com>
 <CANRT7T3iC38WU8=5G8aai-zJ5ko4S2EZZNhTXqRKgF26LBPCYg@mail.gmail.com>
 <CAGpTDNevnDYZfFp4aOzRzwezAehHGESWuCzySKqQCL8=5cX3aQ@mail.gmail.com>
 <CANRT7T27D=mAeORYMucW5fF6OGytwvDdYS7Si=FHsuc+rK9yfw@mail.gmail.com>
 <CAGpTDNforhswoV7UCGRnkhgrKo630gE9ZX598sTwTrT+2Ru27Q@mail.gmail.com> <CANRT7T2vpNtDS0xFtyHYW9UeMB=f9_xaPW1qWjDigAyQA+dpGQ@mail.gmail.com>
From: Jean-Daniel Cryans <jdcryans@apache.org>
Date: Wed, 3 Jan 2018 10:45:09 -0800
Message-ID: <CAGpTDNdo1bV438V=VF1PwWRb5igTm2R1rSMuzK8Mj8f2qBE0TQ@mail.gmail.com>
Subject: Re: first and second run 2x query time difference
To: user@kudu.apache.org
Content-Type: multipart/alternative; boundary="f403043a2e206c534b0561e39d51"
archived-at: Wed, 03 Jan 2018 18:45:14 -0000

--f403043a2e206c534b0561e39d51
Content-Type: text/plain; charset="UTF-8"

Hey Boris,

Thanks for reporting back with results!

On Wed, Jan 3, 2018 at 10:38 AM, Boris Tyukin <boris@boristyukin.com> wrote:

> so it was the page cache that makes this difference. we did a series of
> tests either restarting Kudu only, Impala only or both and resetting or not
> touching page cache.
>
> as for Kudu failures after restart, it was a sequence of services that
> need to be started before Kudu. If we start Kudu after HDFS, everything is
> fine. Data is intact
>

Is it possible that Kudu is sharing disks with ZK?


>
> thanks again for your help, J-D
>
> On Sat, Dec 16, 2017 at 4:05 PM, Jean-Daniel Cryans <jdcryans@apache.org>
> wrote:
>
>> I'm more thinking in terms of the startup IO having some impact on the
>> co-located services, but we really need to know what "went down" means.
>>
>> On Sat, Dec 16, 2017 at 12:50 PM, Boris Tyukin <boris@boristyukin.com>
>> wrote:
>>
>>> yep it is really weird since Kudu does not use neither one. I'll get
>>> with him on Monday to gather more details
>>>
>>> On Sat, Dec 16, 2017 at 3:28 PM, Jean-Daniel Cryans <jdcryans@apache.org
>>> > wrote:
>>>
>>>> Hi Boris,
>>>>
>>>> How exactly did HDFS and ZK go down? A Kudu restart is fairly
>>>> IO-intensive but I don't know how that can cause things like DataNodes to
>>>> fail.
>>>>
>>>> J-D
>>>>
>>>> On Sat, Dec 16, 2017 at 11:45 AM, Boris Tyukin <boris@boristyukin.com>
>>>> wrote:
>>>>
>>>>> well our admin had fun two days - it was the first time we restarted
>>>>> Kudu on our DEV cluster and it did not go well. He is still troubleshooting
>>>>> what happened but after Kudu restart zookeeper and HDFS went down after 3-4
>>>>> minutes. If we disable Kudu, all is well. No error in Kudu logs...I will
>>>>> have more details next week so not asking for help as I do not know all the
>>>>> details. What is obvious thought is that it has to do something with Kudu :)
>>>>>
>>>>> On Thu, Dec 14, 2017 at 9:40 AM, Boris Tyukin <boris@boristyukin.com>
>>>>> wrote:
>>>>>
>>>>>> thanks for your suggestions, J-D, I am sure you are right more often
>>>>>> than that! :))
>>>>>>
>>>>>> I will report back with our results. So far I am really impressed
>>>>>> with Kudu - we have been benchmarking ingest and egress throughput and our
>>>>>> typical queries runtime. The biggest pain so far is lack of support for
>>>>>> decimals
>>>>>>
>>>>>> On Wed, Dec 13, 2017 at 5:07 PM, Jean-Daniel Cryans <
>>>>>> jdcryans@apache.org> wrote:
>>>>>>
>>>>>>> On Wed, Dec 13, 2017 at 11:30 AM, Boris Tyukin <
>>>>>>> boris@boristyukin.com> wrote:
>>>>>>>
>>>>>>>> thanks J-D! we are going to try that and see how it impacts the
>>>>>>>> runtime.
>>>>>>>>
>>>>>>>> is there any way to load this metadata upfront? a lot of our
>>>>>>>> queries are adhoc in nature but they will be hitting the same tables with
>>>>>>>> different predicates and join patterns though.
>>>>>>>>
>>>>>>>
>>>>>>> You could use Impala to compute all the stats of all the tables
>>>>>>> after each Kudu restart. Actually, do try that, restart Kudu then compute
>>>>>>> stats and see how fast it scans.
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>> I am curious why this metadata does not survive restarts though. We
>>>>>>>> are going to run our benchmarks again and this time restart Kudu and Impala.
>>>>>>>>
>>>>>>>
>>>>>>> It's in the tserver memory, it can't survive a restart.
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>> I just ran another query first time which hits 2 large tables and
>>>>>>>> these tables have been scanned by the previous query and this time I do not
>>>>>>>> see any difference in query time before the first and second time - I guess
>>>>>>>> this confirms your statement about " first time ever scanning the
>>>>>>>> table since a Kudu restart" and collecting metadata.
>>>>>>>>
>>>>>>>
>>>>>>> Maybe, I've been known to be right once or twice a year :)
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Wed, Dec 13, 2017 at 11:18 AM, Jean-Daniel Cryans <
>>>>>>>> jdcryans@apache.org> wrote:
>>>>>>>>
>>>>>>>>> Hi Boris,
>>>>>>>>>
>>>>>>>>> Given that we don't have much data we can use here, I'll have to
>>>>>>>>> extrapolate. As an aside though, this is yet another example where we need
>>>>>>>>> more Kudu-side metrics in the query profile.
>>>>>>>>>
>>>>>>>>> So, Kudu lazily loads a bunch of metadata and that can really
>>>>>>>>> affect scan times. If this was your first time ever scanning the table
>>>>>>>>> since a Kudu restart, it's very possible that that's where that time was
>>>>>>>>> spent. There's also the page cache in the OS that might now be populated.
>>>>>>>>> You could do something like "sync; echo 3 > /proc/sys/vm/drop_caches" on
>>>>>>>>> all the machines and run the query 2 times again, without restarting Kudu,
>>>>>>>>> to understand the effect of the page cache itself. There's currently now
>>>>>>>>> way to purge the cached metadata in Kudu though.
>>>>>>>>>
>>>>>>>>> Hope this helps a bit,
>>>>>>>>>
>>>>>>>>> J-D
>>>>>>>>>
>>>>>>>>> On Wed, Dec 13, 2017 at 8:07 AM, Boris Tyukin <
>>>>>>>>> boris@boristyukin.com> wrote:
>>>>>>>>>
>>>>>>>>>> Hi guys,
>>>>>>>>>>
>>>>>>>>>> I am doing some benchmarks with Kudu and Impala/Parquet and hope
>>>>>>>>>> to share it soon but there is one thing that bugs me. This is perhaps
>>>>>>>>>> Impala question but since I am using Kudu with Impala I am going to try and
>>>>>>>>>> ask anyway.
>>>>>>>>>>
>>>>>>>>>> One of my queries takes 120 seconds to run the very first time.
>>>>>>>>>> It joins one large 5B row table with a bunch of smaller tables and then
>>>>>>>>>> stores result in Impala/parquet (not Kudu).
>>>>>>>>>>
>>>>>>>>>> Now if I run it second and third time, it only takes 60 seconds.
>>>>>>>>>> Can someone explain why? Is there any settings to decrease this gap?
>>>>>>>>>>
>>>>>>>>>> I've compared query profiles in CM and the only thing that was
>>>>>>>>>> very different is scan against Kudu table (the large one):
>>>>>>>>>>
>>>>>>>>>> ***************************
>>>>>>>>>> first time:
>>>>>>>>>> ***************************
>>>>>>>>>> KUDU_SCAN_NODE (id=0) (47.68s)
>>>>>>>>>> <https://lkmaorabd103.multihosp.net:7183/cmf/impala/queryDetails?queryId=5143f7165be82819%3Ae00a103500000000&serviceName=impala#>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>    - BytesRead: *0 B*
>>>>>>>>>>    - InactiveTotalTime: *0ns*
>>>>>>>>>>    - KuduRemoteScanTokens: *0*
>>>>>>>>>>    - NumScannerThreadsStarted: *20*
>>>>>>>>>>    - PeakMemoryUsage: *35.8 MiB*
>>>>>>>>>>    - RowsRead: *693,502,241*
>>>>>>>>>>    - RowsReturned: *693,502,241*
>>>>>>>>>>    - RowsReturnedRate: *14643448 per second*
>>>>>>>>>>    - ScanRangesComplete: *20*
>>>>>>>>>>    - ScannerThreadsInvoluntaryContextSwitches: *1,341*
>>>>>>>>>>    - ScannerThreadsTotalWallClockTime: *36.2m*
>>>>>>>>>>       - MaterializeTupleTime(*): *47.57s*
>>>>>>>>>>       - ScannerThreadsSysTime: *31.42s*
>>>>>>>>>>       - ScannerThreadsUserTime: *1.7m*
>>>>>>>>>>    - ScannerThreadsVoluntaryContextSwitches: *96,855*
>>>>>>>>>>    - TotalKuduScanRoundTrips: *52,308*
>>>>>>>>>>    - TotalReadThroughput: *0 B/s*
>>>>>>>>>>    - TotalTime: *47.68s*
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> ***************************
>>>>>>>>>> second time:
>>>>>>>>>> ***************************
>>>>>>>>>> KUDU_SCAN_NODE (id=0) (4.28s)
>>>>>>>>>> <https://lkmaorabd103.multihosp.net:7183/cmf/impala/queryDetails?queryId=53497a308f860837%3A243772e000000000&serviceName=impala#>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>    - BytesRead: *0 B*
>>>>>>>>>>    - InactiveTotalTime: *0ns*
>>>>>>>>>>    - KuduRemoteScanTokens: *0*
>>>>>>>>>>    - NumScannerThreadsStarted: *20*
>>>>>>>>>>    - PeakMemoryUsage: *37.9 MiB*
>>>>>>>>>>    - RowsRead: *693,502,241*
>>>>>>>>>>    - RowsReturned: *693,502,241*
>>>>>>>>>>    - RowsReturnedRate: *173481534 per second*
>>>>>>>>>>    - ScanRangesComplete: *20*
>>>>>>>>>>    - ScannerThreadsInvoluntaryContextSwitches: *1,451*
>>>>>>>>>>    - ScannerThreadsTotalWallClockTime: *19.5m*
>>>>>>>>>>       - MaterializeTupleTime(*): *4.20s*
>>>>>>>>>>       - ScannerThreadsSysTime: *38.22s*
>>>>>>>>>>       - ScannerThreadsUserTime: *1.7m*
>>>>>>>>>>    - ScannerThreadsVoluntaryContextSwitches: *480,870*
>>>>>>>>>>    - TotalKuduScanRoundTrips: *52,142*
>>>>>>>>>>    - TotalReadThroughput: *0 B/s*
>>>>>>>>>>    - TotalTime: *4.28s*
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

--f403043a2e206c534b0561e39d51
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hey Boris,<div><br></div><div>Thanks for reporting back wi=
th results!</div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">=
On Wed, Jan 3, 2018 at 10:38 AM, Boris Tyukin <span dir=3D"ltr">&lt;<a href=
=3D"mailto:boris@boristyukin.com" target=3D"_blank">boris@boristyukin.com</=
a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">so =
it was the page cache that makes this difference. we did a series of tests =
either restarting Kudu only, Impala only or both and resetting or not touch=
ing page cache.<div><br></div><div>as for Kudu failures after restart, it w=
as a sequence of services that need to be started before Kudu. If we start =
Kudu after HDFS, everything is fine. Data is intact</div></div></blockquote=
><div><br></div><div>Is it possible that Kudu is sharing disks with ZK?</di=
v><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><br=
></div><div>thanks again for your help, J-D</div></div><div class=3D"HOEnZb=
"><div class=3D"h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quot=
e">On Sat, Dec 16, 2017 at 4:05 PM, Jean-Daniel Cryans <span dir=3D"ltr">&l=
t;<a href=3D"mailto:jdcryans@apache.org" target=3D"_blank">jdcryans@apache.=
org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"mar=
gin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr=
">I&#39;m more thinking in terms of the startup IO having some impact on th=
e co-located services, but we really need to know what &quot;went down&quot=
; means.</div><div class=3D"m_3927375710438075436HOEnZb"><div class=3D"m_39=
27375710438075436h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quo=
te">On Sat, Dec 16, 2017 at 12:50 PM, Boris Tyukin <span dir=3D"ltr">&lt;<a=
 href=3D"mailto:boris@boristyukin.com" target=3D"_blank">boris@boristyukin.=
com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"mar=
gin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr=
">yep it is really weird since Kudu does not use neither one. I&#39;ll get =
with him on Monday to gather more details</div><div class=3D"m_392737571043=
8075436m_-2154994384276007142HOEnZb"><div class=3D"m_3927375710438075436m_-=
2154994384276007142h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_q=
uote">On Sat, Dec 16, 2017 at 3:28 PM, Jean-Daniel Cryans <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:jdcryans@apache.org" target=3D"_blank">jdcryans@apac=
he.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"=
margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"=
ltr">Hi Boris,<div><br></div><div>How exactly did HDFS and ZK go down? A Ku=
du restart is fairly IO-intensive but I don&#39;t know how that can cause t=
hings like DataNodes to fail.</div><span class=3D"m_3927375710438075436m_-2=
154994384276007142m_-4821731801410890228HOEnZb"><font color=3D"#888888"><di=
v><br></div><div>J-D</div></font></span></div><div class=3D"m_3927375710438=
075436m_-2154994384276007142m_-4821731801410890228HOEnZb"><div class=3D"m_3=
927375710438075436m_-2154994384276007142m_-4821731801410890228h5"><div clas=
s=3D"gmail_extra"><br><div class=3D"gmail_quote">On Sat, Dec 16, 2017 at 11=
:45 AM, Boris Tyukin <span dir=3D"ltr">&lt;<a href=3D"mailto:boris@boristyu=
kin.com" target=3D"_blank">boris@boristyukin.com</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex"><div dir=3D"ltr">well our admin had fun two d=
ays - it was the first time we restarted Kudu on our DEV cluster and it did=
 not go well. He is still troubleshooting what happened but after Kudu rest=
art zookeeper and HDFS went down after 3-4 minutes. If we disable Kudu, all=
 is well. No error in Kudu logs...I will have more details next week so not=
 asking for help as I do not know all the details. What is obvious thought =
is that it has to do something with Kudu :)</div><div class=3D"m_3927375710=
438075436m_-2154994384276007142m_-4821731801410890228m_2402479110709336902H=
OEnZb"><div class=3D"m_3927375710438075436m_-2154994384276007142m_-48217318=
01410890228m_2402479110709336902h5"><div class=3D"gmail_extra"><br><div cla=
ss=3D"gmail_quote">On Thu, Dec 14, 2017 at 9:40 AM, Boris Tyukin <span dir=
=3D"ltr">&lt;<a href=3D"mailto:boris@boristyukin.com" target=3D"_blank">bor=
is@boristyukin.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quot=
e" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">=
<div dir=3D"ltr">thanks for your suggestions, J-D, I am sure you are right =
more often than that! :))<div><br></div><div>I will report back with our re=
sults. So far I am really impressed with Kudu - we have been benchmarking i=
ngest and egress throughput and our typical queries runtime. The biggest pa=
in so far is lack of support for decimals</div></div><div class=3D"m_392737=
5710438075436m_-2154994384276007142m_-4821731801410890228m_2402479110709336=
902m_-749833381122581308HOEnZb"><div class=3D"m_3927375710438075436m_-21549=
94384276007142m_-4821731801410890228m_2402479110709336902m_-749833381122581=
308h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Wed, De=
c 13, 2017 at 5:07 PM, Jean-Daniel Cryans <span dir=3D"ltr">&lt;<a href=3D"=
mailto:jdcryans@apache.org" target=3D"_blank">jdcryans@apache.org</a>&gt;</=
span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8e=
x;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=
=3D"gmail_extra"><div class=3D"gmail_quote"><span>On Wed, Dec 13, 2017 at 1=
1:30 AM, Boris Tyukin <span dir=3D"ltr">&lt;<a href=3D"mailto:boris@boristy=
ukin.com" target=3D"_blank">boris@boristyukin.com</a>&gt;</span> wrote:<br>=
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">thanks=C2=A0<span style=3D"=
font-size:12.8px">J-D! we are going to try that and see how it impacts the =
runtime.=C2=A0</span><div><span style=3D"font-size:12.8px"><br></span></div=
><div><span style=3D"font-size:12.8px">is there any way to load this metada=
ta upfront? a lot of our queries are adhoc=C2=A0in nature but they will be =
hitting the same tables with different predicates and join patterns though.=
=C2=A0</span></div></div></blockquote><div><br></div></span><div>You could =
use Impala to compute all the stats of all the tables after each Kudu resta=
rt. Actually, do try that, restart Kudu then compute stats and see how fast=
 it scans.</div><span><div>=C2=A0</div><blockquote class=3D"gmail_quote" st=
yle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
dir=3D"ltr"><div><span style=3D"font-size:12.8px"><br></span></div><div><sp=
an style=3D"font-size:12.8px">I am curious why this metadata does not survi=
ve restarts though. We are going to run our benchmarks again and this time =
restart Kudu and Impala.</span></div></div></blockquote><div><br></div></sp=
an><div>It&#39;s in the tserver memory, it can&#39;t survive a restart.</di=
v><span><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><d=
iv><span style=3D"font-size:12.8px"><br></span></div><div><span style=3D"fo=
nt-size:12.8px">I just ran another query first time which hits 2 large tabl=
es and these tables have been scanned by the previous query and this time I=
 do not see any difference in query time before the first and second time -=
 I guess this confirms your statement about &quot;</span><span style=3D"fon=
t-size:12.8px">=C2=A0</span><span style=3D"font-size:12.8px">first time eve=
r scanning the table since a Kudu restart&quot; and collecting metadata.</s=
pan></div></div></blockquote><div><br></div></span><div>Maybe, I&#39;ve bee=
n known to be right once or twice a year :)</div><div><div class=3D"m_39273=
75710438075436m_-2154994384276007142m_-4821731801410890228m_240247911070933=
6902m_-749833381122581308m_3585700471761779649h5"><div>=C2=A0</div><blockqu=
ote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc s=
olid;padding-left:1ex"><div dir=3D"ltr"><div><br></div></div><div class=3D"=
m_3927375710438075436m_-2154994384276007142m_-4821731801410890228m_24024791=
10709336902m_-749833381122581308m_3585700471761779649m_8210277688020734929H=
OEnZb"><div class=3D"m_3927375710438075436m_-2154994384276007142m_-48217318=
01410890228m_2402479110709336902m_-749833381122581308m_3585700471761779649m=
_8210277688020734929h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_=
quote">On Wed, Dec 13, 2017 at 11:18 AM, Jean-Daniel Cryans <span dir=3D"lt=
r">&lt;<a href=3D"mailto:jdcryans@apache.org" target=3D"_blank">jdcryans@ap=
ache.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">Hi Boris,<div><br></div><div>Given that we don&#39;t have much dat=
a we can use here, I&#39;ll have to extrapolate. As an aside though, this i=
s yet another example where we need more Kudu-side metrics in the query pro=
file.</div><div><br></div><div>So, Kudu lazily loads a bunch of metadata an=
d that can really affect scan times. If this was your first time ever scann=
ing the table since a Kudu restart, it&#39;s very possible that that&#39;s =
where that time was spent. There&#39;s also the page cache in the OS that m=
ight now be populated. You could do something like &quot;sync; echo 3 &gt; =
/proc/sys/vm/drop_caches&quot; on all the machines and run the query 2 time=
s again, without restarting Kudu, to understand the effect of the page cach=
e itself. There&#39;s currently now way to purge the cached metadata in Kud=
u though.</div><div><br></div><div>Hope this helps a bit,</div><div><br></d=
iv><div>J-D</div></div><div class=3D"m_3927375710438075436m_-21549943842760=
07142m_-4821731801410890228m_2402479110709336902m_-749833381122581308m_3585=
700471761779649m_8210277688020734929m_7423841599431524121HOEnZb"><div class=
=3D"m_3927375710438075436m_-2154994384276007142m_-4821731801410890228m_2402=
479110709336902m_-749833381122581308m_3585700471761779649m_8210277688020734=
929m_7423841599431524121h5"><div class=3D"gmail_extra"><br><div class=3D"gm=
ail_quote">On Wed, Dec 13, 2017 at 8:07 AM, Boris Tyukin <span dir=3D"ltr">=
&lt;<a href=3D"mailto:boris@boristyukin.com" target=3D"_blank">boris@borist=
yukin.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">Hi guys,<div><br></div><div>I am doing some benchmarks with Kudu a=
nd Impala/Parquet and hope to share it soon but there is one thing that bug=
s me. This is perhaps Impala question but since I am using Kudu with Impala=
 I am going to try and ask anyway.</div><div><br></div><div>One of my queri=
es takes 120 seconds to run the very first time. It joins one large 5B row =
table with a bunch of smaller tables and then stores result in Impala/parqu=
et (not Kudu).=C2=A0</div><div><br></div><div>Now if I run it second and th=
ird time, it only takes 60 seconds. Can someone explain why? Is there any s=
ettings to decrease this gap?</div><div><br></div><div>I&#39;ve compared qu=
ery profiles in CM and the only thing that was very different is scan again=
st Kudu table (the large one):</div><div><br></div><div>*******************=
********</div><div>first time:</div><div>***************************</div><=
div><a href=3D"https://lkmaorabd103.multihosp.net:7183/cmf/impala/queryDeta=
ils?queryId=3D5143f7165be82819%3Ae00a103500000000&amp;serviceName=3Dimpala#=
" class=3D"m_3927375710438075436m_-2154994384276007142m_-482173180141089022=
8m_2402479110709336902m_-749833381122581308m_3585700471761779649m_821027768=
8020734929m_7423841599431524121m_-7911394718303036144m_637588162729263090gm=
ail-Toggler" style=3D"color:rgb(11,127,173);text-decoration-line:none;font-=
family:Roboto,&quot;Helvetica Neue&quot;,Helvetica,Arial,sans-serif;font-si=
ze:14px" target=3D"_blank"><h4 style=3D"margin:0px 0px 8px;font-family:inhe=
rit;font-weight:400;line-height:20px;color:rgb(66,66,66);font-size:16px">KU=
DU_SCAN_NODE (id=3D0) (47.68s)</h4></a><span style=3D"color:rgb(66,66,66);f=
ont-family:Roboto,&quot;Helvetica Neue&quot;,Helvetica,Arial,sans-serif;fon=
t-size:14px"></span><div class=3D"m_3927375710438075436m_-21549943842760071=
42m_-4821731801410890228m_2402479110709336902m_-749833381122581308m_3585700=
471761779649m_8210277688020734929m_7423841599431524121m_-791139471830303614=
4m_637588162729263090gmail-plan-details" style=3D"color:rgb(66,66,66);font-=
family:Roboto,&quot;Helvetica Neue&quot;,Helvetica,Arial,sans-serif;font-si=
ze:14px"><ul class=3D"m_3927375710438075436m_-2154994384276007142m_-4821731=
801410890228m_2402479110709336902m_-749833381122581308m_3585700471761779649=
m_8210277688020734929m_7423841599431524121m_-7911394718303036144m_637588162=
729263090gmail-impala-info-strings m_3927375710438075436m_-2154994384276007=
142m_-4821731801410890228m_2402479110709336902m_-749833381122581308m_358570=
0471761779649m_8210277688020734929m_7423841599431524121m_-79113947183030361=
44m_637588162729263090gmail-unstyled" style=3D"padding:0px;margin:0px 0px 0=
px 20px;list-style:none"></ul><ul class=3D"m_3927375710438075436m_-21549943=
84276007142m_-4821731801410890228m_2402479110709336902m_-749833381122581308=
m_3585700471761779649m_8210277688020734929m_7423841599431524121m_-791139471=
8303036144m_637588162729263090gmail-unstyled" style=3D"padding:0px;margin:0=
px 0px 0px 20px;list-style:none"><li style=3D"line-height:20px">BytesRead:=
=C2=A0<strong>0 B</strong></li><li style=3D"line-height:20px">InactiveTotal=
Time:=C2=A0<strong>0ns</strong></li><li style=3D"line-height:20px">KuduRemo=
teScanTokens:=C2=A0<strong>0</strong></li><li style=3D"line-height:20px">Nu=
mScannerThreadsStarted:=C2=A0<strong>20</strong></li><li style=3D"line-heig=
ht:20px">PeakMemoryUsage:=C2=A0<strong>35.8 MiB</strong></li><li style=3D"l=
ine-height:20px">RowsRead:=C2=A0<strong>693,502,241</strong></li><li style=
=3D"line-height:20px">RowsReturned:=C2=A0<strong>693,502,241</strong></li><=
li style=3D"line-height:20px">RowsReturnedRate:=C2=A0<strong>14643448 per s=
econd</strong></li><li style=3D"line-height:20px">ScanRangesComplete:=C2=A0=
<strong>20</strong></li><li style=3D"line-height:20px">ScannerThreadsInvolu=
ntaryConte<wbr>xtSwitches:=C2=A0<strong>1,341</strong></li><li style=3D"lin=
e-height:20px">ScannerThreadsTotalWallClockTi<wbr>me:=C2=A0<strong>36.2m</s=
trong><ul class=3D"m_3927375710438075436m_-2154994384276007142m_-4821731801=
410890228m_2402479110709336902m_-749833381122581308m_3585700471761779649m_8=
210277688020734929m_7423841599431524121m_-7911394718303036144m_637588162729=
263090gmail-unstyled" style=3D"padding:0px;margin:0px 0px 0px 20px;list-sty=
le:none"><li style=3D"line-height:20px">MaterializeTupleTime(*):=C2=A0<stro=
ng>47.57<wbr>s</strong></li><li style=3D"line-height:20px">ScannerThreadsSy=
sTime:=C2=A0<strong>31.42s</strong></li><li style=3D"line-height:20px">Scan=
nerThreadsUserTime:=C2=A0<strong>1.7m</strong></li></ul></li><li style=3D"l=
ine-height:20px">ScannerThreadsVoluntaryContext<wbr>Switches:=C2=A0<strong>=
96,855</strong></li><li style=3D"line-height:20px">TotalKuduScanRoundTrips:=
=C2=A0<strong>52,30<wbr>8</strong></li><li style=3D"line-height:20px">Total=
ReadThroughput:=C2=A0<strong>0 B/s</strong></li><li style=3D"line-height:20=
px">TotalTime:=C2=A0<strong>47.68s</strong></li></ul><div><b><br></b></div>=
<div><div style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-si=
ze:small">***************************</div><div style=3D"color:rgb(34,34,34=
);font-family:arial,sans-serif;font-size:small">second time:</div><div styl=
e=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:small">****=
***********************</div></div><div><a href=3D"https://lkmaorabd103.mul=
tihosp.net:7183/cmf/impala/queryDetails?queryId=3D53497a308f860837%3A243772=
e000000000&amp;serviceName=3Dimpala#" class=3D"m_3927375710438075436m_-2154=
994384276007142m_-4821731801410890228m_2402479110709336902m_-74983338112258=
1308m_3585700471761779649m_8210277688020734929m_7423841599431524121m_-79113=
94718303036144m_637588162729263090gmail-Toggler" style=3D"color:rgb(11,127,=
173);text-decoration-line:none" target=3D"_blank"><h4 style=3D"margin:0px 0=
px 8px;font-family:inherit;font-weight:400;line-height:20px;color:rgb(66,66=
,66);font-size:16px">KUDU_SCAN_NODE (id=3D0) (4.28s)</h4></a><div class=3D"=
m_3927375710438075436m_-2154994384276007142m_-4821731801410890228m_24024791=
10709336902m_-749833381122581308m_3585700471761779649m_8210277688020734929m=
_7423841599431524121m_-7911394718303036144m_637588162729263090gmail-plan-de=
tails"><ul class=3D"m_3927375710438075436m_-2154994384276007142m_-482173180=
1410890228m_2402479110709336902m_-749833381122581308m_3585700471761779649m_=
8210277688020734929m_7423841599431524121m_-7911394718303036144m_63758816272=
9263090gmail-impala-info-strings m_3927375710438075436m_-215499438427600714=
2m_-4821731801410890228m_2402479110709336902m_-749833381122581308m_35857004=
71761779649m_8210277688020734929m_7423841599431524121m_-7911394718303036144=
m_637588162729263090gmail-unstyled" style=3D"padding:0px;margin:0px 0px 0px=
 20px;list-style:none"></ul><ul class=3D"m_3927375710438075436m_-2154994384=
276007142m_-4821731801410890228m_2402479110709336902m_-749833381122581308m_=
3585700471761779649m_8210277688020734929m_7423841599431524121m_-79113947183=
03036144m_637588162729263090gmail-unstyled" style=3D"padding:0px;margin:0px=
 0px 0px 20px;list-style:none"><li style=3D"line-height:20px">BytesRead:=C2=
=A0<strong>0 B</strong></li><li style=3D"line-height:20px">InactiveTotalTim=
e:=C2=A0<strong>0ns</strong></li><li style=3D"line-height:20px">KuduRemoteS=
canTokens:=C2=A0<strong>0</strong></li><li style=3D"line-height:20px">NumSc=
annerThreadsStarted:=C2=A0<strong>20</strong></li><li style=3D"line-height:=
20px">PeakMemoryUsage:=C2=A0<strong>37.9 MiB</strong></li><li style=3D"line=
-height:20px">RowsRead:=C2=A0<strong>693,502,241</strong></li><li style=3D"=
line-height:20px">RowsReturned:=C2=A0<strong>693,502,241</strong></li><li s=
tyle=3D"line-height:20px">RowsReturnedRate:=C2=A0<strong>173481534 per seco=
nd</strong></li><li style=3D"line-height:20px">ScanRangesComplete:=C2=A0<st=
rong>20</strong></li><li style=3D"line-height:20px">ScannerThreadsInvolunta=
ryConte<wbr>xtSwitches:=C2=A0<strong>1,451</strong></li><li style=3D"line-h=
eight:20px">ScannerThreadsTotalWallClockTi<wbr>me:=C2=A0<strong>19.5m</stro=
ng><ul class=3D"m_3927375710438075436m_-2154994384276007142m_-4821731801410=
890228m_2402479110709336902m_-749833381122581308m_3585700471761779649m_8210=
277688020734929m_7423841599431524121m_-7911394718303036144m_637588162729263=
090gmail-unstyled" style=3D"padding:0px;margin:0px 0px 0px 20px;list-style:=
none"><li style=3D"line-height:20px">MaterializeTupleTime(*):=C2=A0<strong>=
4.20s</strong></li><li style=3D"line-height:20px">ScannerThreadsSysTime:=C2=
=A0<strong>38.22s</strong></li><li style=3D"line-height:20px">ScannerThread=
sUserTime:=C2=A0<strong>1.7m</strong></li></ul></li><li style=3D"line-heigh=
t:20px">ScannerThreadsVoluntaryContext<wbr>Switches:=C2=A0<strong>480,870</=
strong></li><li style=3D"line-height:20px">TotalKuduScanRoundTrips:=C2=A0<s=
trong>52,14<wbr>2</strong></li><li style=3D"line-height:20px">TotalReadThro=
ughput:=C2=A0<strong>0 B/s</strong></li><li style=3D"line-height:20px">Tota=
lTime:=C2=A0<strong>4.28s</strong></li></ul></div></div><div><b><br></b></d=
iv></div></div><div><br></div><div><br></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div></div></div><br></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div></div>

--f403043a2e206c534b0561e39d51--