Mailing-List: contact user-help@kudu.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@kudu.apache.org
MIME-Version: 1.0
In-Reply-To: <CAGpTDNdr_o8e6iheBaHDzPc--dHsa7YG24T4z3NyjnF32hvyXg@mail.gmail.com>
References: <CAOHFFGTsmdMQyR1TmkWXBOSurZk1+7Z6XEh1qQyStGq0Hxmrxg@mail.gmail.com>
 <CAGpTDNdr_o8e6iheBaHDzPc--dHsa7YG24T4z3NyjnF32hvyXg@mail.gmail.com>
From: Jean-Daniel Cryans <jdcryans@apache.org>
Date: Mon, 26 Jun 2017 09:18:08 -0700
Message-ID: <CAGpTDNe-h0doZGXZT_Nu2UaTXwmu_8QR0d=HOi2+TAN7jBFvww@mail.gmail.com>
Subject: Re: Spark locality issue
To: user@kudu.apache.org
Content-Type: multipart/alternative; boundary="f403043651a6f9b8f90552df4bca"
archived-at: Mon, 26 Jun 2017 16:18:15 -0000

--f403043651a6f9b8f90552df4bca
Content-Type: text/plain; charset="UTF-8"

On Mon, Jun 26, 2017 at 8:53 AM, Jean-Daniel Cryans <jdcryans@apache.org>
wrote:

> Hi Pavel,
>
> I think the whole Kudu/Spark story needs more attention, for example Spark
> SQL query plans don't have access to any Kudu stats so you can end up with
> some really bad join decisions.
>
> It feels like KUDU-1454 should be really easy to solve at this point. What
> we need is to get the RDD to use CLOSEST_REPLICA and to set a propagated
> timestamp like Todd says in the jira. This is all stuff that's done in
> Impala's integration for Kudu. If you wanted to see if that solves your
> problem you could add the following code on this line http://github.mtv.
> cloudera.com/CDH/kudu/blob/cdh5-trunk/java/kudu-client/
> src/main/java/org/apache/kudu/client/KuduScanToken.java#L226
>

Of course I meant a link more like this
https://github.com/apache/kudu/blob/master/java/kudu-client/src/main/java/org/apache/kudu/client/KuduScanToken.java#L226


>
> builder.replicaSelection(ReplicaSelection.CLOSEST_REPLICA);
>
> The propagated timestamp part is also needed but only for consistency
> purposes, it won't affect the locality.
>
> J-D
>
> On Mon, Jun 26, 2017 at 12:59 AM, Pavel Martynov <mr.xkurt@gmail.com>
> wrote:
>
>> Hi, guys!
>>
>> I working on replacing proprietary analytic platform Microsoft PDW (aka
>> Microsoft APS) in my company with open source alternative. Currently, I
>> experimenting with Mesos/Spark/Kudu stack and it looks promising.
>>
>> Recently I discovered very strange behavior. Situation: I have table on
>> 5-servers cluster with 50 tablets and run simple Spark rdd.count() against
>> it. If table has no replication - all is fine, every server run count
>> aggregation on local data. But, if that table have replication > 1, I see
>> (with iftop util) that Spark scans remote tablets and Spark UI still shows
>> me tasks with locality NODE_LOCAL, what is not true.
>>
>> I found issue https://issues.apache.org/jira/browse/KUDU-1454 "Spark and
>> MR jobs running without scan locality" which looks like my problem.
>>
>> IMHO Kudu-Spark can't be considered as production-ready with such an
>> issue. Are there fundamental problems with fixing of that issue?
>>
>> --
>> with best regards, Pavel Martynov
>>
>
>

--f403043651a6f9b8f90552df4bca
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><br><div class=3D"gmail_quo=
te">On Mon, Jun 26, 2017 at 8:53 AM, Jean-Daniel Cryans <span dir=3D"ltr">&=
lt;<a href=3D"mailto:jdcryans@apache.org" target=3D"_blank">jdcryans@apache=
.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:=
1ex"><div dir=3D"ltr">Hi Pavel,<div><br></div><div>I think the whole Kudu/S=
park story needs more attention, for example Spark SQL query plans don&#39;=
t have access to any Kudu stats so you can end up with some really bad join=
 decisions.</div><div><br></div><div>It feels like KUDU-1454 should be real=
ly easy to solve at this point. What we need is to get the RDD to use CLOSE=
ST_REPLICA and to set a propagated timestamp like Todd says in the jira. Th=
is is all stuff that&#39;s done in Impala&#39;s integration for Kudu. If yo=
u wanted to see if that solves your problem you could add the following cod=
e on this line=C2=A0<a href=3D"http://github.mtv.cloudera.com/CDH/kudu/blob=
/cdh5-trunk/java/kudu-client/src/main/java/org/apache/kudu/client/KuduScanT=
oken.java#L226" target=3D"_blank">http://github.mtv.<wbr>cloudera.com/CDH/k=
udu/blob/<wbr>cdh5-trunk/java/kudu-client/<wbr>src/main/java/org/apache/kud=
u/<wbr>client/KuduScanToken.java#L226</a></div></div></blockquote><div><br>=
</div><div>Of course I meant a link more like this=C2=A0<a href=3D"https://=
github.com/apache/kudu/blob/master/java/kudu-client/src/main/java/org/apach=
e/kudu/client/KuduScanToken.java#L226">https://github.com/apache/kudu/blob/=
master/java/kudu-client/src/main/java/org/apache/kudu/client/KuduScanToken.=
java#L226</a></div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=
=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding=
-left:1ex"><div dir=3D"ltr"><div><br></div><div>builder.replicaSelection(<w=
br>ReplicaSelection.CLOSEST_<wbr>REPLICA);<br></div><div><br></div><div>The=
 propagated timestamp part is also needed but only for consistency purposes=
, it won&#39;t affect the locality.</div><span class=3D"gmail-HOEnZb"><font=
 color=3D"#888888"><div><br></div><div>J-D</div></font></span></div><div cl=
ass=3D"gmail-HOEnZb"><div class=3D"gmail-h5"><div class=3D"gmail_extra"><br=
><div class=3D"gmail_quote">On Mon, Jun 26, 2017 at 12:59 AM, Pavel Martyno=
v <span dir=3D"ltr">&lt;<a href=3D"mailto:mr.xkurt@gmail.com" target=3D"_bl=
ank">mr.xkurt@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail=
_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204=
,204);padding-left:1ex"><div dir=3D"ltr">Hi, guys!<div><br></div><div>I wor=
king on replacing proprietary analytic platform Microsoft PDW (aka Microsof=
t APS) in my company with open source alternative. Currently, I experimenti=
ng with Mesos/Spark/Kudu stack and it looks promising.</div><div><br></div>=
<div>Recently I discovered very strange=C2=A0behavior. Situation: I have ta=
ble on 5-servers cluster with 50 tablets and run simple Spark rdd.count() a=
gainst it. If table has no replication - all is fine, every server run coun=
t aggregation on local data. But, if that table have=C2=A0replication &gt; =
1, I see (with iftop=C2=A0util) that Spark scans remote tablets and Spark U=
I still shows me tasks with locality NODE_LOCAL, what is not true.</div><di=
v><br></div><div>I found issue=C2=A0<a href=3D"https://issues.apache.org/ji=
ra/browse/KUDU-1454" target=3D"_blank">https://issues.apache.or<wbr>g/jira/=
browse/KUDU-1454</a> &quot;Spark and MR jobs running without scan locality&=
quot; which looks like my problem.</div><div><br></div><div>IMHO Kudu-Spark=
 can&#39;t be considered as production-ready with such an issue. Are there =
fundamental problems with fixing of that issue?<span class=3D"gmail-m_-3585=
068450841396898HOEnZb"><font color=3D"#888888"><br></font></span></div><spa=
n class=3D"gmail-m_-3585068450841396898HOEnZb"><font color=3D"#888888"><div=
><div><br></div>-- <br><div class=3D"gmail-m_-3585068450841396898m_-6848302=
823340108261gmail_signature">with best regards, Pavel Martynov<br></div>
</div></font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div></div>

--f403043651a6f9b8f90552df4bca--