Mailing-List: contact user-help@kylin.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@kylin.apache.org
MIME-Version: 1.0
In-Reply-To: <CA+e75usKX_n1Cqdm25R_=LhURqPn3m=ZvJyrLVAW+Zi=5c73Bw@mail.gmail.com>
References: <CA+e75uu9S133vnqopjWN+Zd28ZhEojKuFLUcP6fR5-zX3eBF+A@mail.gmail.com>
 <CAHRce1MrgdwNbDLs3JsMU4LvbhFy4wpw3SR-fp9qvGaCCwh7Nw@mail.gmail.com> <CA+e75usKX_n1Cqdm25R_=LhURqPn3m=ZvJyrLVAW+Zi=5c73Bw@mail.gmail.com>
From: Li Yang <liyang@apache.org>
Date: Sat, 25 Mar 2017 07:31:24 +0800
Message-ID: <CAHRce1N2KfJfOMrQpzotE0xvG1izVjxYLFSLN8uOniYeKKskpw@mail.gmail.com>
Subject: Re: Kylin + SparkSQL integration
To: user@kylin.apache.org
Cc: liyang@apache.org
Content-Type: multipart/alternative; boundary=001a1141b4b058bcfe054b826422
archived-at: Fri, 24 Mar 2017 23:31:28 -0000

--001a1141b4b058bcfe054b826422
Content-Type: text/plain; charset=UTF-8

> taking advantage of underlaying datasource capabilities (predicate
pushdown, projection etc) is important to improve query performance.

That is very true. There was discussion about replacing HBase with Cassandra
<http://apache-kylin.74782.x6.nabble.com/Cassandra-instead-of-HBase-in-Kylin-td2688.html>
previously. And the worry is lack of coprocessor will prevent predicate &
aggregation pushdown. Similar concern exists for Kudu.

Cheers
Yang

On Fri, Mar 24, 2017 at 12:50 AM, Nirav Patel <npatel@xactlycorp.com> wrote:

> Thanks for logging those improvements. I think decision about replacing
> Hbase or using any other nosql datastore for storing cubes would be based
> on many factors but one important I can think of is the query
> engine/optimizer of all of those datasources. I think taking advantage of
> underlaying datasource capabilities (predicate pushdown, projection etc) is
> important to improve query performance.
>
> Cheers,
> Nirav
>
> On Mon, Mar 20, 2017 at 12:23 PM, Li Yang <liyang@apache.org> wrote:
>
>> Hi Nirav,
>>
>> Glad to see you on the mailing list!!
>>
>> Yes, this is a great idea and it is on the roadmap. (This reminds me, I
>> should update the roadmap on kylin website soon.)
>>
>> However there are many moving parts that affect how we approach it. E.g.
>>
>> - If coprocessor is retired, do we still need HBase?
>> - If HBase is retired, what is the alternative storage? How about
>> metadata?
>> - There are other ways to integrate SparkSQL (KYLIN-2515), how do they
>> fit in...
>>
>> There are many work in this direction, I would say.
>>
>> Cheers
>> Yang
>>
>> On Tue, Mar 21, 2017 at 2:05 AM, Nirav Patel <npatel@xactlycorp.com>
>> wrote:
>>
>>> Hi,
>>>
>>> In recent strata conference I raised a question if kylin can support
>>> sparkSQL as a query engine or have a kylin query resultset converted into
>>> spark DataSet(DataFrame) on which user can perform further distributed
>>> computation.
>>> Reason are
>>> 1) some flavor of Hbase doesnt support co-processor
>>> 2) SparkSql UDF  much easier to develop then hbase coprocessor
>>> 3) User can write their own spark UDF and run any custom aggregation
>>>
>>> Is this on roadmap ?
>>>
>>> Thanks,
>>> Nirav
>>>
>>>
>>>
>>> [image: What's New with Xactly] <http://www.xactlycorp.com/email-click/>
>>>
>>> <https://www.nyse.com/quote/XNYS:XTLY>  [image: LinkedIn]
>>> <https://www.linkedin.com/company/xactly-corporation>  [image: Twitter]
>>> <https://twitter.com/Xactly>  [image: Facebook]
>>> <https://www.facebook.com/XactlyCorp>  [image: YouTube]
>>> <http://www.youtube.com/xactlycorporation>
>>
>>
>>
>
>
>
> [image: What's New with Xactly] <http://www.xactlycorp.com/email-click/>
>
> <https://www.nyse.com/quote/XNYS:XTLY>  [image: LinkedIn]
> <https://www.linkedin.com/company/xactly-corporation>  [image: Twitter]
> <https://twitter.com/Xactly>  [image: Facebook]
> <https://www.facebook.com/XactlyCorp>  [image: YouTube]
> <http://www.youtube.com/xactlycorporation>
>

--001a1141b4b058bcfe054b826422
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div>&gt; taking advantage of underlaying datasource =
capabilities (predicate pushdown, projection etc) is important to improve q=
uery performance.<br><br></div>That is very true. There was discussion abou=
t <a href=3D"http://apache-kylin.74782.x6.nabble.com/Cassandra-instead-of-H=
Base-in-Kylin-td2688.html">replacing HBase with Cassandra</a> previously. A=
nd the worry is lack of coprocessor will prevent predicate &amp; aggregatio=
n pushdown. Similar concern exists for Kudu.<br><br></div><div>Cheers<br></=
div>Yang<br></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"=
>On Fri, Mar 24, 2017 at 12:50 AM, Nirav Patel <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:npatel@xactlycorp.com" target=3D"_blank">npatel@xactlycorp.com<=
/a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Th=
anks for logging those improvements. I think decision about replacing Hbase=
 or using any other nosql datastore for storing cubes would be based on man=
y factors but one important I can think of is the query engine/optimizer of=
 all of those datasources. I think taking advantage of underlaying datasour=
ce capabilities (predicate pushdown, projection etc) is important to improv=
e query performance.<br><div><br></div><div>Cheers,</div><div>Nirav</div></=
div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><br>=
<div class=3D"gmail_quote">On Mon, Mar 20, 2017 at 12:23 PM, Li Yang <span =
dir=3D"ltr">&lt;<a href=3D"mailto:liyang@apache.org" target=3D"_blank">liya=
ng@apache.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" st=
yle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
dir=3D"ltr"><div><div><div><div><div><div><div>Hi Nirav,<br><br></div><div>=
Glad to see you on the mailing list!!<br></div><div><br>Yes, this is a grea=
t idea and it is on the roadmap. (This reminds me, I should update the road=
map on kylin website soon.)<br><br></div>However there are many moving part=
s that affect how we approach it. E.g.<br><br></div>- If coprocessor is ret=
ired, do we still need HBase?<br></div>- If HBase is retired, what is the a=
lternative storage? How about metadata?<br></div>- There are other ways to =
integrate SparkSQL (KYLIN-2515), how do they fit in...<br><br></div>There a=
re many work in this direction, I would say.<br><br></div>Cheers</div>Yang<=
/div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"><div><div cl=
ass=3D"m_1527635430808482229h5">On Tue, Mar 21, 2017 at 2:05 AM, Nirav Pate=
l <span dir=3D"ltr">&lt;<a href=3D"mailto:npatel@xactlycorp.com" target=3D"=
_blank">npatel@xactlycorp.com</a>&gt;</span> wrote:<br></div></div><blockqu=
ote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc s=
olid;padding-left:1ex"><div><div class=3D"m_1527635430808482229h5"><div dir=
=3D"ltr">Hi,<div><br></div><div>In recent strata conference I raised a ques=
tion if kylin can support sparkSQL as a query engine or have a kylin query =
resultset converted into spark DataSet(DataFrame) on which user can perform=
 further distributed computation.=C2=A0</div><div>Reason are</div><div>1) s=
ome flavor of Hbase doesnt support co-processor</div><div>2) SparkSql UDF =
=C2=A0much easier to develop then hbase coprocessor</div><div>3) User can w=
rite their own spark UDF and run any custom aggregation</div><div><br></div=
><div>Is this on roadmap ?</div><div><br></div><div>Thanks,</div><div>Nirav=
</div></div>

<br>
<br><br></div></div><a href=3D"http://www.xactlycorp.com/email-click/" targ=
et=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-content/uploads/201=
3/08/xactly-email-sig.jpg" alt=3D"What&#39;s New with Xactly" border=3D"0">=
</a><br><br>

<a href=3D"https://www.nyse.com/quote/XNYS:XTLY" target=3D"_blank"><img src=
=3D"https://www.xactlycorp.com/wp-content/uploads/2015/07/nyse_xtly_alt_24.=
png"></a>=C2=A0=C2=A0<a href=3D"https://www.linkedin.com/company/xactly-cor=
poration" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-conte=
nt/uploads/2013/08/email-sig-linkedin.png" alt=3D"LinkedIn" height=3D"24" b=
order=3D"0" width=3D"24"></a>=C2=A0=C2=A0<a href=3D"https://twitter.com/Xac=
tly" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-content/up=
loads/2013/08/email-sig-twitter.png" alt=3D"Twitter" height=3D"24" border=
=3D"0" width=3D"24"></a>=C2=A0=C2=A0<a href=3D"https://www.facebook.com/Xac=
tlyCorp" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-conten=
t/uploads/2013/08/email-sig-facebook.png" alt=3D"Facebook" height=3D"24" bo=
rder=3D"0" width=3D"24"></a>=C2=A0=C2=A0<a href=3D"http://www.youtube.com/x=
actlycorporation" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/=
wp-content/uploads/2013/08/email-sig-youtube.png" alt=3D"YouTube" height=3D=
"24" border=3D"0" width=3D"24"></a></blockquote></div><br></div>
</blockquote></div><br></div>

<br>
<br><br><a href=3D"http://www.xactlycorp.com/email-click/" target=3D"_blank=
"><img src=3D"https://www.xactlycorp.com/wp-content/uploads/2013/08/xactly-=
email-sig.jpg" alt=3D"What&#39;s New with Xactly" border=3D"0"></a><br><br>

<a href=3D"https://www.nyse.com/quote/XNYS:XTLY" target=3D"_blank"><img src=
=3D"https://www.xactlycorp.com/wp-content/uploads/2015/07/nyse_xtly_alt_24.=
png"></a>=C2=A0=C2=A0<a href=3D"https://www.linkedin.com/company/xactly-cor=
poration" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-conte=
nt/uploads/2013/08/email-sig-linkedin.png" alt=3D"LinkedIn" height=3D"24" b=
order=3D"0" width=3D"24"></a>=C2=A0=C2=A0<a href=3D"https://twitter.com/Xac=
tly" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-content/up=
loads/2013/08/email-sig-twitter.png" alt=3D"Twitter" height=3D"24" border=
=3D"0" width=3D"24"></a>=C2=A0=C2=A0<a href=3D"https://www.facebook.com/Xac=
tlyCorp" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/wp-conten=
t/uploads/2013/08/email-sig-facebook.png" alt=3D"Facebook" height=3D"24" bo=
rder=3D"0" width=3D"24"></a>=C2=A0=C2=A0<a href=3D"http://www.youtube.com/x=
actlycorporation" target=3D"_blank"><img src=3D"https://www.xactlycorp.com/=
wp-content/uploads/2013/08/email-sig-youtube.png" alt=3D"YouTube" height=3D=
"24" border=3D"0" width=3D"24"></a></div></div></blockquote></div><br></div=
>

--001a1141b4b058bcfe054b826422--