Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
MIME-Version: 1.0
In-Reply-To: <f1d93328dca04f98ba5cdc1a1310ddcf@HQ1WP-EXMB12.corp.brocade.com>
References: <35006c8af9124174b0a9b9eee536e6b8@HQ1WP-EXMB12.corp.brocade.com>
 <CABNXB2BMqguNSS5bGMz-gL36C6kKLtQHo-1umOMCne3PZu+1vQ@mail.gmail.com> <f1d93328dca04f98ba5cdc1a1310ddcf@HQ1WP-EXMB12.corp.brocade.com>
From: DuyHai Doan <doanduyhai@gmail.com>
Date: Tue, 6 Jun 2017 20:38:43 +0200
Message-ID: <CABNXB2A-wKni5zZSOSRaKHq=iOppiQnFZJejzemc0bpUjOXnzw@mail.gmail.com>
Subject: Re: Order by for aggregated values
To: "Roger Fischer (CW)" <rfische@brocade.com>
Cc: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary="001a1145a5b61516b305514eef13"
archived-at: Tue, 06 Jun 2017 18:39:10 -0000

--001a1145a5b61516b305514eef13
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

The problem is not that it's not feasible from Cassandra side, it is

The problem is when doing arbitrary ORDER BY, Cassandra needs to resort to
in-memory sorting of a potentially huge amout of data --> more pressure on
heap --> impact on cluster stability

Whereas delegating this kind of job to Spark which has appropriate data
structure to lower heap pressure (Dataframe, project tungsten) is a better
idea.

"but in the Top N use case, far more data has to be transferred to the
client when the client has to do the sorting"

--> It is not true if you co-located your Spark worker with Cassandra
nodes. In this case, Spark reading data out of Cassandra nodes are always
node-local


On Tue, Jun 6, 2017 at 6:20 PM, Roger Fischer (CW) <rfische@brocade.com>
wrote:

> Hi DuyHai,
>
>
>
> this is in response to the other points in your response.
>
>
>
> My application is a real-time application. It monitors devices in the
> network and displays the top N devices for various parameters averaged ov=
er
> a time period. A query may involve anywhere from 10 to 50k devices, and
> anywhere from 5 to 2000 intervals. We expect a query to take less than 2
> seconds.
>
>
>
> My impression was that Spark is aimed at larger scale analytics.
>
>
>
> I am ok with the limitation on =E2=80=9Cgroup by=E2=80=9D. I am intending=
 to use async
> queries and token-aware load balancing to partition the query and execute
> it in parallel on each node.
>
>
>
> Thanks=E2=80=A6
>
>
>
> Roger
>
>
>
>
>
> *From:* DuyHai Doan [mailto:doanduyhai@gmail.com]
> *Sent:* Tuesday, June 06, 2017 12:31 AM
> *To:* Roger Fischer (CW) <rfische@Brocade.com>
> *Cc:* user@cassandra.apache.org
> *Subject:* Re: Order by for aggregated values
>
>
>
> First Group By is only allowed on partition keys and clustering columns,
> not on arbitrary column. The internal implementation of group by tries to
> fetch data on clustering order to avoid having to "re-sort" them in memor=
y
> which would be very expensive
>
>
>
> Second, group by works best when restricted to a single partition other
> wise it will force Cassandra to do a range scan so poor performance
>
>
>
>
>
> For all of those reasons I don't expect an "order by" on aggregated value=
s
> to be available any soon
>
>
>
> Furthermore, Cassandra is optimised for real-time transactional scenarios=
,
> the group by/order by/limit is typically a classical analytics scenario, =
I
> would recommend to use the appropriate tool like Spark for that
>
>
>
>
>
> Le 6 juin 2017 04:00, "Roger Fischer (CW)" <rfische@brocade.com> a =C3=A9=
crit :
>
> Hello,
>
>
>
> is there any intent to support =E2=80=9Corder by=E2=80=9D and =E2=80=9Cli=
mit=E2=80=9D on aggregated values?
>
>
>
> For time series data, top n queries are quite common. Group-by was the
> first step towards supporting such queries, but ordering by value and
> limiting the results are also required.
>
>
>
> Thanks=E2=80=A6
>
>
>
> Roger
>
>
>
>
>
>
>
>
>

--001a1145a5b61516b305514eef13
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">The problem is not that it&#39;s not feasible from Cassand=
ra side, it is=C2=A0<div><br></div><div>The problem is when doing arbitrary=
 ORDER BY, Cassandra needs to resort to in-memory sorting of a potentially =
huge amout of data --&gt; more pressure on heap --&gt; impact on cluster st=
ability</div><div><br></div><div>Whereas delegating this kind of job to Spa=
rk which has appropriate data structure to lower heap pressure (Dataframe, =
project tungsten) is a better idea.</div><div><br></div><div>&quot;<span st=
yle=3D"color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.666=
7px">but in the Top N use case, far more data has to be transferred to the =
client when the client has to do the sorting&quot;</span></div><div><span s=
tyle=3D"color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.66=
67px"><br></span></div><div><span style=3D"color:rgb(31,73,125);font-family=
:Calibri,sans-serif;font-size:14.6667px">--&gt; It is not true if you co-lo=
cated your Spark worker with Cassandra nodes. In this case, Spark reading d=
ata out of Cassandra nodes are always node-local</span></div><div><span sty=
le=3D"color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.6667=
px"><br></span></div><div><span style=3D"color:rgb(31,73,125);font-family:C=
alibri,sans-serif;font-size:14.6667px"><br></span></div></div><div class=3D=
"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Jun 6, 2017 at 6:20 PM=
, Roger Fischer (CW) <span dir=3D"ltr">&lt;<a href=3D"mailto:rfische@brocad=
e.com" target=3D"_blank">rfische@brocade.com</a>&gt;</span> wrote:<br><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex">


<div lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div class=3D"m_-362091469345298014WordSection1">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Hi DuyHai,<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">this is in response to the other poin=
ts in your response.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">My application is a real-time applica=
tion. It monitors devices in the network and displays the top N devices for=
 various parameters averaged over a time period.
 A query may involve anywhere from 10 to 50k devices, and anywhere from 5 t=
o 2000 intervals. We expect a query to take less than 2 seconds.<u></u><u><=
/u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">My impression was that Spark is aimed=
 at larger scale analytics.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">I am ok with the limitation on =E2=80=
=9Cgroup by=E2=80=9D. I am intending to use async queries and token-aware l=
oad balancing to partition the query and execute it in parallel
 on each node.<u></u><u></u></span></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Thanks=E2=80=A6<u></u><u></u></span><=
/p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Roger<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,sans-serif">From:</span></b><span style=3D"font-size:11.0pt;=
font-family:&quot;Calibri&quot;,sans-serif"> DuyHai Doan [mailto:<a href=3D=
"mailto:doanduyhai@gmail.com" target=3D"_blank">doanduyhai@gmail.com</a>]
<br>
<b>Sent:</b> Tuesday, June 06, 2017 12:31 AM<br>
<b>To:</b> Roger Fischer (CW) &lt;rfische@Brocade.com&gt;<br>
<b>Cc:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: Order by for aggregated values<u></u><u></u></span></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</span><div>
<div>
<p class=3D"MsoNormal">First Group By is only allowed on partition keys and=
 clustering columns, not on arbitrary column. The internal implementation o=
f group by tries to fetch data on clustering order to avoid having to &quot=
;re-sort&quot; them in memory which would be
 very expensive=C2=A0<u></u><u></u></p><div><div class=3D"h5">
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Second, group by works best when restricted to a sin=
gle partition other wise it will force Cassandra to do a range scan so poor=
 performance=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">For all of those reasons I don&#39;t expect an &quot=
;order by&quot; on aggregated values to be available any soon<u></u><u></u>=
</p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Furthermore, Cassandra is optimised for real-time tr=
ansactional scenarios, the group by/order by/limit is typically a classical=
 analytics scenario, I would recommend to use the appropriate tool like Spa=
rk for that=C2=A0<u></u><u></u></p>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">Le=C2=A06 juin 2017 04:00, &quot;Roger Fischer (CW)&=
quot; &lt;<a href=3D"mailto:rfische@brocade.com" target=3D"_blank">rfische@=
brocade.com</a>&gt; a =C3=A9crit=C2=A0:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0i=
n 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class=3D"MsoNormal">Hello,<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">is there any intent to support =E2=80=9Corder by=E2=
=80=9D and =E2=80=9Climit=E2=80=9D on aggregated values?<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">For time series data, top n queries are quite common=
. Group-by was the first step towards supporting such queries, but ordering=
 by value and limiting the results are also required.<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Thanks=E2=80=A6<u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"color:#888888">=C2=A0<u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"color:#888888">Roger<u></u><u></u></s=
pan></p>
<p class=3D"MsoNormal"><span style=3D"color:#888888">=C2=A0<u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"color:#888888">=C2=A0<u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"color:#888888">=C2=A0<u></u><u></u></=
span></p>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
</div></div></div>
</div>
</div>
</div>

</blockquote></div><br></div>

--001a1145a5b61516b305514eef13--