Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Date: Wed, 09 Nov 2016 05:11:26 -0500
From: Vladimir Yudovin <vladyu@winguzone.com>
To: "user" <user@cassandra.apache.org>
Message-Id: <158489220c0.bda6a2eb120886.4737303318592947827@winguzone.com>
In-Reply-To: <TY1PR0201MB0910C12A37B431F5DB15C5AFFFB90@TY1PR0201MB0910.apcprd02.prod.outlook.com>
References: <TY1PR0201MB0910D5251BA7FB4DC9CA0169FFA60@TY1PR0201MB0910.apcprd02.prod.outlook.com>,<158445dcfc8.ca493bb728302.6473184930597507436@winguzone.com> <TY1PR0201MB0910C12A37B431F5DB15C5AFFFB90@TY1PR0201MB0910.apcprd02.prod.outlook.com>
Subject: =?UTF-8?Q?Re:_=E7=AD=94=E5=A4=8D:_A_difficult_data_model_with_C*?=
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_Part_383316_294014251.1478686286024"
User-Agent: Zoho Mail
archived-at: Wed, 09 Nov 2016 10:11:46 -0000

------=_Part_383316_294014251.1478686286024
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

You are welcome! )


&gt;recent ten movies watched by the user within 30 days.

In this case you can't use PRIMARY KEY (user_name, video_id), as video_id i=
s demanded to fetch row, so all this stuff may be

CREATE TYPE play (video_id text, position int, last_time timestamp);

CREATE TABLE recent (user_name text PRIMARY KEY, play_list LIST&lt;frozen&l=
t;play&gt;&gt;);


You can easily retrieve play list for specific user by his ID. Instead of L=
IST you can use MAP, I don't think that for ten entries it matters.


Best regards, Vladimir Yudovin,=20

Winguzone - Hosted Cloud Cassandra
Launch your cluster in minutes.


---- On Tue, 08 Nov 2016 22:29:48 -0500ben ben &lt;diamond.ben@outlook.com&=
gt; wrote ----


Hi Vladimir Yudovin,


    Thank you very much for your detailed explaining. Maybe I didn't descri=
be the requirement clearly. The use cases should be:

1. a user login our app.

2. show the recent ten movies watched by the user within 30 days.

3. the user can click any one of the ten movie and continue to watch from t=
he last position she/he did. BTW, a movie can be watched several times by a=
 user and the last positon is needed indeed.


BRs,

BEN


=E5=8F=91=E4=BB=B6=E4=BA=BA: Vladimir Yudovin &lt;vladyu@winguzone.com&gt;
 =E5=8F=91=E9=80=81=E6=97=B6=E9=97=B4: 2016=E5=B9=B411=E6=9C=888=E6=97=A5 2=
2:35:48
 =E6=94=B6=E4=BB=B6=E4=BA=BA: user
 =E4=B8=BB=E9=A2=98: Re: A difficult data model with C*=20
=20


Hi Ben,


if need very limited number of positions (as you said ten) may be you can s=
tore them in LIST of UDT? Or just as JSON string?

So you'll have one row per each pair user-video.=20


It can be something like this:


CREATE TYPE play (position int, last_time timestamp);

CREATE TABLE recent (user_name text, video_id text, review LIST&lt;frozen&l=
t;play&gt;&gt;, PRIMARY KEY (user_name, video_id));


UPDATE recent set review =3D review + [(1234,12345)] where user_name=3D'som=
e user' AND video_id=3D'great video';

UPDATE recent set review =3D review + [(1234,123456)] where user_name=3D'so=
me user' AND video_id=3D'great video';

UPDATE recent set review =3D review + [(1234,1234567)] where user_name=3D's=
ome user' AND video_id=3D'great video';


You can delete the oldest entry by index:

DELETE review[0] FROM recent WHERE user_name=3D'some user' AND video_id=3D'=
great video';


or by value, if you know the oldest entry:


UPDATE recent SET review =3D review - [(1234,12345)]  WHERE user_name=3D'so=
me user' AND video_id=3D'great video';


Best regards, Vladimir Yudovin,=20

Winguzone - Hosted Cloud Cassandra
 Launch your cluster in minutes.


---- On Mon, 07 Nov 2016 21:54:08 -0500ben ben &lt;diamond.ben@outlook.com&=
gt; wrote ----


Hi guys,


We are maintaining a system for an on-line video service. ALL users' viewin=
g records of every movie are stored in C*. So she/he can continue to enjoy =
the movie from the last point next time. The table is designed as below:

CREATE TABLE recent (

user_name text,

vedio_id text,

position int,

last_time timestamp,

PRIMARY KEY (user_name, vedio_id)

)


It worked well before. However, the records increase every day and the last=
 ten items may be adequate for the business. The current model use vedio_id=
 as cluster key to keep a row for a movie, but as you know, the business pr=
efer to order by the last_time desc. If we use last_time as cluster key, th=
ere will be many records for a singe movie and the recent one is actually d=
esired. So how to model that? Do you have any suggestions?=20

Thanks!


BRs,

BEN


------=_Part_383316_294014251.1478686286024
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"><html><head>=
<meta content=3D"text/html;charset=3DUTF-8" http-equiv=3D"Content-Type"></h=
ead><body ><div style=3D'font-size:10pt;font-family:Verdana,Arial,Helvetica=
,sans-serif;'><div>You are welcome! )<br></div><div><br></div><div><span cl=
ass=3D"highlight" style=3D"background-color:#ffffcc">&gt;recent ten movies =
watched by the user&nbsp;within 30 days.</span><br></div><div>In this case =
you can't use&nbsp;PRIMARY KEY (user_name, video_id), as video_id is demand=
ed to fetch row, so all this stuff may be<br></div><blockquote style=3D"bor=
der: 1px solid rgb(204, 204, 204); padding: 7px; background-color: rgb(245,=
 245, 245);"><div><div>CREATE TYPE play (video_id text, position int, last_=
time timestamp);<br></div><div>CREATE TABLE recent (user_name text PRIMARY =
KEY, play_list LIST&lt;frozen&lt;play&gt;&gt;);<br></div></div></blockquote=
><div>You can easily retrieve play list for specific user by his ID. Instea=
d of LIST you can use MAP, I don't think that for ten entries it matters.</=
div><div><br></div><div><br></div><div id=3D""><div>Best regards, Vladimir =
Yudovin, <br></div><div><i><a target=3D"_blank" href=3D"https://winguzone.c=
om?from=3Dlist">Winguzone</a> - Hosted Cloud Cassandra<br>Launch your clust=
er in minutes.</i></div></div><div><br></div><div class=3D"zmail_extra"><di=
v id=3D"1"><div><br></div><div>---- On Tue, 08 Nov 2016 22:29:48 -0500<b>be=
n ben &lt;diamond.ben@outlook.com&gt;</b> wrote ----<br></div></div><div><b=
r></div><blockquote style=3D"border-left: 1px solid #cccccc; padding-left: =
6px; margin:0 0 0 5px"><div><div style=3D"font-size: 12.0pt;color: rgb(0,0,=
0);font-family: Calibri , Arial , Helvetica , sans-serif;" dir=3D"ltr"><p>H=
i <span class=3D"font" style=3D"font-family:Verdana,Arial,Helvetica,sans-se=
rif"><span class=3D"size" style=3D"font-size:13px"><span class=3D"size" sty=
le=3D"font-size:10pt">Vladimir Yudovin</span></span></span>,<br></p><p><br>=
</p><p>&nbsp;&nbsp;&nbsp; Thank you very much for your detailed explaining.=
 Maybe I didn't describe the requirement clearly. The use cases should be:<=
br></p><p>1. a user login our app.<br></p><p>2. show the recent ten movies =
watched by the user&nbsp;<span>within 30 days.</span><br></p><p>3. the user=
 can click&nbsp;<span>any one of the ten movie </span>and continue to watch=
 from the last position she/he did. BTW, a movie can be watched several tim=
es by a user and the last positon is needed indeed.<br></p><div><br></div><=
p>BRs,<br></p><p>BEN<br></p></div><div><hr style=3D"width: 98.0%;"><br></di=
v><div dir=3D"ltr"><div><span class=3D"font" style=3D"font-family:Calibri, =
sans-serif"><span class=3D"colour" style=3D"color:#000000"><b>=E5=8F=91=E4=
=BB=B6=E4=BA=BA:</b> Vladimir Yudovin &lt;<a href=3D"mailto:vladyu@winguzon=
e.com" target=3D"_blank">vladyu@winguzone.com</a>&gt;<br> <b>=E5=8F=91=E9=
=80=81=E6=97=B6=E9=97=B4:</b> 2016=E5=B9=B411=E6=9C=888=E6=97=A5 22:35:48<b=
r> <b>=E6=94=B6=E4=BB=B6=E4=BA=BA:</b> user<br> <b>=E4=B8=BB=E9=A2=98:</b> =
Re: A difficult data model with C*</span></span> </div><div>&nbsp;<br></div=
></div><div><div style=3D"font-size: 10.0pt;font-family: Verdana , Arial , =
Helvetica , sans-serif;"><div>Hi Ben,<br></div><div><br></div><div>if need =
very limited number of positions (as you said ten) may be you can store the=
m in LIST of UDT? Or just as JSON string?<br></div><div>So you'll have one =
row per each pair user-video. <br></div><div><br></div><div>It can be somet=
hing like this:<br></div><div><br></div><div>CREATE TYPE play (position int=
, last_time timestamp);<br></div><div>CREATE TABLE recent (user_name text, =
video_id text, review LIST&lt;frozen&lt;play&gt;&gt;, PRIMARY KEY (user_nam=
e, video_id));<br></div><div><br></div><div>UPDATE recent set review =3D re=
view + [(1234,12345)] where user_name=3D'some user' AND video_id=3D'great v=
ideo';<br></div><div>UPDATE recent set review =3D review + [(1234,123456)] =
where user_name=3D'some user' AND video_id=3D'great video';<br></div><div>U=
PDATE recent set review =3D review + [(1234,1234567)] where user_name=3D'so=
me user' AND video_id=3D'great video';<br></div><div><br></div><div>You can=
 delete the oldest entry by index:<br></div><div>DELETE review[0] FROM rece=
nt WHERE user_name=3D'some user' AND video_id=3D'great video';<br></div><di=
v><br></div><div>or by value, if you know the oldest entry:<br></div><div><=
br></div><div>UPDATE recent SET review =3D review - [(1234,12345)]&nbsp; WH=
ERE user_name=3D'some user' AND video_id=3D'great video';<br></div><div><br=
></div><div><div>Best regards, Vladimir Yudovin, <br></div><div><i><a targe=
t=3D"_blank" href=3D"https://winguzone.com?from=3Dlist">Winguzone</a> - Hos=
ted Cloud Cassandra<br> Launch your cluster in minutes.</i></div></div><div=
><br></div><div class=3D"zmail_extra"><div><div><br></div><div>---- On Mon,=
 07 Nov 2016 21:54:08 -0500<b>ben ben &lt;<a href=3D"mailto:diamond.ben@out=
look.com" target=3D"_blank">diamond.ben@outlook.com</a>&gt;</b> wrote ----<=
br></div></div><div><br></div><blockquote style=3D"border-left: 1.0px solid=
 rgb(204,204,204);padding-left: 6.0px;margin: 0 0 0 5.0px;"><div><div style=
=3D"font-size: 12.0pt;color: rgb(0,0,0);font-family: Calibri , Arial , Helv=
etica , sans-serif;" dir=3D"ltr"><p><br></p><div><div>Hi guys,<br></div><di=
v><br></div><div>We are maintaining a system for an on-line video service. =
ALL users' viewing records of every movie are stored in C*. So she/he can c=
ontinue to enjoy the movie from the last point next time. The table is desi=
gned as below:<br></div><div>CREATE TABLE recent (<br></div><div>user_name =
text,<br></div><div>vedio_id text,<br></div><div>position int,<br></div><di=
v>last_time timestamp,<br></div><div>PRIMARY KEY (user_name, vedio_id)<br><=
/div><div>)<br></div><div><br></div><div>It worked well before. However, th=
e records increase every day and the last ten items may be adequate for the=
 business. The current model use vedio_id as cluster key to keep a row for =
a movie, but as you know, the business prefer to order by the last_time  de=
sc. If we use last_time as cluster key, there will be many records for a si=
nge movie and the recent one is actually desired. So how to model that? Do =
you have any suggestions? <br></div><div>Thanks!<br></div><div><br></div><d=
iv><br></div><div>BRs,<br></div><div>BEN<br></div></div><p><br></p></div></=
div></blockquote></div><div><br></div></div></div></div></blockquote></div>=
<div><br></div></div></body></html>
------=_Part_383316_294014251.1478686286024--