Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <TY1PR0301MB13287A0E7625A169E5D43F4A8F2F0@TY1PR0301MB1328.apcprd03.prod.outlook.com>
References: 
 <TY1PR0301MB13287A0E7625A169E5D43F4A8F2F0@TY1PR0301MB1328.apcprd03.prod.outlook.com>
Date: Fri, 30 Oct 2015 12:04:34 +0000
Message-ID: 
 <CAEiXDZ0nU4g+CUyRku9DU=5_iAQpKZqzzMwtg4d=vrYK+3xLEg@mail.gmail.com>
Subject: Re: Cassandra Data Model with Narrow partition
From: Carlos Alonso <info@mrcalonso.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a1146ffb41a2eb90523513b01

--001a1146ffb41a2eb90523513b01
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Chandra,

Narrow partition is probably your best choice, but you need to bucket data
somehow, otherwise your partitions will soon become unmanageable and you'll
have problems reading them, both because the partitions will become very
big and also because of the tombstones that your expired records will
generate.

In general having a partition that can grow indefinitely is a bad idea, so
I'd advice you to use time based artificial bucketing to limit the maximum
size of your partitions to be as close as possible to the recommendations.

Also 120+ columns sounds like quite many, is there a way you can separate
in different cfs or maybe use collections? I'd advice to do some
benchmarking here: http://mrcalonso.com/benchmarking-cassandra-models/.
This post is a bit outdated as nowadays you can use cassandra-stress with
your own models, but the idea is the same.

About compactions I'd use DTCS or LCS, but given that you will have a big
amount of tombstones due to TTLs I'd never go with STCS.

Hope it helps!

Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

On 30 October 2015 at 10:55, <chandrasekar.krc@wipro.com> wrote:

> Hi,
>
>
>
> Could you please suggest if Narrow partition is a  good choice for the
> below use case.
>
>
>
> 1)      Write heavy event log table with 50m inserts per day with a peak
> load of 20K transaction per sec. There aren=E2=80=99t any updates/deletes=
 to
> records inserted. Records are inserted with a TTL of 60 days (retention
> period)
>
> 2)      The table has a single primary key which is a sequence number (27
> digits) generated by source application
>
> 3)      There are only two access patterns used =E2=80=93 one by using th=
e
> sequence number & the other using sequence number + event date (range sca=
ns
> also possible)
>
> 4)      My target data model in Cassandra is partitioned with sequence
> number as the primary key + event date as clustering columns to enable
> range scans on date.
>
> 5)      The Table has close to 120+ columns and the average row size
> comes close to 32K bytes
>
> 6)      Reads are very very less and account to <5% while inserts can be
> close to 95%.
>
> 7)      From a functional standpoint, I do not see any other columns that
> can be part of primary key to keep the partition reasonable (<100MB)
>
>
>
> Questions:
>
> 1)      Is Narrow partition an ideal choice for the above use case.
>
> 2)      Is artificial bucketing an alternate choice to make the partition
> reasonable
>
> 3)      We are using varint as the data type for sequence number which is
> 27 digits long. Is DECIMAL data type ?
>
> 4)      Any suggestions on performance impacts during compaction ?
>
>
>
> Regards, Chandra Sekar KR
>
>
> The information contained in this electronic message and any attachments
> to this message are intended for the exclusive use of the addressee(s) an=
d
> may contain proprietary, confidential or privileged information. If you a=
re
> not the intended recipient, you should not disseminate, distribute or cop=
y
> this e-mail. Please notify the sender immediately and destroy all copies =
of
> this message and any attachments. WARNING: Computer viruses can be
> transmitted via email. The recipient should check this email and any
> attachments for the presence of viruses. The company accepts no liability
> for any damage caused by any virus transmitted by this email.
> www.wipro.com
>

--001a1146ffb41a2eb90523513b01
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Chandra,<div><br></div><div>Narrow partition is probabl=
y your best choice, but you need to bucket data somehow, otherwise your par=
titions will soon become unmanageable and you&#39;ll have problems reading =
them, both because the partitions will become very big and also because of =
the tombstones that your expired records will generate.</div><div><br></div=
><div>In general having a partition that can grow indefinitely is a bad ide=
a, so I&#39;d advice you to use time based artificial bucketing to limit th=
e maximum size of your partitions to be as close as possible to the recomme=
ndations.</div><div><br></div><div>Also 120+ columns sounds like quite many=
, is there a way you can separate in different cfs or maybe use collections=
? I&#39;d advice to do some benchmarking here:=C2=A0<a href=3D"http://mrcal=
onso.com/benchmarking-cassandra-models/">http://mrcalonso.com/benchmarking-=
cassandra-models/</a>. This post is a bit outdated as nowadays you can use =
cassandra-stress with your own models, but the idea is the same.</div><div>=
<br></div><div>About compactions I&#39;d use DTCS or LCS, but given that yo=
u will have a big amount of tombstones due to TTLs I&#39;d never go with ST=
CS.</div><div><br></div><div>Hope it helps!</div></div><div class=3D"gmail_=
extra"><br clear=3D"all"><div><div class=3D"gmail_signature"><div dir=3D"lt=
r"><span style=3D"font-size:12.8000001907349px">Carlos Alonso | Software En=
gineer |=C2=A0</span><a href=3D"https://twitter.com/calonso" style=3D"font-=
size:12.8000001907349px" target=3D"_blank">@calonso</a><br></div></div></di=
v>
<br><div class=3D"gmail_quote">On 30 October 2015 at 10:55,  <span dir=3D"l=
tr">&lt;<a href=3D"mailto:chandrasekar.krc@wipro.com" target=3D"_blank">cha=
ndrasekar.krc@wipro.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail=
_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:=
1ex">


<div lang=3D"EN-US" link=3D"#0563C1" vlink=3D"#954F72">
<div>
<p class=3D"MsoNormal">Hi,<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">Could you please suggest if Narrow partition is a=C2=
=A0 good choice for the below use case.<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p><u></u><span>1)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>Write heavy event log table with 50m inserts per day w=
ith a peak load of 20K transaction per sec. There aren=E2=80=99t any update=
s/deletes to records inserted. Records are inserted with a TTL of 60 days (=
retention period)<u></u><u></u></p>
<p><u></u><span>2)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>The table has a single primary key which is a sequence=
 number (27 digits) generated by source application<u></u><u></u></p>
<p><u></u><span>3)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>There are only two access patterns used =E2=80=93 one =
by using the sequence number &amp; the other using sequence number + event =
date (range scans also possible)<u></u><u></u></p>
<p><u></u><span>4)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>My target data model in Cassandra is partitioned with =
sequence number as the primary key + event date as clustering columns to en=
able range scans on date.<u></u><u></u></p>
<p><u></u><span>5)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>The Table has close to 120+ columns and the average ro=
w size comes close to 32K bytes<u></u><u></u></p>
<p><u></u><span>6)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>Reads are very very less and account to &lt;5% while i=
nserts can be close to 95%.<u></u><u></u></p>
<p><u></u><span>7)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>From a functional standpoint, I do not see any other c=
olumns that can be part of primary key to keep the partition reasonable (&l=
t;100MB)<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">Questions:<u></u><u></u></p>
<p><u></u><span>1)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>Is Narrow partition an ideal choice for the above use =
case.<u></u><u></u></p>
<p><u></u><span>2)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>Is artificial bucketing an alternate choice to make th=
e partition reasonable<u></u><u></u></p>
<p><u></u><span>3)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>We are using varint as the data type for sequence numb=
er which is 27 digits long. Is DECIMAL data type ?<u></u><u></u></p>
<p><u></u><span>4)<span style=3D"font:7.0pt &quot;Times New Roman&quot;">=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span><u></u>Any suggestions on performance impacts during compacti=
on ?<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">Regards, Chandra Sekar KR<u></u><u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
The information contained in this electronic message and any attachments to=
 this message are intended for the exclusive use of the addressee(s) and ma=
y contain proprietary, confidential or privileged information. If you are n=
ot the intended recipient, you should
 not disseminate, distribute or copy this e-mail. Please notify the sender =
immediately and destroy all copies of this message and any attachments. WAR=
NING: Computer viruses can be transmitted via email. The recipient should c=
heck this email and any attachments
 for the presence of viruses. The company accepts no liability for any dama=
ge caused by any virus transmitted by this email. <a href=3D"http://www.wip=
ro.com" target=3D"_blank">www.wipro.com</a>
</div>

</blockquote></div><br></div>

--001a1146ffb41a2eb90523513b01--