Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of ynerella999@gmail.com
 designates 209.85.220.50 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAENxBwxiq0_peErNHb1z1=RvuM3wzZzrX58JPDrk242GMi2y2w@mail.gmail.com>
References: 
 <CABNXB2C=u14R0TQEmiiG_3sNjz4U2T+iE+yV9YObyXSVAja57g@mail.gmail.com>
	<CAKmMYa9ge5mtc6VY1n5hXQwr_zXLtYDp=R4j8zPomVk9sphVJw@mail.gmail.com>
	<CABNXB2CDh8YcgzShh8xyzVo+3V3hJbAw_KE9K2iNjrfU7GjR0w@mail.gmail.com>
	<CAENxBwxiq0_peErNHb1z1=RvuM3wzZzrX58JPDrk242GMi2y2w@mail.gmail.com>
Date: Tue, 4 Feb 2014 09:02:06 -0800
Message-ID: 
 <CAOZF2Bf1B9OYLjvqFYbYbTwWT7v8eU7FFgpSK4YoO9r3Znkh5A@mail.gmail.com>
Subject: Re: Ultra wide row anti pattern
From: Yogi Nerella <ynerella999@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b86c4a2a59fcb04f19799a8

--047d7b86c4a2a59fcb04f19799a8
Content-Type: text/plain; charset=ISO-8859-1

Sorry, I am not understanding the problem, and I am new to Cassandra, and
want to understand this issue.

Why do we need to use wide row for this situation, why not a simple table
in cassandra?

todolist  (user, state)   ==> is there any other information in this table
which needs for processing todo?
processedlist (user, state)


On Tue, Feb 4, 2014 at 7:50 AM, Edward Capriolo <edlinuxguru@gmail.com>wrote:

> I have actually been building something similar in my space time. You can
> hang around and wait for it or build your own. Here is the basics. Not
> perfect but it will work.
>
> Create column family queue with gc_grace_period=[1 day]
>
> set queue [timeuuid()] ["z"+timeuuid()] = [ work do do]
>
> The producer can decide how it wants to role over the row key and the
> column key it does not matter.
>
> Supposing there are N consumers. We need a way for the consumers to not do
> the same work. We can use something like the bakery algorithm. Remember at
> QUORUM a reader sees writes.
>
> A consumer needs an identifier (it could be another uuid or an ip address)
> A consumer calls get_range_slice on the queue the slice is from new byte[]
> to byte[] limit 100
>
> The consumer sees data like this.
>
> [1234] [z-$timeuuid] = data
>
> Now we register that this consumer wants to consume this queue
>
> set [1234] [a-$[ip}] at quorum
>
> Now we do a slice
> get_slice [1234]  from new byte [] to ' b'
>
> There are a few possible returns.
> 1) 1 bidder...
> [1234] [a-$myip]
> You won start consuming
>
> 2)  2 bidders
> [1234] [a-$myip]
> [1234] [a-$otherip]
> compare $myip vs $otherip higher wins
>
> Whoever wins can then start consuming the columns in the queue and delete
> them when done.
>
>
>
>
>
>
> On Friday, January 31, 2014, DuyHai Doan <doanduyhai@gmail.com> wrote:
> > Thanks Nat for your ideas.
> >>This could be as simple as adding year and month to the primary key (in
> the form >'yyyymm'). Alternatively, you could add this in the partition in
> the definition. Either way, it >then becomes pretty easy to re-generate
> these based on the query parameters.
> >
> >  The thing is that it's not that simple. My customer has a very BAD
> idea, using Cassandra as a queue (the perfect anti-pattern ever).
> >  Before trying to tell them to redesign their entire architecture and
> put in some queueing system like ActiveMQ or something similar, I would
> like to see how I can use wide rows to meet the requirements.
> >  The functional need is quite simple:
> >  1) A process A loads users into Cassandra and sets the status on this
> user to be 'TODO'. When using the bucketing technique, we can limit a row
> width to, let's say 100 000 columns. So at the end of the current row,
> process A knows that it should move to next bucket. Bucket is coded using
> composite partition key, in our example it would be 'TODO:1', 'TODO:2' ....
> etc
> >
> >  2) A process B reads the wide row for 'TODO' status. It starts at
> bucket 1 so it will read row with partition key 'TODO:1'. The users are
> processed and inserted in a new row 'PROCESSED:1' for example to keep track
> of the status. After retrieving 100 000 columns, it will switch
> automatically to the next bucket. Simple. Fair enough
> >
> >  3) Now what sucks it that some time, process B does not have enough
> data to perform functional logic on the user it fetched from the wide row,
> so it has to REPUT some users back into the 'TODO' status rather than
> transitioning to 'PROCESSED' status. That's exactly a queue behavior.
> >  A simplistic idea would be to insert again those m users with 'TODO:n',
> with n higher than the current bucket number so it can be processed later.
> But then it screws up all the counting system. Process A which inserts data
> will not know that there are already m users in row n, so will happily add
> 100 000 columns, making the row size grow to  100 000 + m. When process B
> reads back again this row, it will stop at the first 100 000 columns and
> skip the trailing m elements .
> >   That 's the main reason for which I dropped the idea of bucketing
> (which is quite smart in normal case) to trade for ultra wide row.
> >  Any way, I'll follow your advice and play around with the parameters of
> SizeTiered
> >  Regards
> >  Duy Hai DOAN
> >
> > On Fri, Jan 31, 2014 at 9:23 PM, Nate McCall <nate@thelastpickle.com>
> wrote:
> >>>
> >>>  The only drawback for ultra wide row I can see is point 1). But if I
> use leveled compaction with a sufficiently large value for
> "sstable_size_in_mb" (let's say 200Mb), will my read performance be
> impacted as the row grows ?
> >>
> >> For this use case, you would want to use SizeTieredCompaction and play
> around with the configuration a bit to keep a small number of large
> SSTables. Specifically: keep min|max_threshold really low, set bucket_low
> and bucket_high closer together maybe even both to 1.0, and maybe a larger
> min_sstable_size.
> >> YMMV though - per Rob's suggestion, take the time to run some tests
> tweaking these options.
> >>
> >>>
> >>>  Of course, splitting wide row into several rows using bucketing
> technique is one solution but it forces us to keep track of the bucket
> number and it's not convenient. We have one process (jvm) that insert data
> and another process (jvm) that read data. Using bucketing, we need to
> synchronize the bucket number between the 2 processes.
> >>
> >> This could be as simple as adding year and month to the primary key (in
> the form 'yyyymm'). Alternatively, you could add this in the partition in
> the definition. Either way, it then becomes pretty easy to re-generate
> these based on the query parameters.
> >>
> >>
> >> --
> >> -----------------
> >> Nate McCall
> >> Austin, TX
> >> @zznate
> >>
> >> Co-Founder & Sr. Technical Consultant
> >> Apache Cassandra Consulting
> >> http://www.thelastpickle.com
> >
>

--047d7b86c4a2a59fcb04f19799a8
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Sorry, I am not understanding the problem, and I am new to=
 Cassandra, and want to understand this issue.<div><br></div><div>Why do we=
 need to use wide row for this situation, why not a simple table in cassand=
ra?</div>
<div><br></div><div>todolist =A0(user, state) =A0 =3D=3D&gt; is there any o=
ther information in this table which needs for processing todo?</div><div>p=
rocessedlist (user, state)=A0</div><div><br></div></div><div class=3D"gmail=
_extra">
<br><br><div class=3D"gmail_quote">On Tue, Feb 4, 2014 at 7:50 AM, Edward C=
apriolo <span dir=3D"ltr">&lt;<a href=3D"mailto:edlinuxguru@gmail.com" targ=
et=3D"_blank">edlinuxguru@gmail.com</a>&gt;</span> wrote:<br><blockquote cl=
ass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;p=
adding-left:1ex">
<div dir=3D"ltr"><div>I have actually been building something similar in my=
 space time. You can hang around and wait for it or build your own. Here is=
 the basics. Not perfect but it will work.<br><br></div>Create column famil=
y queue with gc_grace_period=3D[1 day]<br>

<div><div>
<br></div><div>set queue [timeuuid()] [&quot;z&quot;+timeuuid()] =3D [ work=
 do do] <br><br></div><div>The producer can decide how it wants to role ove=
r the row key and the column key it does not matter.<br></div><div><br>
</div><div>Supposing there are N consumers. We need a way for the consumers=
 to not do the same work. We can use something like the bakery algorithm. R=
emember at QUORUM a reader sees writes.<br><br></div><div>A consumer needs =
an identifier (it could be another uuid or an ip address) <br>

</div><div>A consumer calls get_range_slice on the queue the slice is from =
new byte[] to byte[] limit 100<br></div><div><br></div><div>The consumer se=
es data like this. <br><br></div><div>[1234] [z-$timeuuid] =3D data<br><br>

</div><div>Now we register that this consumer wants to consume this queue<b=
r><br></div><div>set [1234] [a-$[ip}] at quorum<br><br></div><div>Now we do=
 a slice<br></div><div>get_slice [1234]=A0 from new byte [] to &#39; b&#39;=
<br>

<br></div><div>There are a few possible returns.<br></div><div>1) 1 bidder.=
..<br></div><div>[1234] [a-$myip] <br></div><div>You won start consuming<br=
></div><div><br><div>2)=A0 2 bidders<br></div>[1234] [a-$myip] <br>[1234] [=
a-$otherip]<br>

</div><div>compare $myip vs $otherip higher wins<br><br></div><div>Whoever =
wins can then start consuming the columns in the queue and delete them when=
 done.<br><br><br></div><div><div class=3D"h5"><div><br></div><div><br></di=
v>
<div><br></div><div>
<br>On Friday, January 31, 2014, DuyHai Doan &lt;<a href=3D"mailto:doanduyh=
ai@gmail.com" target=3D"_blank">doanduyhai@gmail.com</a>&gt; wrote:<br>&gt;=
 Thanks Nat for your ideas.<br>

&gt;&gt;This could be as simple as adding year and month to the primary key=
 (in the form &gt;&#39;yyyymm&#39;). Alternatively, you could add this in t=
he partition in the definition. Either way, it &gt;then becomes pretty easy=
 to re-generate these based on the query parameters.=A0<br>


&gt;<br>&gt; =A0The thing is that it&#39;s not that simple. My customer has=
 a very BAD idea, using Cassandra as a queue (the perfect anti-pattern ever=
).<br>&gt; =A0Before trying to tell them to redesign their entire architect=
ure and put in some queueing system like ActiveMQ or something similar, I w=
ould like to see how I can use wide rows to meet the requirements.<br>


&gt; =A0The functional need is quite simple:<br>&gt; =A01) A process A load=
s users into Cassandra and sets the status on this user to be &#39;TODO&#39=
;. When using the bucketing technique, we can limit a row width to, let&#39=
;s say 100 000 columns. So at the end of the current row, process A knows t=
hat it should move to next bucket. Bucket is coded using composite partitio=
n key, in our example it would be &#39;TODO:1&#39;, &#39;TODO:2&#39; .... e=
tc<br>


&gt;<br>&gt; =A02) A process B reads the wide row for &#39;TODO&#39; status=
. It starts at bucket 1 so it will read row with partition key &#39;TODO:1&=
#39;. The users are processed and inserted in a new row &#39;PROCESSED:1=
9; for example to keep track of the status. After retrieving 100 000 column=
s, it will switch automatically to the next bucket. Simple. Fair enough<br>


&gt;<br>&gt; =A03) Now what sucks it that some time, process B does not hav=
e enough data to perform functional logic on the user it fetched from the w=
ide row, so it has to REPUT some users back into the &#39;TODO&#39; status =
rather than transitioning to &#39;PROCESSED&#39; status. That&#39;s exactly=
 a queue behavior.<br>


&gt; =A0A simplistic idea would be to insert again those m users with &#39;=
TODO:n&#39;, with n higher than the current bucket number so it can be proc=
essed later. But then it screws up all the counting system. Process A which=
 inserts data will not know that there are already m users in row n, so wil=
l happily=A0add 100 000 columns, making the row size grow to =A0100 000 + m=
. When process B reads back again this row, it will stop at the first 100 0=
00 columns and skip the trailing=A0m elements .<br>


&gt; =A0 That &#39;s the main reason for which I dropped the idea of bucket=
ing (which is quite smart in normal case) to trade for ultra wide row.<br>&=
gt; =A0Any way, I&#39;ll follow your advice and play around with the parame=
ters of SizeTiered<br>


&gt; =A0Regards<br>&gt; =A0Duy Hai DOAN<br>&gt;<br>&gt; On Fri, Jan 31, 201=
4 at 9:23 PM, Nate McCall &lt;<a href=3D"mailto:nate@thelastpickle.com" tar=
get=3D"_blank">nate@thelastpickle.com</a>&gt; wrote:<br>&gt;&gt;&gt;<br>&gt=
;&gt;&gt; =A0The only drawback for ultra wide row I can see is point 1). Bu=
t if I use leveled compaction with a sufficiently large value for &quot;sst=
able_size_in_mb&quot; (let&#39;s say 200Mb), will my read performance be im=
pacted as the row grows ?<br>


&gt;&gt;<br>&gt;&gt; For this use case, you would want to use SizeTieredCom=
paction and play around with the configuration a bit to keep a small number=
 of large SSTables. Specifically: keep min|max_threshold really low, set bu=
cket_low and bucket_high closer together maybe even both to 1.0, and maybe =
a larger min_sstable_size.=A0<br>


&gt;&gt; YMMV though - per Rob&#39;s suggestion, take the time to run some =
tests tweaking these options.<br>&gt;&gt; =A0<br>&gt;&gt;&gt;<br>&gt;&gt;&g=
t; =A0Of course, splitting wide row into several rows using bucketing techn=
ique is one solution but it forces us to keep track of the bucket number an=
d it&#39;s not convenient. We have one process (jvm) that insert data and a=
nother process (jvm) that read data. Using bucketing, we need to synchroniz=
e the bucket number between the 2 processes.<br>


&gt;&gt;<br>&gt;&gt; This could be as simple as adding year and month to th=
e primary key (in the form &#39;yyyymm&#39;). Alternatively, you could add =
this in the partition in the definition. Either way, it then becomes pretty=
 easy to re-generate these based on the query parameters. =A0<br>


&gt;&gt; =A0<br>&gt;&gt;<br>&gt;&gt; --<br>&gt;&gt; -----------------<br>&g=
t;&gt; Nate McCall<br>&gt;&gt; Austin, TX<br>&gt;&gt; @zznate<br>&gt;&gt;<b=
r>&gt;&gt; Co-Founder &amp; Sr. Technical Consultant<br>&gt;&gt; Apache Cas=
sandra Consulting<br>


&gt;&gt; <a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://=
www.thelastpickle.com</a><br>&gt;
</div></div></div></div></div>
</blockquote></div><br></div>

--047d7b86c4a2a59fcb04f19799a8--