Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of doanduyhai@gmail.com designates
 209.85.214.175 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <1445a826596.-8542939828757609342.-4273850624862758512@zohocorp.com>
References: 
 <1445a1e7345.-1888803717199936016.-6496021625704613558@zohocorp.com>
	<CABNXB2BNXuFMyqreuY0686som=WQKsAQ5YHw4kd6L=jWutiu6g@mail.gmail.com>
	<1445a826596.-8542939828757609342.-4273850624862758512@zohocorp.com>
Date: Sat, 22 Feb 2014 18:18:19 +0100
Message-ID: 
 <CABNXB2A6c4M3qMqiA+k7SGvzhvVWZq4W3158bR+SSUR1L=Ghbg@mail.gmail.com>
Subject: Re: Queuing System
From: DuyHai Doan <doanduyhai@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b5d3602c5537604f301ecf8

--047d7b5d3602c5537604f301ecf8
Content-Type: text/plain; charset=ISO-8859-1

Jagan

Few time ago I dealed with a similar queuing design for one customer.

*If you never delete messages in the queue*, then it is possible to use
wide rows with bucketing and increasing monotonic column name to store
messages.

CREATE TABLE *read_only_queue *(
   bucket_number int,
   insertion_time timeuuid,
   message text,
   PRIMARY KEY(bucket_number,insertion_time)
);

 Let's say that you allow only 100 000 messages per partition (physical
row) to avoid too wide rows, then inserting/reading from the table
*read_only_queue
*is easy;

 For message producer :

   1) Start at bucket_number = 1
   2) Insert messages with column name = generated timeUUID with
micro-second precision (depending on whether the insertion rate is high or
not)
   3) If message count = 100 000, increment bucket_number by one and go to
2)

For message reader:

   1) Start at bucket_number = 1
   2) Read messages by slice of *N, *save the *insertion_time *of the last
read message
   3) Use the saved *insertion_time *to perform next slice query
   4) If read messages count = 100 000, increment bucket_number and go to
2). Keep the *insertion_time, *do not reset it since his value is
increasing monotonically

For multiple and concurrent producers & writers, there is a trick. Let's
assume you have *P* concurrent producers and *C* concurrent consumers.

  Assign a numerical ID for each producer and consumer. First producer ID =
1... last producer ID = *P*. Same for consumers.

  - re-use the above algorithm
  - each producer/consumer start at *bucket_number *= his ID
  - at the end of the row,
        - next bucket_number = current bucker_number + *P* for producers
        - next bucket_number = current bucker_number + *C* for consumers


The last thing to take care of is compaction configuration to reduce the
number of SSTables on disk.

If you achieve to get rid of accumulation effects, e.g reading rate is
faster than writing rate,  the message are likely to be consumed while it's
still in memory (in memtable) at server side. In this particular case, you
can optimize further by deactivating compaction for the table.

Regards

 Duy Hai


On Sat, Feb 22, 2014 at 5:56 PM, Jagan Ranganathan <jagan@zohocorp.com>wrote:

> Hi,
>
> Thanks for the pointer.
>
> Following are some options given there,
>
>    - If you know where your live data begins, hint Cassandra with a start
>    column, to reduce the scan times and the amount of tombstones to collect.
>    -  A broker will usually have some notion of what's next in the
>    sequence and thus be able to do much more targeted queries, down to a
>    single record if the storage strategy were to choose monotonic sequence
>    numbers.
>
> We need to do is have some intelligence in using the system and avoid
> tombstones either use the pointed Column Name or use proper start column if
> slice query is used.
>
> Is that right or I am missing something here?
>
> Regards,
> Jagan
>
> ---- On Sat, 22 Feb 2014 20:55:39 +0530 *DuyHai Doan<doanduyhai@gmail.com
> <doanduyhai@gmail.com>>* wrote ----
>
>  Jagan
>
>   Queue-like data structures are known to be one of the worst anti
> patterns for Cassandra:
> http://www.datastax.com/dev/blog/cassandra-anti-patterns-queues-and-queue-like-datasets
>
>
>
> On Sat, Feb 22, 2014 at 4:03 PM, Jagan Ranganathan <jagan@zohocorp.com>wrote:
>
>  Hi,
>
>  I need to decouple some of the work being processed from the user thread
> to provide better user experience. For that I need a queuing system with
> the following needs,
>
>    - High Availability
>    - No Data Loss
>    - Better Performance.
>
> Following are some libraries that were considered along with the
> limitation I see,
>
>    - Redis - Data Loss
>    - ZooKeeper - Not advised for Queue system.
>    - TokyoCabinet/SQLite/LevelDB - of this Level DB seem to be performing
>    better. With replication requirement, I probably have to look at Apache
>    ActiveMQ+LevelDB.
>
> After checking on the third option above, I kind of wonder if Cassandra
> with Leveled Compaction offer a similar system. Do you see any issues in
> such a usage or is there other better solutions available.
>
> Will be great to get insights on this.
>
> Regards,
> Jagan
>
>
>
>

--047d7b5d3602c5537604f301ecf8
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div><div><div><div><div>Jagan<br><br></div>Few =
time ago I dealed with a similar queuing design for one customer. <br><br><=
/div><b>If you never delete messages in the queue</b>, then it is possible =
to use wide rows with bucketing and increasing monotonic column name to sto=
re messages.<br>
<br></div>CREATE TABLE <b>read_only_queue </b>(<br></div>&nbsp;&nbsp; bucke=
t_number int,<br></div>&nbsp;&nbsp; insertion_time timeuuid,<br></div>&nbsp=
;&nbsp; message text,<br></div>&nbsp;&nbsp; PRIMARY KEY(bucket_number,inser=
tion_time)<br><div><div><div><div>);<br>
<br></div><div>&nbsp;Let&#39;s say that you allow only 100 000 messages per=
 partition (physical row) to avoid too wide rows, then inserting/reading fr=
om the table <b>read_only_queue </b>is easy;<br><br></div><div>&nbsp;For me=
ssage producer :<br>
<br></div><div>&nbsp;&nbsp; 1) Start at bucket_number =3D 1<br></div><div>&=
nbsp;&nbsp; 2) Insert messages with column name =3D generated timeUUID with=
 micro-second precision (depending on whether the insertion rate is high or=
 not)<br></div><div>
&nbsp;&nbsp; 3) If message count =3D 100 000, increment bucket_number by on=
e and go to 2)<br><br></div><div>For message reader:<br><br>&nbsp;&nbsp; 1)=
 Start at bucket_number =3D 1<br><div>&nbsp;&nbsp; 2) Read messages by slic=
e of <i><b>N, </b></i>save the <i>insertion_time </i>of the last read messa=
ge</div>
<div><i><b>&nbsp;&nbsp; </b></i>3) Use the saved <i>insertion_time </i>to p=
erform next slice query </div><div>&nbsp;&nbsp; 4) If read messages count =
=3D 100 000, increment bucket_number and go to 2). Keep the <i>insertion_ti=
me, </i>do not reset it since his value is increasing monotonically<br>
<br></div><div>For multiple and concurrent producers &amp; writers, there i=
s a trick. Let&#39;s assume you have <b>P</b> concurrent producers and <b>C=
</b> concurrent consumers.<br><br></div><div>&nbsp; Assign a numerical ID f=
or each producer and consumer. First producer ID =3D 1... last producer ID =
=3D <b>P</b>. Same for consumers.<br>
</div><div>&nbsp; <br></div><div>&nbsp; - re-use the above algorithm<br></d=
iv><div>&nbsp; - each producer/consumer start at <i>bucket_number </i>=3D h=
is ID <br></div><div>&nbsp; - at the end of the row,<br>&nbsp; &nbsp; &nbsp=
; &nbsp; - next bucket_number =3D current bucker_number + <b>P</b> for prod=
ucers<br>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; - next bucket_number =3D current=
 bucker_number + <b>C</b> for consumers</div><div><br><br></div><div>The la=
st thing to take care of is compaction configuration to reduce the number o=
f SSTables on disk.<br><br></div>
<div>If you achieve to get rid of accumulation effects, e.g reading rate is=
 faster than writing rate,&nbsp; the message are likely to be consumed whil=
e it&#39;s still in memory (in memtable) at server side. In this particular=
 case, you can optimize further by deactivating compaction for the table. <=
br>
<br></div><div>Regards<br><br></div><div>&nbsp;Duy Hai <br></div><div>&nbsp=
;<br></div><div><br><br></div><div><br> </div>&nbsp;<br>&nbsp; <br> </div><=
/div></div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quo=
te">On Sat, Feb 22, 2014 at 5:56 PM, Jagan Ranganathan <span dir=3D"ltr">&l=
t;<a href=3D"mailto:jagan@zohocorp.com" target=3D"_blank">jagan@zohocorp.co=
m</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left:1px solid rgb(204,204,204);padding-left:1ex"><u></u><div><div style=3D=
"font-size:10pt;font-family:Verdana,Arial,Helvetica,sans-serif">Hi,<div><br=
></div>
<div>Thanks for the pointer.&nbsp;</div><div><br></div><div>Following are s=
ome options given there,</div><div><ul><li><span style=3D"line-height:1.4em=
;color:rgb(75,75,75);font-size:0.84615rem;font-family:RobotoRegular,arial,h=
elvetica,clean,sans-serif">If you know where your live data begins, hint Ca=
ssandra with a start column, to reduce the scan times and the amount of tom=
bstones to collect.</span></li>
<li><span style=3D"line-height:18.9538px;color:rgb(75,75,75);font-size:14px=
;font-family:RobotoRegular,arial,helvetica,clean,sans-serif">&nbsp;A broker=
 will usually have some notion of what&rsquo;s next in the sequence and thu=
s be able to do much more targeted queries, down to a single record if the =
storage strategy were to choose monotonic sequence numbers.</span></li>
</ul></div><div><span style=3D"font-size:10pt">We need to do is have some i=
ntelligence in using the system and avoid tombstones either use the pointed=
 Column Name or use proper start column if slice query is used.</span></div=
>
<div><span style=3D"font-size:10pt"><br></span></div><div><span style=3D"fo=
nt-size:10pt">Is that right or I am missing something here?</span></div><di=
v><span style=3D"font-size:10pt"><br></span></div><div><span style=3D"font-=
size:10pt">Regards,</span></div>
<div><span style=3D"font-size:10pt">Jagan</span></div><div><div><br>---- On=
 Sat, 22 Feb 2014 20:55:39 +0530 <b>DuyHai Doan&lt;<a href=3D"mailto:doandu=
yhai@gmail.com" target=3D"_blank">doanduyhai@gmail.com</a>&gt;</b> wrote --=
-- <br>
</div><div><div class=3D"h5"><br><blockquote style=3D"border-left:1px solid=
 rgb(0,0,255);padding-left:6px;margin:0px 0px 0px 5px"><div>   <div>     <s=
pan style=3D"font-family:Verdana,Arial,Helvetica,sans-serif;font-size:13.6p=
x">Jagan</span>     <br>
     <div>       <span style=3D"font-family:Verdana,Arial,Helvetica,sans-se=
rif;font-size:13.6px">         <br></span></div>     <div>       <span styl=
e=3D"font-family:Verdana,Arial,Helvetica,sans-serif;font-size:13.6px">&nbsp=
;Queue-like         data structures are known to be one of the worst anti p=
atterns         for Cassandra:&nbsp;</span>       <a href=3D"http://www.dat=
astax.com/dev/blog/cassandra-anti-patterns-queues-and-queue-like-datasets" =
target=3D"_blank">http://www.datastax.com/dev/blog/cassandra-anti-patterns-=
queues-and-queue-like-datasets</a></div>
     <div>       <br></div></div>   <div>     <br>     <br>     <div>On Sat=
, Feb 22, 2014 at 4:03 PM, Jagan Ranganathan <span>&lt;<a href=3D"mailto:ja=
gan@zohocorp.com" target=3D"_blank">jagan@zohocorp.com</a>&gt;</span> wrote=
:<br>
       <blockquote style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex">         <div>           <div style=3D"f=
ont-size:10pt;font-family:Verdana,Arial,Helvetica,sans-serif">Hi,<div>     =
          <br>
</div>             <div>               <div>I need to decouple some of the =
work being processed                 from the user thread to provide better=
 user experience.                 For that I need a queuing system with the=
 following needs,</div>
               <div>                 <ul>                   <li>           =
          <span style=3D"font-size:10pt">High Availability</span></li>     =
              <li>                     <span style=3D"font-size:10pt">No Da=
ta Loss</span></li>
                   <li>                     <span style=3D"font-size:10pt">=
Better Performance.</span></li></ul></div>               <div>Following are=
 some libraries that were considered                 along with the limitat=
ion I see,</div>
               <div>                 <ul>                   <li>           =
          <span style=3D"font-size:10pt">Redis - Data Loss</span></li>     =
              <li>                     <span style=3D"font-size:10pt">ZooKe=
eper - Not                       advised for Queue system.</span></li>
                   <li>                     <span style=3D"font-size:10pt">=
TokyoCabinet/SQLite/LevelDB                       - of this Level DB seem t=
o be performing better.                       With replication requirement,=
 I probably have to                       look at Apache ActiveMQ+LevelDB.<=
/span></li>
</ul></div>               <div>After checking on the third option above, I =
kind of                 wonder if Cassandra with Leveled Compaction offer a=
                 similar system. Do you see any issues in such a usage or  =
               is there other better solutions available.</div>
               <div>                 <br></div>               <div>Will be =
great to get insights on this.</div>               <br></div>             <=
div>Regards,</div>             <div>Jagan</div></div></div></blockquote>
</div>     <br></div></div> </blockquote><br></div></div></div></div></div>=
</blockquote></div><br></div></div>

--047d7b5d3602c5537604f301ecf8--