Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of tyler@datastax.com designates
 209.85.212.44 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <5905460D-32C8-448E-953F-8A46AB907EA8@thelastpickle.com>
References: 
 <CADsDW8xuNZv-u-A4MfK9So5LvnTiw=CL-p4aO7JnnjSKcP6Hvg@mail.gmail.com>
	<503D8C5B.3010003@mebigfatguy.com>
	<CADVHTB89_tKO_nze2gpLPTUa-omw8yKb-FGrmUPVSZu2mLfh9g@mail.gmail.com>
	<CADVHTB98HBvb-pCLTKMsyXVamSFXB39v7Z-TYdrrCSgqnHXw4w@mail.gmail.com>
	<5905460D-32C8-448E-953F-8A46AB907EA8@thelastpickle.com>
Date: Thu, 30 Aug 2012 13:45:08 -0500
Message-ID: 
 <CAAam9stoybfKGwynjSgMb7vUk+9FnZxvsnQEMN2Kf+w2qKJB6A@mail.gmail.com>
Subject: Re: Why Cassandra secondary indexes are so slow on just 350k rows?
From: Tyler Hobbs <tyler@datastax.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7bf15ff81330b704c8801354

--047d7bf15ff81330b704c8801354
Content-Type: text/plain; charset=ISO-8859-1

pycassa already breaks up the query into smaller chunks, but you should try
playing with the buffer_size kwarg for get_indexed_slices, perhaps lowering
it to ~300, as Aaron suggests:
http://pycassa.github.com/pycassa/api/pycassa/columnfamily.html#pycassa.columnfamily.ColumnFamily.get_indexed_slices

On Wed, Aug 29, 2012 at 11:40 PM, aaron morton <aaron@thelastpickle.com>wrote:

>  *from 12 to 20 seconds (!!!) to find 5000 rows*.
>
> More is not always better.
>
> Cassandra must materialise the full 5000 rows and send them all over the
> wire to be materialised on the other side. Try asking for a few hundred at
> a time and see how it goes.
>
> Cheers
>
> -----------------
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
>
> On 29/08/2012, at 6:46 PM, Robin Verlangen <robin@us2.nl> wrote:
>
> @Edward: I think you should consider a queue for exporting the new rows.
> Just store the rowkey in a queue (you might want to consider looking at
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Distributed-work-queues-td5226248.html )
> and process that row every couple of minutes. Then manually delete columns
> from that queue-row.
>
> With kind regards,
>
> Robin Verlangen
> *Software engineer*
> *
> *
> W http://www.robinverlangen.nl
> E robin@us2.nl
>
> Disclaimer: The information contained in this message and attachments is
> intended solely for the attention and use of the named addressee and may be
> confidential. If you are not the intended recipient, you are reminded that
> the information remains the property of the sender. You must not use,
> disclose, distribute, copy, print or rely on this e-mail. If you have
> received this message in error, please contact the sender immediately and
> irrevocably delete this message and any copies.
>
>
>
> 2012/8/29 Robin Verlangen <robin@us2.nl>
>
>> "What this means is that eventually you will have 1 row in the secondary
>> index table with 350K columns"
>>
>> Is this really true? I would have expected that Cassandra used internal
>> index sharding/bucketing?
>>
>> With kind regards,
>>
>> Robin Verlangen
>> *Software engineer*
>> *
>> *
>> W http://www.robinverlangen.nl
>> E robin@us2.nl
>>
>> Disclaimer: The information contained in this message and attachments is
>> intended solely for the attention and use of the named addressee and may be
>> confidential. If you are not the intended recipient, you are reminded that
>> the information remains the property of the sender. You must not use,
>> disclose, distribute, copy, print or rely on this e-mail. If you have
>> received this message in error, please contact the sender immediately and
>> irrevocably delete this message and any copies.
>>
>>
>>
>> 2012/8/29 Dave Brosius <dbrosius@mebigfatguy.com>
>>
>>> If i understand you correctly, you are only ever querying for the rows
>>> where is_exported = false, and turning them into trues. What this means is
>>> that eventually you will have 1 row in the secondary index table with 350K
>>> columns that you will never look at.
>>>
>>> It seems to me you that perhaps you should just hold your own "manual
>>> index" cf that points to non exported rows, and just delete those columns
>>> when they are exported.
>>>
>>>
>>>
>>> On 08/28/2012 05:23 PM, Edward Kibardin wrote:
>>>
>>>> I have a column family with the secondary index. The secondary index is
>>>> basically a binary field, but I'm using a string for it. The field called
>>>> *is_exported* and can be *'true'* or *'false'*. After request all loaded
>>>> rows are updated with *is_exported = 'false'*.
>>>>
>>>> I'm polling this column table each ten minutes and exporting new rows
>>>> as they appear.
>>>>
>>>> But here the problem: I'm seeing that time for this query grows pretty
>>>> linear with amount of data in column table, and currently it takes *from 12
>>>> to 20 seconds (!!!) to find 5000 rows*. From my understanding, indexed
>>>> request should not depend on number of rows in CF but from number of rows
>>>> per one index value (cardinality), as it's just another hidden CF like:
>>>>
>>>>         "true" : rowKey1 rowKey2 rowKey3 ...
>>>>         "false": rowKey1 rowKey2 rowKey3 ...
>>>>
>>>> I'm using Pycassa to query the data, here the code I'm using:
>>>>
>>>>         column_family = pycassa.ColumnFamily(**cassandra_pool,
>>>> column_family_name, read_consistency_level=2)
>>>>         is_exported_expr = create_index_expression('is_**exported',
>>>> 'false')
>>>>         clause = create_index_clause([is_**exported_expr], count =
>>>> 5000)
>>>>         column_family.get_indexed_**slices(clause)
>>>>
>>>> Am I doing something wrong, but I expect this operation to work MUCH
>>>> faster.
>>>>
>>>> Any ideas or suggestions?
>>>>
>>>> Some config info:
>>>>  - Cassandra 1.1.0
>>>>  - RandomPartitioner
>>>>  - I have 2 nodes and replication_factor = 2 (each server has a full
>>>> data copy)
>>>>  - Using AWS EC2, large instances
>>>>  - Software raid0 on ephemeral drives
>>>>
>>>> Thanks in advance!
>>>>
>>>>
>>>
>>
>
>


-- 
Tyler Hobbs
DataStax <http://datastax.com/>

--047d7bf15ff81330b704c8801354
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

pycassa already breaks up the query into smaller chunks, but you should try=
 playing with the buffer_size kwarg for get_indexed_slices, perhaps lowerin=
g it to ~300, as Aaron suggests: <a href=3D"http://pycassa.github.com/pycas=
sa/api/pycassa/columnfamily.html#pycassa.columnfamily.ColumnFamily.get_inde=
xed_slices">http://pycassa.github.com/pycassa/api/pycassa/columnfamily.html=
#pycassa.columnfamily.ColumnFamily.get_indexed_slices</a><br>
<br><div class=3D"gmail_quote">On Wed, Aug 29, 2012 at 11:40 PM, aaron mort=
on <span dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.com" target=
=3D"_blank">aaron@thelastpickle.com</a>&gt;</span> wrote:<br><blockquote cl=
ass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;p=
adding-left:1ex">
<div style=3D"word-wrap:break-word"><div class=3D"im"><blockquote type=3D"c=
ite">=A0*from 12 to 20 seconds (!!!) to find 5000 rows*.<br></blockquote></=
div>More is not always better.=A0<div><br></div><div>Cassandra must materia=
lise the full 5000 rows and send them all over the wire to be materialised =
on the other side. Try asking for a few hundred at a time and see how it go=
es.=A0</div>
<div><br></div><div>Cheers</div><div>=A0</div><div><div>
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;te=
xt-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norm=
al;border-collapse:separate;text-transform:none;font-size:medium;white-spac=
e:normal;font-family:Helvetica;word-spacing:0px"><span style=3D"text-indent=
:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weigh=
t:normal;line-height:normal;border-collapse:separate;text-transform:none;fo=
nt-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><=
div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance Deve=
loper</div><div>@aaronmorton</div><div><a href=3D"http://www.thelastpickle.=
com" target=3D"_blank">http://www.thelastpickle.com</a></div></div></div></=
span></div>
</span></div></span></span>
</div><div><div class=3D"h5">

<br><div><div>On 29/08/2012, at 6:46 PM, Robin Verlangen &lt;<a href=3D"mai=
lto:robin@us2.nl" target=3D"_blank">robin@us2.nl</a>&gt; wrote:</div><br><b=
lockquote type=3D"cite">@Edward: I think you should consider a queue for ex=
porting the new rows. Just store the rowkey in a queue (you might want to c=
onsider looking at=A0
<a href=3D"http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com=
/Distributed-work-queues-td5226248.html" target=3D"_blank">http://cassandra=
-user-incubator-apache-org.3065146.n2.nabble.com/Distributed-work-queues-td=
5226248.html</a>=A0) and process that row every couple of minutes. Then man=
ually delete columns from that queue-row.<div>

<br clear=3D"all">With kind regards,<div><br></div><div>Robin Verlangen</di=
v><div><i>Software engineer</i></div><div><i><br></i></div><div>W <a href=
=3D"http://www.robinverlangen.nl/" target=3D"_blank">http://www.robinverlan=
gen.nl</a></div>

<div>E <a href=3D"mailto:robin@us2.nl" target=3D"_blank">robin@us2.nl</a></=
div><div><br></div><div><font color=3D"#666666">Disclaimer: The information=
 contained in this message and attachments is intended solely for the atten=
tion and use of the named addressee and may be confidential. If you are not=
 the intended recipient, you are reminded that the information remains the =
property of the sender. You must not use, disclose, distribute, copy, print=
 or rely on this e-mail. If you have received this message in error, please=
 contact the sender immediately and irrevocably delete this message and any=
 copies.</font></div>

<br>
<br><br><div class=3D"gmail_quote">2012/8/29 Robin Verlangen <span dir=3D"l=
tr">&lt;<a href=3D"mailto:robin@us2.nl" target=3D"_blank">robin@us2.nl</a>&=
gt;</span><br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex">

<div><div>&quot;<span style=3D"color:rgb(34,34,34);font-size:13px;font-fami=
ly:arial,sans-serif">What this means is that eventually you will have 1 row=
 in the secondary index table with 350K columns&quot;</span></div>

<div><span style=3D"color:rgb(34,34,34);font-size:13px;font-family:arial,sa=
ns-serif"><br></span></div></div><div><span style=3D"color:rgb(34,34,34);fo=
nt-size:13px;font-family:arial,sans-serif">Is this really true? I would hav=
e expected that Cassandra used internal index sharding/bucketing?</span></d=
iv>


<br clear=3D"all">With kind regards,<div><br></div><div>Robin Verlangen</di=
v><div><i>Software engineer</i></div><div><i><br></i></div><div>W <a href=
=3D"http://www.robinverlangen.nl/" target=3D"_blank">http://www.robinverlan=
gen.nl</a></div>


<div>E <a href=3D"mailto:robin@us2.nl" target=3D"_blank">robin@us2.nl</a></=
div><div><br></div><div><font color=3D"#666666">Disclaimer: The information=
 contained in this message and attachments is intended solely for the atten=
tion and use of the named addressee and may be confidential. If you are not=
 the intended recipient, you are reminded that the information remains the =
property of the sender. You must not use, disclose, distribute, copy, print=
 or rely on this e-mail. If you have received this message in error, please=
 contact the sender immediately and irrevocably delete this message and any=
 copies.</font></div>

<div><div>
<br>
<br><br><div class=3D"gmail_quote">2012/8/29 Dave Brosius <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:dbrosius@mebigfatguy.com" target=3D"_blank">dbrosius=
@mebigfatguy.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


If i understand you correctly, you are only ever querying for the rows wher=
e is_exported =3D false, and turning them into trues. What this means is th=
at eventually you will have 1 row in the secondary index table with 350K co=
lumns that you will never look at.<br>


<br>
It seems to me you that perhaps you should just hold your own &quot;manual =
index&quot; cf that points to non exported rows, and just delete those colu=
mns when they are exported.<div><br>
<br>
<br>
On 08/28/2012 05:23 PM, Edward Kibardin wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
I have a column family with the secondary index. The secondary index is bas=
ically a binary field, but I&#39;m using a string for it. The field called =
*is_exported* and can be *&#39;true&#39;* or *&#39;false&#39;*. After reque=
st all loaded rows are updated with *is_exported =3D &#39;false&#39;*.<br>


<br>
I&#39;m polling this column table each ten minutes and exporting new rows a=
s they appear.<br>
<br>
But here the problem: I&#39;m seeing that time for this query grows pretty =
linear with amount of data in column table, and currently it takes *from 12=
 to 20 seconds (!!!) to find 5000 rows*. From my understanding, indexed req=
uest should not depend on number of rows in CF but from number of rows per =
one index value (cardinality), as it&#39;s just another hidden CF like:<br>


<br>
=A0 =A0 =A0 =A0 &quot;true&quot; : rowKey1 rowKey2 rowKey3 ...<br>
=A0 =A0 =A0 =A0 &quot;false&quot;: rowKey1 rowKey2 rowKey3 ...<br>
<br>
I&#39;m using Pycassa to query the data, here the code I&#39;m using:<br>
<br>
=A0 =A0 =A0 =A0 column_family =3D pycassa.ColumnFamily(<u></u>cassandra_poo=
l, column_family_name, read_consistency_level=3D2)<br>
=A0 =A0 =A0 =A0 is_exported_expr =3D create_index_expression(&#39;is_<u></u=
>exported&#39;, &#39;false&#39;)<br>
=A0 =A0 =A0 =A0 clause =3D create_index_clause([is_<u></u>exported_expr], c=
ount =3D 5000)<br>
=A0 =A0 =A0 =A0 column_family.get_indexed_<u></u>slices(clause)<br>
<br>
Am I doing something wrong, but I expect this operation to work MUCH faster=
.<br>
<br>
Any ideas or suggestions?<br>
<br>
Some config info:<br>
=A0- Cassandra 1.1.0<br>
=A0- RandomPartitioner<br>
=A0- I have 2 nodes and replication_factor =3D 2 (each server has a full da=
ta copy)<br>
=A0- Using AWS EC2, large instances<br>
=A0- Software raid0 on ephemeral drives<br>
<br>
Thanks in advance!<br>
<br>
</blockquote>
<br>
</div></blockquote></div><br>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div></div></div></div></blockquote></div><br><br c=
lear=3D"all"><br>-- <br><font color=3D"#888888">Tyler Hobbs<span></span><br=
>
<a href=3D"http://datastax.com/" target=3D"_blank">DataStax</a><br></font><=
br>

--047d7bf15ff81330b704c8801354--