Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of michael.laing@nytimes.com
 designates 209.85.216.182 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAAX2xq6sG-vg1JBheyXiaESC87aSVSnz3Jse2gdZyZF2nChSAg@mail.gmail.com>
References: 
 <CAAX2xq7NnUFg6P12JYUKPsZT8KVHb=tH5DJ675DAb68Vi0YYYA@mail.gmail.com>
	<CACUnPaA=ekr6yaY=2xL1ZmzDnJFU3R43VyZ8_iMaW=CTsW_a2g@mail.gmail.com>
	<CAAX2xq7nju82SLPsNvdCXcbF8fGsuxN5hsL47krkvjPAXrybBQ@mail.gmail.com>
	<CACUnPaDnHvogW0sCbHkMm8TMwdQDLz1peNA7sB7-BURUVuFFrw@mail.gmail.com>
	<CAAX2xq7+GO23DRaN7foktpRBsU4Apa4JD+AZuXmAVv__DPLy+w@mail.gmail.com>
	<CACUnPaCyUZm2mJqOX=omb0o9QsDkpHSb6J3i0=Hxtv_PR654tw@mail.gmail.com>
	<CAAX2xq6sG-vg1JBheyXiaESC87aSVSnz3Jse2gdZyZF2nChSAg@mail.gmail.com>
Date: Fri, 20 Jun 2014 06:59:39 -0400
Message-ID: 
 <CAKgmDnH+cGJTqprhG+=VRfep4DyVbE=ATFDNOe4+53ajYnZYJA@mail.gmail.com>
Subject: Re: Best way to do a multi_get using CQL
From: "Laing, Michael" <michael.laing@nytimes.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11c15750d91db104fc426351

--001a11c15750d91db104fc426351
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

However my extensive benchmarking this week of the python driver from
master shows a performance *decrease* when using 'token_aware'.

This is on 12-node, 2-datacenter, RF-3 cluster in AWS.

Also why do the work the coordinator will do for you: send all the queries,
wait for everything to come back in whatever order, and sort the result.

I would rather keep my app code simple.

But the real point is that you should benchmark in your own environment.

ml


On Fri, Jun 20, 2014 at 3:29 AM, Marcelo Elias Del Valle <
marcelo@s1mbi0se.com.br> wrote:

> Yes, I am using the CQL datastax drivers.
> It was a good advice, thanks a lot Janathan.
> []s
>
>
> 2014-06-20 0:28 GMT-03:00 Jonathan Haddad <jon@jonhaddad.com>:
>
> The only case in which it might be better to use an IN clause is if
>> the entire query can be satisfied from that machine.  Otherwise, go
>> async.
>>
>> The native driver reuses connections and intelligently manages the
>> pool for you.  It can also multiplex queries over a single connection.
>>
>> I am assuming you're using one of the datastax drivers for CQL, btw.
>>
>> Jon
>>
>> On Thu, Jun 19, 2014 at 7:37 PM, Marcelo Elias Del Valle
>> <marcelo@s1mbi0se.com.br> wrote:
>> > This is interesting, I didn't know that!
>> > It might make sense then to use select =3D + async + token aware, I wi=
ll
>> try
>> > to change my code.
>> >
>> > But would it be a "recomended solution" for these cases? Any other
>> options?
>> >
>> > I still would if this is the right use case for Cassandra, to look for
>> > random keys in a huge cluster. After all, the amount of connections to
>> > Cassandra will still be huge, right... Wouldn't it be a problem?
>> > Or when you use async the driver reuses the connection?
>> >
>> > []s
>> >
>> >
>> > 2014-06-19 22:16 GMT-03:00 Jonathan Haddad <jon@jonhaddad.com>:
>> >
>> >> If you use async and your driver is token aware, it will go to the
>> >> proper node, rather than requiring the coordinator to do so.
>> >>
>> >> Realistically you're going to have a connection open to every server
>> >> anyways.  It's the difference between you querying for the data
>> >> directly and using a coordinator as a proxy.  It's faster to just ask
>> >> the node with the data.
>> >>
>> >> On Thu, Jun 19, 2014 at 6:11 PM, Marcelo Elias Del Valle
>> >> <marcelo@s1mbi0se.com.br> wrote:
>> >> > But using async queries wouldn't be even worse than using SELECT IN=
?
>> >> > The justification in the docs is I could query many nodes, but I
>> would
>> >> > still
>> >> > do it.
>> >> >
>> >> > Today, I use both async queries AND SELECT IN:
>> >> >
>> >> > SELECT_ENTITY_LOOKUP =3D "SELECT entity_id FROM " + ENTITY_LOOKUP +=
 "
>> >> > WHERE
>> >> > name=3D%s and value in(%s)"
>> >> >
>> >> > for name, values in identifiers.items():
>> >> >    query =3D self.SELECT_ENTITY_LOOKUP % ('%s',
>> >> > ','.join(['%s']*len(values)))
>> >> >    args =3D [name] + values
>> >> >    query_msg =3D query % tuple(args)
>> >> >    futures.append((query_msg, self.session.execute_async(query,
>> args)))
>> >> >
>> >> > for query_msg, future in futures:
>> >> >    try:
>> >> >       rows =3D future.result(timeout=3D100000)
>> >> >       for row in rows:
>> >> >         entity_ids.add(row.entity_id)
>> >> >    except:
>> >> >       logging.error("Query '%s' returned ERROR " % (query_msg))
>> >> >       raise
>> >> >
>> >> > Using async just with select =3D would mean instead of 1 async quer=
y
>> >> > (example:
>> >> > in (0, 1, 2)), I would do several, one for each value of "values"
>> array
>> >> > above.
>> >> > In my head, this would mean more connections to Cassandra and the
>> same
>> >> > amount of work, right? What would be the advantage?
>> >> >
>> >> > []s
>> >> >
>> >> >
>> >> >
>> >> >
>> >> > 2014-06-19 22:01 GMT-03:00 Jonathan Haddad <jon@jonhaddad.com>:
>> >> >
>> >> >> Your other option is to fire off async queries.  It's pretty
>> >> >> straightforward w/ the java or python drivers.
>> >> >>
>> >> >> On Thu, Jun 19, 2014 at 5:56 PM, Marcelo Elias Del Valle
>> >> >> <marcelo@s1mbi0se.com.br> wrote:
>> >> >> > I was taking a look at Cassandra anti-patterns list:
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> http://www.datastax.com/documentation/cassandra/2.0/cassandra/architectu=
re/architecturePlanningAntiPatterns_c.html
>> >> >> >
>> >> >> > Among then is
>> >> >> >
>> >> >> > SELECT ... IN or index lookups=C2=B6
>> >> >> >
>> >> >> > SELECT ... IN and index lookups (formerly secondary indexes)
>> should
>> >> >> > be
>> >> >> > avoided except for specific scenarios. See When not to use IN in
>> >> >> > SELECT
>> >> >> > and
>> >> >> > When not to use an index in Indexing in
>> >> >> >
>> >> >> > CQL for Cassandra 2.0"
>> >> >> >
>> >> >> > And Looking at the SELECT doc, I saw:
>> >> >> >
>> >> >> > When not to use IN=C2=B6
>> >> >> >
>> >> >> > The recommendations about when not to use an index apply to usin=
g
>> IN
>> >> >> > in
>> >> >> > the
>> >> >> > WHERE clause. Under most conditions, using IN in the WHERE claus=
e
>> is
>> >> >> > not
>> >> >> > recommended. Using IN can degrade performance because usually ma=
ny
>> >> >> > nodes
>> >> >> > must be queried. For example, in a single, local data center
>> cluster
>> >> >> > having
>> >> >> > 30 nodes, a replication factor of 3, and a consistency level of
>> >> >> > LOCAL_QUORUM, a single key query goes out to two nodes, but if t=
he
>> >> >> > query
>> >> >> > uses the IN condition, the number of nodes being queried are mos=
t
>> >> >> > likely
>> >> >> > even higher, up to 20 nodes depending on where the keys fall in
>> the
>> >> >> > token
>> >> >> > range."
>> >> >> >
>> >> >> > In my system, I have a column family called "entity_lookup":
>> >> >> >
>> >> >> > CREATE KEYSPACE IF NOT EXISTS Identification1
>> >> >> >   WITH REPLICATION =3D { 'class' : 'NetworkTopologyStrategy',
>> >> >> >   'DC1' : 3 };
>> >> >> > USE Identification1;
>> >> >> >
>> >> >> > CREATE TABLE IF NOT EXISTS entity_lookup (
>> >> >> >   name varchar,
>> >> >> >   value varchar,
>> >> >> >   entity_id uuid,
>> >> >> >   PRIMARY KEY ((name, value), entity_id));
>> >> >> >
>> >> >> > And I use the following select to query it:
>> >> >> >
>> >> >> > SELECT entity_id FROM entity_lookup WHERE name=3D%s and value in=
(%s)
>> >> >> >
>> >> >> > Is this an anti-pattern?
>> >> >> >
>> >> >> > If not using SELECT IN, which other way would you recomend for
>> >> >> > lookups
>> >> >> > like
>> >> >> > that? I have several values I would like to search in cassandra
>> and
>> >> >> > they
>> >> >> > might not be in the same particion, as above.
>> >> >> >
>> >> >> > Is Cassandra the wrong tool for lookups like that?
>> >> >> >
>> >> >> > Best regards,
>> >> >> > Marcelo Valle.
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >>
>> >> >>
>> >> >>
>> >> >> --
>> >> >> Jon Haddad
>> >> >> http://www.rustyrazorblade.com
>> >> >> skype: rustyrazorblade
>> >> >
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Jon Haddad
>> >> http://www.rustyrazorblade.com
>> >> skype: rustyrazorblade
>> >
>> >
>>
>>
>>
>> --
>> Jon Haddad
>> http://www.rustyrazorblade.com
>> skype: rustyrazorblade
>>
>
>

--001a11c15750d91db104fc426351
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">However my extensive benchmarking this week of the python =
driver from master shows a performance <b>decrease</b>=C2=A0when using &#39=
;token_aware&#39;.<div><br></div><div>This is on 12-node, 2-datacenter, RF-=
3 cluster in AWS.</div>
<div><br></div><div>Also why do the work the coordinator will do for you: s=
end all the queries, wait for everything to come back in whatever order, an=
d sort the result.</div><div><br></div><div>I would rather keep my app code=
 simple.</div>
<div><br></div><div>But the real point is that you should benchmark in your=
 own environment.</div><div><br></div><div>ml</div></div><div class=3D"gmai=
l_extra"><br><br><div class=3D"gmail_quote">On Fri, Jun 20, 2014 at 3:29 AM=
, Marcelo Elias Del Valle <span dir=3D"ltr">&lt;<a href=3D"mailto:marcelo@s=
1mbi0se.com.br" target=3D"_blank">marcelo@s1mbi0se.com.br</a>&gt;</span> wr=
ote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Yes, I am using the CQL dat=
astax drivers.<div>It was a good advice, thanks a lot Janathan.</div><div>[=
]s</div>
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2014-06=
-20 0:28 GMT-03:00 Jonathan Haddad <span dir=3D"ltr">&lt;<a href=3D"mailto:=
jon@jonhaddad.com" target=3D"_blank">jon@jonhaddad.com</a>&gt;</span>:<div>=
<div class=3D"h5">
<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">The only case in which it might be better to=
 use an IN clause is if<br>
the entire query can be satisfied from that machine. =C2=A0Otherwise, go<br=
>
async.<br>
<br>
The native driver reuses connections and intelligently manages the<br>
pool for you. =C2=A0It can also multiplex queries over a single connection.=
<br>
<br>
I am assuming you&#39;re using one of the datastax drivers for CQL, btw.<br=
>
<br>
Jon<br>
<br>
On Thu, Jun 19, 2014 at 7:37 PM, Marcelo Elias Del Valle<br>
<div><div>&lt;<a href=3D"mailto:marcelo@s1mbi0se.com.br" target=3D"_blank">=
marcelo@s1mbi0se.com.br</a>&gt; wrote:<br>
&gt; This is interesting, I didn&#39;t know that!<br>
&gt; It might make sense then to use select =3D + async + token aware, I wi=
ll try<br>
&gt; to change my code.<br>
&gt;<br>
&gt; But would it be a &quot;recomended solution&quot; for these cases? Any=
 other options?<br>
&gt;<br>
&gt; I still would if this is the right use case for Cassandra, to look for=
<br>
&gt; random keys in a huge cluster. After all, the amount of connections to=
<br>
&gt; Cassandra will still be huge, right... Wouldn&#39;t it be a problem?<b=
r>
&gt; Or when you use async the driver reuses the connection?<br>
&gt;<br>
&gt; []s<br>
&gt;<br>
&gt;<br>
&gt; 2014-06-19 22:16 GMT-03:00 Jonathan Haddad &lt;<a href=3D"mailto:jon@j=
onhaddad.com" target=3D"_blank">jon@jonhaddad.com</a>&gt;:<br>
&gt;<br>
&gt;&gt; If you use async and your driver is token aware, it will go to the=
<br>
&gt;&gt; proper node, rather than requiring the coordinator to do so.<br>
&gt;&gt;<br>
&gt;&gt; Realistically you&#39;re going to have a connection open to every =
server<br>
&gt;&gt; anyways. =C2=A0It&#39;s the difference between you querying for th=
e data<br>
&gt;&gt; directly and using a coordinator as a proxy. =C2=A0It&#39;s faster=
 to just ask<br>
&gt;&gt; the node with the data.<br>
&gt;&gt;<br>
&gt;&gt; On Thu, Jun 19, 2014 at 6:11 PM, Marcelo Elias Del Valle<br>
&gt;&gt; &lt;<a href=3D"mailto:marcelo@s1mbi0se.com.br" target=3D"_blank">m=
arcelo@s1mbi0se.com.br</a>&gt; wrote:<br>
&gt;&gt; &gt; But using async queries wouldn&#39;t be even worse than using=
 SELECT IN?<br>
&gt;&gt; &gt; The justification in the docs is I could query many nodes, bu=
t I would<br>
&gt;&gt; &gt; still<br>
&gt;&gt; &gt; do it.<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; Today, I use both async queries AND SELECT IN:<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; SELECT_ENTITY_LOOKUP =3D &quot;SELECT entity_id FROM &quot; +=
 ENTITY_LOOKUP + &quot;<br>
&gt;&gt; &gt; WHERE<br>
&gt;&gt; &gt; name=3D%s and value in(%s)&quot;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; for name, values in identifiers.items():<br>
&gt;&gt; &gt; =C2=A0 =C2=A0query =3D self.SELECT_ENTITY_LOOKUP % (&#39;%s&#=
39;,<br>
&gt;&gt; &gt; &#39;,&#39;.join([&#39;%s&#39;]*len(values)))<br>
&gt;&gt; &gt; =C2=A0 =C2=A0args =3D [name] + values<br>
&gt;&gt; &gt; =C2=A0 =C2=A0query_msg =3D query % tuple(args)<br>
&gt;&gt; &gt; =C2=A0 =C2=A0futures.append((query_msg, self.session.execute_=
async(query, args)))<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; for query_msg, future in futures:<br>
&gt;&gt; &gt; =C2=A0 =C2=A0try:<br>
&gt;&gt; &gt; =C2=A0 =C2=A0 =C2=A0 rows =3D future.result(timeout=3D100000)=
<br>
&gt;&gt; &gt; =C2=A0 =C2=A0 =C2=A0 for row in rows:<br>
&gt;&gt; &gt; =C2=A0 =C2=A0 =C2=A0 =C2=A0 entity_ids.add(row.entity_id)<br>
&gt;&gt; &gt; =C2=A0 =C2=A0except:<br>
&gt;&gt; &gt; =C2=A0 =C2=A0 =C2=A0 logging.error(&quot;Query &#39;%s&#39; r=
eturned ERROR &quot; % (query_msg))<br>
&gt;&gt; &gt; =C2=A0 =C2=A0 =C2=A0 raise<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; Using async just with select =3D would mean instead of 1 asyn=
c query<br>
&gt;&gt; &gt; (example:<br>
&gt;&gt; &gt; in (0, 1, 2)), I would do several, one for each value of &quo=
t;values&quot; array<br>
&gt;&gt; &gt; above.<br>
&gt;&gt; &gt; In my head, this would mean more connections to Cassandra and=
 the same<br>
&gt;&gt; &gt; amount of work, right? What would be the advantage?<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; []s<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; 2014-06-19 22:01 GMT-03:00 Jonathan Haddad &lt;<a href=3D"mai=
lto:jon@jonhaddad.com" target=3D"_blank">jon@jonhaddad.com</a>&gt;:<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; Your other option is to fire off async queries. =C2=A0It&=
#39;s pretty<br>
&gt;&gt; &gt;&gt; straightforward w/ the java or python drivers.<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; On Thu, Jun 19, 2014 at 5:56 PM, Marcelo Elias Del Valle<=
br>
&gt;&gt; &gt;&gt; &lt;<a href=3D"mailto:marcelo@s1mbi0se.com.br" target=3D"=
_blank">marcelo@s1mbi0se.com.br</a>&gt; wrote:<br>
&gt;&gt; &gt;&gt; &gt; I was taking a look at Cassandra anti-patterns list:=
<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; <a href=3D"http://www.datastax.com/documentation/cas=
sandra/2.0/cassandra/architecture/architecturePlanningAntiPatterns_c.html" =
target=3D"_blank">http://www.datastax.com/documentation/cassandra/2.0/cassa=
ndra/architecture/architecturePlanningAntiPatterns_c.html</a><br>


&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; Among then is<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; SELECT ... IN or index lookups=C2=B6<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; SELECT ... IN and index lookups (formerly secondary =
indexes) should<br>
&gt;&gt; &gt;&gt; &gt; be<br>
&gt;&gt; &gt;&gt; &gt; avoided except for specific scenarios. See When not =
to use IN in<br>
&gt;&gt; &gt;&gt; &gt; SELECT<br>
&gt;&gt; &gt;&gt; &gt; and<br>
&gt;&gt; &gt;&gt; &gt; When not to use an index in Indexing in<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; CQL for Cassandra 2.0&quot;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; And Looking at the SELECT doc, I saw:<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; When not to use IN=C2=B6<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; The recommendations about when not to use an index a=
pply to using IN<br>
&gt;&gt; &gt;&gt; &gt; in<br>
&gt;&gt; &gt;&gt; &gt; the<br>
&gt;&gt; &gt;&gt; &gt; WHERE clause. Under most conditions, using IN in the=
 WHERE clause is<br>
&gt;&gt; &gt;&gt; &gt; not<br>
&gt;&gt; &gt;&gt; &gt; recommended. Using IN can degrade performance becaus=
e usually many<br>
&gt;&gt; &gt;&gt; &gt; nodes<br>
&gt;&gt; &gt;&gt; &gt; must be queried. For example, in a single, local dat=
a center cluster<br>
&gt;&gt; &gt;&gt; &gt; having<br>
&gt;&gt; &gt;&gt; &gt; 30 nodes, a replication factor of 3, and a consisten=
cy level of<br>
&gt;&gt; &gt;&gt; &gt; LOCAL_QUORUM, a single key query goes out to two nod=
es, but if the<br>
&gt;&gt; &gt;&gt; &gt; query<br>
&gt;&gt; &gt;&gt; &gt; uses the IN condition, the number of nodes being que=
ried are most<br>
&gt;&gt; &gt;&gt; &gt; likely<br>
&gt;&gt; &gt;&gt; &gt; even higher, up to 20 nodes depending on where the k=
eys fall in the<br>
&gt;&gt; &gt;&gt; &gt; token<br>
&gt;&gt; &gt;&gt; &gt; range.&quot;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; In my system, I have a column family called &quot;en=
tity_lookup&quot;:<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; CREATE KEYSPACE IF NOT EXISTS Identification1<br>
&gt;&gt; &gt;&gt; &gt; =C2=A0 WITH REPLICATION =3D { &#39;class&#39; : &#39=
;NetworkTopologyStrategy&#39;,<br>
&gt;&gt; &gt;&gt; &gt; =C2=A0 &#39;DC1&#39; : 3 };<br>
&gt;&gt; &gt;&gt; &gt; USE Identification1;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; CREATE TABLE IF NOT EXISTS entity_lookup (<br>
&gt;&gt; &gt;&gt; &gt; =C2=A0 name varchar,<br>
&gt;&gt; &gt;&gt; &gt; =C2=A0 value varchar,<br>
&gt;&gt; &gt;&gt; &gt; =C2=A0 entity_id uuid,<br>
&gt;&gt; &gt;&gt; &gt; =C2=A0 PRIMARY KEY ((name, value), entity_id));<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; And I use the following select to query it:<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; SELECT entity_id FROM entity_lookup WHERE name=3D%s =
and value in(%s)<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; Is this an anti-pattern?<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; If not using SELECT IN, which other way would you re=
comend for<br>
&gt;&gt; &gt;&gt; &gt; lookups<br>
&gt;&gt; &gt;&gt; &gt; like<br>
&gt;&gt; &gt;&gt; &gt; that? I have several values I would like to search i=
n cassandra and<br>
&gt;&gt; &gt;&gt; &gt; they<br>
&gt;&gt; &gt;&gt; &gt; might not be in the same particion, as above.<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; Is Cassandra the wrong tool for lookups like that?<b=
r>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt; Best regards,<br>
&gt;&gt; &gt;&gt; &gt; Marcelo Valle.<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; --<br>
&gt;&gt; &gt;&gt; Jon Haddad<br>
&gt;&gt; &gt;&gt; <a href=3D"http://www.rustyrazorblade.com" target=3D"_bla=
nk">http://www.rustyrazorblade.com</a><br>
&gt;&gt; &gt;&gt; skype: rustyrazorblade<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Jon Haddad<br>
&gt;&gt; <a href=3D"http://www.rustyrazorblade.com" target=3D"_blank">http:=
//www.rustyrazorblade.com</a><br>
&gt;&gt; skype: rustyrazorblade<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
--<br>
Jon Haddad<br>
<a href=3D"http://www.rustyrazorblade.com" target=3D"_blank">http://www.rus=
tyrazorblade.com</a><br>
skype: rustyrazorblade<br>
</div></div></blockquote></div></div></div><br></div>
</blockquote></div><br></div>

--001a11c15750d91db104fc426351--