Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of stuhood@gmail.com designates
 74.125.82.172 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=tixgmoUf8+G87inwygcdK1JFqxTPLC9VZdUhw7gm9Sktou/TFg2fgKsC/BJwI4XYks
         cr8ROnqZT9I82CI7H2fTD6ADSc3xyjX7OsCfs+5DjWPiCoo74s5Kz+hxLVcM0iYCPCH0
         uyOiM6lIywxhxNwS7F1fitdSBZWY9YBxP/qX8=
MIME-Version: 1.0
In-Reply-To: 
 <59640.150.140.193.14.1297247703.squirrel@webmail.ceid.upatras.gr>
References: <ae8691d2-6de2-ce08-8a3b-f2cf194cb046@me.com>
	<59640.150.140.193.14.1297247703.squirrel@webmail.ceid.upatras.gr>
Date: Wed, 9 Feb 2011 03:09:44 -0800
Message-ID: <AANLkTim5-4V+4xmYZOhTnwQQ6YPkgiKPMkEp5yNKH7Zq@mail.gmail.com>
Subject: Re: How do secondary indices work
From: Stu Hood <stuhood@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016e65c88729c8407049bd780f0

--0016e65c88729c8407049bd780f0
Content-Type: text/plain; charset=ISO-8859-1

Alexander:

The secondary indexes in 0.7.0 (type KEYS) are stored internally in a column
family, and are kept synchronized with the base data via locking on a local
node, meaning they are always consistent on the local node. Eventual
consistency still applies between nodes, but a returned result will always
match your query.

This index column family stores a mapping from index values to a sorted list
of matching row keys. When you query for rows between x and y matching a
value z (via the get_indexed_slices call), Cassandra performs a lookup to
the index column family for the slice of columns in row z between x and y.
If any matches are found in the index, they are row keys that match the
index clause, and we query the base data to return you those rows.

Iterating through all of the rows matching an index clause on your cluster
is guaranteed to touch N/RF of the nodes in your cluster, because each node
only knows about data that is indexed locally.

Some portions of the indexing implementation are not fully baked yet: for
instance, although the API allows you to specify multiple columns, only one
index will actually be used per query, and the rest of the clauses will be
brute forced.

A second secondary index implementation has been on the back burner for a
while: it provides an identical API, but does not use a column family to
store the index, and should be more efficient for append only data. See
https://issues.apache.org/jira/browse/CASSANDRA-1472

Thanks,
Stu

On Wed, Feb 9, 2011 at 2:35 AM, <altanis@ceid.upatras.gr> wrote:

> Thank you for the links, I did read a bit in the comments of the ticket,
> but I couldn't get much out of it.
>
> I am mainly interested in how the index is stored and partitioned, not how
> it is used. I think the people in the dev list will probably be better
> qualified to answer that. My questions always seem to get moved to the
> user list, and usually with good cause, but I think this time it should be
> in the dev list :) Please move it back, if you can.
>
> Alexander
>
> > AFAIK this was the ticket the original work was done under
> > https://issues.apache.org/jira/browse/CASSANDRA-1415
> >
> > also  http://www.datastax.com/docs/0.7/data_model/secondary_indexes
> > and  http://pycassa.githubcom/pycassa/tutorial.html#indexes may help
> >
> > (sorry on reflection the email prob did not need to be moved from dev, my
> > bad)
> > Aaron
> >
> > On 09 Feb, 2011,at 09:16 AM, Aaron Morton <aaron@thelastpickle.com>
> wrote:
> >
> > Moving to the user group.
> >
> >
> >
> > On 08 Feb, 2011,at 11:39 PM, altanis@ceid.upatras.gr wrote:
> >
> > Hello,
> >
> > I'd like some information about how secondary indices work under the
> hood.
> >
> > 1) Is data stored in some external data structure, or is it stored in an
> > actual Cassandra table, as columns within column families?
> > 2) Is data stored sorted or not? How is it partitioned?
> > 3) How can I access index data?
> >
> > Thanks in a advance,
> >
> > Alexander Altanis
> >
>

--0016e65c88729c8407049bd780f0
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Alexander:<div><br></div><div>The secondary indexes in 0.7.0=A0(type KEYS)=
=A0are stored internally in a column family, and are kept synchronized with=
 the base data via locking on a local node, meaning they are always consist=
ent on the local node. Eventual consistency still applies between nodes, bu=
t a returned result will always match your query.</div>
<div><br></div><div>This index column family stores a mapping from index va=
lues to a sorted list of matching row keys. When you query for rows between=
 x and y matching a value z (via the get_indexed_slices call), Cassandra pe=
rforms a lookup to the index column family for the slice of columns in row =
z between x and y. If any matches are found in the index, they are row keys=
 that match the index clause, and we query the base data to return you thos=
e rows.</div>
<meta charset=3D"utf-8"><div><br></div><div>Iterating through all of the ro=
ws matching an index clause on your cluster is guaranteed to touch N/RF of =
the nodes in your cluster, because each node only knows about data that is =
indexed locally.</div>
<div><br></div><div>Some portions of the indexing implementation are not fu=
lly baked yet: for instance, although the API allows you to specify multipl=
e columns, only one index will actually be used per query, and the rest of =
the clauses will be brute forced.</div>
<div><br></div><div>A second secondary index implementation has been on the=
 back burner for a while: it provides an identical API, but does not use a =
column family to store the index, and should be more efficient for append o=
nly data. See=A0<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-=
1472">https://issues.apache.org/jira/browse/CASSANDRA-1472</a></div>
<div><br></div><div>Thanks,</div><div>Stu<br><br><div class=3D"gmail_quote"=
>On Wed, Feb 9, 2011 at 2:35 AM,  <span dir=3D"ltr">&lt;<a href=3D"mailto:a=
ltanis@ceid.upatras.gr">altanis@ceid.upatras.gr</a>&gt;</span> wrote:<br><b=
lockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px =
#ccc solid;padding-left:1ex;">
Thank you for the links, I did read a bit in the comments of the ticket,<br=
>
but I couldn&#39;t get much out of it.<br>
<br>
I am mainly interested in how the index is stored and partitioned, not how<=
br>
it is used. I think the people in the dev list will probably be better<br>
qualified to answer that. My questions always seem to get moved to the<br>
user list, and usually with good cause, but I think this time it should be<=
br>
in the dev list :) Please move it back, if you can.<br>
<br>
Alexander<br>
<div class=3D"im"><br>
&gt; AFAIK this was the ticket the original work was done under=A0<br>
&gt; <a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-1415" targe=
t=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-1415</a><br>
&gt;<br>
&gt; also =A0<a href=3D"http://www.datastax.com/docs/0.7/data_model/seconda=
ry_indexes" target=3D"_blank">http://www.datastax.com/docs/0.7/data_model/s=
econdary_indexes</a><br>
</div>&gt; and =A0<a href=3D"http://pycassa.githubcom/pycassa/tutorial.html=
#indexes" target=3D"_blank">http://pycassa.githubcom/pycassa/tutorial.html#=
indexes</a>=A0may help<br>
<div><div></div><div class=3D"h5">&gt;<br>
&gt; (sorry on reflection the email prob did not need to be moved from dev,=
 my<br>
&gt; bad)<br>
&gt; Aaron<br>
&gt;<br>
&gt; On 09 Feb, 2011,at 09:16 AM, Aaron Morton &lt;<a href=3D"mailto:aaron@=
thelastpickle.com">aaron@thelastpickle.com</a>&gt; wrote:<br>
&gt;<br>
&gt; Moving to the user group.<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; On 08 Feb, 2011,at 11:39 PM, <a href=3D"mailto:altanis@ceid.upatras.gr=
">altanis@ceid.upatras.gr</a> wrote:<br>
&gt;<br>
&gt; Hello,<br>
&gt;<br>
&gt; I&#39;d like some information about how secondary indices work under t=
he hood.<br>
&gt;<br>
&gt; 1) Is data stored in some external data structure, or is it stored in =
an<br>
&gt; actual Cassandra table, as columns within column families?<br>
&gt; 2) Is data stored sorted or not? How is it partitioned?<br>
&gt; 3) How can I access index data?<br>
&gt;<br>
&gt; Thanks in a advance,<br>
&gt;<br>
&gt; Alexander Altanis<br>
&gt;<br>
</div></div></blockquote></div><br></div>

--0016e65c88729c8407049bd780f0--