Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of mishra.vivs@gmail.com
 designates 209.85.210.44 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <8845493D-8CAF-4DA9-A0B9-B7C96797D2E2@thelastpickle.com>
References: <E1Sa3EP-0002xW-Qh.nrejepov-mail-ru@f83.mail.ru>
	<8845493D-8CAF-4DA9-A0B9-B7C96797D2E2@thelastpickle.com>
Date: Fri, 1 Jun 2012 16:50:26 +0530
Message-ID: 
 <CANJo1uD-RKRK9EbuEM8X9sHyDaqKnpN0onZjtEzv8ZQUuQroVg@mail.gmail.com>
Subject: Re: How can we use composite indexes and secondary indexes together
From: Vivek Mishra <mishra.vivs@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b33d92402a3d904c1675fed

--047d7b33d92402a3d904c1675fed
Content-Type: text/plain; charset=ISO-8859-1

Have a look at Kundera (https://github.com/impetus-opensource/Kundera). It
does provide some sort of support (using Lucene) and allow you to deal with
association in JPA way.

-Vivek

On Fri, Jun 1, 2012 at 6:54 AM, aaron morton <aaron@thelastpickle.com>wrote:

> If you want to do arbitrary complex online / realtime queries look at Data
> Stax Enterprise, or https://github.com/tjake/Solandra or straight Solr.
>
> Alternatively denormalise the model to materialise the results when you
> insert so you query is a straight lookup. Or do some client side filtering
> / aggregation.
>
> If you want to do the queries offline, you can use Pig or Hive with Hadoop
> over Cassandra. The Apache Cassandra distro includes the pig support, hive
> is coming (i think) and there are Hadoop interfaces.  You can also look at
> Data Stax Enterprise.
>
>
> Cheers
>
> -----------------
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
>
> On 31/05/2012, at 11:07 PM, Nury Redjepow wrote:
>
> We want to use cassandra to store complex data. But we can't figure out,
> how to organize indexes.
>
> Our table (column family) looks like this:
>
> Users = { RandomId int, Firstname varchar, Lastname varchar, Age int,
> Country int, ChildCount int }
>
> In our queries we have mandatory fields (Firstname,Lastname,Age) and extra
> search options (Country,ChildCount). How do we organize index to make this
> kind of queries fast?
>
> First I thought, it would be natural to make composite index on
> (Firstname,Lastname,Age) and add separate secondary index on remaining
> fields (Country and ChildCount). But I can't insert rows into table after
> creating secondary indexes. And also, I can't query the table.
>
> I'm using cassandra 1.1.0, and cqlsh with --cql3 option.
>
> Any other suggestions to solve our problem (complex queries with mandatory
> and additional options) are welcome.
> The main point is, how can we join data in cassandra. If I make few index
> column families, I need to intersect the values, to get rows that pass all
> search criteria??? Or should I use something based on Hadoop (Pig,Hive) to
> make such queries?
>
> Respectfully, Nury
>
> ------------------------------
>
> ------------------------------
>
> ------------------------------
>
> ------------------------------
>
>
>

--047d7b33d92402a3d904c1675fed
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Have a look at Kundera (<a href=3D"https://github.com/impetus-opensource/Ku=
ndera">https://github.com/impetus-opensource/Kundera</a>). It does provide =
some sort of support (using Lucene) and allow you to deal with association =
in JPA way.<br>
<br>-Vivek<br><br><div class=3D"gmail_quote">On Fri, Jun 1, 2012 at 6:54 AM=
, aaron morton <span dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.=
com" target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span> wrote:<br><bl=
ockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex;border-lef=
t:1px solid rgb(204,204,204);padding-left:1ex">
<div style=3D"word-wrap:break-word"><div>If you want to do arbitrary comple=
x online / realtime queries look at Data Stax Enterprise, or=A0<a href=3D"h=
ttps://github.com/tjake/Solandra" target=3D"_blank">https://github.com/tjak=
e/Solandra</a>=A0or straight Solr.=A0</div>
<div><br></div><div>Alternatively denormalise the model to materialise the =
results when you insert so you query is a straight lookup. Or do some clien=
t side filtering / aggregation.=A0</div><div><br></div><div>If you want to =
do the queries offline, you can use Pig or Hive with Hadoop over Cassandra.=
 The Apache Cassandra distro includes the pig support, hive is coming (i th=
ink) and there are Hadoop interfaces. =A0You can also look at Data Stax Ent=
erprise.=A0</div>
<div><br></div><div>=A0</div><div>Cheers</div><div><br><div>
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><span style=3D"text-indent:0px;letter-spacing:norm=
al;font-variant:normal;font-style:normal;font-weight:normal;line-height:nor=
mal;border-collapse:separate;text-transform:none;font-size:medium;white-spa=
ce:normal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:b=
reak-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance Deve=
loper</div><div>@aaronmorton</div><div><a href=3D"http://www.thelastpickle.=
com" target=3D"_blank">http://www.thelastpickle.com</a></div></div></div></=
span></div>
</span></div></span></span>
</div><div><div class=3D"h5">
<br><div><div>On 31/05/2012, at 11:07 PM, Nury Redjepow wrote:</div><br><bl=
ockquote type=3D"cite">
<div><div><div><div><div><div><div><div><div><div><div><div><div><p style=
=3D"padding:0px;border:0px none;font-size:14px;vertical-align:baseline;clea=
r:both;word-wrap:break-word;font-family:Arial,&#39;Liberation Sans&#39;,=
9;DejaVu Sans&#39;,sans-serif;line-height:18px;text-align:left">
We want to use cassandra to store complex data. But we can&#39;t figure out=
, how to organize indexes.</p><p style=3D"padding:0px;border:0px none;font-=
size:14px;vertical-align:baseline;clear:both;word-wrap:break-word;font-fami=
ly:Arial,&#39;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-he=
ight:18px;text-align:left">
Our table (column family) looks like this:</p><p style=3D"padding:0px;borde=
r:0px none;font-size:14px;vertical-align:baseline;clear:both;word-wrap:brea=
k-word;font-family:Arial,&#39;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sa=
ns-serif;line-height:18px;text-align:left">
Users =3D { RandomId int, Firstname varchar, Lastname varchar, Age int, Cou=
ntry int, ChildCount int }</p><p style=3D"padding:0px;border:0px none;font-=
size:14px;vertical-align:baseline;clear:both;word-wrap:break-word;font-fami=
ly:Arial,&#39;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-he=
ight:18px;text-align:left">
In our queries we have mandatory fields (Firstname,Lastname,Age) and extra =
search options (Country,ChildCount). How do we organize index to make this =
kind of queries fast?</p><p style=3D"padding:0px;border:0px none;font-size:=
14px;vertical-align:baseline;clear:both;word-wrap:break-word;font-family:Ar=
ial,&#39;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:=
18px;text-align:left">
First I thought, it would be natural to make composite index on (Firstname,=
Lastname,Age) and add separate secondary index on remaining fields (Country=
 and ChildCount). But I can&#39;t insert rows into table after creating sec=
ondary indexes. And also, I can&#39;t query the table.</p>
<p style=3D"padding:0px;border:0px none;font-size:14px;vertical-align:basel=
ine;clear:both;word-wrap:break-word;font-family:Arial,&#39;Liberation Sans&=
#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:18px;text-align:left">I&#=
39;m using cassandra 1.1.0, and cqlsh with --cql3 option.</p>
<p style=3D"padding:0px;border:0px none;font-size:14px;vertical-align:basel=
ine;clear:both;word-wrap:break-word;font-family:Arial,&#39;Liberation Sans&=
#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:18px;text-align:left">Any=
 other suggestions to solve our problem (complex queries with mandatory and=
 additional options) are welcome.</p>
The main point is, how can we join data in cassandra. If I make few index c=
olumn families, I need to intersect the values, to get rows that pass all s=
earch criteria??? Or should I use something based on Hadoop (Pig,Hive) to m=
ake such queries?<br>
<br>Respectfully, Nury
</div>
		=09
	=09
	=09
	</div>

=09
</div>


<br><hr>
</div>
		=09
	=09
	=09
	</div>

=09
</div>


<br><hr>
</div>
		=09
	=09
	=09
	</div>

=09
</div>


<br><hr>
</div>
		=09
	=09
	=09
	</div>

=09
</div>


<br><hr></div>
</blockquote></div><br></div></div></div></div></blockquote></div><br>

--047d7b33d92402a3d904c1675fed--