Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm
Precedence: bulk
Reply-To: cassandra-user@incubator.apache.org
Received-SPF: pass (athena.apache.org: domain of erikholstad@gmail.com
 designates 209.85.216.174 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=bO2tvWzdSUxMBm/7pA7YJ5+nNKs/0pNQta76nuTqZGS1ESAxr5b+xNbW6mjSnUuou1
         dgzA7Mim3t2j0tBv2ZdkaqhuHu9lL921WCHtBolcRpM7NeXdpLeqtttPxDCB9FfueefQ
         STqU4zGOkk08soV1/TgX+jkC4P537Apb5edq8=
MIME-Version: 1.0
In-Reply-To: <a43524d01002021439k40e8de9x9d6935d8b2542f95@mail.gmail.com>
References: <74f4d40b1002020950p66e6f9a1hdea1b5cb4b4e1aa2@mail.gmail.com>
	 <a17e13e71002021141m6d3c04c5nf204f4302e8bca48@mail.gmail.com>
	 <74f4d40b1002021200m5a91b21es3e3a12cd3b80ff0c@mail.gmail.com>
	 <a17e13e71002021430j2ec56dc9ka59d8360b5736a78@mail.gmail.com>
	 <a43524d01002021439k40e8de9x9d6935d8b2542f95@mail.gmail.com>
Date: Tue, 2 Feb 2010 15:02:39 -0800
Message-ID: <74f4d40b1002021502q3b8034dfu97aef578d39fbc74@mail.gmail.com>
Subject: Re: Using column plus value or only column?
From: Erik Holstad <erikholstad@gmail.com>
To: cassandra-user@incubator.apache.org
Content-Type: multipart/alternative; boundary=0016e64cbaec3cf2ac047ea619f5

--0016e64cbaec3cf2ac047ea619f5
Content-Type: text/plain; charset=ISO-8859-1

@Nathan
So what I'm planning to do is to store multiple sort orders for the same
data, where they all use the
same data table just fetches it in different orders, so to say. I want to be
able to rad the different sort
orders from the front and from the back to get both regular and reverse sort
order.

With your approach using super columns you would need to replicate all data,
right?

And if I understand
http://issues.apache.org/jira/browse/CASSANDRA-598correctly you would
need to
read the whole thing before you can limit the results handed back to you.

In regards to the two calls get_slice and get_range_slice, the way I
understand it is that you hand
the second one an optional start and stop key plus a limit, to get a range
of keys/rows. I was planning
to use this call together with the OPP, but are thinking about not using it
since there is no way to do
an inverse scan, right?

Thanks a lot
Erik


On Tue, Feb 2, 2010 at 2:39 PM, Jesse McConnell
<jesse.mcconnell@gmail.com>wrote:

> infinite is a bit of a bold claim....
>
> by my understanding you are bound by the memory of the jvm as all of
> the content of a key/row currently needs to fit in memory for
> compaction, which includes columns and supercolumns for given key/row.
>
> if you are going to run into those scenarios then some sort of
> sharding on the keys is required, afaict
>
> cheers,
> jesse
>
> --
> jesse mcconnell
> jesse.mcconnell@gmail.com
>
>
>
> On Tue, Feb 2, 2010 at 16:30, Nathan McCall <nate@vervewireless.com>
> wrote:
> > Erik,
> > Sure, you could and depending on the workload, that might be quite
> > efficient for small pieces of data. However, this also sounds like
> > something that might be better addressed with the addition of a
> > SuperColumn on "Sorts" and getting rid of "Data" altogether:
> >
> > Sorts : {
> >   sort_row_1 : {
> >        sortKey1 : { col1:val1, col2:val2 },
> >        sortKey2 : { col1:val3, col2:val4 }
> >   }
> > }
> >
> > You can have an infinite number of SuperColumns for a key, but make
> > sure you understand get_slice vs. get_range_slice before you commit to
> > a design. Hopefully I understood your example correctly, if not, do
> > you have anything more concrete?
> >
> > Cheers,
> > -Nate
> >
> >
> > On Tue, Feb 2, 2010 at 12:00 PM, Erik Holstad <erikholstad@gmail.com>
> wrote:
> >> Thanks Nate for the example.
> >>
> >> I was thinking more a long the lines of something like:
> >>
> >> If you have a family
> >>
> >> Data : {
> >>   row1 : {
> >>     col1:val1,
> >>   row2 : {
> >>     col1:val2,
> >>     ...
> >>   }
> >> }
> >>
> >>
> >> Using
> >> Sorts : {
> >>   sort_row : {
> >>     sortKey1_datarow1: [],
> >>     sortKey2_datarow2: []
> >>   }
> >> }
> >>
> >> Instead of
> >> Sorts : {
> >>   sort_row : {
> >>     sortKey1: datarow1,
> >>     sortKey2: datarow2
> >>   }
> >> }
> >>
> >> If that makes any sense?
> >>
> >> --
> >> Regards Erik
> >>
> >
>


-- 
Regards Erik

--0016e64cbaec3cf2ac047ea619f5
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

@Nathan<br>So what I&#39;m planning to do is to store multiple sort orders =
for the same data, where they all use the<br>same data table just fetches i=
t in different orders, so to say. I want to be able to rad the different so=
rt<br>
orders from the front and from the back to get both regular and reverse sor=
t order.<br><br>With your approach using super columns you would need to re=
plicate all data, right?<br><br>And if I understand <a href=3D"http://issue=
s.apache.org/jira/browse/CASSANDRA-598">http://issues.apache.org/jira/brows=
e/CASSANDRA-598</a> correctly you would need to<br>
read the whole thing before you can limit the results handed back to you.<b=
r><br>In regards to the two calls get_slice and get_range_slice, the way I =
understand it is that you hand <br>the second one an optional start and sto=
p key plus a limit, to get a range of keys/rows. I was planning<br>
to use this call together with the OPP, but are thinking about not using it=
 since there is no way to do<br>an inverse scan, right?<br><br>Thanks a lot=
<br>Erik<br><br><br><div class=3D"gmail_quote">On Tue, Feb 2, 2010 at 2:39 =
PM, Jesse McConnell <span dir=3D"ltr">&lt;<a href=3D"mailto:jesse.mcconnell=
@gmail.com">jesse.mcconnell@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">infinite is a bit=
 of a bold claim....<br>
<br>
by my understanding you are bound by the memory of the jvm as all of<br>
the content of a key/row currently needs to fit in memory for<br>
compaction, which includes columns and supercolumns for given key/row.<br>
<br>
if you are going to run into those scenarios then some sort of<br>
sharding on the keys is required, afaict<br>
<br>
cheers,<br>
jesse<br>
<font color=3D"#888888"><br>
--<br>
jesse mcconnell<br>
<a href=3D"mailto:jesse.mcconnell@gmail.com">jesse.mcconnell@gmail.com</a><=
br>
</font><div><div></div><div class=3D"h5"><br>
<br>
<br>
On Tue, Feb 2, 2010 at 16:30, Nathan McCall &lt;<a href=3D"mailto:nate@verv=
ewireless.com">nate@vervewireless.com</a>&gt; wrote:<br>
&gt; Erik,<br>
&gt; Sure, you could and depending on the workload, that might be quite<br>
&gt; efficient for small pieces of data. However, this also sounds like<br>
&gt; something that might be better addressed with the addition of a<br>
&gt; SuperColumn on &quot;Sorts&quot; and getting rid of &quot;Data&quot; a=
ltogether:<br>
&gt;<br>
&gt; Sorts : {<br>
&gt; =A0 sort_row_1 : {<br>
&gt; =A0 =A0 =A0 =A0sortKey1 : { col1:val1, col2:val2 },<br>
&gt; =A0 =A0 =A0 =A0sortKey2 : { col1:val3, col2:val4 }<br>
&gt; =A0 }<br>
&gt; }<br>
&gt;<br>
&gt; You can have an infinite number of SuperColumns for a key, but make<br=
>
&gt; sure you understand get_slice vs. get_range_slice before you commit to=
<br>
&gt; a design. Hopefully I understood your example correctly, if not, do<br=
>
&gt; you have anything more concrete?<br>
&gt;<br>
&gt; Cheers,<br>
&gt; -Nate<br>
&gt;<br>
&gt;<br>
&gt; On Tue, Feb 2, 2010 at 12:00 PM, Erik Holstad &lt;<a href=3D"mailto:er=
ikholstad@gmail.com">erikholstad@gmail.com</a>&gt; wrote:<br>
&gt;&gt; Thanks Nate for the example.<br>
&gt;&gt;<br>
&gt;&gt; I was thinking more a long the lines of something like:<br>
&gt;&gt;<br>
&gt;&gt; If you have a family<br>
&gt;&gt;<br>
&gt;&gt; Data : {<br>
&gt;&gt; =A0 row1 : {<br>
&gt;&gt; =A0=A0=A0 col1:val1,<br>
&gt;&gt; =A0 row2 : {<br>
&gt;&gt; =A0=A0=A0 col1:val2,<br>
&gt;&gt; =A0=A0=A0 ...<br>
&gt;&gt; =A0 }<br>
&gt;&gt; }<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; Using<br>
&gt;&gt; Sorts : {<br>
&gt;&gt; =A0 sort_row : {<br>
&gt;&gt; =A0=A0=A0 sortKey1_datarow1: [],<br>
&gt;&gt; =A0=A0=A0 sortKey2_datarow2: []<br>
&gt;&gt; =A0 }<br>
&gt;&gt; }<br>
&gt;&gt;<br>
&gt;&gt; Instead of<br>
&gt;&gt; Sorts : {<br>
&gt;&gt; =A0 sort_row : {<br>
&gt;&gt; =A0=A0=A0 sortKey1: datarow1,<br>
&gt;&gt; =A0=A0=A0 sortKey2: datarow2<br>
&gt;&gt; =A0 }<br>
&gt;&gt; }<br>
&gt;&gt;<br>
&gt;&gt; If that makes any sense?<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Regards Erik<br>
&gt;&gt;<br>
&gt;<br>
</div></div></blockquote></div><br><br clear=3D"all"><br>-- <br>Regards Eri=
k<br>

--0016e64cbaec3cf2ac047ea619f5--