Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
From: Larry Root <larry@armorgames.com>
Date: Fri, 23 Apr 2010 13:33:14 -0700
Message-ID: <q2vd5c761911004231333wd3abbec4kdf4866302bb82838@mail.gmail.com>
Subject: Trying To Understand get_range_slices Results When Using
	RandomPartitioner
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=000e0cd137be587fb70484ed57ab

--000e0cd137be587fb70484ed57ab
Content-Type: text/plain; charset=ISO-8859-1

I trying to better understand how using the RandomPartitioner will affect my
ability to select ranges of keys. Consider my simple example where we have
many online games across different game genres (GameType). These games need
to store data for each one of their users. With that in mind consider the
following data model:

enum GameType {'RPG', 'FPS', 'ARCADE'}

{
    "GameData": {                         // Super Column Family

        *GameType+"1234"*: {                // Row (concat gametype with a
game id for example)
            *"user-data:5678"*:{            // Super column (user data)
                *"user_prop_name"*: "value",// Subcolumn (arbitrary user
properties and values)
*                "another_prop_name"*: "value",
                 ...
            },
            *"user-data:9012"*:{
                *"**user_prop_name**"*: "value",
                 ...
            }
        },

        * GameType+"3456"*: {...},
        *GameType+"7890"*: {...},
        ...
    }
}

Assume we have a multi node cluster running Cassandra 0.6.1. In that
scenario could some one help me understand what the result would be in the
following cases:

   1. We use a range slice to grab keys for all 'RPG' games (range slice at
   the ROW level). Would we be able to get all games back in a single query or
   would that not be guaranteed?

   2. For a given game we use a range slice to grab all user-data keys in
   which the ID starts with '5' (range slice at the COLUMN level). Again, would
   we be able to get all keys in one call (assuming number of keys in the
   result was not an issue)?

   3. Finally for a given game and a given user we do a range slice to grab
   all user properties that start with 'a' (range slice at the SUBCOLUMN level
   of a SUPERCOLUMN). Is that possible in one call?

I'm trying to understand at what level the RandomPartioner affects my
example data model. Is it at a fixed level like just ROWS (the sub data is
fixed to the same node) or is all data at every level *randomized* across
all nodes.

Are there any tricks to doing these sort of range slices using RP? For
example if I set my consistency level to 'ALL' when doing a range slice
would that effectively compile a complete result set for me?

Thanks for the help!

larry

--000e0cd137be587fb70484ed57ab
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

I trying to better understand how using the RandomPartitioner will affect m=
y ability to select ranges of keys. Consider my simple example where we hav=
e many online games across different game genres (GameType). These games ne=
ed to store data for each one of their users. With that in mind consider th=
e following data model: <br>

<br><font size=3D"2"><span style=3D"font-family: courier new,monospace;">en=
um GameType {&#39;RPG&#39;, &#39;FPS&#39;, &#39;ARCADE&#39;}</span><br styl=
e=3D"font-family: courier new,monospace;"><br style=3D"font-family: courier=
 new,monospace;">

<span style=3D"font-family: courier new,monospace;">{</span><br style=3D"fo=
nt-family: courier new,monospace;"><span style=3D"font-family: courier new,=
monospace;">=A0=A0=A0 &quot;GameData&quot;: {=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 // Super Column Family</span><br=
 style=3D"font-family: courier new,monospace;">

<span style=3D"font-family: courier new,monospace;"><br>=A0=A0=A0 =A0=A0=A0=
 <b>GameType+&quot;1234&quot;</b>: {=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 // Row (concat gametype with a game id for example)</span><br style=
=3D"font-family: courier new,monospace;"><span style=3D"font-family: courie=
r new,monospace;">=A0=A0=A0 =A0=A0=A0 =A0=A0=A0 <b>&quot;user-data:5678&quo=
t;</b>:{=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 // Super column (user data)</span=
><br style=3D"font-family: courier new,monospace;">

<span style=3D"font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=A0 =A0=
=A0=A0 =A0=A0=A0 <b>&quot;user_prop_name&quot;</b>: &quot;value&quot;,</spa=
n></font><font size=3D"2"><span style=3D"font-family: courier new,monospace=
;">// Subcolumn (arbitrary user properties and values)</span></font><br>

<font size=3D"2"><span style=3D"font-family: courier new,monospace;"><b>=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 &quot;another_prop_name&quot;</b=
>:
 &quot;value&quot;,</span></font><font size=3D"2"><span style=3D"font-famil=
y: courier new,monospace;">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
 <br>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ...<br>=A0=A0=A0 =A0=
=A0=A0 =A0=A0=A0 },</span><br style=3D"font-family: courier new,monospace;"=
><span style=3D"font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=A0 =
=A0=A0=A0 <b>&quot;user-data:9012&quot;</b>:{</span><br style=3D"font-famil=
y: courier new,monospace;">

<span style=3D"font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=A0 =A0=
=A0=A0 =A0=A0=A0 <b>&quot;</b></span></font><b><font size=3D"2"><span style=
=3D"font-family: courier new,monospace;">user_prop_name</span></font></b><f=
ont size=3D"2"><span style=3D"font-family: courier new,monospace;"><b>&quot=
;</b>: &quot;value&quot;,</span><br style=3D"font-family: courier new,monos=
pace;">

<span style=3D"font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=A0 =A0=
=A0=A0 =A0=A0=A0=A0 ...</span><br style=3D"font-family: courier new,monospa=
ce;"><span style=3D"font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=
=A0 =A0=A0=A0 }</span><br style=3D"font-family: courier new,monospace;">

<span style=3D"font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=A0 },<=
br><br style=3D"font-family: courier new,monospace;"></span><span style=3D"=
font-family: courier new,monospace;">=A0=A0=A0 =A0=A0=A0 <b>
GameType+&quot;3456&quot;</b>:
 {...},</span><br style=3D"font-family: courier new,monospace;"><span style=
=3D"font-family: courier new,monospace;">=A0=A0=A0=A0=A0=A0=A0 <b>GameType+=
&quot;7890&quot;</b>:
 {...},</span><br style=3D"font-family: courier new,monospace;"><span style=
=3D"font-family: courier new,monospace;">=A0=A0=A0=A0=A0=A0=A0 ...</span><b=
r style=3D"font-family: courier new,monospace;"><span style=3D"font-family:=
 courier new,monospace;">=A0=A0=A0 }</span><br style=3D"font-family: courie=
r new,monospace;">

<span style=3D"font-family: courier new,monospace;">}<br><br></span></font>=
Assume we have a multi node cluster running Cassandra 0.6.1. In that scenar=
io could some one help me understand what the result would be in the follow=
ing cases:<br>

<ol><li>We use a range slice to grab keys for all &#39;RPG&#39; games (rang=
e slice at the ROW level). Would we be able to get all games back in a sing=
le query or would that not be guaranteed?<br><br></li><li>For a given game =
we use a range slice to grab all user-data keys in which the ID starts with=
 &#39;5&#39; (range slice at the COLUMN level). Again, would we be able to =
get all keys in one call (assuming number of keys in the result was not an =
issue)?<br>

<br></li><li>Finally for a given game and a given user we do a range slice =
to grab all user properties that start with &#39;a&#39; (range slice at the=
 SUBCOLUMN level of a SUPERCOLUMN). Is that possible in one call?</li>
</ol>
I&#39;m trying to understand at what level the RandomPartioner affects my e=
xample data model. Is it at a fixed level like just ROWS (the sub data is f=
ixed to the same node) or is all data at every level *randomized* across al=
l nodes.<br>

<br>Are there any tricks to doing these sort of range slices using RP? For =
example if I set my consistency level to &#39;ALL&#39; when doing a range s=
lice would that effectively compile a complete result set for me?<br><br>

Thanks for the help!<br><br>larry

--000e0cd137be587fb70484ed57ab--