Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of rajkumar.w93@gmail.com
 designates 209.85.214.44 as permitted sender)
MIME-Version: 1.0
Sender: rajkumar.w93@gmail.com
In-Reply-To: <lhwfunxipl295erem757viyq.1321373406766@email.android.com>
References: <lhwfunxipl295erem757viyq.1321373406766@email.android.com>
Date: Wed, 16 Nov 2011 01:41:54 +0530
Message-ID: 
 <CANGD+iq1eRxPT_eqVPieB=OpHSeZ5GEPEgQ2Z7QP+q3A58GNJw@mail.gmail.com>
Subject: Re: Seeking advice on Schema and Caching
From: Aditya Narayan <adynnn@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=000e0cdfcbc64aa6a104b1cb9952

--000e0cdfcbc64aa6a104b1cb9952
Content-Type: text/plain; charset=ISO-8859-1

Any insights on this ?

On Tue, Nov 15, 2011 at 9:40 PM, Quintero <quinteros8888@gmail.com> wrote:

>
>
> Aditya Narayan <adynnn@gmail.com> wrote:
>
> >Hi
> >
> >I need to add 'search users' functionality to my application. (The trigger
> >for fetching searched items(like google instant search) is made when 3
> >letters have been typed in).
> >
> >For this, I make a CF with String type keys. Each such key is made of
> first
> >3 letters of a user's name.
> >
> >Thus all names starting with 'Mar-' are stored in single row (with
> >key="Mar").
> >The column names are framed as remaining letters of the names. Thus, a
> name
> >'Marcos' will be stored within rowkey "Mar" & col name "cos". The id will
> >be stored as column value. Since there could be many users with same name.
> >Thus I would have multple userIds(of users named "Marcos") to be stored
> >inside columnname "cos" under key "Mar". Thus,
> >
> >1. Supercolumn seems to be a better fit for my use case(so that ids of
> >users with same name may fit as sub-columns inside a super-column) but
> >since supercolumns are not encouraged thus I want to use an alternative
> >schema for this usecase if possible. Could you suggest some ideas on this
> ?
> >
> >2. Another thing, I would like to row cache this CF so that when the user
> >types in the next character & the query is made consequently, then this
> row
> >be retrieved from the cache without touching DB. It is expected while
> >searching for a single username, the query(as a part of making
> >instantaneous suggestions) will be made at least 2-3 times. One may also
> >suggest to fetch all the columns starting with queried string to be
> >retrieved & then filter out at application level but what about just
> >fecthing the exact no of columns(ids/names of users) I need to show to the
> >user. Thus instead of keeping all the hundreds of cols in the application
> >layer what about keeping it within the DB cache.!?
> >The space alloted for the cache will be very small so that row remains in
> >cache for a very short time(enough to serve only for the time duration
> >while user is making a single search!?) ?
>

--000e0cdfcbc64aa6a104b1cb9952
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Any insights on this ?<br><br><div class=3D"gmail_quote">On Tue, Nov 15, 20=
11 at 9:40 PM, Quintero <span dir=3D"ltr">&lt;<a href=3D"mailto:quinteros88=
88@gmail.com">quinteros8888@gmail.com</a>&gt;</span> wrote:<br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid=
;padding-left:1ex;">
<div><div></div><div class=3D"h5"><br>
<br>
Aditya Narayan &lt;<a href=3D"mailto:adynnn@gmail.com">adynnn@gmail.com</a>=
&gt; wrote:<br>
<br>
&gt;Hi<br>
&gt;<br>
&gt;I need to add &#39;search users&#39; functionality to my application. (=
The trigger<br>
&gt;for fetching searched items(like google instant search) is made when 3<=
br>
&gt;letters have been typed in).<br>
&gt;<br>
&gt;For this, I make a CF with String type keys. Each such key is made of f=
irst<br>
&gt;3 letters of a user&#39;s name.<br>
&gt;<br>
&gt;Thus all names starting with &#39;Mar-&#39; are stored in single row (w=
ith<br>
&gt;key=3D&quot;Mar&quot;).<br>
&gt;The column names are framed as remaining letters of the names. Thus, a =
name<br>
&gt;&#39;Marcos&#39; will be stored within rowkey &quot;Mar&quot; &amp; col=
 name &quot;cos&quot;. The id will<br>
&gt;be stored as column value. Since there could be many users with same na=
me.<br>
&gt;Thus I would have multple userIds(of users named &quot;Marcos&quot;) to=
 be stored<br>
&gt;inside columnname &quot;cos&quot; under key &quot;Mar&quot;. Thus,<br>
&gt;<br>
&gt;1. Supercolumn seems to be a better fit for my use case(so that ids of<=
br>
&gt;users with same name may fit as sub-columns inside a super-column) but<=
br>
&gt;since supercolumns are not encouraged thus I want to use an alternative=
<br>
&gt;schema for this usecase if possible. Could you suggest some ideas on th=
is ?<br>
&gt;<br>
&gt;2. Another thing, I would like to row cache this CF so that when the us=
er<br>
&gt;types in the next character &amp; the query is made consequently, then =
this row<br>
&gt;be retrieved from the cache without touching DB. It is expected while<b=
r>
&gt;searching for a single username, the query(as a part of making<br>
&gt;instantaneous suggestions) will be made at least 2-3 times. One may als=
o<br>
&gt;suggest to fetch all the columns starting with queried string to be<br>
&gt;retrieved &amp; then filter out at application level but what about jus=
t<br>
&gt;fecthing the exact no of columns(ids/names of users) I need to show to =
the<br>
&gt;user. Thus instead of keeping all the hundreds of cols in the applicati=
on<br>
&gt;layer what about keeping it within the DB cache.!?<br>
&gt;The space alloted for the cache will be very small so that row remains =
in<br>
&gt;cache for a very short time(enough to serve only for the time duration<=
br>
&gt;while user is making a single search!?) ?<br>
</div></div></blockquote></div><br>

--000e0cdfcbc64aa6a104b1cb9952--