incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tyler Hobbs <ty...@datastax.com>
Subject Re: Read Performance / Schema Design
Date Thu, 27 Oct 2011 05:01:11 GMT
On Wed, Oct 26, 2011 at 9:35 PM, Ben Gambley <ben.gambley@intoscience.com>wrote:

>
> Hi Everyone
>
> I have a question with regards read performance and schema design if
> someone could help please.
>
>
> Our requirement is to store per user, many unique results (which is
> basically an attempt at some questions ..) so I had thought of having the
> userid as the row key and the result id as columns.
>
> The keys  for the result ids are maintained in a separate location so are
> known without having to perform any additional lookups.
>
> My concern is that over time reading a single result would incur the
> overhead of reading the entire row from disk so gradually slow things down.
>
>
> So I was considering if changing the row key to *userid + result id* would
> be a better solution ?
>
>

Do you regularly need to read all of the results for a given userid?  If
not, go with the user_id + result_id approach. It will be more efficient for
single-result lookups.

-- 
Tyler Hobbs
DataStax <http://datastax.com/>

Mime
View raw message