incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sasha Dolgy <sdo...@gmail.com>
Subject design that mimics twitter tweet search
Date Sun, 18 Mar 2012 14:54:30 GMT
Hi All,

With twitter, when I search for words like:  "cassandra is the bestest", 4
tweets will appear, including one i just did.  My understand that the
internals of twitter work in that each word in a tweet is allocated,
irrespective of the presence of a  # hash tag, and the tweet id is assigned
to a row for that word.  What is puzzling to me, and hopeful that some
smart people on here can shed some light on -- is how would this work with
Cassandra?

row [ cassandra ]: key -> tweetid  / timestamp
row [ bestest ]: key -> tweetid / timestamp

I had thought that I could simply pull a list of all column names from each
row (representing each word) and flag all occurrences (tweet id's) that
exist in each row ... however, these rows would get quite long over time.

Am I missing an easier way to get a list of all "tweetid's" that exist in
multiple rows?

-- 
Sasha Dolgy
sasha.dolgy@gmail.com

Mime
View raw message