hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark <static.void....@gmail.com>
Subject Re: Generic Schema Question
Date Sun, 14 Aug 2011 00:16:10 GMT
Ok so something like this?

row                                   cf:qual           value
----------------------------------------- data:page      "/foo/bar" data:referrer  "google.com" data:session  "f306e5af69b48568323fdc3018e40e7e"

----------------------------------------- data:page "/foo/baz" data:page "" data:page "f306e5af69b48568323fdc3018e40e7e"

Will this allow me to query the last 100 rows for ip "". If 
so, how? Will it be efficient? Also, would you mind explaining an 
alternative way of accomplishing this as I'm still trying to figure out 
all the possibilities.

Thanks again

On 8/13/11 4:53 PM, Blake Lemoine wrote:
> You need to have the ip address followed by a slash followed by the time as
> the row key.  Or some other such a way of getting multiple rows per ip.
> Then you could scan for the ip prefix.  Of course that's just one possible
> solution.
> On Aug 13, 2011 1:01 PM, "Mark"<static.void.dev@gmail.com>  wrote:
>> Hi all, I'm trying to wrap my head around HBase schema design and I am
>> having trouble modeling the following use case:
>> We store all our use behavior (clicks, searches, page views) in Hadoop
>> and we would like to add this into HBase so we can interactively
>> "explore" what our users are doing. For example we would like, given an
>> IP address get back a list of all searches, page views, clicks etc that
>> this user has attempted.
>> My initial thought for something like this would be to create a table
>> "Logs" with a CF "Data" that have qualifiers of "Search", "Click" and
>> "View". Each column would have a row with the IP as its key.
>> Is this along the right lines or am I missing something... sure feels
>> like I am. Would anyone please explain how I would accomplish what I am
>> looking for.
>> Thanks

View raw message