hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Meil <doug.m...@explorysmedical.com>
Subject Re: Generic Schema Question
Date Sun, 14 Aug 2011 01:19:50 GMT

See this section in the Hbase book...

11.6.3. Close ResultScanners


There is a snippet of how to use a Scan, which is what you'd what for that.


I just realized that there should be a better Scan example in the Data
Model chapter.  I'll add it.

Doug Meil
Chief Software Architect, Explorys
doug.meil@explorys.com






On 8/13/11 8:16 PM, "Mark" <static.void.dev@gmail.com> wrote:

>Ok so something like this?
>
>row                                   cf:qual           value
>-----------------------------------------
>192.168.1.2/1313280451 data:page      "/foo/bar"
>192.168.1.2/1313280451 data:referrer  "google.com"
>192.168.1.2/1313280451 data:session  "f306e5af69b48568323fdc3018e40e7e"
>
>-----------------------------------------
>192.168.1.2/1313281242 data:page "/foo/baz"
>192.168.1.2/1313281242 data:page ""
>192.168.1.2/1313281242 data:page "f306e5af69b48568323fdc3018e40e7e"
>....
>
>Will this allow me to query the last 100 rows for ip "192.168.1.2". If
>so, how? Will it be efficient? Also, would you mind explaining an
>alternative way of accomplishing this as I'm still trying to figure out
>all the possibilities.
>
>Thanks again
>
>
>On 8/13/11 4:53 PM, Blake Lemoine wrote:
>> You need to have the ip address followed by a slash followed by the
>>time as
>> the row key.  Or some other such a way of getting multiple rows per ip.
>> Then you could scan for the ip prefix.  Of course that's just one
>>possible
>> solution.
>> On Aug 13, 2011 1:01 PM, "Mark"<static.void.dev@gmail.com>  wrote:
>>> Hi all, I'm trying to wrap my head around HBase schema design and I am
>>> having trouble modeling the following use case:
>>>
>>> We store all our use behavior (clicks, searches, page views) in Hadoop
>>> and we would like to add this into HBase so we can interactively
>>> "explore" what our users are doing. For example we would like, given an
>>> IP address get back a list of all searches, page views, clicks etc that
>>> this user has attempted.
>>>
>>> My initial thought for something like this would be to create a table
>>> "Logs" with a CF "Data" that have qualifiers of "Search", "Click" and
>>> "View". Each column would have a row with the IP as its key.
>>>
>>> Is this along the right lines or am I missing something... sure feels
>>> like I am. Would anyone please explain how I would accomplish what I am
>>> looking for.
>>>
>>> Thanks


Mime
View raw message