hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Cam Bazz <>
Subject counting impressions strategy
Date Tue, 01 Mar 2011 12:33:26 GMT

Now I would like to count impressions per item. To achieve this, I
made a logger, for instance when the user goes in a category or search
page, and some items are listed, I am logging:

CATPAGE   CAT1    1,2,3,4,5
CATPAGE   CAT2    6,7,8,9,10
SEARCH     keyword 1,6

basically I am logging all the displayed items in a comma seperated list.

I need to calculate and store daily impressions from this such as:

1, 2
6, 2

(the first line is item sid, the second number is impressions, in
total from different impression types)

Now I have couple of questions:

considering that the system will produce at least 1 line per item per
day, what kind of table i must store this? previously, I have been
using text files for everything, I never had any requirement to query
hive, but rather export results from it. now I will probably need to
make queries like "select * from myimpression table where sid = xx"
giving me a timeline of impressions per item.

Second question:

what kind of query I need in order to count impressions like above?

Thank you very much,

View raw message