hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Oleg Ruchovets <oruchov...@gmail.com>
Subject aggregation by time window
Date Mon, 28 Jan 2013 12:56:29 GMT
Hi ,
    I have such row data structure:

event_id  |   time
==============
event1     |  10:07
event2     |  10:10
event3     |  10:12

event4     |   10:20
event5     |   10:23
event6     |   10:25

Numbers of records is  50-100 million.

Question:
   I need to get events that was during time T.

For example: if T=7 munutes.
     event1 , event2 , event3 were detected durint 7 minutes.
     event4 , event5 , event6 were detected during 7 minutes.

How can I implement such aggregation using map/reduce.

Thanks
Oleg.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message