hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Parker <michael.g.par...@gmail.com>
Subject Side-loading output from one MR into another?
Date Thu, 23 Aug 2012 04:42:46 GMT
Hi all,

Is it possible to take a collection of sorted key-value pairs,
generated from one MapReduce, and side-load them into another
MapReduce, i.e. as it runs, the second MapReduce can look up the value
for a given key computed by the first MapReduce?

I need this for a cohort study -- one MR puts users into cohorts, and
the second MR needs that user-to-cohort mapping to see how cohorts
behave over time.

Any help would be greatly appreciated. Thanks!

- Mike

View raw message