hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From abc xyz <fabc_xyz...@yahoo.com>
Subject Partitioned Datasets Map/Reduce
Date Sat, 03 Jul 2010 16:33:39 GMT
Hello everyone,

I have written my custom partitioner for partitioning datasets. I want  to 
partition two datasets using the same partitioner and then in the  next 
mapreduce job, I want each mapper to handle the same partition from  the two 
sources and perform some function such as joining etc. How I  can I ensure that 
one mapper gets the split that corresponds to same  partition from both the 
sources? 


Any help would be highly appreciated.


      
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message