hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shahab Yunus <shahab.yu...@gmail.com>
Subject Re: Mapreduce outputs to a different cluster?
Date Thu, 24 Oct 2013 22:42:48 GMT
As far as I know, you can use distcp to transfer the results of the job
form one cluster to another, once the job is done. You can write a simple
script to do that. Simple and tested. Some poiners below:

You might be able to do this through the job as well byt changing the
output paths of the  generated files but I wouldn't suggest that there can
be latency and performance issues.

Maybe others have better idea....


On Thu, Oct 24, 2013 at 6:28 PM, S. Zhou <myxjtu@yahoo.com> wrote:

> The scenario is: I run mapreduce job on cluster A (all source data is in
> cluster A) but I want the output of the job to cluster B. Is it possible?
> If yes, please let me know how to do it.
> Here are some notes of my mapreduce job:
> 1. the data source is an HBase table
> 2. It only has mapper no reducer.
> Thanks
> Senqiang

View raw message