hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ravi teja ch n v <raviteja.c...@huawei.com>
Subject RE: Calling one MR job within another MR job
Date Wed, 04 Apr 2012 11:05:03 GMT
Hi Stuti,

If you are looking for MRjob2 to run after MRjob1, ie the job dependency,

you can use JobControl API, where you can manage the dependencies.

Calling another Job from a Mapper is not a good idea.


Ravi Teja

From: Stuti Awasthi [stutiawasthi@hcl.com]
Sent: 04 April 2012 16:04:19
To: mapreduce-user@hadoop.apache.org
Subject: Calling one MR job within another MR job

Hi all,

We have a usecase in which I start with first MR1 job with input file as File1.txt, and from
this job, call another MR2 job with input as File2.txt
So :


My queries are is this kind of approach is possible and how much are the implications from
the performance perspective.

Stuti Awasthi
HCL Comnet Systems and Services Ltd
F-8/9 Basement, Sec-3,Noida.


The contents of this e-mail and any attachment(s) are confidential and intended for the named
recipient(s) only.
It shall not attach any liability on the originator or HCL or its affiliates. Any views or
opinions presented in
this email are solely those of the author and may not necessarily reflect the opinions of
HCL or its affiliates.
Any form of reproduction, dissemination, copying, disclosure, modification, distribution and
/ or publication of
this message without the prior written consent of the author of this e-mail is strictly prohibited.
If you have
received this email in error please delete it and notify the sender immediately. Before opening
any mail and
attachments please check them for viruses and defect.


View raw message