Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6D6967C94 for ; Thu, 22 Sep 2011 17:09:51 +0000 (UTC) Received: (qmail 99519 invoked by uid 500); 22 Sep 2011 17:09:50 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 99435 invoked by uid 500); 22 Sep 2011 17:09:50 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 99374 invoked by uid 99); 22 Sep 2011 17:09:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Sep 2011 17:09:50 +0000 X-ASF-Spam-Status: No, hits=3.3 required=5.0 tests=HTML_MESSAGE,NO_RDNS_DOTCOM_HELO,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: 216.145.54.172 is neither permitted nor denied by domain of evans@yahoo-inc.com) Received: from [216.145.54.172] (HELO mrout2.yahoo.com) (216.145.54.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Sep 2011 17:09:41 +0000 Received: from sp1-ex07cas01.ds.corp.yahoo.com (sp1-ex07cas01.ds.corp.yahoo.com [216.252.116.137]) by mrout2.yahoo.com (8.14.4/8.14.4/y.out) with ESMTP id p8MH8Esj051417 for ; Thu, 22 Sep 2011 10:08:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=yahoo-inc.com; s=cobra; t=1316711294; bh=w+wwojgxKbNs7KVGeQfqSoSM15VtH24kT/cwBsUIzUE=; h=From:To:Date:Subject:Message-ID:In-Reply-To:Content-Type: MIME-Version; b=a83+brLC55e2vNN7p2n//JXF9R/pafB0eZtiOUubLOjYycnksUeqTmigTlqoN69iD TJp37kUPKG8Uzig3Ixl/1lT8oWSX7a0Jk8wsMhMhPiJbL5M7v5foKGxbwNF+bpsPgV 7W6wbcSwtnaJ+SLtD6VfV2deCDleeTngiyioGW8I= Received: from SP1-EX07VS02.ds.corp.yahoo.com ([216.252.116.135]) by sp1-ex07cas01.ds.corp.yahoo.com ([216.252.116.137]) with mapi; Thu, 22 Sep 2011 10:08:13 -0700 From: Robert Evans To: "mapreduce-user@hadoop.apache.org" Date: Thu, 22 Sep 2011 10:08:12 -0700 Subject: Re: How do I set the intermediate output path when I use 2 mapreduce jobs? Thread-Topic: How do I set the intermediate output path when I use 2 mapreduce jobs? Thread-Index: Acx4vooS/t8BGEQxTvGfrjwQcghabQAi6t/L Message-ID: In-Reply-To: <6d7aa922.f967.1328e87baa0.Coremail.tanjun_2525@163.com> Accept-Language: en-US Content-Language: en X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: multipart/alternative; boundary="_000_CAA0D5AC297ACevansyahooinccom_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_CAA0D5AC297ACevansyahooinccom_ Content-Type: text/plain; charset="gb2312" Content-Transfer-Encoding: base64 U29ycnkgYWJvdXQgdGhlIGNvbmZ1c2lvbiB0aGVuLiAgTG9vayBhdCB0aGUgY29kZSB0aGF0IFN3 YXRoaSBzZW50LiAgWW91ciBwcm9ibGVtIGlzIHByb2JhYmx5IHdpdGggdGltaW5nIHNvbWUgd2hl cmUuICBZb3UgbWF5IGJlIGxhdW5jaGluZyB0aGUgc2Vjb25kIGpvYiBiZWZvcmUgdGhlIGZpcnN0 IG9uZSBjb21wbGV0ZWx5IGZpbmlzaGVkLg0KDQotLUJvYmJ5IEV2YW5zDQoNCk9uIDkvMjEvMTEg NzoyNyBQTSwgIsy3vvwiIDx0YW5qdW5fMjUyNUAxNjMuY29tPiB3cm90ZToNCg0KQm9iYnkgRXZh bnMNClRlbXAgZmlsZXMgYXJlIG9uIEhERlMgbm90IG9uIGxvY2FsIGZpbGUgc3lzdGVtLg0KDQot LQ0KUmVnYXJkcyENCg0KSnVuIFRhbg0KDQoNCkF0IDIwMTEtMDktMjEgMjI6NTc6NDQsIlJvYmVy dCBFdmFucyIgPGV2YW5zQHlhaG9vLWluYy5jb20+IHdyb3RlOg0KSnVuIFRhbiwNCg0KU28geW91 IHdhbnQgdG8gaGF2ZSB0aGUgdGVtcCBmaWxlIG9uIHRoZSBsb2NhbCBmaWxlIHN5c3RlbSwgbm90 IG9uIEhERlM/ICBUaGF0IGlzIG5vdCBnb2luZyB0byB3b3JrLCBiZWNhdXNlIHRoZXJlIGFyZSBv dGhlciBwYXJ0cyBvZiB0aGUgY29kZSB0aGF0IGFzc3VtZSB0aGF0IHRoZXkgY2FuIHNlZSB0aGUg ZmlsZSAoaS5lLiBUaGUgc3BsaXR0ZXIpIHdoaWNoIGl0IGNhbm5vdCBpZiBpdCBpcyBvbmx5IG9u IHRoZSBsb2NhbCBmaWxlIHN5c3RlbSBvZiBhIHJlbW90ZSBob3N0LiAgSXQgaGFzIHRvIGJlIHN0 b3JlZCBpbiBIREZTLCBvciBzb21lIG90aGVyIGdsb2JhbGx5IHZpZXdhYmxlIGZpbGUgc3lzdGVt Lg0KDQotLUJvYmJ5IEV2YW5zDQoNCg0KT24gOS8yMS8xMSA5OjU0IEFNLCAizLe+/CIgPHRhbmp1 bl8yNTI1QDE2My5jb20gPGh0dHA6Ly9lZzVhODUubWFpbC4xNjMuY29tL2pzNC90YW5qdW5fMjUy NUAxNjMuY29tPiA+IHdyb3RlOg0KDQpIaSwNCkkgd2FudCB0byB1c2UgMiBNUiBqb2JzIHNlcXVl bnRpYWxseS4NCkFuZCB0aGUgZmlyc3Qgam9iIHByb2R1Y2VzIGludGVybWVkaWF0ZSByZXN1bHQg dG8gYSB0ZW1wIGZpbGUuDQpUaGUgc2Vjb25kIGpvYiByZWFkcyB0aGUgcmVzdWx0IGluIHRlbXAg ZmlsZSBidXQgbm90IHRoZSBGaWxlSW5wdXRQYXRoLg0KSSB0cmllZCwgYnV0IEZpbGVOb3RGb3Vu ZEV4Y2VwdGlvbiByZXBvcnRlZC4NClRoZW4gSSBjaGVja2VkIHRoZSBkYXRhbm9kZXMsIHRlbXAg ZmlsZSB3YXMgY3JlYXRlZC4NClRoZSBmaXJzdCBqb2Igd2FzIGV4ZWN1dGVkIGNvcnJlY3RseS4N CldoeSB0aGUgc2Vjb25kIGpvYiBjYW5ub3QgZmluZCB0aGUgZmlsZT8gVGhlIGZpbGUgd2FzIGNy ZWF0ZWQgYmVmb3JlIHRoZSBzZWNvbmQgam9iIHdhcyBleGVjdXRlZC4NClRoYW5rcyENCg0KLS0N ClJlZ2FyZHMhDQoNCkp1biBUYW4NCg0KDQoNCg0KDQoNCg== --_000_CAA0D5AC297ACevansyahooinccom_ Content-Type: text/html; charset="gb2312" Content-Transfer-Encoding: quoted-printable Re: How do I set the intermediate output path when I use 2 mapreduce= jobs? Sorry about the confusion then.  Look at the code that Swathi se= nt.  Your problem is probably with timing some where.  You may be= launching the second job before the first one completely finished.

--Bobby Evans

On 9/21/11 7:27 PM, "=CC=B7=BE=FC" <tanjun_2525@163.com> wrote:

Bobby Evans
Temp files are on HDFS not on local file system.
=
--
Regards!

Jun Tan


At 2011-09-21 22:57:44,"Robert Evans" <evans@yahoo-inc.com> wrote:
Jun Tan,

So you want to have the temp file on the local file system, not on HDFS? &n= bsp;That is not going to work, because there are other parts of the code th= at assume that they can see the file (i.e. The splitter) which it cannot if= it is only on the local file system of a remote host.  It has to be s= tored in HDFS, or some other globally viewable file system.

--Bobby Evans


On 9/21/11 9:54 AM, "=CC=B7=BE=FC" <tanjun_2525@163.com <http://eg5a85.mail.163.com/js4/tanjun_2525@163.com> > wrote:

Hi,
I want to use 2 MR jobs sequentially.
And the first job produces intermediate result to a temp file.
The second job reads the result in temp file but not the FileInputPath.
I tried, but FileNotFoundException reported.
Then I checked the datanodes, temp file was created.
The first job was executed correctly.
Why the second job cannot find the file? The file was created before the se= cond job was executed.
Thanks!

--
Regards!

Jun Tan






--_000_CAA0D5AC297ACevansyahooinccom_--