Return-Path: Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: (qmail 37343 invoked from network); 26 Sep 2009 06:35:35 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Sep 2009 06:35:35 -0000 Received: (qmail 48586 invoked by uid 500); 26 Sep 2009 06:35:32 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 48503 invoked by uid 500); 26 Sep 2009 06:35:31 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 48484 invoked by uid 99); 26 Sep 2009 06:35:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Sep 2009 06:35:31 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of starrysl@gmail.com designates 209.85.223.171 as permitted sender) Received: from [209.85.223.171] (HELO mail-iw0-f171.google.com) (209.85.223.171) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Sep 2009 06:35:23 +0000 Received: by iwn1 with SMTP id 1so1588784iwn.2 for ; Fri, 25 Sep 2009 23:35:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:from:date:message-id :subject:to:content-type; bh=ffWgusbCT5y+kZEFhWQrxVsZSSnt1iCBu9x4odA0k3M=; b=dWex3dBV5AXULQxD5kKbaBqfujn9ZDSNfiTd0CFOaWjYY/JRGgwb7LJYB1Hskw3d8u XTeIfQIJPmEg02NdA9iiLTT1idQ2oxAOw+FdirnuXdi2Ec/zcw5OhSg2v7h++TcNdiJR 9yojqVJyvfktKMq7HJBrevepzjryJQ9U1Xwvg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; b=xMM7Lbh/vqv7XJQ0DDeGGGBoKNazklPCwyxNtf3Y7cHMooEhakykRTUjeQrbDMZAMu fXwWhuUO7yLfH64lAZ47koH/BB7ksjCCYVv1aKd8Cul9pGh3VAHIf0bDSjBNO20D0zQC fNham1gW3vW9OEDx1KrY8K1xGMySgwnMjQwCM= MIME-Version: 1.0 Received: by 10.231.25.199 with SMTP id a7mr1825126ibc.51.1253946903109; Fri, 25 Sep 2009 23:35:03 -0700 (PDT) From: Starry SHI Date: Sat, 26 Sep 2009 14:34:43 +0800 Message-ID: Subject: Where are temp files stored? To: common-dev@hadoop.apache.org, common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=001517741218ba1fd40474754360 X-Virus-Checked: Checked by ClamAV on apache.org --001517741218ba1fd40474754360 Content-Type: text/plain; charset=UTF-8 Hi. I am wondering where the temp files (intermediate files) are stored. They should be located in the hadoop.tmp.dir by default, right? why I cannot find them in either the local file system and hdfs? I was doing a two table join using hadoop. before the job is completed, the intermidiate files should be stored in the tmp folder, however, I cannot find the trace of them. Can somebody tell me how to get access to the intermediate files in hadoop? Another question is about the replication of the intermediate files. By default, will the intermediate (tmp) files be written to HDFS? If yes, will they have replicas? I am thinking if the tmp files also have replica, they should cause a great overhead on the performance. Is there a way to specify which files should have replica and which need not? Looking forward to your reply! Best regards, Starry /* Tomorrow is another day. So is today. */ --001517741218ba1fd40474754360--