Return-Path: Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: (qmail 39590 invoked from network); 25 Aug 2010 15:12:56 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 Aug 2010 15:12:56 -0000 Received: (qmail 44327 invoked by uid 500); 25 Aug 2010 15:12:55 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 44159 invoked by uid 500); 25 Aug 2010 15:12:54 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 44151 invoked by uid 99); 25 Aug 2010 15:12:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Aug 2010 15:12:54 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [202.165.103.49] (HELO web15904.mail.cnb.yahoo.com) (202.165.103.49) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 25 Aug 2010 15:12:48 +0000 Received: (qmail 16087 invoked by uid 60001); 25 Aug 2010 15:12:25 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com.cn; s=s1024; t=1282749145; bh=c33iFNCO4k2F4a4XyVIcI65Z4PPQVZjwHKNzyxEef8g=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=jOubltXPk8iUndNV51jv/3EUJ9XqdEgCsRq9NleXFzbwrpDCQJp2j4d8L/21pG29LG2sHck2tIXUuhVcyA/47mru1p7zaroKwKHN2zQdYz4cv/hgI0gHzcdi2kMuZyvIBDp4wHfKGmI2TgTJ6dxQIJ/XMomOceeEivoDmrme6DI= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com.cn; h=Message-ID:X-YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=mAmQAXkzsXO4Aze6ZMZWboJ4mY9cDcJSm35PJOnNd0q7ItX5OOl6CR4IQNKr0pzg5eo42kuDvsvDv5HHPV3F9OdWep1ZyxdkxXX4UWj+4eG7ms/JMmrwoEP3e1JdWtnbwCzHQVijSQ0YX4e1miLkUAZ93VN5XDT4+ObOjv4QdBw=; Message-ID: <400093.14505.qm@web15904.mail.cnb.yahoo.com> X-YMail-OSG: ttaJW38VM1l8GHlnlPXjnIri_h6mNLRPM_qC3cOALiU3ehd ZBgeQ.MFhONrpIoMfA01543.fNQ.s7ZrkXE5vTrNv7HORfIoZjdoOUvQXqaB uhcJuFpDEXmkp8.w_x24Ga5h3H_mNPOWa.QOEyghL1EYMchriqCtSnX7A_OI kF0B0PxqvGVcOsazJfFQ3WPiySGg7aWxfW_2gNrj31uS667_bsOuYqGWCfzz DzhpHMxzcqjK6SpU7ysXylZSCYBaiplvHJYDyDtHrVnepISmfMra07z0BPf3 dos_g7cdwLez5iZ8- Received: from [152.3.137.226] by web15904.mail.cnb.yahoo.com via HTTP; Wed, 25 Aug 2010 23:12:25 CST X-Mailer: YahooMailRC/470 YahooMailWebService/0.8.105.279950 References: <346926.55544.qm@web15904.mail.cnb.yahoo.com> <875057.75594.qm@web15903.mail.cnb.yahoo.com> Date: Wed, 25 Aug 2010 23:12:25 +0800 (CST) From: Gang Luo Subject: Re: where distributed cache start working To: common-dev@hadoop.apache.org In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=gb2312 Content-Transfer-Encoding: quoted-printable Hi Jeff,=0AI realize the profiling is running within each jvm, while the di= stributed cache =0Aseems start before the jvm starts. That is probably why = I couldn't trace it.=0A=0AThanks,=0A-Gang=0A=0A=0A=0A=0A----- =D4=AD=CA=BC= =D3=CA=BC=FE ----=0A=B7=A2=BC=FE=C8=CB=A3=BA Jeff Zhang = =0A=CA=D5=BC=FE=C8=CB=A3=BA common-dev@hadoop.apache.org=0A=B7=A2=CB=CD=C8= =D5=C6=DA=A3=BA 2010/8/23 (=D6=DC=D2=BB) 12:47:31 =C9=CF=CE=E7=0A=D6=F7 = =CC=E2=A3=BA Re: where distributed cache start working=0A=0ADo you debug it= using LocalJobRunner ? In local mode, TaskRunner won't=0Abeen called.=0AIn= local mode, mapper task runs in thread rather than forked jvm. The=0ATaskR= unner only been called in distributed mode.=0A=0A=0A=0A=0A2010/8/22 Gang Lu= o :=0A> Thanks Jeff.=0A>=0A> However, are you sure T= askRunner.run() is also used in the new API? I use =0A>btrace=0A> to trace = the function call but didn't find this function had been called=0A> anywher= e.=0A>=0A>=0A> One more question about distributed cache. After I call=0A> = DistributedCache.purgeCache, I think the local cached files should be delet= ed =0A>or=0A> invalidated. However ,When I run the same job with the purge = operation at the=0A> end multiple times, I find the local files have never = been deleted and the=0A> modification time is when the first job run. How c= an I ask my job to=0A> re-distributed the cache again anyway?=0A>=0A> Thank= s,=0A> -Gang=0A>=0A>=0A>=0A>=0A> ----- =D4=AD=CA=BC=D3=CA=BC=FE ----=0A> = =B7=A2=BC=FE=C8=CB=A3=BA Jeff Zhang =0A> =CA=D5=BC=FE=C8= =CB=A3=BA common-dev@hadoop.apache.org=0A> =B7=A2=CB=CD=C8=D5=C6=DA=A3=BA 2= 010/8/20 (=D6=DC=CE=E5) 11:22:49 =C9=CF=CE=E7=0A> =D6=F7 =CC=E2=A3=BA Re:= where distributed cache start working=0A>=0A> Hi Gang,=0A>=0A> In the Task= Runner's run() method, hadoop will download the cache files=0A> which you s= et on the client side to local, then the forked child jvm=0A> can use these= cache files locally.=0A>=0A>=0A>=0A> On Fri, Aug 20, 2010 at 8:08 AM, Gang= Luo wrote:=0A>> Hi all,=0A>> I go through the code= , but couldn't find the place where distributed cache=0A>>start=0A>> workin= g. I want to know between DistriubtedCache.addCacheFile at the master=0A>>n= ode=0A>> and DistributedCache.getLocalCacheFiles at the client side, when a= nd where =0Aare=0A>> the files get distributed.=0A>>=0A>>=0A>> Thanks,=0A>>= -Gang=0A>>=0A>>=0A>>=0A>>=0A>>=0A>=0A>=0A>=0A> --=0A> Best Regards=0A>=0A>= Jeff Zhang=0A>=0A>=0A>=0A>=0A>=0A=0A=0A=0A-- =0ABest Regards=0A=0AJeff Zha= ng=0A=0A=0A=0A