Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4EE8B10CC0 for ; Mon, 27 Jan 2014 19:01:06 +0000 (UTC) Received: (qmail 50769 invoked by uid 500); 27 Jan 2014 19:00:56 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 50674 invoked by uid 500); 27 Jan 2014 19:00:56 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 50667 invoked by uid 99); 27 Jan 2014 19:00:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 27 Jan 2014 19:00:56 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of vinodkv@hortonworks.com designates 209.85.220.53 as permitted sender) Received: from [209.85.220.53] (HELO mail-pa0-f53.google.com) (209.85.220.53) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 27 Jan 2014 19:00:49 +0000 Received: by mail-pa0-f53.google.com with SMTP id lj1so6225157pab.26 for ; Mon, 27 Jan 2014 11:00:28 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:sender:from:message-id:mime-version:subject:date :references:to:in-reply-to:content-type; bh=udVlTBzau8spuW+2hnkbuUMZ0ldkuo362eHEzWVppvo=; b=CtesssKco0vbMcNW8Rc9pDgVUhnJA13EehVuc5ZuIMZJ0EaprTKiA9oVufiAfl/70F WDERyCOMaJhCq4+UireM3XRXJ9WSn0Htoyv1lWrV166iJAEG/lOCSNuPitzaJSWpkJqv 6KcfCeV8jZP7w9YnLLj+9KKMb5z4Xuh46dn4P7DCKJcZT7mrLA3WACleIJqscQGr0Gf8 Az4STz1tqe8tpcsxWmP51lzbOMXW6/7CauxooDJlCCFM7TJzHX7pje9uwWOJ4zpHhZq7 mMOhiPxKHZGlnXFiH4SPYTMwPWjKA5Mg56IBYLT46PQlyGm++Y4dEuIIjK7KY4ur5ZVI G3RQ== X-Gm-Message-State: ALoCoQl3gNzeopjLUokLU7r19f9xq7NX/oHwPjzw1+7DplaVj6UkMfxq2txW7ZzUtatOfqoOX7G6MKoa9eUQfWlkynqMruj4pqAbRARVNrJeNSCTQBtcYHs= X-Received: by 10.68.204.161 with SMTP id kz1mr4460038pbc.156.1390849228405; Mon, 27 Jan 2014 11:00:28 -0800 (PST) Received: from [10.11.2.113] ([192.175.27.2]) by mx.google.com with ESMTPSA id yi8sm91146117pab.16.2014.01.27.11.00.13 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 27 Jan 2014 11:00:17 -0800 (PST) Sender: Vinod Kumar Vavilapalli From: Vinod Kumar Vavilapalli Message-Id: Mime-Version: 1.0 (Mac OS X Mail 7.1 \(1827\)) Subject: Re: Invalide URI in job start Date: Mon, 27 Jan 2014 11:00:10 -0800 References: <52E62F51.6030308@gmail.com> To: user@hadoop.apache.org In-Reply-To: <52E62F51.6030308@gmail.com> X-Mailer: Apple Mail (2.1827) Content-Type: multipart/mixed; boundary=047d7b2e48d833a6a004f0f8527f X-Virus-Checked: Checked by ClamAV on apache.org --047d7b2e48d833a6a004f0f8527f Content-Type: multipart/alternative; boundary="Apple-Mail=_DE9BB03F-039B-4DD1-8BC0-703189D2FED3" --Apple-Mail=_DE9BB03F-039B-4DD1-8BC0-703189D2FED3 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1 Need your help to debug this. Seems like the scheme is getting lost somewhe= re along the way. Clearly as you say if job.jar is on the file-system, then= JobClient is properly uploading it. There are multilple things that you'll= need to check - Check the NodeManager logs for the URL. It does print what URL it is try= ing to download from. Check if the scheme is getting there or not. - If that doesn't tell you something, change JobClient to print the URL be= fore it constructs the ContainerLaunchContext for the ApplicationMaster. Yo= u'll need to do this in YarnRunner.java. Specifically the method createAppl= icationResource. Thanks, +Vinod Kumar Vavilapalli Hortonworks Inc. http://hortonworks.com/ On Jan 27, 2014, at 2:05 AM, Lukas Kairies = wrote: > Hello, >=20 > I try to use XtreemFS as an alternative file system for Hadoop 2.x. There= is an existing FileSystem implementation for Hadoop 1.x that works fine. F= irst think I did was to implement a DelegateToFileSystem subclass to provid= e an AbstractFileSystem implementation for XtreemFS (just constructors that= use the FileSystem implementation). When I start the wordcount example app= lication I get the following Exception on the NodeManager: >=20 > 2014-01-20 14:18:19,349 WARN org.apache.hadoop.yarn.server.nodemanager.co= ntainermanager.container.Container: Failed to parse resource-request > java.net.URISyntaxException: Expected scheme name at index 0: :///tmp/had= oop-yarn/staging/lkairies/.staging/job_1390223418764_0004/job.jar > at java.net.URI$Parser.fail(URI.java:2829) > at java.net.URI$Parser.failExpecting(URI.java:2835) > at java.net.URI$Parser.parse(URI.java:3027) > at java.net.URI.(URI.java:753) > at org.apache.hadoop.yarn.util.ConverterUtils.getPathFromYarnURL(C= onverterUtils.java:80) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.loca= lizer.LocalResourceRequest.(LocalResourceRequest.java:46) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.cont= ainer.ContainerImpl$RequestResourcesTransition.transition(ContainerImpl.jav= a:529) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.cont= ainer.ContainerImpl$RequestResourcesTransition.transition(ContainerImpl.jav= a:497) > at org.apache.hadoop.yarn.state.StateMachineFactory$MultipleIntern= alArc.doTransition(StateMachineFactory.java:385) > at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(S= tateMachineFactory.java:302) > at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(Sta= teMachineFactory.java:46) > at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateM= achine.doTransition(StateMachineFactory.java:448) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.cont= ainer.ContainerImpl.handle(ContainerImpl.java:864) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.cont= ainer.ContainerImpl.handle(ContainerImpl.java:73) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.Cont= ainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:= 815) > at org.apache.hadoop.yarn.server.nodemanager.containermanager.Cont= ainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:= 808) > at org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDisp= atcher.java:134) > at org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatc= her.java:81) > at java.lang.Thread.run(Thread.java:724) >=20 > Additionally the following is printed on the console: >=20 > 14/01/27 11:02:14 INFO input.FileInputFormat: Total input paths to proces= s : 1 > 14/01/27 11:02:14 INFO mapreduce.JobSubmitter: number of splits:1 > 14/01/27 11:02:15 INFO Configuration.deprecation: user.name is deprecated= . Instead, use mapreduce.job.user.name > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.jar is deprecate= d. Instead, use mapreduce.job.jar > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.output.value.cla= ss is deprecated. Instead, use mapreduce.job.output.value.class > 14/01/27 11:02:15 INFO Configuration.deprecation: mapreduce.combine.class= is deprecated. Instead, use mapreduce.job.combine.class > 14/01/27 11:02:15 INFO Configuration.deprecation: mapreduce.map.class is = deprecated. Instead, use mapreduce.job.map.class > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.job.name is depr= ecated. Instead, use mapreduce.job.name > 14/01/27 11:02:15 INFO Configuration.deprecation: mapreduce.reduce.class = is deprecated. Instead, use mapreduce.job.reduce.class > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.input.dir is dep= recated. Instead, use mapreduce.input.fileinputformat.inputdir > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.output.dir is de= precated. Instead, use mapreduce.output.fileoutputformat.outputdir > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.map.tasks is dep= recated. Instead, use mapreduce.job.maps > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.output.key.class= is deprecated. Instead, use mapreduce.job.output.key.class > 14/01/27 11:02:15 INFO Configuration.deprecation: mapred.working.dir is d= eprecated. Instead, use mapreduce.job.working.dir > 14/01/27 11:02:15 INFO mapreduce.JobSubmitter: Submitting tokens for job:= job_1390816735288_0001 > 14/01/27 11:02:15 INFO impl.YarnClientImpl: Submitted application applica= tion_1390816735288_0001 to ResourceManager at /0.0.0.0:8032 > 14/01/27 11:02:15 INFO mapreduce.Job: The url to track the job: http://lu= diwg:8088/proxy/application_1390816735288_0001/ > 14/01/27 11:02:15 INFO mapreduce.Job: Running job: job_1390816735288_0001 > 14/01/27 11:02:19 INFO mapreduce.Job: Job job_1390816735288_0001 running = in uber mode : false > 14/01/27 11:02:19 INFO mapreduce.Job: map 0% reduce 0% > 14/01/27 11:02:19 INFO mapreduce.Job: Job job_1390816735288_0001 failed w= ith state FAILED due to: Application application_1390816735288_0001 failed = 2 times due to AM Container for appattempt_1390816735288_0001_000002 exited= with exitCode: -1000 due to: .Failing this attempt.. Failing the applicat= ion. > 14/01/27 11:02:19 INFO mapreduce.Job: Counters: 0 >=20 >=20 > The job files are created in XtreemFS. After a lot of debugging I still d= id not find the problem. >=20 > Any idea how to fix this? >=20 > Regards, > Lukas --=20 CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to= =20 which it is addressed and may contain information that is confidential,=20 privileged and exempt from disclosure under applicable law. If the reader= =20 of this message is not the intended recipient, you are hereby notified that= =20 any printing, copying, dissemination, distribution, disclosure or=20 forwarding of this communication is strictly prohibited. If you have=20 received this communication in error, please contact the sender immediately= =20 and delete it from your system. Thank You. --Apple-Mail=_DE9BB03F-039B-4DD1-8BC0-703189D2FED3 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=ISO-8859-1 Need your help to debug th= is. Seems like the scheme is getting lost somewhere along the way. Clearly = as you say if job.jar is on the file-system, then JobClient is properly upl= oading it. There are multilple things that you'll need to check
 -= Check the NodeManager logs for the URL. It does print what URL it is tryin= g to download from. Check if the scheme is getting there or not.
&n= bsp;- If that doesn't tell you something, change JobClient to print the URL= before it constructs the ContainerLaunchContext for the ApplicationMaster.= You'll need to do this in YarnRunner.java. Specifically the method cr= eateApplicationResource.

Thanks,
+Vinod Kumar Vavi= lapalli
Hortonworks Inc.
http://hortonworks.com/

On Jan 27, 2014, at 2:05 AM, Lukas Kairies <lukas.xtreemfs@googlemail.com> wr= ote:

Hello,

I try to use XtreemFS as an alternative file system for Hado= op 2.x. There is an existing FileSystem implementation for Hadoop 1.x that = works fine. First think I did was to implement a DelegateToFileSystem subcl= ass to provide an AbstractFileSystem implementation for XtreemFS (just cons= tructors that use the FileSystem implementation). When I start the wordcoun= t example application I get the following Exception on the NodeManager:
=
2014-01-20 14:18:19,349 WARN org.apache.hadoop.yarn.server.nodemanager.= containermanager.container.Container: Failed to parse resource-request
j= ava.net.URISyntaxException: Expected scheme name at index 0: :///tmp/hadoop= -yarn/staging/lkairies/.staging/job_1390223418764_0004/job.jar
 &n= bsp;     at java.net.URI$Parser.fail(URI.java:2829= )
       at java.net.URI$Parser.fail= Expecting(URI.java:2835)
       at j= ava.net.URI$Parser.parse(URI.java:3027)
     &= nbsp; at java.net.URI.<init>(URI.java:753)
   = ;    at org.apache.hadoop.yarn.util.ConverterUtils.getP= athFromYarnURL(ConverterUtils.java:80)
     &n= bsp; at org.apache.hadoop.yarn.server.nodemanager.containermanager.loc= alizer.LocalResourceRequest.<init>(LocalResourceRequest.java:46)
=        at org.apache.hadoop.yarn.server.= nodemanager.containermanager.container.ContainerImpl$RequestResourcesTransi= tion.transition(ContainerImpl.java:529)
     &= nbsp; at org.apache.hadoop.yarn.server.nodemanager.containermanager.co= ntainer.ContainerImpl$RequestResourcesTransition.transition(ContainerImpl.j= ava:497)
       at org.apache.hadoop= .yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachi= neFactory.java:385)
       at org.ap= ache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory= .java:302)
       at org.apache.hado= op.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)        at org.apache.hadoop.yarn.stat= e.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory= .java:448)
       at org.apache.hado= op.yarn.server.nodemanager.containermanager.container.ContainerImpl.handle(= ContainerImpl.java:864)
       at or= g.apache.hadoop.yarn.server.nodemanager.containermanager.container.Containe= rImpl.handle(ContainerImpl.java:73)
      = ; at org.apache.hadoop.yarn.server.nodemanager.containermanager.Contai= nerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:81= 5)
       at org.apache.hadoop.yarn.= server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDisp= atcher.handle(ContainerManagerImpl.java:808)
    &n= bsp;  at org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(As= yncDispatcher.java:134)
       at or= g.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:81)        at java.lang.Thread.run(Thread= .java:724)

Additionally the following is printed on the console:
=
14/01/27 11:02:14 INFO input.FileInputFormat: Total input paths to proc= ess : 1
14/01/27 11:02:14 INFO mapreduce.JobSubmitter: number of splits:= 1
14/01/27 11:02:15 INFO Configuration.deprecation: user.name is depreca= ted. Instead, use mapreduce.job.user.name
14/01/27 11:02:15 INFO Configu= ration.deprecation: mapred.jar is deprecated. Instead, use mapreduce.job.ja= r
14/01/27 11:02:15 INFO Configuration.deprecation: mapred.output.value.= class is deprecated. Instead, use mapreduce.job.output.value.class
14/01= /27 11:02:15 INFO Configuration.deprecation: mapreduce.combine.class is dep= recated. Instead, use mapreduce.job.combine.class
14/01/27 11:02:15 INFO= Configuration.deprecation: mapreduce.map.class is deprecated. Instead, use= mapreduce.job.map.class
14/01/27 11:02:15 INFO Configuration.deprecatio= n: mapred.job.name is deprecated. Instead, use mapreduce.job.name
14/01/= 27 11:02:15 INFO Configuration.deprecation: mapreduce.reduce.class is depre= cated. Instead, use mapreduce.job.reduce.class
14/01/27 11:02:15 INFO Co= nfiguration.deprecation: mapred.input.dir is deprecated. Instead, use mapre= duce.input.fileinputformat.inputdir
14/01/27 11:02:15 INFO Configuration= .deprecation: mapred.output.dir is deprecated. Instead, use mapreduce.outpu= t.fileoutputformat.outputdir
14/01/27 11:02:15 INFO Configuration.deprec= ation: mapred.map.tasks is deprecated. Instead, use mapreduce.job.maps
1= 4/01/27 11:02:15 INFO Configuration.deprecation: mapred.output.key.class is= deprecated. Instead, use mapreduce.job.output.key.class
14/01/27 11:02:= 15 INFO Configuration.deprecation: mapred.working.dir is deprecated. Instea= d, use mapreduce.job.working.dir
14/01/27 11:02:15 INFO mapreduce.JobSub= mitter: Submitting tokens for job: job_1390816735288_0001
14/01/27 11:02= :15 INFO impl.YarnClientImpl: Submitted application application_13908167352= 88_0001 to ResourceManager at /0.0.0.0:8032
14/01/27 11:02:15 INFO mapre= duce.Job: The url to track the job: http://ludiwg:8088/proxy/application_13908167= 35288_0001/
14/01/27 11:02:15 INFO mapreduce.Job: Running job: job_1= 390816735288_0001
14/01/27 11:02:19 INFO mapreduce.Job: Job job_13908167= 35288_0001 running in uber mode : false
14/01/27 11:02:19 INFO mapreduce= .Job:  map 0% reduce 0%
14/01/27 11:02:19 INFO mapreduce.Job: Job j= ob_1390816735288_0001 failed with state FAILED due to: Application applicat= ion_1390816735288_0001 failed 2 times due to AM Container for appattempt_13= 90816735288_0001_000002 exited with  exitCode: -1000 due to: .Failing = this attempt.. Failing the application.
14/01/27 11:02:19 INFO mapreduce= .Job: Counters: 0


The job files are created in XtreemFS. After a= lot of debugging I still did not find the problem.

Any idea how to = fix this?

Regards,
Lukas


CONFIDENTIALITY NOTICE
NOTICE: This message is = intended for the use of the individual or entity to which it is addressed a= nd may contain information that is confidential, privileged and exempt from= disclosure under applicable law. If the reader of this message is not the = intended recipient, you are hereby notified that any printing, copying, dis= semination, distribution, disclosure or forwarding of this communication is= strictly prohibited. If you have received this communication in error, ple= ase contact the sender immediately and delete it from your system. Thank Yo= u. --Apple-Mail=_DE9BB03F-039B-4DD1-8BC0-703189D2FED3-- --047d7b2e48d833a6a004f0f8527f Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=signature.asc Content-Type: application/pgp-signature; name=signature.asc Content-Description: Message signed with OpenPGP using GPGMail -----BEGIN PGP SIGNATURE----- Comment: GPGTools - http://gpgtools.org iQEcBAEBAgAGBQJS5qy6AAoJECVa31bDbF8PR08H/RrTzK7E8resNm5+FZT0/ytz oZRw/PD1pb94iGbVyVEcZ/tVO6XaMhTJ8SXPaW1uTB85ScrgLxPLO1n7GUsX0UyJ QIHF9Zhcl4ikef7L24AtDYjDLGBBY+DHe4dNpB6cmK+09PUEkvZN8MebocyfkeGP qTjgSfq00TKsGMr5/4C57UJsNh1p0sadxvJurCFHg3GtPl+HucbzH/VUt0YP6O9l FBBsIERt/Icg/iZT/i3aSaO52NtyxtDMLfRgj76SvKk26w1RZacdOWh/NA0ctHwU ukeGwdCOqWFBmIGSnZtUGcnSYxoLSa2wd13g71pmcjWclrSkFPVRgcMp0EiFisA= =VHxG -----END PGP SIGNATURE----- --047d7b2e48d833a6a004f0f8527f--