Mailing-List: contact user-help@spark.incubator.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@spark.incubator.apache.org
Received-SPF: pass (athena.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws;
  s=s1024; d=yahoo.com;
  h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type;
  b=l/r5n8Q2eWoxDvJWhLaB7YayYdNjRBn5mUmgKA5XIALXEmRmRNs7t1AWHIjdJx5sA1Q2DFVI/rT0hmlhAvVQ/fTV9T0lY4Rew9TzPADs3L8HBs/W3gugeX1/TZD9TBlyyUMWgiyLlQMbNQv6R1wA5HSyXM8z3a+IFtnArATtUVI=;
References: 
 <CANgoNWo_QvBPuR36YdGuhqYJ+t72djBv3cDD5AgvXsze=ycxng@mail.gmail.com>
	<1384785248.10520.YahooMailNeo@web140306.mail.bf1.yahoo.com>
	<CANgoNWoEP2CWaeqVChHtRZT2oF5pFKVuy=3RO3Z3XO4DQB8eBg@mail.gmail.com>
	<1384833057.82865.YahooMailNeo@web140304.mail.bf1.yahoo.com>
 <CANgoNWru4Z=gWtorc+g6-6ZZxTwGLmoh9PFc65V5eCLHwugqAg@mail.gmail.com>
Message-ID: <1384876541.48001.YahooMailNeo@web161205.mail.bf1.yahoo.com>
Date: Tue, 19 Nov 2013 07:55:41 -0800 (PST)
From: Tom Graves <tgraves_cs@yahoo.com>
Reply-To: Tom Graves <tgraves_cs@yahoo.com>
Subject: Re: App master failed to find application jar in the master branch on
 YARN
To: "user@spark.incubator.apache.org" <user@spark.incubator.apache.org>
In-Reply-To: 
 <CANgoNWru4Z=gWtorc+g6-6ZZxTwGLmoh9PFc65V5eCLHwugqAg@mail.gmail.com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
 boundary="537826103-1223718939-1384876541=:48001"

--537826103-1223718939-1384876541=:48001
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable

The property is deprecated but will still work. Either one is fine.=0A=0ALa=
unching the job from the namenode is fine .=A0=0A=0AI brought up a cluster =
with 2.0.5-alpha and built the latest spark master branch and it runs fine =
for me. It looks like namenode 2.0.5-alpha won't even start with the defaul=
Fs of file:///. =A0Please make sure your namenode is actually up and runnin=
g and you are pointing to it because you can run some jobs successfully wit=
hout it (on a single node cluster), but when you have a multinode cluster =
=A0here is the error I get when I run without a namenode up and it looks ve=
ry similar to your error message:=0A=0A=A0 =A0 =A0 =A0 appDiagnostics: Appl=
ication application_1384876319080_0001 failed 1 times due to AM Container f=
or appattempt_1384876319080_0001_000001 exited with =A0exitCode: -1000 due =
to: java.io.FileNotFoundException: File file:/home/tgravescs/spark-master/a=
ssembly/target/scala-2.9.3/spark-assembly-0.9.0-incubating-SNAPSHOT-hadoop2=
.0.5-alpha.jar does not exist=0A=0A=0AWhen you changed the default fs confi=
g did you restart the cluster?=0A=0A=0ACan you try just running the example=
s jar:=0A=0ASPARK_JAR=3Dassembly/target/scala-2.9.3/spark-assembly-0.9.0-in=
cubating-SNAPSHOT-hadoop2.0.5-alpha.jar=0A=0A./spark-class =A0org.apache.sp=
ark.deploy.yarn.Client --jar examples/target/scala-2.9.3/spark-examples-ass=
embly-0.9.0-incubating-SNAPSHOT.jar =A0--class org.apache.spark.examples.Sp=
arkPi =A0--args yarn-standalone =A0--num-workers 2 =A0--master-memory 2g --=
worker-memory 2g --worker-cores 1=0A=0AOn the client side you should see me=
ssages like this:=0A13/11/19 15:41:30 INFO yarn.Client: Uploading file:/hom=
e/tgravescs/spark-master/examples/target/scala-2.9.3/spark-examples-assembl=
y-0.9.0-incubating-SNAPSHOT.jar to hdfs://namenode.host.com:9000/user/tgrav=
escs/.sparkStaging/application_1384874528558_0003/spark-examples-assembly-0=
.9.0-incubating-SNAPSHOT.jar=0A13/11/19 15:41:31 INFO yarn.Client: Uploadin=
g file:/home/tgravescs/spark-master/assembly/target/scala-2.9.3/spark-assem=
bly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar to hdfs://namenode.host=
.com:9000/user/tgravescs/.sparkStaging/application_1384874528558_0003/spark=
-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar=0A=0ATom=0A=0A=0A=
=0AOn Tuesday, November 19, 2013 5:35 AM, guojc <guojc03@gmail.com> wrote:=
=0A =0AHi Tom,=0A=A0 =A0Thank you for your response. I =A0have double check=
ed that I had upload both jar in the same folder on hdfs. I think the=A0<na=
me>fs.default.name</name> you pointed out is the old deprecated name for=A0=
fs.defaultFS=A0config accordiing =A0http://hadoop.apache.org/docs/r2.0.2-al=
pha/hadoop-project-dist/hadoop-common/DeprecatedProperties.html . =A0Anyway=
, we have tried both =A0fs.default.name and =A0fs.defaultFS=A0set to hdfs n=
amenode, and the situation remained same. And we have removed SPARK_HOME en=
v variable on worker node. =A0An additional information might be related is=
 that job=A0submission=A0is done on the same machine of hdfs namenode. =A0B=
ut I'm not sure this will cause the problem.=0A=0AThanks,=0AJiacheng Guo=0A=
=0A=0A=0AOn Tue, Nov 19, 2013 at 11:50 AM, Tom Graves <tgraves_cs@yahoo.com=
> wrote:=0A=0ASorry for the delay. What is the default filesystem on your H=
DFS setup? =A0It looks like its set to file: rather then hdfs://. =A0That i=
s the only reason I can think its listing the directory as=A0=A0file:/home/=
work/.sparkStaging/application_1384588058297_0056. =A0Its basically just co=
pying it local rather then uploading to hdfs and its just trying to use the=
 local=A0=A0file:/home/work/guojiacheng/spark/assembly/target/scala-2.9.3/s=
park-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar.=A0 It genera=
lly would create that in hdfs so it accessible on all the nodes. =A0Is your=
 /home/work nfs mounted on all the nodes? =A0 =A0=0A>=0A>=0A>You can find t=
he default fs by looking at the Hadoop config files. =A0Generally in core-s=
ite.xml. =A0its specified by:=A0=A0 =A0 =A0 =A0 <name>fs.default.name</name=
>=0A>=0A>=0A>Its pretty odd if those are its erroring with file:// when you=
 specified hdfs://.=0A>when you tried the hdfs:// did you upload both the s=
park jar and your client jar=A0(SparkAUC-assembly-0.1.jar)? =A0If not try t=
hat and make sure to put hdfs:// on them when you export SPARK_JAR and spec=
ify the --jar option. =A0=0A>=0A>=0A>=0A>I'll try to reproduce the error to=
morrow to see if a bug was introduced when I added the feature to run spark=
 from HDFS.=0A>=0A>=0A>Tom=0A>=0A>=0A>=0A>On Monday, November 18, 2013 11:1=
3 AM, guojc <guojc03@gmail.com> wrote:=0A> =0A>Hi Tom,=0A>=A0 =A0I'm on Had=
oop 2.05. =A0I can launch application spark 0.8 release normally. However I=
 switch to git master branch version with application built with it, I got =
the jar not found exception and same happens to the example application. I =
have tried both file:// protocol and hdfs:// protocol with jar in local fil=
e system and hdfs respectively, and even tried jar list parameter when new =
spark context. =A0The exception is slightly different for hdfs protocol and=
 local file path. My application launch command is =A0=A0=0A>=0A>=0A>=A0SPA=
RK_JAR=3D/home/work/guojiacheng/spark/assembly/target/scala-2.9.3/spark-ass=
embly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar /home/work/guojiachen=
g/spark/spark-class =A0org.apache.spark.deploy.yarn.Client --jar /home/work=
/guojiacheng/spark-auc/target/scala-2.9.3/SparkAUC-assembly-0.1.jar --class=
 =A0myClass.SparkAUC --args -c --args yarn-standalone =A0--args -i --args h=
dfs://{hdfs_host}:9000/user/work/guojiacheng/data --args -m --args hdfs://{=
hdfs_host}:9000/user/work/guojiacheng/model_large --args -o --args hdfs://{=
hdfs_host}:9000/user/work/guojiacheng/score --num-workers 60 =A0--master-me=
mory 6g --worker-memory 7g --worker-cores 1=0A>=0A>=0A>And my build command=
 is=A0SPARK_HADOOP_VERSION=3D2.0.5-alpha SPARK_YARN=3Dtrue sbt/sbt assembly=
=0A>=0A>=0A>Only thing I can think of might be related is on each cluster n=
ode, it has a env SPARK_HOME point to a copy of 0.8 version's position, and=
 its bin fold is in Path environment variable. And 0.9 version is not there=
. =A0It was something left over, when cluster was setup. =A0But I don't kno=
w whether it is related, as my understand is the yarn version try to distri=
bute spark through yarn.=0A>=0A>=0A>hdfs version error message:=0A>=0A>=0A>=
=A0 =A0 =A0 =A0 =A0appDiagnostics: Application application_1384588058297_00=
56 failed 1 times due to AM Container for appattempt_1384588058297_0056_000=
001 exited with =A0exitCode: -1000 due to: RemoteTrace:=A0=0A>java.io.FileN=
otFoundException: File file:/home/work/.sparkStaging/application_1384588058=
297_0056/SparkAUC-assembly-0.1.jar does not exist=0A>=A0 =A0=0A>local versi=
on error message.=0A>appDiagnostics: Application application_1384588058297_=
0066 failed 1 times due to AM Container for appattempt_1384588058297_0066_0=
00001 exited with =A0exitCode: -1000 due to: java.io.FileNotFoundException:=
 File file:/home/work/guojiacheng/spark/assembly/target/scala-2.9.3/spark-a=
ssembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar does not exist=0A>=
=0A>=0A>=0A>Best Regards,=0A>Jiacheng GUo=0A>=0A>=0A>=0A>=0A>=0A>On Mon, No=
v 18, 2013 at 10:34 PM, Tom Graves <tgraves_cs@yahoo.com> wrote:=0A>=0A>Hey=
=A0Jiacheng Guo,=0A>>=0A>>=0A>>do you have SPARK_EXAMPLES_JAR env variable =
set? =A0If you do, you have to add the --addJars parameter to the yarn clie=
nt and point to the spark examples jar. =A0Or just unset SPARK_EXAMPLES_JAR=
 env variable.=0A>>=0A>>=0A>>You should only have to set SPARK_JAR env vari=
able. =A0=0A>>=0A>>=0A>>If that isn't the issue let me know the build comma=
nd you used and hadoop version, and your defaultFs or hadoop.=0A>>=0A>>=0A>=
>Tom=0A>>=0A>>=0A>>=0A>>On Saturday, November 16, 2013 2:32 AM, guojc <guoj=
c03@gmail.com> wrote:=0A>> =0A>>hi,=0A>>=A0 =A0After reading about the exit=
ing progress in consolidating shuffle, I'm eager to trying out the last mas=
ter branch. However up to launch the example application, the job failed wi=
th prompt the app master failed to find the target jar. appDiagnostics: App=
lication application_1384588058297_0017 failed 1 times due to AM Container =
for appattempt_1384588058297_0017_000001 exited with =A0exitCode: -1000 due=
 to: java.io.FileNotFoundException: File file:/${my_work_dir}/spark/example=
s/target/scala-2.9.3/spark-examples-assembly-0.9.0-incubating-SNAPSHOT.jar =
does not exist.=0A>>=0A>>=0A>>=A0 Is there any change on how to launch a ya=
rn job now?=0A>>=0A>>=0A>>Best Regards,=0A>>Jiacheng Guo=0A>>=0A>>=0A>>=0A>=
>=0A>=0A>=0A>
--537826103-1223718939-1384876541=:48001
Content-Type: text/html; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable

<html><body><div style=3D"color:#000; background-color:#fff; font-family:He=
lveticaNeue, Helvetica Neue, Helvetica, Arial, Lucida Grande, sans-serif;fo=
nt-size:12pt"><div><span>The property is deprecated but will still work. Ei=
ther one is fine.</span></div><div style=3D"color: rgb(0, 0, 0); font-size:=
 16px; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Luc=
ida Grande', sans-serif; background-color: transparent; font-style: normal;=
"><span><br></span></div><div style=3D"color: rgb(0, 0, 0); font-size: 16px=
; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida G=
rande', sans-serif; background-color: transparent; font-style: normal;"><sp=
an>Launching the job from the namenode is fine .&nbsp;</span></div><div sty=
le=3D"color: rgb(0, 0, 0); font-size: 16px; font-family: HelveticaNeue, 'He=
lvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-co=
lor: transparent; font-style: normal;"><span><br></span></div><div
 style=3D"color: rgb(0, 0, 0); font-size: 16px; font-family: HelveticaNeue,=
 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; backgroun=
d-color: transparent; font-style: normal;"><span>I brought up a cluster wit=
h 2.0.5-alpha and built the latest spark master branch and it runs fine for=
 me. It looks like namenode 2.0.5-alpha won't even start with the defaulFs =
of file:///. &nbsp;Please make sure your namenode is actually up and runnin=
g and you are pointing to it because you can run some jobs successfully wit=
hout it (on a single node cluster), but when you have a multinode cluster &=
nbsp;h</span><span style=3D"font-size: 12pt;">ere is the error I get when I=
 run without a namenode up and it looks very similar to your error message:=
</span></div><div style=3D"color: rgb(0, 0, 0); font-size: 12pt; font-famil=
y: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans=
-serif; background-color: transparent; font-style: normal;"><span
 style=3D"font-size: 12pt;"><br></span></div><div style=3D"color: rgb(0, 0,=
 0); font-size: 16px; font-family: HelveticaNeue, 'Helvetica Neue', Helveti=
ca, Arial, 'Lucida Grande', sans-serif; background-color: transparent; font=
-style: normal;"><span style=3D"background-color: transparent; font-size: 1=
2pt;">&nbsp; &nbsp; &nbsp; &nbsp; appDiagnostics: Application application_1=
384876319080_0001 failed 1 times due to AM Container for appattempt_1384876=
319080_0001_000001 exited with &nbsp;exitCode: -1000 due to: java.io.FileNo=
tFoundException: File file:/home/tgravescs/spark-master/assembly/target/sca=
la-2.9.3/spark-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar doe=
s not exist</span><br></div><div style=3D"color: rgb(0, 0, 0); font-size: 1=
6px; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucid=
a Grande', sans-serif; background-color: transparent; font-style: normal;">=
<span><br></span></div><div style=3D"color: rgb(0, 0, 0); font-size: 16px;
 font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Gr=
ande', sans-serif; background-color: transparent; font-style: normal;"><spa=
n>When you changed the default fs config did you restart the cluster?<br></=
span></div><div style=3D"color: rgb(0, 0, 0); font-size: 16px; font-family:=
 HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-s=
erif; background-color: transparent; font-style: normal;"><span><br></span>=
</div><div style=3D"color: rgb(0, 0, 0); font-size: 16px; font-family: Helv=
eticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif;=
 background-color: transparent; font-style: normal;"><span>Can you try just=
 running the examples jar:</span></div><div style=3D"color: rgb(0, 0, 0); f=
ont-size: 16px; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Ar=
ial, 'Lucida Grande', sans-serif; background-color: transparent; font-style=
: normal;"><span><br></span></div><div style=3D"color: rgb(0, 0, 0);
 font-size: 16px; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, =
Arial, 'Lucida Grande', sans-serif; background-color: transparent; font-sty=
le: normal;"><span></span></div><div style=3D"background-color: transparent=
;">SPARK_JAR=3Dassembly/target/scala-2.9.3/spark-assembly-0.9.0-incubating-=
SNAPSHOT-hadoop2.0.5-alpha.jar</div><div><br></div><div style=3D"background=
-color: transparent;"><span>./spark-class &nbsp;org.apache.spark.deploy.yar=
n.Client --jar examples/target/scala-2.9.3/spark-examples-assembly-0.9.0-in=
cubating-SNAPSHOT.jar &nbsp;--class org.apache.spark.examples.SparkPi &nbsp=
;--args yarn-standalone &nbsp;--num-workers 2 &nbsp;--master-memory 2g --wo=
rker-memory 2g --worker-cores 1</span></div><div style=3D"background-color:=
 transparent; color: rgb(0, 0, 0); font-size: 16px; font-family: HelveticaN=
eue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-=
style: normal;"><span><br></span></div><div style=3D"background-color:
 transparent; color: rgb(0, 0, 0); font-size: 16px; font-family: HelveticaN=
eue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-=
style: normal;"><span>On the client side you should see messages like this:=
</span></div><div style=3D"background-color: transparent;">13/11/19 15:41:3=
0 INFO yarn.Client: Uploading file:/home/tgravescs/spark-master/examples/ta=
rget/scala-2.9.3/spark-examples-assembly-0.9.0-incubating-SNAPSHOT.jar to h=
dfs://namenode.host.com:9000/user/tgravescs/.sparkStaging/application_13848=
74528558_0003/spark-examples-assembly-0.9.0-incubating-SNAPSHOT.jar</div><d=
iv style=3D"background-color: transparent;"><span></span></div><div style=
=3D"background-color: transparent;">13/11/19 15:41:31 INFO yarn.Client: Upl=
oading file:/home/tgravescs/spark-master/assembly/target/scala-2.9.3/spark-=
assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar to
 hdfs://namenode.host.com:9000/user/tgravescs/.sparkStaging/application_138=
4874528558_0003/spark-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.=
jar</div><div style=3D"background-color: transparent; color: rgb(0, 0, 0); =
font-size: 16px; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, A=
rial, 'Lucida Grande', sans-serif; font-style: normal;"><span><br></span></=
div><div style=3D"background-color: transparent; color: rgb(0, 0, 0); font-=
size: 16px; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial,=
 'Lucida Grande', sans-serif; font-style: normal;"><span>Tom</span></div><d=
iv class=3D"yahoo_quoted" style=3D"display: block;"> <br> <br> <div style=
=3D"font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida=
 Grande', sans-serif; font-size: 12pt;"> <div style=3D"font-family: Helveti=
caNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; fo=
nt-size: 12pt;"> <div dir=3D"ltr"> <font size=3D"2" face=3D"Arial"> On Tues=
day, November
 19, 2013 5:35 AM, guojc &lt;guojc03@gmail.com&gt; wrote:<br> </font> </div=
>  <div class=3D"y_msg_container"><div id=3D"yiv6919540961"><div><div dir=
=3D"ltr">Hi Tom,<div>&nbsp; &nbsp;Thank you for your response. I &nbsp;have=
 double checked that I had upload both jar in the same folder on hdfs. I th=
ink the&nbsp;<span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helv=
etica, Arial, 'Lucida Grande', sans-serif;">&lt;name&gt;</span><a rel=3D"no=
follow" shape=3D"rect" target=3D"_blank" href=3D"http://fs.default.name/">f=
s.default.name</a><span style=3D"font-family: 'Helvetica Neue', 'Segoe UI',=
 Helvetica, Arial, 'Lucida Grande', sans-serif;">&lt;/name&gt; you pointed =
out is the old deprecated name for&nbsp;</span><font face=3D"Helvetica Neue=
, Segoe UI, Helvetica, Arial, Lucida Grande, sans-serif">fs.defaultFS</font=
><span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial=
, 'Lucida Grande', sans-serif;">&nbsp;config accordiing &nbsp;</span><font =
face=3D"Helvetica
 Neue, Segoe UI, Helvetica, Arial, Lucida Grande, sans-serif"><a rel=3D"nof=
ollow" shape=3D"rect" target=3D"_blank" href=3D"http://hadoop.apache.org/do=
cs/r2.0.2-alpha/hadoop-project-dist/hadoop-common/DeprecatedProperties.html=
">http://hadoop.apache.org/docs/r2.0.2-alpha/hadoop-project-dist/hadoop-com=
mon/DeprecatedProperties.html</a> . &nbsp;Anyway, we have tried both &nbsp;=
</font><a rel=3D"nofollow" shape=3D"rect" target=3D"_blank" href=3D"http://=
fs.default.name/">fs.default.name</a> and &nbsp;<font face=3D"Helvetica Neu=
e, Segoe UI, Helvetica, Arial, Lucida Grande, sans-serif">fs.defaultFS&nbsp=
;set to hdfs namenode, and the situation remained same. And we have removed=
 SPARK_HOME env variable on worker node. &nbsp;An additional information mi=
ght be related is that job&nbsp;submission&nbsp;is done on the same machine=
 of hdfs namenode. &nbsp;But I'm not sure this will cause the problem.</fon=
t></div>=0A<div><font face=3D"Helvetica Neue, Segoe UI, Helvetica, Arial, L=
ucida Grande, sans-serif"><br clear=3D"none"></font></div><div><font face=
=3D"Helvetica Neue, Segoe UI, Helvetica, Arial, Lucida Grande, sans-serif">=
Thanks,</font></div><div><font face=3D"Helvetica Neue, Segoe UI, Helvetica,=
 Arial, Lucida Grande, sans-serif">Jiacheng Guo</font></div>=0A</div><div c=
lass=3D"yiv6919540961yqt7884265324" id=3D"yiv6919540961yqt28610"><div class=
=3D"yiv6919540961gmail_extra"><br clear=3D"none"><br clear=3D"none"><div cl=
ass=3D"yiv6919540961gmail_quote">On Tue, Nov 19, 2013 at 11:50 AM, Tom Grav=
es <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" ymailto=3D"mail=
to:tgraves_cs@yahoo.com" target=3D"_blank" href=3D"mailto:tgraves_cs@yahoo.=
com">tgraves_cs@yahoo.com</a>&gt;</span> wrote:<br clear=3D"none">=0A<block=
quote class=3D"yiv6919540961gmail_quote" style=3D"margin:0 0 0 .8ex;border-=
left:1px #ccc solid;padding-left:1ex;"><div><div style=3D"font-size: 12pt; =
font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Gra=
nde', sans-serif;"><div>=0ASorry for the delay. What is the default filesys=
tem on your HDFS setup? &nbsp;It looks like its set to file: rather then hd=
fs://. &nbsp;That is the only reason I can think its listing the directory =
as&nbsp;<span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica=
, Arial, 'Lucida Grande', sans-serif; font-size: 13px;">&nbsp;</span><span =
style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Luci=
da Grande', sans-serif; font-size: 13px;">file:/home/work/.sparkStaging/app=
lication_1384588058297_0056. &nbsp;Its basically just copying it local rath=
er then uploading to hdfs and its just trying to use the local&nbsp;</span>=
<span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial,=
 'Lucida Grande', sans-serif; font-size: 13px;">&nbsp;</span><span style=3D=
"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grand=
e', sans-serif; font-size:
 13px;">file:/home/work/guojiacheng/spark/assembly/target/scala-2.9.3/spark=
-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alpha.jar.</span><span styl=
e=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida G=
rande', sans-serif; font-size: 13px;">&nbsp; It generally would create that=
 in hdfs so it accessible on all the nodes. &nbsp;Is your /home/work nfs mo=
unted on all the nodes? &nbsp; &nbsp;</span></div>=0A<div style=3D"font-sty=
le: normal; font-size: 13px; background-color: transparent; font-family: 'H=
elvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;"=
><span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial=
, 'Lucida Grande', sans-serif; font-size: 13px;"><br clear=3D"none">=0A</sp=
an></div><div style=3D"background-color:transparent;"><font face=3D"Helveti=
ca Neue, Segoe UI, Helvetica, Arial, Lucida  Grande, sans-serif">You can fi=
nd the default fs by looking at the Hadoop config files. &nbsp;Generally in=
 core-site.xml. &nbsp;its specified by:&nbsp;</font><span style=3D"backgrou=
nd-color:transparent;"><font face=3D"Helvetica Neue, Segoe UI, Helvetica, A=
rial, Lucida Grande, sans-serif">&nbsp; &nbsp; &nbsp; &nbsp; &lt;name&gt;<a=
 rel=3D"nofollow" shape=3D"rect" target=3D"_blank" href=3D"http://fs.defaul=
t.name/">fs.default.name</a>&lt;/name&gt;</font></span></div>=0A<div style=
=3D"font-style: normal; font-size: 13px; background-color: transparent; fon=
t-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', =
sans-serif;"><span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helv=
etica, Arial, 'Lucida Grande', sans-serif; font-size: 13px;"><br clear=3D"n=
one">=0A</span></div><div style=3D"font-style: normal; font-size: 13px; bac=
kground-color: transparent; font-family: 'Helvetica Neue', 'Segoe UI', Helv=
etica, Arial, 'Lucida Grande', sans-serif;"><span>Its pretty odd if those a=
re its erroring with file:// when you specified hdfs://.</span></div>=0A<di=
v style=3D"font-style: normal; font-size: 13px; background-color: transpare=
nt; font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Gr=
ande', sans-serif;"><span style=3D"background-color:transparent;">when you =
tried the hdfs:// did you upload both the spark jar and your client jar</sp=
an><span style=3D"background-color:transparent;">&nbsp;(SparkAUC-assembly-0=
.1.jar)? &nbsp;If not try that and make sure to put hdfs:// on them when yo=
u export SPARK_JAR and specify the --jar option. &nbsp;</span><br clear=3D"=
none">=0A</div><div style=3D"font-style: normal; font-size: 13px; backgroun=
d-color: transparent; font-family: 'Helvetica Neue', 'Segoe UI', Helvetica,=
 Arial, 'Lucida Grande', sans-serif;"><span style=3D"background-color:trans=
parent;"><br clear=3D"none">=0A</span></div><div style=3D"font-style: norma=
l; font-size: 13px; background-color: transparent; font-family: 'Helvetica =
Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;"><span st=
yle=3D"background-color:transparent;">I'll try to reproduce the error tomor=
row to see if a bug was introduced when I added the feature to run spark fr=
om HDFS.</span></div>=0A<span class=3D"yiv6919540961HOEnZb"><font color=3D"=
#888888"></font></span><div style=3D"font-style: normal; font-size: 13px; b=
ackground-color: transparent; font-family: 'Helvetica Neue', 'Segoe UI', He=
lvetica, Arial, 'Lucida Grande', sans-serif;">=0A<br clear=3D"none"></div><=
div style=3D"font-style: normal; font-size: 13px; background-color: transpa=
rent; font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida =
Grande', sans-serif;"><span style=3D"font-family: 'Helvetica Neue', 'Segoe =
UI', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 13px;">Tom</=
span></div>=0A<div><div class=3D"yiv6919540961h5"><div style=3D"display:blo=
ck;"> <br clear=3D"none"> <br clear=3D"none"> <div style=3D"font-family: He=
lveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-seri=
f; font-size: 12pt;"> <div style=3D"font-family: HelveticaNeue, 'Helvetica =
Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; font-size: 12pt;">=0A=
 <div dir=3D"ltr"> <font face=3D"Arial"> On Monday, November 18, 2013 11:13=
 AM, guojc &lt;<a rel=3D"nofollow" shape=3D"rect" ymailto=3D"mailto:guojc03=
@gmail.com" target=3D"_blank" href=3D"mailto:guojc03@gmail.com">guojc03@gma=
il.com</a>&gt; wrote:<br clear=3D"none"> </font> </div>  <div><div><div><di=
v dir=3D"ltr">Hi Tom,<div>=0A&nbsp; &nbsp;I'm on Hadoop 2.05. &nbsp;I can l=
aunch application spark 0.8 release normally. However I switch to git maste=
r branch version with application built with it, I got the jar not found ex=
ception and same happens to the example application. I have tried both file=
:// protocol and hdfs:// protocol with jar in local file system and hdfs re=
spectively, and even tried jar list parameter when new spark context. &nbsp=
;The exception is slightly different for hdfs protocol and local file path.=
=0A My application launch command is &nbsp;&nbsp;</div>=0A<div><br clear=3D=
"none"></div><div>&nbsp;SPARK_JAR=3D/home/work/guojiacheng/spark/assembly/t=
arget/scala-2.9.3/spark-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alph=
a.jar /home/work/guojiacheng/spark/spark-class &nbsp;org.apache.spark.deplo=
y.yarn.Client --jar /home/work/guojiacheng/spark-auc/target/scala-2.9.3/Spa=
rkAUC-assembly-0.1.jar --class &nbsp;myClass.SparkAUC --args -c --args yarn=
-standalone &nbsp;--args -i --args hdfs://{hdfs_host}:9000/user/work/guojia=
cheng/data --args -m --args hdfs://{hdfs_host}:9000/user/work/guojiacheng/m=
odel_large --args -o --args hdfs://{hdfs_host}:9000/user/work/guojiacheng/s=
core --num-workers 60 &nbsp;--master-memory 6g --worker-memory 7g --worker-=
cores 1</div>=0A=0A<div><br clear=3D"none"></div><div>And my build command =
is&nbsp;SPARK_HADOOP_VERSION=3D2.0.5-alpha SPARK_YARN=3Dtrue sbt/sbt assemb=
ly</div><div><br clear=3D"none"></div><div>Only thing I can think of might =
be related is on each cluster node, it has a env SPARK_HOME point to a copy=
 of 0.8 version's position, and its bin fold is in Path environment variabl=
e. And 0.9 version is not there. &nbsp;It was something left over, when clu=
ster was setup. &nbsp;But I don't know whether it is related, as my underst=
and is the yarn version try to distribute spark through yarn.</div>=0A=0A<d=
iv><br clear=3D"none"></div><div>hdfs version error message:</div><div><div=
><br clear=3D"none"></div><div>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;appDiagnos=
tics: Application application_1384588058297_0056 failed 1 times due to AM C=
ontainer for appattempt_1384588058297_0056_000001 exited with &nbsp;exitCod=
e: -1000 due to: RemoteTrace:&nbsp;</div>=0A=0A<div>java.io.FileNotFoundExc=
eption: File file:/home/work/.sparkStaging/application_1384588058297_0056/S=
parkAUC-assembly-0.1.jar does not exist</div><div>&nbsp; &nbsp;</div></div>=
<div>local version error message.</div><div>appDiagnostics: Application app=
lication_1384588058297_0066 failed 1 times due to AM Container for appattem=
pt_1384588058297_0066_000001 exited with &nbsp;exitCode: -1000 due to: java=
.io.FileNotFoundException: File file:/home/work/guojiacheng/spark/assembly/=
target/scala-2.9.3/spark-assembly-0.9.0-incubating-SNAPSHOT-hadoop2.0.5-alp=
ha.jar does not exist<br clear=3D"none">=0A=0A</div><div><br clear=3D"none"=
></div><div>Best Regards,</div><div>Jiacheng GUo</div><div><br clear=3D"non=
e"></div></div><div><div><br clear=3D"none"><br clear=3D"none"><div>On Mon,=
 Nov 18, 2013 at 10:34 PM, Tom Graves <span dir=3D"ltr">&lt;<a rel=3D"nofol=
low" shape=3D"rect" ymailto=3D"mailto:tgraves_cs@yahoo.com" target=3D"_blan=
k" href=3D"mailto:tgraves_cs@yahoo.com">tgraves_cs@yahoo.com</a>&gt;</span>=
 wrote:<br clear=3D"none">=0A=0A<blockquote style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex;"><div><div style=3D"font-size: 12p=
t; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida =
Grande', sans-serif;"><div>=0A=0A<span>Hey&nbsp;</span><span style=3D"font-=
family: 'Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', sa=
ns-serif; font-size: 13px;">Jiacheng Guo,</span></div><div style=3D"font-st=
yle: normal; font-size: 13px; background-color: transparent; font-family: '=
Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;=
">=0A=0A<span style=3D"font-family: 'Helvetica Neue', 'Segoe UI', Helvetica=
, Arial, 'Lucida Grande', sans-serif; font-size: 13px;"><br clear=3D"none">=
</span></div><div style=3D"font-style: normal; font-size: 13px; background-=
color: transparent; font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, A=
rial, 'Lucida Grande', sans-serif;">=0A=0Ado you have SPARK_EXAMPLES_JAR en=
v variable set? &nbsp;If you do, you have to add the --addJars parameter to=
=0A the yarn client and point to the spark examples jar. &nbsp;Or just unse=
t SPARK_EXAMPLES_JAR env variable.</div><div style=3D"font-style: normal; f=
ont-size: 13px; background-color: transparent; font-family: 'Helvetica Neue=
', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;">=0A=0A<br cl=
ear=3D"none"></div><div style=3D"font-style: normal; font-size: 13px; backg=
round-color: transparent; font-family: 'Helvetica Neue', 'Segoe UI', Helvet=
ica, Arial, 'Lucida Grande', sans-serif;">You should only have to set SPARK=
_JAR env variable. &nbsp;</div>=0A=0A<div style=3D"font-style: normal; font=
-size: 13px; background-color: transparent; font-family: 'Helvetica Neue', =
'Segoe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;"><br clear=3D"no=
ne"></div><div style=3D"font-style: normal; font-size: 13px; background-col=
or: transparent; font-family: 'Helvetica Neue', 'Segoe UI', Helvetica, Aria=
l, 'Lucida Grande', sans-serif;">=0A=0AIf=0A that isn't the issue let me kn=
ow the build command you used and hadoop version, and your defaultFs or had=
oop.</div><span><font color=3D"#888888"></font></span><div style=3D"font-st=
yle: normal; font-size: 13px; background-color: transparent; font-family: '=
Helvetica Neue', 'Segoe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;=
">=0A=0A<br clear=3D"none"></div><div style=3D"font-style: normal; font-siz=
e: 13px; background-color: transparent; font-family: 'Helvetica Neue', 'Seg=
oe UI', Helvetica, Arial, 'Lucida Grande', sans-serif;">Tom</div><div><div>=
=0A=0A<div style=3D"display:block;"> <br clear=3D"none"> <br clear=3D"none"=
> <div style=3D"font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Ar=
ial, 'Lucida Grande', sans-serif; font-size: 12pt;"> <div style=3D"font-fam=
ily: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sa=
ns-serif; font-size: 12pt;">=0A=0A <div dir=3D"ltr"> <font face=3D"Arial"> =
On Saturday, November 16, 2013 2:32 AM, guojc &lt;<a rel=3D"nofollow" shape=
=3D"rect" ymailto=3D"mailto:guojc03@gmail.com" target=3D"_blank" href=3D"ma=
ilto:guojc03@gmail.com">guojc03@gmail.com</a>&gt; wrote:<br clear=3D"none">=
 </font>=0A </div>  <div><div><div dir=3D"ltr">hi,<div>&nbsp; &nbsp;After r=
eading about the exiting progress in consolidating shuffle, I'm eager to tr=
ying out the last master branch. However up to launch the example applicati=
on, the job failed with prompt the app master failed to find the target jar=
. appDiagnostics: Application application_1384588058297_0017 failed 1 times=
 due to AM Container for appattempt_1384588058297_0017_000001 exited with &=
nbsp;exitCode: -1000 due to: java.io.FileNotFoundException: File file:/${my=
_work_dir}/spark/examples/target/scala-2.9.3/spark-examples-assembly-0.9.0-=
incubating-SNAPSHOT.jar does not exist.</div>=0A=0A=0A<div><br clear=3D"non=
e"></div><div>&nbsp; Is there any change on how to launch a yarn job now?</=
div><div><br clear=3D"none"></div><div>Best Regards,</div><div>Jiacheng Guo=
</div><div><br clear=3D"none"></div></div></div><br clear=3D"none">=0A<br c=
lear=3D"none"></div>  </div> </div>  </div> </div></div></div>=0A</div></bl=
ockquote></div><br clear=3D"none"></div></div></div></div><br clear=3D"none=
"><br clear=3D"none"></div>  </div> </div>  </div> </div></div></div></div>=
</blockquote></div><br clear=3D"none"></div></div></div></div><br><br></div=
>  </div> </div>  </div> </div></body></html>
--537826103-1223718939-1384876541=:48001--