Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of ashettia@hortonworks.com
 designates 209.85.219.48 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOTDg_RYj=1c3ZbeD_1ao3GuvtPUfb=+DzhZZKofryuH_Cmq4Q@mail.gmail.com>
References: 
 <CAOTDg_TfQZPOgcM0Js=fig9CZ3Fi07kn0LDqvuooxDVx_hHZyw@mail.gmail.com>
	<CAPLAqn6DzQMgKXaPSPW8xyjKFWATtwJcPNzg=oKL_KskLVxC8A@mail.gmail.com>
	<CAO5zWOtBx3s=X82JMGm=gSzeUHOhDUCzH7_PJm+GqPVGd-E3cQ@mail.gmail.com>
	<CAO5zWOtUWZB6ubbFpPTz8c3gzhp6OZTZmRWi-L7j8Y8tX71oNA@mail.gmail.com>
	<00BBFD45-D11F-4D64-ABB5-E0B44F6F13B0@hortonworks.com>
	<CAOTDg_RYj=1c3ZbeD_1ao3GuvtPUfb=+DzhZZKofryuH_Cmq4Q@mail.gmail.com>
Date: Wed, 16 Apr 2014 13:30:55 -0700
Message-ID: 
 <CADE3u=YmM=SEutF+r_15GMxe4caVpj6d5sFei4r+MxC+TF32nA@mail.gmail.com>
Subject: Re: using "-libjars" in Hadoop 2.2.1
From: Abdelrahman Shettia <ashettia@hortonworks.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=089e0149c04e1dc63604f72ecbb0

--089e0149c04e1dc63604f72ecbb0
Content-Type: text/plain; charset=US-ASCII

Hi Kim,

You can try to grep on the RM java process by running the following
command:

ps aux | grep


On Wed, Apr 16, 2014 at 10:31 AM, Kim Chew <kchew534@gmail.com> wrote:

> Thanks Rahman, I have mixed things up a little bit in my mapred-site.xml
> so it tried to run the job locally. Now I am running into the problem that
> Rahul has, I am unable to to connect to the ResourceManager.
>
> The setup of my targeted cluster runs MR1 instead of YARN, hence the "
> mapreduce.framework.name" is set to "classic".
>
> Here are my settings in my mapred-site.xml on the client side.
>
> <property>
>     <!-- Pointed to the remote JobTracker -->
>         <name>mapreduce.job.tracker.address</name>
>         <value>172.31.3.150:8021</value>
>     </property>
>     <property>
>         <name>mapreduce.framework.name</name>
>         <value>yarn</value>
>     </property>
>
> and my yarn-site.xml
>
>        <property>
>             <description>The hostname of the RM.</description>
>             <name>yarn.resourcemanager.hostname</name>
>             <value>172.31.3.150</value>
>         </property>
>
>         <property>
>             <description>The address of the applications manager interface
> in the RM.</description>
>             <name>yarn.resourcemanager.address</name>
>             <value>${yarn.resourcemanager.hostname}:8032</value>
>         </property>
>
> 14/04/16 10:23:02 INFO client.RMProxy: Connecting to ResourceManager at /
> 172.31.3.150:8032
> 14/04/16 10:23:10 INFO ipc.Client: Retrying connect to server:
> hadoop-host1.eng.narus.com/172.31.3.150:8032. Already tried 0 time(s);
> retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
> sleepTime=1 SECONDS)
>
> Therefore, the question is how do I figure out where the ResourceManager
> is running?
>
> TIA
>
> Kim
>
>
>
> On Wed, Apr 16, 2014 at 8:43 AM, Abdelrahman Shettia <
> ashettia@hortonworks.com> wrote:
>
>> Hi Kim,
>>
>> It looks like it is pointing to hdfs location. Can you create the hdfs
>> dir and put the jar there? Hope this helps
>> Thanks,
>> Rahman
>>
>> On Apr 16, 2014, at 8:39 AM, Rahul Singh <smart.rahul.iiit@gmail.com>
>> wrote:
>>
>> any help...all are welcome?
>>
>>
>> On Wed, Apr 16, 2014 at 1:13 PM, Rahul Singh <smart.rahul.iiit@gmail.com>wrote:
>>
>>> Hi,
>>>  I am running with the following command but still, jar is not available
>>> to mapper and reducers.
>>>
>>> hadoop jar /home/hduser/workspace/Minerva.jar my.search.Minerva
>>> /user/hduser/input_minerva_actual /user/hduser/output_merva_actual3
>>> -libjars /home/hduser/Documents/Lib/json-simple-1.1.1.jar
>>> -Dmapreduce.user.classpath.first=true
>>>
>>>
>>> Error Log
>>>
>>> 14/04/16 13:08:37 INFO client.RMProxy: Connecting to ResourceManager at /
>>> 0.0.0.0:8032
>>> 14/04/16 13:08:37 INFO client.RMProxy: Connecting to ResourceManager at /
>>> 0.0.0.0:8032
>>> 14/04/16 13:08:37 WARN mapreduce.JobSubmitter: Hadoop command-line
>>> option parsing not performed. Implement the Tool interface and execute your
>>> application with ToolRunner to remedy this.
>>> 14/04/16 13:08:37 INFO mapred.FileInputFormat: Total input paths to
>>> process : 1
>>> 14/04/16 13:08:37 INFO mapreduce.JobSubmitter: number of splits:10
>>> 14/04/16 13:08:37 INFO mapreduce.JobSubmitter: Submitting tokens for
>>> job: job_1397534064728_0028
>>> 14/04/16 13:08:38 INFO impl.YarnClientImpl: Submitted application
>>> application_1397534064728_0028
>>> 14/04/16 13:08:38 INFO mapreduce.Job: The url to track the job:
>>> http://L-Rahul-Tech:8088/proxy/application_1397534064728_0028/<http://l-rahul-tech:8088/proxy/application_1397534064728_0028/>
>>> 14/04/16 13:08:38 INFO mapreduce.Job: Running job: job_1397534064728_0028
>>> 14/04/16 13:08:47 INFO mapreduce.Job: Job job_1397534064728_0028 running
>>> in uber mode : false
>>> 14/04/16 13:08:47 INFO mapreduce.Job:  map 0% reduce 0%
>>> 14/04/16 13:08:58 INFO mapreduce.Job: Task Id :
>>> attempt_1397534064728_0028_m_000005_0, Status : FAILED
>>> Error: java.lang.RuntimeException: Error in configuring object
>>>     at
>>> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
>>>     at
>>> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
>>>     at
>>> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
>>>     at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
>>>     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
>>>     at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
>>>     at java.security.AccessController.doPrivileged(Native Method)
>>>     at javax.security.auth.Subject.doAs(Subject.java:416)
>>>     at
>>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
>>>     at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
>>> Caused by: java.lang.reflect.InvocationTargetException
>>>     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>     at
>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>>>     at
>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>>     at java.lang.reflect.Method.invoke(Method.java:622)
>>>     at
>>> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
>>>     ... 9 more
>>> Caused by: java.lang.NoClassDefFoundError:
>>> org/json/simple/parser/ParseException
>>>     at java.lang.Class.forName0(Native Method)
>>>     at java.lang.Class.forName(Class.java:270)
>>>     at
>>> org.apache.hadoop.conf.Configuration.getClassByNameOrNull(Configuration.java:1821)
>>>     at
>>> org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1786)
>>>     at
>>> org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1880)
>>>     at
>>> org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1906)
>>>     at org.apache.hadoop.mapred.JobConf.getMapperClass(JobConf.java:1107)
>>>     at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
>>>     ... 14 more
>>> Caused by: java.lang.ClassNotFoundException:
>>> org.json.simple.parser.ParseException
>>>     at java.net.URLClassLoader$1.run(URLClassLoader.java:217)
>>>     at java.security.AccessController.doPrivileged(Native Method)
>>>     at java.net.URLClassLoader.findClass(URLClassLoader.java:205)
>>>     at java.lang.ClassLoader.loadClass(ClassLoader.java:323)
>>>     at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:294)
>>>     at java.lang.ClassLoader.loadClass(ClassLoader.java:268)
>>>     ... 22 more
>>>
>>> When i analyzed the logs it says
>>> "14/04/16 13:08:37 WARN mapreduce.JobSubmitter: Hadoop command-line
>>> option parsing not performed. Implement the Tool interface and execute your
>>> application with ToolRunner to remedy this."
>>>
>>> But i have implemented the tool class as described below:
>>>
>>> package my.search;
>>>
>>> import org.apache.hadoop.conf.Configured;
>>> import org.apache.hadoop.fs.Path;
>>> import org.apache.hadoop.io.Text;
>>> import org.apache.hadoop.mapred.FileInputFormat;
>>> import org.apache.hadoop.mapred.FileOutputFormat;
>>> import org.apache.hadoop.mapred.JobClient;
>>> import org.apache.hadoop.mapred.JobConf;
>>> import org.apache.hadoop.mapred.TextInputFormat;
>>> import org.apache.hadoop.mapred.TextOutputFormat;
>>> import org.apache.hadoop.util.Tool;
>>> import org.apache.hadoop.util.ToolRunner;
>>>
>>> public class Minerva extends Configured implements Tool
>>> {
>>>     public int run(String[] args) throws Exception {
>>>         JobConf conf = new JobConf(Minerva.class);
>>>         conf.setJobName("minerva sample job");
>>>
>>>         conf.setMapOutputKeyClass(Text.class);
>>>         conf.setMapOutputValueClass(TextArrayWritable.class);
>>>
>>>         conf.setOutputKeyClass(Text.class);
>>>         conf.setOutputValueClass(Text.class);
>>>
>>>         conf.setMapperClass(Map.class);
>>>         // conf.setCombinerClass(Reduce.class);
>>>         conf.setReducerClass(Reduce.class);
>>>
>>>         conf.setInputFormat(TextInputFormat.class);
>>>         conf.setOutputFormat(TextOutputFormat.class);
>>>
>>>         FileInputFormat.setInputPaths(conf, new Path(args[0]));
>>>         FileOutputFormat.setOutputPath(conf, new Path(args[1]));
>>>
>>>         JobClient.runJob(conf);
>>>
>>>         return 0;
>>>     }
>>>
>>>     public static void main(String[] args) throws Exception {
>>>         int res = ToolRunner.run(new Minerva(), args);
>>>         System.exit(res);
>>>     }
>>> }
>>>
>>>
>>> Please let me know if you see any issues?
>>>
>>>
>>>
>>> On Thu, Apr 10, 2014 at 9:29 AM, Shengjun Xin <sxin@gopivotal.com>wrote:
>>>
>>>> add '-Dmapreduce.user.classpath.first=true' to your command and try
>>>> again
>>>>
>>>>
>>>>
>>>> On Wed, Apr 9, 2014 at 6:27 AM, Kim Chew <kchew534@gmail.com> wrote:
>>>>
>>>>> It seems to me that in Hadoop 2.2.1, using the "libjars" option does
>>>>> not search the jars located in the the local file system but HDFS. For
>>>>> example,
>>>>>
>>>>> hadoop jar target/myJar.jar Foo -libjars
>>>>> /home/kchew/test-libs/testJar.jar /user/kchew/inputs/raw.vector
>>>>> /user/kchew/outputs hdfs://remoteNN:8020 remoteJT:8021
>>>>>
>>>>> 14/04/08 15:11:02 INFO jvm.JvmMetrics: Initializing JVM Metrics with
>>>>> processName=JobTracker, sessionId=
>>>>> 14/04/08 15:11:02 INFO mapreduce.JobSubmitter: Cleaning up the staging
>>>>> area
>>>>> file:/tmp/hadoop-kchew/mapred/staging/kchew202924688/.staging/job_local202924688_0001
>>>>> 14/04/08 15:11:02 ERROR security.UserGroupInformation:
>>>>> PriviledgedActionException as:kchew (auth:SIMPLE)
>>>>> cause:java.io.FileNotFoundException: File does not exist:
>>>>> hdfs://remoteNN:8020/home/kchew/test-libs/testJar.jar
>>>>> java.io.FileNotFoundException: File does not exist:
>>>>> hdfs:/remoteNN:8020/home/kchew/test-libs/testJar.jar
>>>>>     at
>>>>> org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1110)
>>>>>     at
>>>>> org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1102)
>>>>>     at
>>>>> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
>>>>>     at
>>>>> org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1102)
>>>>>     at
>>>>> org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:288)
>>>>>     at
>>>>> org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:224)
>>>>>     at
>>>>> org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestamps(ClientDistributedCacheManager.java:93)
>>>>>     at
>>>>> org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestampsAndCacheVisibilities(ClientDistributedCacheManager.java:57)
>>>>>     at
>>>>> org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:264)
>>>>>
>>>>> So under Hadoop 2.2.1, do I have to explicitly set some configurations
>>>>> so when using the "libjars" option it will copy the file to hdfs from local
>>>>> fs?
>>>>>
>>>>> TIA
>>>>>
>>>>> Kim
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Regards
>>>> Shengjun
>>>>
>>>
>>>
>>
>>
>> CONFIDENTIALITY NOTICE
>> NOTICE: This message is intended for the use of the individual or entity
>> to which it is addressed and may contain information that is confidential,
>> privileged and exempt from disclosure under applicable law. If the reader
>> of this message is not the intended recipient, you are hereby notified that
>> any printing, copying, dissemination, distribution, disclosure or
>> forwarding of this communication is strictly prohibited. If you have
>> received this communication in error, please contact the sender immediately
>> and delete it from your system. Thank You.
>
>
>

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

--089e0149c04e1dc63604f72ecbb0
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Kim,=A0<div><br></div><div>You can try to grep on the R=
M java process by running the following command:=A0</div><div><br></div><di=
v>ps aux | grep=A0</div><div><br></div><div><br></div></div><div class=3D"g=
mail_extra">
<br><br><div class=3D"gmail_quote">On Wed, Apr 16, 2014 at 10:31 AM, Kim Ch=
ew <span dir=3D"ltr">&lt;<a href=3D"mailto:kchew534@gmail.com" target=3D"_b=
lank">kchew534@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmai=
l_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left=
:1ex">
<div dir=3D"ltr"><div><div><div><div><div><div>Thanks Rahman, I have mixed =
things up a little bit in my mapred-site.xml so it tried to run the job loc=
ally. Now I am running into the problem that Rahul has, I am unable to to c=
onnect to the ResourceManager.<br>

<br></div>The setup of my targeted cluster runs MR1 instead of YARN, hence =
the &quot;<br><a href=3D"http://mapreduce.framework.name" target=3D"_blank"=
>mapreduce.framework.name</a>&quot; is set to &quot;classic&quot;.<br><br>
</div>Here are my settings in my mapred-site.xml on the client side.<br>
<br>&lt;property&gt;<br>=A0=A0=A0 &lt;!-- Pointed to the remote JobTracker =
--&gt;<br>=A0=A0=A0=A0=A0=A0=A0 &lt;name&gt;mapreduce.job.tracker.address&l=
t;/name&gt;<br>=A0=A0=A0=A0=A0=A0=A0 &lt;value&gt;<a href=3D"http://172.31.=
3.150:8021" target=3D"_blank">172.31.3.150:8021</a>&lt;/value&gt;<br>

=A0=A0=A0 &lt;/property&gt;<br>=A0=A0=A0 &lt;property&gt;<br>=A0=A0=A0=A0=
=A0=A0=A0 &lt;name&gt;<a href=3D"http://mapreduce.framework.name" target=3D=
"_blank">mapreduce.framework.name</a>&lt;/name&gt;<br>=A0=A0=A0=A0=A0=A0=A0=
 &lt;value&gt;yarn&lt;/value&gt;<br>=A0=A0=A0 &lt;/property&gt;<br>

<br></div>and my yarn-site.xml<br><br>=A0=A0=A0=A0=A0=A0 &lt;property&gt;<b=
r>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 &lt;description&gt;The hostname of the =
RM.&lt;/description&gt;<br>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 &lt;name&gt;ya=
rn.resourcemanager.hostname&lt;/name&gt;<br>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0 &lt;value&gt;172.31.3.150&lt;/value&gt;<br>

=A0=A0=A0=A0=A0=A0=A0 &lt;/property&gt;=A0=A0=A0 <br>=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0 <br>=A0=A0=A0=A0=A0=A0=A0 &lt;property&gt;<br>=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0 &lt;description&gt;The address of the applicati=
ons manager interface in the RM.&lt;/description&gt;<br>=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0 &lt;name&gt;yarn.resourcemanager.address&lt;/name&gt;<br>

=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 &lt;value&gt;${yarn.resourcemanager.hostn=
ame}:8032&lt;/value&gt;<br>=A0=A0=A0=A0=A0=A0=A0 &lt;/property&gt;<br><br>1=
4/04/16 10:23:02 INFO client.RMProxy: Connecting to ResourceManager at /<a =
href=3D"http://172.31.3.150:8032" target=3D"_blank">172.31.3.150:8032</a><b=
r>

14/04/16 10:23:10 INFO ipc.Client: Retrying connect to server: <a href=3D"h=
ttp://hadoop-host1.eng.narus.com/172.31.3.150:8032" target=3D"_blank">hadoo=
p-host1.eng.narus.com/172.31.3.150:8032</a>. Already tried 0 time(s); retry=
 policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=
=3D1 SECONDS)<br>

<br></div>Therefore, the question is how do I figure out where the Resource=
Manager is running?<br><br></div>TIA<br><br></div>Kim<br><div><div><div><br=
></div></div></div></div><div class=3D"gmail_extra"><br><br><div class=3D"g=
mail_quote">
<div><div class=3D"h5">
On Wed, Apr 16, 2014 at 8:43 AM, Abdelrahman Shettia <span dir=3D"ltr">&lt;=
<a href=3D"mailto:ashettia@hortonworks.com" target=3D"_blank">ashettia@hort=
onworks.com</a>&gt;</span> wrote:<br></div></div><blockquote class=3D"gmail=
_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:=
1ex">
<div><div class=3D"h5">
<div style=3D"word-wrap:break-word">Hi Kim,<div><br></div><div>It looks lik=
e it is pointing to hdfs location. Can you create the hdfs dir and put the =
jar there? Hope this helps=A0<br><div>
Thanks,<div>Rahman</div>

</div><div><div>
<br><div><div>On Apr 16, 2014, at 8:39 AM, Rahul Singh &lt;<a href=3D"mailt=
o:smart.rahul.iiit@gmail.com" target=3D"_blank">smart.rahul.iiit@gmail.com<=
/a>&gt; wrote:</div><br><blockquote type=3D"cite"><div dir=3D"ltr">any help=
...all are welcome?<br>

</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed,=
 Apr 16, 2014 at 1:13 PM, Rahul Singh <span dir=3D"ltr">&lt;<a href=3D"mail=
to:smart.rahul.iiit@gmail.com" target=3D"_blank">smart.rahul.iiit@gmail.com=
</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div dir=3D"ltr"><div><div><div><div><div>Hi,<br></div>=A0=
I am running with the following command but still, jar is not available to =
mapper and reducers.<br>


<br>hadoop jar /home/hduser/workspace/Minerva.jar my.search.Minerva /user/h=
duser/input_minerva_actual /user/hduser/output_merva_actual3 -libjars /home=
/hduser/Documents/Lib/json-simple-1.1.1.jar -Dmapreduce.user.classpath.firs=
t=3Dtrue<br>


<br><br></div>Error Log<br><br>14/04/16 13:08:37 INFO client.RMProxy: Conne=
cting to ResourceManager at /<a href=3D"http://0.0.0.0:8032/" target=3D"_bl=
ank">0.0.0.0:8032</a><br>14/04/16 13:08:37 INFO client.RMProxy: Connecting =
to ResourceManager at /<a href=3D"http://0.0.0.0:8032/" target=3D"_blank">0=
.0.0.0:8032</a><br>


14/04/16 13:08:37 WARN mapreduce.JobSubmitter: Hadoop command-line option p=
arsing not performed. Implement the Tool interface and execute your applica=
tion with ToolRunner to remedy this.<br>14/04/16 13:08:37 INFO mapred.FileI=
nputFormat: Total input paths to process : 1<br>


14/04/16 13:08:37 INFO mapreduce.JobSubmitter: number of splits:10<br>14/04=
/16 13:08:37 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_13=
97534064728_0028<br>14/04/16 13:08:38 INFO impl.YarnClientImpl: Submitted a=
pplication application_1397534064728_0028<br>


14/04/16 13:08:38 INFO mapreduce.Job: The url to track the job: <a href=3D"=
http://l-rahul-tech:8088/proxy/application_1397534064728_0028/" target=3D"_=
blank">http://L-Rahul-Tech:8088/proxy/application_1397534064728_0028/</a><b=
r>


14/04/16 13:08:38 INFO mapreduce.Job: Running job: job_1397534064728_0028<b=
r>
14/04/16 13:08:47 INFO mapreduce.Job: Job job_1397534064728_0028 running in=
 uber mode : false<br>14/04/16 13:08:47 INFO mapreduce.Job:=A0 map 0% reduc=
e 0%<br>14/04/16 13:08:58 INFO mapreduce.Job: Task Id : attempt_13975340647=
28_0028_m_000005_0, Status : FAILED<br>


Error: java.lang.RuntimeException: Error in configuring object<br>=A0=A0=A0=
 at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:=
109)<br>=A0=A0=A0 at org.apache.hadoop.util.ReflectionUtils.setConf(Reflect=
ionUtils.java:75)<br>


=A0=A0=A0 at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionU=
tils.java:133)<br>=A0=A0=A0 at org.apache.hadoop.mapred.MapTask.runOldMappe=
r(MapTask.java:426)<br>=A0=A0=A0 at org.apache.hadoop.mapred.MapTask.run(Ma=
pTask.java:342)<br>


=A0=A0=A0 at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)<b=
r>=A0=A0=A0 at java.security.AccessController.doPrivileged(Native Method)<b=
r>=A0=A0=A0 at javax.security.auth.Subject.doAs(Subject.java:416)<br>=A0=A0=
=A0 at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInform=
ation.java:1548)<br>


=A0=A0=A0 at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)<br=
>Caused by: java.lang.reflect.InvocationTargetException<br>=A0=A0=A0 at sun=
.reflect.NativeMethodAccessorImpl.invoke0(Native Method)<br>=A0=A0=A0 at su=
n.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)=
<br>


=A0=A0=A0 at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMeth=
odAccessorImpl.java:43)<br>=A0=A0=A0 at java.lang.reflect.Method.invoke(Met=
hod.java:622)<br>=A0=A0=A0 at org.apache.hadoop.util.ReflectionUtils.setJob=
Conf(ReflectionUtils.java:106)<br>


=A0=A0=A0 ... 9 more<br>Caused by: java.lang.NoClassDefFoundError: org/json=
/simple/parser/ParseException<br>=A0=A0=A0 at java.lang.Class.forName0(Nati=
ve Method)<br>=A0=A0=A0 at java.lang.Class.forName(Class.java:270)<br>=A0=
=A0=A0 at org.apache.hadoop.conf.Configuration.getClassByNameOrNull(Configu=
ration.java:1821)<br>


=A0=A0=A0 at org.apache.hadoop.conf.Configuration.getClassByName(Configurat=
ion.java:1786)<br>=A0=A0=A0 at org.apache.hadoop.conf.Configuration.getClas=
s(Configuration.java:1880)<br>=A0=A0=A0 at org.apache.hadoop.conf.Configura=
tion.getClass(Configuration.java:1906)<br>


=A0=A0=A0 at org.apache.hadoop.mapred.JobConf.getMapperClass(JobConf.java:1=
107)<br>=A0=A0=A0 at org.apache.hadoop.mapred.MapRunner.configure(MapRunner=
.java:38)<br>=A0=A0=A0 ... 14 more<br>Caused by: java.lang.ClassNotFoundExc=
eption: org.json.simple.parser.ParseException<br>


=A0=A0=A0 at java.net.URLClassLoader$1.run(URLClassLoader.java:217)<br>=A0=
=A0=A0 at java.security.AccessController.doPrivileged(Native Method)<br>=A0=
=A0=A0 at java.net.URLClassLoader.findClass(URLClassLoader.java:205)<br>=A0=
=A0=A0 at java.lang.ClassLoader.loadClass(ClassLoader.java:323)<br>


=A0=A0=A0 at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:294)<=
br>=A0=A0=A0 at java.lang.ClassLoader.loadClass(ClassLoader.java:268)<br>=
=A0=A0=A0 ... 22 more<br><br></div>When i analyzed the logs it says<br>&quo=
t;14/04/16 13:08:37 WARN mapreduce.JobSubmitter: Hadoop command-line option=
 parsing not performed. Implement the Tool interface and execute your appli=
cation with ToolRunner to remedy this.&quot; <br>


<br></div>But i have implemented the tool class as described below: <br><br=
>package my.search;<br><br>import org.apache.hadoop.conf.Configured;<br>imp=
ort org.apache.hadoop.fs.Path;<br>import org.apache.hadoop.io.Text;<br>


import org.apache.hadoop.mapred.FileInputFormat;<br>import org.apache.hadoo=
p.mapred.FileOutputFormat;<br>import org.apache.hadoop.mapred.JobClient;<br=
>import org.apache.hadoop.mapred.JobConf;<br>import org.apache.hadoop.mapre=
d.TextInputFormat;<br>


import org.apache.hadoop.mapred.TextOutputFormat;<br>import org.apache.hado=
op.util.Tool;<br>import org.apache.hadoop.util.ToolRunner;<br><br>public cl=
ass Minerva extends Configured implements Tool<br>{<br>=A0=A0=A0 public int=
 run(String[] args) throws Exception {<br>


=A0=A0=A0 =A0=A0=A0 JobConf conf =3D new JobConf(Minerva.class);<br>=A0=A0=
=A0 =A0=A0=A0 conf.setJobName(&quot;minerva sample job&quot;);<br><br>=A0=
=A0=A0 =A0=A0=A0 conf.setMapOutputKeyClass(Text.class);<br>=A0=A0=A0 =A0=A0=
=A0 conf.setMapOutputValueClass(TextArrayWritable.class);<br>


<br>=A0=A0=A0 =A0=A0=A0 conf.setOutputKeyClass(Text.class);<br>=A0=A0=A0 =
=A0=A0=A0 conf.setOutputValueClass(Text.class);<br><br>=A0=A0=A0 =A0=A0=A0 =
conf.setMapperClass(Map.class);<br>=A0=A0=A0 =A0=A0=A0 // conf.setCombinerC=
lass(Reduce.class);<br>=A0=A0=A0 =A0=A0=A0 conf.setReducerClass(Reduce.clas=
s);<br>


<br>=A0=A0=A0 =A0=A0=A0 conf.setInputFormat(TextInputFormat.class);<br>=A0=
=A0=A0 =A0=A0=A0 conf.setOutputFormat(TextOutputFormat.class);<br><br>=A0=
=A0=A0 =A0=A0=A0 FileInputFormat.setInputPaths(conf, new Path(args[0]));<br=
>=A0=A0=A0 =A0=A0=A0 FileOutputFormat.setOutputPath(conf, new Path(args[1])=
);<br>


<br>=A0=A0=A0 =A0=A0=A0 JobClient.runJob(conf);<br>=A0=A0=A0 =A0=A0=A0 <br>=
=A0=A0=A0 =A0=A0=A0 return 0;<br>=A0=A0=A0 }<br>=A0=A0=A0 <br>=A0=A0=A0 pub=
lic static void main(String[] args) throws Exception {<br>=A0=A0=A0=A0=A0=
=A0=A0 int res =3D ToolRunner.run(new Minerva(), args);<br>=A0=A0=A0=A0=A0=
=A0=A0 System.exit(res);<br>


=A0=A0=A0 }<br>}<br><br><br></div>Please let me know if you see any issues?=
<br><br></div><div><div><div class=3D"gmail_extra"><br><br><div class=3D"gm=
ail_quote">On Thu, Apr 10, 2014 at 9:29 AM, Shengjun Xin <span dir=3D"ltr">=
&lt;<a href=3D"mailto:sxin@gopivotal.com" target=3D"_blank">sxin@gopivotal.=
com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">add &#39;-Dmapreduce.user.c=
lasspath.first=3Dtrue&#39; to your command and try again<br><br></div><div =
class=3D"gmail_extra">


<div><br><br><div class=3D"gmail_quote">On Wed, Apr 9, 2014 at 6:27 AM, Kim=
 Chew <span dir=3D"ltr">&lt;<a href=3D"mailto:kchew534@gmail.com" target=3D=
"_blank">kchew534@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div dir=3D"ltr"><div><div><div>It seems to me that in Had=
oop 2.2.1, using the &quot;libjars&quot; option does not search the jars lo=
cated in the the local file system but HDFS. For example,<br>


<br>hadoop jar target/myJar.jar Foo -libjars /home/kchew/test-libs/testJar.=
jar /user/kchew/inputs/raw.vector /user/kchew/outputs <a>hdfs://remoteNN:80=
20</a> remoteJT:8021<br>
<br>14/04/08 15:11:02 INFO jvm.JvmMetrics: Initializing JVM Metrics with pr=
ocessName=3DJobTracker, sessionId=3D<br>14/04/08 15:11:02 INFO mapreduce.Jo=
bSubmitter: Cleaning up the staging area file:/tmp/hadoop-kchew/mapred/stag=
ing/kchew202924688/.staging/job_local202924688_0001<br>


14/04/08 15:11:02 ERROR security.UserGroupInformation: PriviledgedActionExc=
eption as:kchew (auth:SIMPLE) cause:java.io.FileNotFoundException: File doe=
s not exist: <a>hdfs://remoteNN:8020/home/kchew/test-libs/testJar.jar</a><b=
r>


java.io.FileNotFoundException: File does not exist: hdfs:/remoteNN:8020/hom=
e/kchew/test-libs/testJar.jar<br>
=A0=A0=A0 at org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(Distrib=
utedFileSystem.java:1110)<br>=A0=A0=A0 at org.apache.hadoop.hdfs.Distribute=
dFileSystem$17.doCall(DistributedFileSystem.java:1102)<br>=A0=A0=A0 at org.=
apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java=
:81)<br>


=A0=A0=A0 at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(Dis=
tributedFileSystem.java:1102)<br>=A0=A0=A0 at org.apache.hadoop.mapreduce.f=
ilecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCache=
Manager.java:288)<br>


=A0=A0=A0 at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheMa=
nager.getFileStatus(ClientDistributedCacheManager.java:224)<br>=A0=A0=A0 at=
 org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determ=
ineTimestamps(ClientDistributedCacheManager.java:93)<br>


=A0=A0=A0 at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheMa=
nager.determineTimestampsAndCacheVisibilities(ClientDistributedCacheManager=
.java:57)<br>=A0=A0=A0 at org.apache.hadoop.mapreduce.JobSubmitter.copyAndC=
onfigureFiles(JobSubmitter.java:264)<br>


<br></div>So under Hadoop 2.2.1, do I have to explicitly set some configura=
tions so when using the &quot;libjars&quot; option it will copy the file to=
 hdfs from local fs?<br><br></div>TIA<span><font color=3D"#888888"><br>
<br></font></span></div><span><font color=3D"#888888">Kim<br></font></span>=
</div>
</blockquote></div><br><br clear=3D"all"><br></div><span><font color=3D"#88=
8888">-- <br><div dir=3D"ltr"><div>Regards <br></div>Shengjun<br></div>
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div></div></div></div>
<br>
</div></div><span style=3D"color:rgb(128,128,128);font-family:Arial,sans-se=
rif;font-size:10px">CONFIDENTIALITY NOTICE</span><br style=3D"color:rgb(128=
,128,128);font-family:Arial,sans-serif;font-size:10px"><span style=3D"color=
:rgb(128,128,128);font-family:Arial,sans-serif;font-size:10px">NOTICE: This=
 message is intended for the use of the individual or entity to which it is=
 addressed and may contain information that is confidential, privileged and=
 exempt from disclosure under applicable law. If the reader of this message=
 is not the intended recipient, you are hereby notified that any printing, =
copying, dissemination, distribution, disclosure or forwarding of this comm=
unication is strictly prohibited. If you have received this communication i=
n error, please contact the sender immediately and delete it from your syst=
em. Thank You.</span></blockquote>

</div><br></div>
</blockquote></div><br></div>

<br>
<span style=3D"color:rgb(128,128,128);font-family:Arial,sans-serif;font-siz=
e:10px">CONFIDENTIALITY NOTICE</span><br style=3D"color:rgb(128,128,128);fo=
nt-family:Arial,sans-serif;font-size:10px"><span style=3D"color:rgb(128,128=
,128);font-family:Arial,sans-serif;font-size:10px">NOTICE: This message is =
intended for the use of the individual or entity to which it is addressed a=
nd may contain information that is confidential, privileged and exempt from=
 disclosure under applicable law. If the reader of this message is not the =
intended recipient, you are hereby notified that any printing, copying, dis=
semination, distribution, disclosure or forwarding of this communication is=
 strictly prohibited. If you have received this communication in error, ple=
ase contact the sender immediately and delete it from your system. Thank Yo=
u.</span>
--089e0149c04e1dc63604f72ecbb0--