Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: local policy)
From: "Vjeran Marcinko" <vjeran.marcinko@email.t-com.hr>
To: <user@hadoop.apache.org>
Subject: Best Hadoop dev environment [WAS: RE: Few noob MR questions]
Date: Sun, 14 Apr 2013 07:18:35 +0200
Message-ID: <001801ce38cf$83556300$8a002900$@email.t-com.hr>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_0019_01CE38E0.46E58600"
Thread-Index: Ac44ze3bAhbmX+xETpym98QSKsu9QQ==
Content-Language: hr

This is a multipart message in MIME format.

------=_NextPart_000_0019_01CE38E0.46E58600
Content-Type: text/plain;
	charset="us-ascii"
Content-Transfer-Encoding: 7bit

Hi again,

 
You actually touched what I'm trying to do here - setup best Hadooop
development environment. 

 
Moreoever, don't ask me why, my development machine is on Windows, so I
don't have my Hadoop on it, so I use linux virtual machine with Hadoop
running in it, so I would like mostly to develop my job code in my favourite
IDE, and just deploy my jobs from there, and let them see running in this
"remote" virtual Hadoop platform. Although build scripts can help a lot, so
each time I change some job code, using these scripts I could package it and
transfer to Hadoop machine where I can deploy it via "hadoop jar." command,
and I will certainly do that *in production*, but *in development
environment* I would like to avoid that, And when in IDE, when I say "Run",
it uses "java -classpath .", not even "java -jar .", so job class is not
found in some packaged form. (at least by default - any proper IDE can add
additional build steps to it),

 
So are there any more hints for me to setup this environment?

 
Hadoop can really be intimidating for newbvie - there so much versions out
there, so many examples using different APIs, and so many ways to deploy a
job for eg, that I don't know how to start. And my windows OS brings even
more problems in the beginning, when I don't know much.

 
Regards,

Vjeran 

 
From: Bjorn Jonsson [mailto:bjornjon@gmail.com] 
Sent: Sunday, April 14, 2013 5:27 AM
To: user@hadoop.apache.org
Subject: Re: Few noob MR questions

 
Correct, you can use java -jar to submit a job...with the "driver" code in a
plain static main method. I do it all the time. You can of course run a Job
straight from your IDE Java code also. You can check out the .runJar()
method in the Hadoop API Javadoc to see what the hadoop command does
essentially I think. 

 
Cheers,

Bj

 
On Sat, Apr 13, 2013 at 3:59 PM, Jens Scheidtmann
<jens.scheidtmann@gmail.com> wrote:

Dear Vjeran,

your own jobs should implement the Tool Interface and ToolRunner. This gives
additional standard options on the command line. 

Also have a look at class ProgramDriver as used here:
https://svn.apache.org/repos/asf/hadoop/common/trunk/hadoop-mapreduce-projec
t/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/Example
Driver.java

which further simplifies executing your MR jobs.

 
Best regards,

Jens

 
------=_NextPart_000_0019_01CE38E0.46E58600
Content-Type: text/html;
	charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40"><head><meta =
http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii"><meta name=3DGenerator content=3D"Microsoft Word 14 =
(filtered medium)"><style><!--
/* Font Definitions */
@font-face
	{font-family:Tahoma;
	panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-reply;
	font-family:"Arial","sans-serif";
	color:#1F497D;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-family:"Arial","sans-serif";
	mso-fareast-language:EN-US;}
@page WordSection1
	{size:612.0pt 792.0pt;
	margin:70.85pt 70.85pt 70.85pt 70.85pt;}
div.WordSection1
	{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]--></head><body lang=3DHR link=3Dblue =
vlink=3Dpurple><div class=3DWordSection1><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>Hi again,<o:p></o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>You actually touched what I'm trying to do here &#8211; setup best =
Hadooop development environment. <o:p></o:p></span></p><p =
class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>Moreoever, don't ask me why, my development machine is on Windows, so I =
don't have my Hadoop on it, so I use linux virtual machine with Hadoop =
running in it, so I would like mostly to develop my job code in my =
favourite IDE, and just deploy my jobs from there, and let them see =
running in this &#8222;remote&#8220; virtual Hadoop platform. Although =
build scripts can help a lot, so each time I change some job code, using =
these scripts I could package it and transfer to Hadoop machine where I =
can deploy it via &#8222;hadoop jar&#8230;&#8220; command, and I will =
certainly do that *<b>in production</b>*, but *<b>in development =
environment</b>* I would like to avoid that, And when in IDE, when I say =
&#8222;Run&#8220;, it uses &#8222;java &#8211;classpath &#8230;&#8220;, =
not even &#8222;java &#8211;jar &#8230;&#8220;, so job class is not =
found in some packaged form. (at least by default &#8211; any proper IDE =
can add additional build steps to it),<o:p></o:p></span></p><p =
class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>So are there any more hints for me to setup this =
environment?<o:p></o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>Hadoop can really be intimidating for newbvie &#8211; there so much =
versions out there, so many examples using different APIs, and so many =
ways to deploy a job for eg, that I don't know how to start. And my =
windows OS brings even more problems in the beginning, when I don't know =
much.<o:p></o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>Regards,<o:p></o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
>Vjeran <o:p></o:p></span></p><p class=3DMsoNormal><span =
style=3D'font-size:11.0pt;font-family:"Arial","sans-serif";color:#1F497D'=
><o:p>&nbsp;</o:p></span></p><p class=3DMsoNormal><b><span lang=3DEN-US =
style=3D'font-size:10.0pt;font-family:"Tahoma","sans-serif"'>From:</span>=
</b><span lang=3DEN-US =
style=3D'font-size:10.0pt;font-family:"Tahoma","sans-serif"'> Bjorn =
Jonsson [mailto:bjornjon@gmail.com] <br><b>Sent:</b> Sunday, April 14, =
2013 5:27 AM<br><b>To:</b> user@hadoop.apache.org<br><b>Subject:</b> Re: =
Few noob MR questions<o:p></o:p></span></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><div><p =
class=3DMsoNormal>Correct, you can use java -jar to submit a job...with =
the &quot;driver&quot; code in a plain static main method. I do it all =
the time. You can&nbsp;of course run a Job straight from your IDE Java =
code also. You can check out the .runJar() method in the Hadoop API =
Javadoc to see what the hadoop command does essentially I =
think.&nbsp;<o:p></o:p></p><div><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p></div><div><p =
class=3DMsoNormal>Cheers,<o:p></o:p></p></div><div><p =
class=3DMsoNormal>Bj<o:p></o:p></p></div></div><div><p class=3DMsoNormal =
style=3D'margin-bottom:12.0pt'><o:p>&nbsp;</o:p></p><div><p =
class=3DMsoNormal>On Sat, Apr 13, 2013 at 3:59 PM, Jens Scheidtmann =
&lt;<a href=3D"mailto:jens.scheidtmann@gmail.com" =
target=3D"_blank">jens.scheidtmann@gmail.com</a>&gt; =
wrote:<o:p></o:p></p><div><div><div><div><p class=3DMsoNormal =
style=3D'margin-bottom:12.0pt'>Dear Vjeran,<o:p></o:p></p></div><p =
class=3DMsoNormal style=3D'margin-bottom:12.0pt'>your own jobs should =
implement the Tool Interface and ToolRunner. This gives additional =
standard options on the command line. <o:p></o:p></p></div><p =
class=3DMsoNormal style=3D'margin-bottom:12.0pt'>Also have a look at =
class ProgramDriver as used here: <a =
href=3D"https://svn.apache.org/repos/asf/hadoop/common/trunk/hadoop-mapre=
duce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/ex=
amples/ExampleDriver.java" =
target=3D"_blank">https://svn.apache.org/repos/asf/hadoop/common/trunk/ha=
doop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache=
/hadoop/examples/ExampleDriver.java</a><o:p></o:p></p></div><div><p =
class=3DMsoNormal>which further simplifies executing your MR =
jobs.<o:p></o:p></p></div><div><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p></div><div><p =
class=3DMsoNormal>Best =
regards,<br><br>Jens<o:p></o:p></p></div></div></div><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p></div></div></body></html>
------=_NextPart_000_0019_01CE38E0.46E58600--