Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of Nikhil.Agarwal@netapp.com
 designates 216.240.18.77 as permitted sender)
From: "Agarwal, Nikhil" <Nikhil.Agarwal@netapp.com>
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Subject: RE: How to add another file system in Hadoop
Thread-Topic: How to add another file system in Hadoop
Thread-Index: Ac4QIF2RJ9Jf1VqQSeOMsstG9tacRQAmGQWQ
Date: Fri, 22 Feb 2013 05:05:17 +0000
Message-ID: 
 <7B0D51053A50034199FF706B2513104F09C250A1@SACEXCMBX01-PRD.hq.netapp.com>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_7B0D51053A50034199FF706B2513104F09C250A1SACEXCMBX01PRDh_"
MIME-Version: 1.0

--_000_7B0D51053A50034199FF706B2513104F09C250A1SACEXCMBX01PRDh_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

Hi All,

Thanks a lot for taking out your time to answer my question.

Ling, thank you for directing me to glusterfs. I can surely take lot of hel=
p from that but what I wanted to know is that in README.txt it is mentioned=
 :


>> # ./bin/start-mapred.sh
  If the map/reduce job/task trackers are up, all I/O will be done to Glust=
erFS.

So, suppose my input files are scattered in different nodes(glusterfs serve=
rs), how do I(hadoop client having glusterfs plugged in) issue a Mapreduce =
command?
Moreover, after issuing a Mapreduce command would my hadoop client fetch al=
l the data from different servers to my local machine and then do a Mapredu=
ce or would it start the TaskTracker daemons on the machine(s) where the in=
put file(s) are located and perform a Mapreduce there?
Please rectify me if I am wrong but I suppose that the location of input fi=
les top Mapreduce is being returned by the function getFileBlockLocations (=
FileStatus file, long start, long len).

Thank you very much for your time and helping me out.

Regards,
Nikhil

From: Agarwal, Nikhil
Sent: Thursday, February 21, 2013 4:19 PM
To: 'user@hadoop.apache.org'
Subject: How to add another file system in Hadoop

Hi,

I am planning to add a file system called CDMI under org.apache.hadoop.fs i=
n Hadoop, something similar to KFS or S3 which are already there under org.=
apache.hadoop.fs. I wanted to ask that say, I write my file system for CDMI=
 and add the package under fs but then how do I tell the core-site.xml or o=
ther configuration files to use CDMI file system. Where all do I need to ma=
ke changes to enable CDMI file system become a part of Hadoop ?

Thanks a lot in advance.

Regards,
Nikhil

--_000_7B0D51053A50034199FF706B2513104F09C250A1SACEXCMBX01PRDh_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr=
osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" xmlns=3D"http:=
//www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
<meta name=3D"Generator" content=3D"Microsoft Word 14 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
	{font-family:Tahoma;
	panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
	{font-family:Consolas;
	panose-1:2 11 6 9 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
pre
	{mso-style-priority:99;
	mso-style-link:"HTML Preformatted Char";
	margin:0in;
	margin-bottom:.0001pt;
	font-size:10.0pt;
	font-family:"Courier New";}
span.EmailStyle17
	{mso-style-type:personal;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
span.EmailStyle18
	{mso-style-type:personal-reply;
	font-family:"Calibri","sans-serif";
	color:#1F497D;}
span.HTMLPreformattedChar
	{mso-style-name:"HTML Preformatted Char";
	mso-style-priority:99;
	mso-style-link:"HTML Preformatted";
	font-family:"Courier New";}
span.nf
	{mso-style-name:nf;}
span.o
	{mso-style-name:o;}
span.n
	{mso-style-name:n;}
span.kt
	{mso-style-name:kt;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-size:10.0pt;}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div class=3D"WordSection1">
<p class=3D"MsoNormal"><span style=3D"color:#1F497D">Hi All,<o:p></o:p></sp=
an></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D"><o:p>&nbsp;</o:p></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D">Thanks a lot for takin=
g out your time to answer my question.<o:p></o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D"><o:p>&nbsp;</o:p></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D">Ling, thank you for di=
recting me to glusterfs. I can surely take lot of help from that but what I=
 wanted to know is that in README.txt it is mentioned :<o:p></o:p></span></=
p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D"><o:p>&nbsp;</o:p></spa=
n></p>
<pre style=3D"line-height:12.0pt"><span style=3D"font-family:&quot;Calibri&=
quot;,&quot;sans-serif&quot;;color:#1F497D">&gt;&gt; </span><span style=3D"=
font-size:9.0pt;font-family:Consolas;color:#333333"># ./bin/start-mapred.sh=
<o:p></o:p></span></pre>
<p class=3D"MsoNormal" style=3D"line-height:12.0pt"><span style=3D"font-siz=
e:9.0pt;font-family:Consolas;color:#333333">&nbsp;&nbsp;If the map/reduce j=
ob/task trackers are up, all I/O will be done to GlusterFS.<o:p></o:p></spa=
n></p>
<p class=3D"MsoNormal" style=3D"line-height:12.0pt"><span style=3D"font-siz=
e:9.0pt;font-family:Consolas;color:#333333"><o:p>&nbsp;</o:p></span></p>
<p class=3D"MsoNormal" style=3D"line-height:12.0pt"><span style=3D"font-siz=
e:9.0pt;font-family:Consolas;color:#333333">So, suppose my input files are =
scattered in different nodes(glusterfs servers), how do I(hadoop client hav=
ing glusterfs plugged in) issue a Mapreduce
 command?<o:p></o:p></span></p>
<p class=3D"MsoNormal" style=3D"line-height:12.0pt"><span style=3D"font-siz=
e:9.0pt;font-family:Consolas;color:#333333">Moreover, after issuing a Mapre=
duce command would my hadoop client fetch all the data from different serve=
rs to my local machine and then do a Mapreduce
 or would it start the TaskTracker daemons on the machine(s) where the inpu=
t file(s) are located and perform a Mapreduce there?<o:p></o:p></span></p>
<p class=3D"MsoNormal" style=3D"line-height:12.0pt"><span style=3D"font-siz=
e:9.0pt;font-family:Consolas;color:#333333">Please rectify me if I am wrong=
 but I suppose that the location of input files top Mapreduce is being retu=
rned by the function
</span><span class=3D"nf"><b><span style=3D"font-size:9.0pt;font-family:Con=
solas;color:#990000;border:none windowtext 1.0pt;padding:0in;background:whi=
te">getFileBlockLocations</span></b></span><span style=3D"font-size:9.0pt;f=
ont-family:Consolas;color:#333333;background:white">
<span class=3D"o"><b><span style=3D"border:none windowtext 1.0pt;padding:0i=
n">(</span></b></span><span class=3D"n"><span style=3D"border:none windowte=
xt 1.0pt;padding:0in">FileStatus</span></span>
<span class=3D"n"><span style=3D"border:none windowtext 1.0pt;padding:0in">=
file</span></span><span class=3D"o"><b><span style=3D"border:none windowtex=
t 1.0pt;padding:0in">,</span></b></span>
</span><span class=3D"kt"><b><span style=3D"font-size:9.0pt;font-family:Con=
solas;color:#445588;border:none windowtext 1.0pt;padding:0in;background:whi=
te">long</span></b></span><span style=3D"font-size:9.0pt;font-family:Consol=
as;color:#333333;background:white">
<span class=3D"n"><span style=3D"border:none windowtext 1.0pt;padding:0in">=
start</span></span><span class=3D"o"><b><span style=3D"border:none windowte=
xt 1.0pt;padding:0in">,</span></b></span>
</span><span class=3D"kt"><b><span style=3D"font-size:9.0pt;font-family:Con=
solas;color:#445588;border:none windowtext 1.0pt;padding:0in;background:whi=
te">long</span></b></span><span style=3D"font-size:9.0pt;font-family:Consol=
as;color:#333333;background:white">
<span class=3D"n"><span style=3D"border:none windowtext 1.0pt;padding:0in">=
len</span></span><span class=3D"o"><b><span style=3D"border:none windowtext=
 1.0pt;padding:0in">).
</span></b></span></span><span style=3D"font-size:9.0pt;font-family:Consola=
s;color:#333333"><o:p></o:p></span></p>
<p class=3D"MsoNormal" style=3D"line-height:12.0pt"><span style=3D"font-siz=
e:9.0pt;font-family:Consolas;color:#333333"><o:p>&nbsp;</o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D">Thank you very much fo=
r your time and helping me out.<o:p></o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D"><o:p>&nbsp;</o:p></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D">Regards,<o:p></o:p></s=
pan></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D">Nikhil<o:p></o:p></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"color:#1F497D"><o:p>&nbsp;</o:p></spa=
n></p>
<div>
<div style=3D"border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in =
0in 0in">
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Tahoma&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-s=
ize:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;"> Agarwal,=
 Nikhil
<br>
<b>Sent:</b> Thursday, February 21, 2013 4:19 PM<br>
<b>To:</b> 'user@hadoop.apache.org'<br>
<b>Subject:</b> How to add another file system in Hadoop<o:p></o:p></span><=
/p>
</div>
</div>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">Hi,<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">I am planning to add a file system called CDMI under=
 org.apache.hadoop.fs in Hadoop, something similar to KFS or S3 which are a=
lready there under org.apache.hadoop.fs. I wanted to ask that say, I write =
my file system for CDMI and add the
 package under fs but then how do I tell the core-site.xml or other configu=
ration files to use CDMI file system. Where all do I need to make changes t=
o enable CDMI file system become a part of Hadoop ?<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">Thanks a lot in advance.<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">Regards,<o:p></o:p></p>
<p class=3D"MsoNormal">Nikhil <o:p></o:p></p>
</div>
</body>
</html>

--_000_7B0D51053A50034199FF706B2513104F09C250A1SACEXCMBX01PRDh_--