Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of daniel.savard@gmail.com
 designates 209.85.128.173 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAHDsjKu4V9T--c+2xvJsNj8gJtHMbQbc-+=qfCUZRFEs_QMb0g@mail.gmail.com>
References: 
 <CAHDsjKv7m4jXcr3xJ+zPrUgJpSqYEufqpHck0pMOJCQG5_PiGA@mail.gmail.com>
 <CAA2tiY+tzjVDCSVwt1u3q0Fo0g8vMN=B32=vQQgy0Zg615Kn_Q@mail.gmail.com>
 <CAHDsjKtsfkmgiH317vNUYEt53ZJV+YxAfvDApt_ahdDe+XC7Vw@mail.gmail.com>
 <3F7FF87F-2894-403E-BF1C-837FECBA2875@hortonworks.com>
 <CAHDsjKtGA7FuS+-UTjTYSZL1t9GGj13BemQshCA_FzXq2UJ50g@mail.gmail.com>
 <CAHodO=KuMA1qtxS2ANSqjGNB_O_aNPR0SkgUsfAU+sBp5iK=YA@mail.gmail.com>
 <CAHDsjKubNv_RTm=g1xraNzJT8+RwdzP=qQu+P1KoRCy6W4-AEw@mail.gmail.com>
 <CAHodO=+K94LNOYQTefWSm2DPp_-sg94kUCOkqyuM4UpRox1bmg@mail.gmail.com>
 <CAHDsjKuS9jWoMGV8N33YsPsj-uxEAVR+qzAn3kADzBpGWzwxpg@mail.gmail.com>
 <CAHodO=+hBLAiF+BByX5ev42yHrwhxTz5xcdeS2fyUiC3nyRfvQ@mail.gmail.com>
 <CAHDsjKu4V9T--c+2xvJsNj8gJtHMbQbc-+=qfCUZRFEs_QMb0g@mail.gmail.com>
From: Daniel Savard <daniel.savard@gmail.com>
Date: Tue, 3 Dec 2013 21:10:08 -0500
Message-ID: 
 <CAHDsjKtL7dnyZs2f07wfCuHxwLPKS8JQQ8Y6CtyS2x3U=Ai2Og@mail.gmail.com>
Subject: Re: Hadoop 2.2.0 from source configuration
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=001a11c2c544c2127404ecabea80

--001a11c2c544c2127404ecabea80
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Adam and others,

I solved my problem by increasing by 3GB the filesystem holding the data. I
didn't try to increase it by smaller steps, so I don't know exactly at
which point I had enough space for HDFS to work properly. Is there anywhere
in the documentation a place we can have a list of guidelines, requirements
for the filesystem(s). And I suppose it is possible to use much less space
provided some parameter(s) is/are properly configured to use less space
(namenode?). Any worksheets to plan the disk space capacity for any
configuration (standalone single node or complete cluster)?


-----------------
Daniel Savard


2013/12/3 Daniel Savard <daniel.savard@gmail.com>

> Adam,
>
> here is the link:
> http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/S=
ingleCluster.html
>
> Then, since it didn't work I tried a number of things, but my
> configuration files are really skinny and there isn't much stuff in it.
>
> -----------------
> Daniel Savard
>
>
> 2013/12/3 Adam Kawa <kawa.adam@gmail.com>
>
>> Could you please send me a link to the documentation that you followed t=
o
>> setup your single-node cluster?
>> I will go through it and do it step by step, so hopefully at the end you=
r
>> issue will be solved and the documentation will be improved.
>>
>> If you have any non-standard settings in core-site.xml, hdfs-site.xml an=
d
>> hadoop-env.sh (that were not suggested by the documentation that you
>> followed), then please share them.
>>
>>
>> 2013/12/3 Daniel Savard <daniel.savard@gmail.com>
>>
>>> Adam,
>>>
>>> that's not the issue, I did substitute the name in the first report. Th=
e
>>> actual hostname is feynman.cids.ca.
>>>
>>> -----------------
>>> Daniel Savard
>>>
>>>
>>> 2013/12/3 Adam Kawa <kawa.adam@gmail.com>
>>>
>>>> Daniel,
>>>>
>>>> I see that in previous hdfs report, you had: hosta.subdom1.tld1, but
>>>> now you have feynman.cids.ca. What is the content of your /etc/hosts
>>>> file, and output of $hostname command?
>>>>
>>>>
>>>>
>>>>
>>>> 2013/12/3 Daniel Savard <daniel.savard@gmail.com>
>>>>
>>>>> I did that more than once, I just retry it from the beginning. I
>>>>> zapped the directories and recreated them with hdfs namenode -format =
and
>>>>> restarted HDFS and I am still getting the very same error.
>>>>>
>>>>> I have posted previously the report. Is there anything in this report
>>>>> that indicates I am not having enough free space somewhere? That's th=
e only
>>>>> thing I can see may cause this problem after everything I read on the
>>>>> subject. I am new to Hadoop and I just want to setup a standalone nod=
e for
>>>>> starting to experiment a while with it before going ahead with a comp=
lete
>>>>> cluster.
>>>>>
>>>>> I repost the report for convenience:
>>>>>
>>>>> Configured Capacity: 2939899904 (2.74 GB)
>>>>> Present Capacity: 534421504 (509.66 MB)
>>>>> DFS Remaining: 534417408 (509.66 MB)
>>>>>
>>>>> DFS Used: 4096 (4 KB)
>>>>> DFS Used%: 0.00%
>>>>> Under replicated blocks: 0
>>>>> Blocks with corrupt replicas: 0
>>>>> Missing blocks: 0
>>>>>
>>>>> -------------------------------------------------
>>>>> Datanodes available: 1 (1 total, 0 dead)
>>>>>
>>>>> Live datanodes:
>>>>> Name: 127.0.0.1:50010 (feynman.cids.ca)
>>>>> Hostname: feynman.cids.ca
>>>>> Decommission Status : Normal
>>>>> Configured Capacity: 2939899904 (2.74 GB)
>>>>>
>>>>> DFS Used: 4096 (4 KB)
>>>>> Non DFS Used: 2405478400 (2.24 GB)
>>>>> DFS Remaining: 534417408 (509.66 MB)
>>>>> DFS Used%: 0.00%
>>>>> DFS Remaining%: 18.18%
>>>>> Last contact: Tue Dec 03 13:37:02 EST 2013
>>>>>
>>>>>
>>>>> -----------------
>>>>> Daniel Savard
>>>>>
>>>>>
>>>>> 2013/12/3 Adam Kawa <kawa.adam@gmail.com>
>>>>>
>>>>>> Daniel,
>>>>>>
>>>>>> It looks that you can only communicate with NameNode to do
>>>>>> "metadata-only" operations (e.g. listing, creating a dir, empty file=
)...
>>>>>>
>>>>>> Did you format the NameNode correctly?
>>>>>> A quite similar issue is described here:
>>>>>> http://www.manning-sandbox.com/thread.jspa?messageID=3D126741. The
>>>>>> last reply says: "The most common is that you have reformatted the
>>>>>> namenode leaving it in an inconsistent state. The most common soluti=
on is
>>>>>> to stop dfs, remove the contents of the dfs directories on all the
>>>>>> machines, run =E2=80=9Chadoop namenode -format=E2=80=9D on the contr=
oller, then restart
>>>>>> dfs. That consistently fixes the problem for me. This may be serious
>>>>>> overkill but it works."
>>>>>>
>>>>>>
>>>>>> 2013/12/3 Daniel Savard <daniel.savard@gmail.com>
>>>>>>
>>>>>>> Thanks Arun,
>>>>>>>
>>>>>>> I already read and did everything recommended at the referred URL.
>>>>>>> There isn't any error message in the logfiles. The only error messa=
ge
>>>>>>> appears when I try to put a non-zero file on the HDFS as posted abo=
ve.
>>>>>>> Beside that, absolutely nothing in the logs is telling me something=
 is
>>>>>>> wrong with the configuration so far.
>>>>>>>
>>>>>>> Is there some sort of diagnostic tool that can query/ping each
>>>>>>> server to make sure it responds properly to requests? When trying t=
o put my
>>>>>>> file, in the datanode log I see nothing, the message appears in the
>>>>>>> namenode log. Is this the expected behavior or should I see at leas=
t some
>>>>>>> kind of request message in the datanode logfile?
>>>>>>>
>>>>>>>
>>>>>>> -----------------
>>>>>>> Daniel Savard
>>>>>>>
>>>>>>>
>>>>>>> 2013/12/2 Arun C Murthy <acm@hortonworks.com>
>>>>>>>
>>>>>>>> Daniel,
>>>>>>>>
>>>>>>>>  Apologies if you had a bad experience. If you can point them out
>>>>>>>> to us, we'd be more than happy to fix it - alternately, we'd *love=
* it if
>>>>>>>> you could help us improve docs too.
>>>>>>>>
>>>>>>>>  Now, for the problem at hand:
>>>>>>>> http://wiki.apache.org/hadoop/CouldOnlyBeReplicatedTo is one place
>>>>>>>> to look. Basically NN cannot find any datanodes. Anything in your =
NN logs
>>>>>>>> to indicate trouble?
>>>>>>>>
>>>>>>>>  Also, pls feel free to open liras with issues you find and we'll
>>>>>>>> help.
>>>>>>>>
>>>>>>>> thanks,
>>>>>>>> Arun
>>>>>>>>
>>>>>>>> On Dec 2, 2013, at 8:44 AM, Daniel Savard <daniel.savard@gmail.com=
>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>> Andr=C3=A9,
>>>>>>>>
>>>>>>>> good for you that greedy instructions on the reference page were
>>>>>>>> enough to setup your cluster. However, read them again and see how=
 many
>>>>>>>> assumptions are made into them about what you are supposed to alre=
ady know
>>>>>>>> and should come without saying more about it.
>>>>>>>>
>>>>>>>> I did try the single node setup, it is worst than the cluster setu=
p
>>>>>>>> regarding the instructions. You are supposed to already have a nea=
r working
>>>>>>>> system as far as I understand the instructions. It is assumed the =
HDFS is
>>>>>>>> already setup and working properly. Try to find the instructions t=
o setup
>>>>>>>> HDFS for version 2.2.0 and you will end up with a lot of inappropr=
iate
>>>>>>>> instructions about previous version (some properties were renamed)=
.
>>>>>>>>
>>>>>>>> It may appear hard at people to say this is toxic, but it is. The
>>>>>>>> first place a newcomer will go is setup a single node. This will b=
e his
>>>>>>>> starting point and he will be left with a bunch of a priori and no=
 clue.
>>>>>>>>
>>>>>>>> To go back to my very problem at this point:
>>>>>>>>
>>>>>>>> 13/12/02 11:34:07 WARN hdfs.DFSClient: DataStreamer Exception
>>>>>>>> org.apache.hadoop.ipc.RemoteException(java.io.IOException): File
>>>>>>>> /test._COPYING_ could only be replicated to 0 nodes instead of
>>>>>>>> minReplication (=3D1).  There are 1 datanode(s) running and no nod=
e(s) are
>>>>>>>> excluded in this operation.
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseT=
arget(BlockManager.java:1384)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalB=
lock(FSNamesystem.java:2477)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(=
NameNodeRpcServer.java:555)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSide=
TranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:387=
)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos=
$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.j=
ava:59582)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.=
call(ProtobufRpcEngine.java:585)
>>>>>>>>     at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)
>>>>>>>>     at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2048=
)
>>>>>>>>     at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044=
)
>>>>>>>>     at java.security.AccessController.doPrivileged(Native Method)
>>>>>>>>     at javax.security.auth.Subject.doAs(Subject.java:415)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInfo=
rmation.java:1491)
>>>>>>>>     at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2042)
>>>>>>>>
>>>>>>>>     at org.apache.hadoop.ipc.Client.call(Client.java:1347)
>>>>>>>>     at org.apache.hadoop.ipc.Client.call(Client.java:1300)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpc=
Engine.java:206)
>>>>>>>>     at com.sun.proxy.$Proxy9.addBlock(Unknown Source)
>>>>>>>>     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>>>>>>     at
>>>>>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorIm=
pl.java:57)
>>>>>>>>     at
>>>>>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAc=
cessorImpl.java:43)
>>>>>>>>     at java.lang.reflect.Method.invoke(Method.java:606)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(Ret=
ryInvocationHandler.java:186)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvo=
cationHandler.java:102)
>>>>>>>>     at com.sun.proxy.$Proxy9.addBlock(Unknown Source)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslator=
PB.addBlock(ClientNamenodeProtocolTranslatorPB.java:330)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.locateFollowin=
gBlock(DFSOutputStream.java:1226)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutpu=
tStream(DFSOutputStream.java:1078)
>>>>>>>>     at
>>>>>>>> org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputS=
tream.java:514)
>>>>>>>>
>>>>>>>> I can copy an empty file, but as soon as its content is non-zero I
>>>>>>>> am getting this message. Searching on the message is of no help so=
 far.
>>>>>>>>
>>>>>>>> And I skimmed through the cluster instructions and found nothing
>>>>>>>> there that could help in any way neither.
>>>>>>>>
>>>>>>>>
>>>>>>>> -----------------
>>>>>>>> Daniel Savard
>>>>>>>>
>>>>>>>>
>>>>>>>> 2013/12/2 Andre Kelpe <akelpe@concurrentinc.com>
>>>>>>>>
>>>>>>>>> Hi Daniel,
>>>>>>>>>
>>>>>>>>> first of all, before posting to a mailing list, take a deep breat=
h
>>>>>>>>> and
>>>>>>>>> let your frustrations out. Then write the email. Using words like
>>>>>>>>> "crappy", "toxicware", "nightmare" are not going to help you
>>>>>>>>> getting
>>>>>>>>> useful responses.
>>>>>>>>>
>>>>>>>>> While I agree that the docs can be confusing, we should try to st=
ay
>>>>>>>>> constructive. You haven't  mentioned which documentation you are
>>>>>>>>> using. I found the cluster tutorial sufficient to get me started:
>>>>>>>>>
>>>>>>>>> http://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-c=
ommon/ClusterSetup.html
>>>>>>>>>
>>>>>>>>> If you are looking for an easy way to spin up a small cluster wit=
h
>>>>>>>>> hadoop 2.2, try the hadoop2 branch of this vagrant setup:
>>>>>>>>>
>>>>>>>>> https://github.com/fs111/vagrant-hadoop-cluster/tree/hadoop2
>>>>>>>>>
>>>>>>>>> - Andr=C3=A9
>>>>>>>>>
>>>>>>>>> On Mon, Dec 2, 2013 at 5:34 AM, Daniel Savard <
>>>>>>>>> daniel.savard@gmail.com> wrote:
>>>>>>>>> > I am trying to configure hadoop 2.2.0 from source code and I
>>>>>>>>> found the
>>>>>>>>> > instructions really crappy and incomplete. It is like they were
>>>>>>>>> written to
>>>>>>>>> > avoid someone can do the job himself and must contract someone
>>>>>>>>> else to do it
>>>>>>>>> > or buy a packaged version.
>>>>>>>>> >
>>>>>>>>> > It is about three days I am struggling with this stuff with
>>>>>>>>> partial success.
>>>>>>>>> > The documentation is less than clear and most of the stuff out
>>>>>>>>> there apply
>>>>>>>>> > to earlier version and they haven't been updated for version
>>>>>>>>> 2.2.0.
>>>>>>>>> >
>>>>>>>>> > I was able to setup HDFS, however I am still unable to use it. =
I
>>>>>>>>> am doing a
>>>>>>>>> > single node installation and the instruction page doesn't
>>>>>>>>> explain anything
>>>>>>>>> > beside telling you to do this and that without documenting what
>>>>>>>>> each thing
>>>>>>>>> > is doing and what choices are available and what guidelines you
>>>>>>>>> should
>>>>>>>>> > follow. There is even environment variables you are told to set=
,
>>>>>>>>> but nothing
>>>>>>>>> > is said about what they mean and to which value they should be
>>>>>>>>> set. It seems
>>>>>>>>> > it assumes prior knowledge of everything about hadoop.
>>>>>>>>> >
>>>>>>>>> > Anyone knows a site with proper documentation about hadoop or
>>>>>>>>> it's hopeless
>>>>>>>>> > and this whole thing is just a piece of toxicware?
>>>>>>>>> >
>>>>>>>>> > I am already looking for alternate solutions to hadoop which fo=
r
>>>>>>>>> sure will
>>>>>>>>> > be a nightmare to manage and install each time a new version,
>>>>>>>>> release will
>>>>>>>>> > become available.
>>>>>>>>> >
>>>>>>>>> > TIA
>>>>>>>>> > -----------------
>>>>>>>>> > Daniel Savard
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> --
>>>>>>>>> Andr=C3=A9 Kelpe
>>>>>>>>> andre@concurrentinc.com
>>>>>>>>> http://concurrentinc.com
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>  --
>>>>>>>> Arun C. Murthy
>>>>>>>> Hortonworks Inc.
>>>>>>>> http://hortonworks.com/
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> CONFIDENTIALITY NOTICE
>>>>>>>> NOTICE: This message is intended for the use of the individual or
>>>>>>>> entity to which it is addressed and may contain information that i=
s
>>>>>>>> confidential, privileged and exempt from disclosure under applicab=
le law.
>>>>>>>> If the reader of this message is not the intended recipient, you a=
re hereby
>>>>>>>> notified that any printing, copying, dissemination, distribution,
>>>>>>>> disclosure or forwarding of this communication is strictly prohibi=
ted. If
>>>>>>>> you have received this communication in error, please contact the =
sender
>>>>>>>> immediately and delete it from your system. Thank You.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

--001a11c2c544c2127404ecabea80
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div>Adam and others,<br></div><br></div>I solved my =
problem by increasing by 3GB the filesystem holding the data. I didn&#39;t =
try to increase it by smaller steps, so I don&#39;t know exactly at which p=
oint I had enough space for HDFS to work properly. Is there anywhere in the=
 documentation a place we can have a list of guidelines, requirements for t=
he filesystem(s). And I suppose it is possible to use much less space provi=
ded some parameter(s) is/are properly configured to use less space (namenod=
e?). Any worksheets to plan the disk space capacity for any configuration (=
standalone single node or complete cluster)?<br>

<br><br></div><div class=3D"gmail_extra"><br clear=3D"all"><div>-----------=
------<br>Daniel Savard</div>
<br><br><div class=3D"gmail_quote">2013/12/3 Daniel Savard <span dir=3D"ltr=
">&lt;<a href=3D"mailto:daniel.savard@gmail.com" target=3D"_blank">daniel.s=
avard@gmail.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div dir=3D"ltr"><div>Adam,<br><br>here is the link: <a href=3D"http://hado=
op.apache.org/docs/current/hadoop-project-dist/hadoop-common/SingleCluster.=
html" target=3D"_blank">http://hadoop.apache.org/docs/current/hadoop-projec=
t-dist/hadoop-common/SingleCluster.html</a><br>


<br></div>Then, since it didn&#39;t work I tried a number of things, but my=
 configuration files are really skinny and there isn&#39;t much stuff in it=
.<br></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extr=
a">

<br clear=3D"all"><div>-----------------<br>
Daniel Savard</div>
<br><br><div class=3D"gmail_quote">2013/12/3 Adam Kawa <span dir=3D"ltr">&l=
t;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_blank">kawa.adam@gmail.=
com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D"margin:0 0=
 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Could you please send me a link to the documentation that =
you followed to setup your single-node cluster?<div>I will go through it an=
d do it step by step, so hopefully at the end your issue will be solved and=
 the documentation will be improved.</div>


<div><br></div><div>If you have any non-standard settings in core-site.xml,=
 hdfs-site.xml and hadoop-env.sh (that were not suggested by the documentat=
ion that you followed), then please share them.</div></div><div>
<div><div class=3D"gmail_extra">
<br><br><div class=3D"gmail_quote">2013/12/3 Daniel Savard <span dir=3D"ltr=
">&lt;<a href=3D"mailto:daniel.savard@gmail.com" target=3D"_blank">daniel.s=
avard@gmail.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Adam,<br><br>that&#39;s not the issue, I did substitute th=
e name in the first report. The actual hostname is <a href=3D"http://feynma=
n.cids.ca" target=3D"_blank">feynman.cids.ca</a>.<br></div><div>
<div><div class=3D"gmail_extra"><br clear=3D"all">

<div>-----------------<br>Daniel Savard</div>
<br><br><div class=3D"gmail_quote">2013/12/3 Adam Kawa <span dir=3D"ltr">&l=
t;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_blank">kawa.adam@gmail.=
com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D"margin:0 0=
 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Daniel,<div><br></div><div>I see that in previous hdfs rep=
ort, you had:=C2=A0<span style=3D"font-family:arial,sans-serif;font-size:13=
px">hosta.subdom1.tld1, but now you have=C2=A0</span><font face=3D"arial, s=
ans-serif"><a href=3D"http://feynman.cids.ca" target=3D"_blank">feynman.cid=
s.ca</a>.=C2=A0</font><span style=3D"font-family:arial,sans-serif">What is =
the content of your=C2=A0/etc/hosts file, and output of $hostname command?<=
/span></div>


<div><span style=3D"font-family:arial,sans-serif"><br></span></div><div><br=
></div></div><div><div><div class=3D"gmail_extra"><br><br><div class=3D"gma=
il_quote">2013/12/3 Daniel Savard <span dir=3D"ltr">&lt;<a href=3D"mailto:d=
aniel.savard@gmail.com" target=3D"_blank">daniel.savard@gmail.com</a>&gt;</=
span><br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><div>I did that more t=
han once, I just retry it from the beginning. I zapped the directories and =
recreated them with hdfs namenode -format and restarted HDFS and I am still=
 getting the very same error.<br>


<br></div>I have posted previously the report. Is there anything in this re=
port that indicates I am not having enough free space somewhere? That&#39;s=
 the only thing I can see may cause this problem after everything I read on=
 the subject. I am new to Hadoop and I just want to setup a standalone node=
 for starting to experiment a while with it before going ahead with a compl=
ete cluster.<br>


<br></div>I repost the report for convenience:<br><br>Configured Capacity: =
2939899904 (2.74 GB)<br>Present Capacity: <a href=3D"tel:534421504" value=
=3D"+48534421504" target=3D"_blank">534421504</a> (509.66 MB)<br>DFS Remain=
ing: <a href=3D"tel:534417408" value=3D"+48534417408" target=3D"_blank">534=
417408</a> (509.66 MB)<div>


<br>DFS Used: 4096 (4 KB)<br>DFS Used%: 0.00%<br>

Under replicated blocks: 0<br>Blocks with corrupt replicas: 0<br>Missing bl=
ocks: 0<br><br>-------------------------------------------------<br>Datanod=
es available: 1 (1 total, 0 dead)<br><br>Live datanodes:<br></div>Name: <a =
href=3D"http://127.0.0.1:50010" target=3D"_blank">127.0.0.1:50010</a> (<a h=
ref=3D"http://feynman.cids.ca" target=3D"_blank">feynman.cids.ca</a>)<br>


Hostname: <a href=3D"http://feynman.cids.ca" target=3D"_blank">feynman.cids=
.ca</a><br>Decommission Status : Normal<br>Configured Capacity: 2939899904 =
(2.74 GB)<div><br>DFS Used: 4096 (4 KB)<br></div>Non DFS Used: <a href=3D"t=
el:2405478400" value=3D"+12405478400" target=3D"_blank">2405478400</a> (2.2=
4 GB)<br>


DFS Remaining: <a href=3D"tel:534417408" value=3D"+48534417408" target=3D"_=
blank">534417408</a> (509.66 MB)<br>

DFS Used%: 0.00%<br>DFS Remaining%: 18.18%<br>Last contact: Tue Dec 03 13:3=
7:02 EST 2013<br><br></div><div class=3D"gmail_extra"><br clear=3D"all"><di=
v>-----------------<br>Daniel Savard</div><div><div>
<br><br><div class=3D"gmail_quote">2013/12/3 Adam Kawa <span dir=3D"ltr">&l=
t;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_blank">kawa.adam@gmail.=
com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D"margin:0 0=
 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Daniel,<div><br></div><div>It looks that you can only comm=
unicate with NameNode to do &quot;metadata-only&quot; operations (e.g. list=
ing, creating a dir, empty file)...</div><div><br></div><div>Did you format=
 the NameNode correctly?</div>


<div>A quite similar issue is described here:=C2=A0<a href=3D"http://www.ma=
nning-sandbox.com/thread.jspa?messageID=3D126741" target=3D"_blank">http://=
www.manning-sandbox.com/thread.jspa?messageID=3D126741</a>. The last reply =
says: &quot;<span style=3D"font-size:13px;font-family:Arial,Helvetica,sans-=
serif">The most common is that you have reformatted the namenode leaving it=
 in an inconsistent state. The most common solution is to stop dfs, remove =
the contents of the dfs directories on all the machines, run =E2=80=9Chadoo=
p namenode -format=E2=80=9D on the controller, then restart dfs. That consi=
stently fixes the problem for me. This may be serious overkill but it works=
.&quot;</span></div>


</div><div><div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quot=
e">2013/12/3 Daniel Savard <span dir=3D"ltr">&lt;<a href=3D"mailto:daniel.s=
avard@gmail.com" target=3D"_blank">daniel.savard@gmail.com</a>&gt;</span><b=
r>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div dir=3D"ltr"><div>Thanks Arun,<br><br>I already read and did everything=
 recommended at the referred URL. There isn&#39;t any error message in the =
logfiles. The only error message appears when I try to put a non-zero file =
on the HDFS as posted above. Beside that, absolutely nothing in the logs is=
 telling me something is wrong with the configuration so far.<br>


<br></div>Is there some sort of diagnostic tool that can query/ping each se=
rver to make sure it responds properly to requests? When trying to put my f=
ile, in the datanode log I see nothing, the message appears in the namenode=
 log. Is this the expected behavior or should I see at least some kind of r=
equest message in the datanode logfile?<br>


<br></div><div class=3D"gmail_extra"><br clear=3D"all"><div>---------------=
--<br>Daniel Savard</div><div><div>
<br><br><div class=3D"gmail_quote">2013/12/2 Arun C Murthy <span dir=3D"ltr=
">&lt;<a href=3D"mailto:acm@hortonworks.com" target=3D"_blank">acm@hortonwo=
rks.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D"margin=
:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div style=3D"word-wrap:break-word">Daniel,<div><br></div><div>=C2=A0Apolog=
ies if you had a bad experience. If you can point them out to us, we&#39;d =
be more than happy to fix it - alternately, we&#39;d *love* it if you could=
 help us improve docs too.</div>


<div><br></div><div>=C2=A0Now, for the problem at hand:=C2=A0<a href=3D"htt=
p://wiki.apache.org/hadoop/CouldOnlyBeReplicatedTo" target=3D"_blank">http:=
//wiki.apache.org/hadoop/CouldOnlyBeReplicatedTo</a>=C2=A0is one place to l=
ook. Basically NN cannot find any datanodes. Anything in your NN logs to in=
dicate trouble?</div>


<div><br></div><div>=C2=A0Also, pls feel free to open liras with issues you=
 find and we&#39;ll help.</div><div><br></div><div>thanks,</div><div>Arun</=
div><div><div><div><br><div><div>On Dec 2, 2013, at 8:44 AM, Daniel Savard =
&lt;<a href=3D"mailto:daniel.savard@gmail.com" target=3D"_blank">daniel.sav=
ard@gmail.com</a>&gt; wrote:</div>


<br><blockquote type=3D"cite"><div dir=3D"ltr"><div><div><div><div><div><di=
v>Andr=C3=A9,<br><br></div>good for you that greedy instructions on the ref=
erence page were enough to setup your cluster. However, read them again and=
 see how many assumptions are made into them about what you are supposed to=
 already know and should come without saying more about it.<br>


<br></div>I did try the single node setup, it is worst than the cluster set=
up regarding the instructions. You are supposed to already have a near work=
ing system as far as I understand the instructions. It is assumed the HDFS =
is already setup and working properly. Try to find the instructions to setu=
p HDFS for version 2.2.0 and you will end up with a lot of inappropriate in=
structions about previous version (some properties were renamed).<br>


<br></div>It may appear hard at people to say this is toxic, but it is. The=
 first place a newcomer will go is setup a single node. This will be his st=
arting point and he will be left with a bunch of a priori and no clue.<br>


<br></div>To go back to my very problem at this point: <br><br>13/12/02 11:=
34:07 WARN hdfs.DFSClient: DataStreamer Exception<br>org.apache.hadoop.ipc.=
RemoteException(java.io.IOException): File /test._COPYING_ could only be re=
plicated to 0 nodes instead of minReplication (=3D1).=C2=A0 There are 1 dat=
anode(s) running and no node(s) are excluded in this operation.<br>


=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.server.blockmanagement.BlockMa=
nager.chooseTarget(BlockManager.java:1384)<br>=C2=A0=C2=A0=C2=A0 at org.apa=
che.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesyste=
m.java:2477)<br>=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.server.namenod=
e.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:555)<br>


=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProto=
colServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslat=
orPB.java:387)<br>=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.protocol.pro=
to.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod=
(ClientNamenodeProtocolProtos.java:59582)<br>


=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoB=
ufRpcInvoker.call(ProtobufRpcEngine.java:585)<br>=C2=A0=C2=A0=C2=A0 at org.=
apache.hadoop.ipc.RPC$Server.call(RPC.java:928)<br>=C2=A0=C2=A0=C2=A0 at or=
g.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2048)<br>


=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.ipc.Server$Handler$1.run(Server.jav=
a:2044)<br>=C2=A0=C2=A0=C2=A0 at java.security.AccessController.doPrivilege=
d(Native Method)<br>=C2=A0=C2=A0=C2=A0 at javax.security.auth.Subject.doAs(=
Subject.java:415)<br>=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.security.UserG=
roupInformation.doAs(UserGroupInformation.java:1491)<br>


=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.ipc.Server$Handler.run(Server.java:=
2042)<br><br>=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.ipc.Client.call(Client=
.java:1347)<br>=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.ipc.Client.call(Clie=
nt.java:1300)<br>=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.ipc.ProtobufRpcEng=
ine$Invoker.invoke(ProtobufRpcEngine.java:206)<br>


=C2=A0=C2=A0=C2=A0 at com.sun.proxy.$Proxy9.addBlock(Unknown Source)<br>=C2=
=A0=C2=A0=C2=A0 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Meth=
od)<br>=C2=A0=C2=A0=C2=A0 at sun.reflect.NativeMethodAccessorImpl.invoke(Na=
tiveMethodAccessorImpl.java:57)<br>=C2=A0=C2=A0=C2=A0 at sun.reflect.Delega=
tingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)<br>


=C2=A0=C2=A0=C2=A0 at java.lang.reflect.Method.invoke(Method.java:606)<br>=
=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.io.retry.RetryInvocationHandler.inv=
okeMethod(RetryInvocationHandler.java:186)<br>=C2=A0=C2=A0=C2=A0 at org.apa=
che.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.ja=
va:102)<br>


=C2=A0=C2=A0=C2=A0 at com.sun.proxy.$Proxy9.addBlock(Unknown Source)<br>=C2=
=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocol=
TranslatorPB.addBlock(ClientNamenodeProtocolTranslatorPB.java:330)<br>=C2=
=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.loca=
teFollowingBlock(DFSOutputStream.java:1226)<br>


=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.n=
extBlockOutputStream(DFSOutputStream.java:1078)<br>=C2=A0=C2=A0=C2=A0 at or=
g.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:=
514)<br><br></div>I can copy an empty file, but as soon as its content is n=
on-zero I am getting this message. Searching on the message is of no help s=
o far.<br>


<br></div>And I skimmed through the cluster instructions and found nothing =
there that could help in any way neither.<br><br></div><div class=3D"gmail_=
extra"><br clear=3D"all"><div>-----------------<br>Daniel Savard</div>


<br><br><div class=3D"gmail_quote">2013/12/2 Andre Kelpe <span dir=3D"ltr">=
&lt;<a href=3D"mailto:akelpe@concurrentinc.com" target=3D"_blank">akelpe@co=
ncurrentinc.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


Hi Daniel,<br>
<br>
first of all, before posting to a mailing list, take a deep breath and<br>
let your frustrations out. Then write the email. Using words like<br>
&quot;crappy&quot;, &quot;toxicware&quot;, &quot;nightmare&quot; are not go=
ing to help you getting<br>
useful responses.<br>
<br>
While I agree that the docs can be confusing, we should try to stay<br>
constructive. You haven&#39;t =C2=A0mentioned which documentation you are<b=
r>
using. I found the cluster tutorial sufficient to get me started:<br>
<a href=3D"http://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-=
common/ClusterSetup.html" target=3D"_blank">http://hadoop.apache.org/docs/s=
table/hadoop-project-dist/hadoop-common/ClusterSetup.html</a><br>
<br>
If you are looking for an easy way to spin up a small cluster with<br>
hadoop 2.2, try the hadoop2 branch of this vagrant setup:<br>
<br>
<a href=3D"https://github.com/fs111/vagrant-hadoop-cluster/tree/hadoop2" ta=
rget=3D"_blank">https://github.com/fs111/vagrant-hadoop-cluster/tree/hadoop=
2</a><br>
<br>
- Andr=C3=A9<br>
<div><div><br>
On Mon, Dec 2, 2013 at 5:34 AM, Daniel Savard &lt;<a href=3D"mailto:daniel.=
savard@gmail.com" target=3D"_blank">daniel.savard@gmail.com</a>&gt; wrote:<=
br>
&gt; I am trying to configure hadoop 2.2.0 from source code and I found the=
<br>
&gt; instructions really crappy and incomplete. It is like they were writte=
n to<br>
&gt; avoid someone can do the job himself and must contract someone else to=
 do it<br>
&gt; or buy a packaged version.<br>
&gt;<br>
&gt; It is about three days I am struggling with this stuff with partial su=
ccess.<br>
&gt; The documentation is less than clear and most of the stuff out there a=
pply<br>
&gt; to earlier version and they haven&#39;t been updated for version 2.2.0=
.<br>
&gt;<br>
&gt; I was able to setup HDFS, however I am still unable to use it. I am do=
ing a<br>
&gt; single node installation and the instruction page doesn&#39;t explain =
anything<br>
&gt; beside telling you to do this and that without documenting what each t=
hing<br>
&gt; is doing and what choices are available and what guidelines you should=
<br>
&gt; follow. There is even environment variables you are told to set, but n=
othing<br>
&gt; is said about what they mean and to which value they should be set. It=
 seems<br>
&gt; it assumes prior knowledge of everything about hadoop.<br>
&gt;<br>
&gt; Anyone knows a site with proper documentation about hadoop or it&#39;s=
 hopeless<br>
&gt; and this whole thing is just a piece of toxicware?<br>
&gt;<br>
&gt; I am already looking for alternate solutions to hadoop which for sure =
will<br>
&gt; be a nightmare to manage and install each time a new version, release =
will<br>
&gt; become available.<br>
&gt;<br>
&gt; TIA<br>
&gt; -----------------<br>
&gt; Daniel Savard<br>
<br>
<br>
<br>
</div></div><span><font color=3D"#888888">--<br>
Andr=C3=A9 Kelpe<br>
<a href=3D"mailto:andre@concurrentinc.com" target=3D"_blank">andre@concurre=
ntinc.com</a><br>
<a href=3D"http://concurrentinc.com/" target=3D"_blank">http://concurrentin=
c.com</a><br>
</font></span></blockquote></div><br></div>
</blockquote></div><br></div></div><div>
<span style=3D"border-collapse:separate;border-spacing:0px"><span style=3D"=
text-indent:0px;letter-spacing:normal;font-variant:normal;text-align:-webki=
t-auto;font-style:normal;font-weight:normal;line-height:normal;border-colla=
pse:separate;text-transform:none;font-size:medium;white-space:normal;font-f=
amily:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-word">


<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;te=
xt-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norm=
al;border-collapse:separate;text-transform:none;font-size:medium;white-spac=
e:normal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:br=
eak-word">


--</div><div style=3D"word-wrap:break-word">Arun C. Murthy</div><div style=
=3D"word-wrap:break-word">Hortonworks Inc.<br><a href=3D"http://hortonworks=
.com/" target=3D"_blank">http://hortonworks.com/</a><br><br></div></span></=
div>


</span></span>
</div>
<br></div></div>
<br>
<span style=3D"color:rgb(128,128,128);font-family:Arial,sans-serif;font-siz=
e:10px">CONFIDENTIALITY NOTICE</span><br style=3D"color:rgb(128,128,128);fo=
nt-family:Arial,sans-serif;font-size:10px"><span style=3D"color:rgb(128,128=
,128);font-family:Arial,sans-serif;font-size:10px">NOTICE: This message is =
intended for the use of the individual or entity to which it is addressed a=
nd may contain information that is confidential, privileged and exempt from=
 disclosure under applicable law. If the reader of this message is not the =
intended recipient, you are hereby notified that any printing, copying, dis=
semination, distribution, disclosure or forwarding of this communication is=
 strictly prohibited. If you have received this communication in error, ple=
ase contact the sender immediately and delete it from your system. Thank Yo=
u.</span></blockquote>


</div><br></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--001a11c2c544c2127404ecabea80--