Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of dechouxb@gmail.com designates
 209.85.216.41 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAE636z8jq5YdPj45jkJXLJcZ0bUL_Yc7sMjrhJQ1zkNCe69ocA@mail.gmail.com>
References: 
 <CAE636z9nOxmjS_7C_LTH5yGBnPJZur0ujYxzXOj_sHZeMbSHUw@mail.gmail.com>
	<CAHJVWbVmK0t=kzFVrrvTy6AEPX1HrG1HEicRx6SmMh=c58QoTw@mail.gmail.com>
	<CA+4kjVvnXJ_-j-=uLVPtOKrUn+AgU+J7=1ER8fu+YqF5_9PkEA@mail.gmail.com>
	<CAE636z8jq5YdPj45jkJXLJcZ0bUL_Yc7sMjrhJQ1zkNCe69ocA@mail.gmail.com>
Date: Mon, 24 Sep 2012 17:54:25 +0200
Message-ID: 
 <CAO6W-2fjLfmOWnVfp2kTdXzrjoPTKP6pbdAsSOA97TB31g0R0Q@mail.gmail.com>
Subject: Re: Not able to place enough replicas in Reduce
From: Bertrand Dechoux <dechouxb@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=20cf3074b5ce9b894904ca749a84

--20cf3074b5ce9b894904ca749a84
Content-Type: text/plain; charset=ISO-8859-1

And do you have any remaining space in your HDFS? (Or do you have quota?
But the message should be different, I guess.)
What are the metrics you get from the namenode? Are all datanodes (you have
only one?) live?

http://localhost:50070/dfshealth.jsp

As long as you consume (map) you don't need much space in HDFS but when you
produce (reduce) you definitely need some.
As Ted pointed out, your error is a standard one when hadoop is unable to
replicate a block. It should not be related to the reduce itself and even
less related about your specific logic.

Regards

Bertrand

On Mon, Sep 24, 2012 at 5:41 PM, Jason Yang <lin.yang.jason@gmail.com>wrote:

> Hi, Ted
>
> here is the result of jps:
> yanglin@ubuntu:~$ jps
> 3286 TaskTracker
> 14053 Jps
> 2623 DataNode
> 2996 JobTracker
> 2329 NameNode
> 2925 SecondaryNameNode
> ---
> It seems that the DN is working.
>
> And it is not failed immediately when enter the reduce phase, actually it
> always failed after processing some data
>
>
> 2012/9/24 Steve Loughran <stevel@hortonworks.com>
>
>>
>>
>> On 24 September 2012 15:47, Ted Reynolds <tedr@hortonworks.com> wrote:
>>
>>> Jason,
>>>
>>> The line in the JobTracker log - "Could only be replicated to 0 nodes,
>>> instead of 1" points to a problem with your data node.  I generally means
>>> that your DataNode is either down or not functioning correctly.  What is
>>> the output of the "jps" command?  ("jps" is found in <JAVA_HOME>/bin).
>>>
>>>
>>
>> see also: http://wiki.apache.org/hadoop/CouldOnlyBeReplicatedTo
>>
>> -steve
>>
>
>
>
> --
> YANG, Lin
>
>


-- 
Bertrand Dechoux

--20cf3074b5ce9b894904ca749a84
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

And do you have any remaining space in your HDFS? (Or do you have quota? Bu=
t the message should be different, I guess.)<br>What are the metrics you ge=
t from the namenode? Are all datanodes (you have only one?) live?<br><br>
<a href=3D"http://localhost:50070/dfshealth.jsp">http://localhost:50070/dfs=
health.jsp</a><br><br>As long as you consume (map) you don&#39;t need much =
space in HDFS but when you produce (reduce) you definitely need some.<br>
As Ted pointed out, your error is a standard one when hadoop is unable to r=
eplicate a block. It should not be related to the reduce itself and even le=
ss related about your specific logic.<br><br>Regards<br><br>Bertrand<br>
<br><div class=3D"gmail_quote">On Mon, Sep 24, 2012 at 5:41 PM, Jason Yang =
<span dir=3D"ltr">&lt;<a href=3D"mailto:lin.yang.jason@gmail.com" target=3D=
"_blank">lin.yang.jason@gmail.com</a>&gt;</span> wrote:<br><blockquote clas=
s=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;pad=
ding-left:1ex">
Hi, Ted<div><br></div><div>here is the result of jps:</div><div><div>yangli=
n@ubuntu:~$ jps</div><div>3286 TaskTracker</div><div>14053 Jps</div><div>26=
23 DataNode</div><div>2996 JobTracker</div><div>2329 NameNode</div><div>


2925 SecondaryNameNode</div><div>---</div><div>It seems that the DN is work=
ing.</div><div><br></div><div>And it is not failed immediately when enter t=
he reduce phase, actually it always failed after processing some data</div>


<div><br></div><br><div class=3D"gmail_quote">2012/9/24 Steve Loughran <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:stevel@hortonworks.com" target=3D"_blan=
k">stevel@hortonworks.com</a>&gt;</span><br><blockquote class=3D"gmail_quot=
e" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<br><br><div class=3D"gmail_quote"><div>On 24 September 2012 15:47, Ted Rey=
nolds <span dir=3D"ltr">&lt;<a href=3D"mailto:tedr@hortonworks.com" target=
=3D"_blank">tedr@hortonworks.com</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex">


Jason,<div><br></div><div>The line in the JobTracker log - &quot;Could only=
 be replicated to 0 nodes, instead of 1&quot; points to a problem with your=
 data node. =A0I generally means that your DataNode is either down or not f=
unctioning correctly. =A0What is the output of the &quot;jps&quot; command?=
 =A0(&quot;jps&quot; is found in &lt;JAVA_HOME&gt;/bin).</div>


<div><br clear=3D"all"></div></blockquote><div><br></div><div><br></div></d=
iv><div>see also:=A0<a href=3D"http://wiki.apache.org/hadoop/CouldOnlyBeRep=
licatedTo" target=3D"_blank">http://wiki.apache.org/hadoop/CouldOnlyBeRepli=
catedTo</a>=A0</div>


<span><font color=3D"#888888"><div><br>
</div><div>-steve</div></font></span></div><span class=3D"HOEnZb"><font col=
or=3D"#888888">
</font></span></blockquote></div><span class=3D"HOEnZb"><font color=3D"#888=
888"><br><br clear=3D"all"><div><br></div>-- <br><div>YANG, Lin</div><br>
</font></span></div>
</blockquote></div><br><br clear=3D"all"><br>-- <br>Bertrand Dechoux<br>

--20cf3074b5ce9b894904ca749a84--