hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ChaoChun Liang <cclian...@gmail.com>
Subject RE: secondary namenode errors
Date Wed, 05 Sep 2007 05:47:00 GMT

This patch solved the problems from the secondary name node, 
which also provided more clear information to debug the reduce hanged
problem
in my environment. The hanged problem comes from the firewall, which blocks
the connection, and all tasks back to normal now. 

Thanks.
ChaoChun


Dhruba Borthakur wrote:
> 
> I have already uploaded a patch for HADOOP-1076. If somebody can review
> this
> patch, it can be committed.
> 
> Thanks,
> dhruba
> 
> -----Original Message-----
> From: ChaoChun Liang [mailto:ccliangnn@gmail.com] 
> Sent: Tuesday, September 04, 2007 1:12 AM
> To: hadoop-user@lucene.apache.org
> Subject: Re: secondary namenode errors
> 
> 
> 
> Koji Noguchi wrote:
>> 
>> Joydeep,
>> 
>> I think you're hitting this bug.
>> http://issues.apache.org/jira/browse/HADOOP-1076
>> 
>> In any case, as Raghu suggested, please use 0.13.1 and not 0.13.
>> 
>> Koji
>> 
> 
> Hi,
>    I met the similar problem under 0.14.0 when I increase the slave nodes
> from 2 to 3.
>    All tasks work fine in 2-nodes cluster, but will hang in reduce process
> in the 3-nodes cluster.
>    My logs show as following for each *.secondarynamenode.* logs. 
> 
> 2007-09-04 16:02:33,390 ERROR org.apache.hadoop.dfs.NameNode.Secondary:
> Exception in doCheckpoint:
> 2007-09-04 16:02:33,390 ERROR org.apache.hadoop.dfs.NameNode.Secondary:
> org.apache.hadoop.ipc.RemoteException: java.io.IOException: Attempt to
> roll
> edit log but edits.new exists
> 	at org.apache.hadoop.dfs.FSEditLog.rollEditLog(FSEditLog.java:582)
> 	at
> org.apache.hadoop.dfs.FSNamesystem.rollEditLog(FSNamesystem.java:3510)
> 	at org.apache.hadoop.dfs.NameNode.rollEditLog(NameNode.java:553)
> 	at sun.reflect.GeneratedMethodAccessor14.invoke(Unknown Source)
> 	at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl
> .java:25)
> 	at java.lang.reflect.Method.invoke(Method.java:597)
> 	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:340)
> 	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:566)
> 
> 	at org.apache.hadoop.ipc.Client.call(Client.java:470)
> 	at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:165)
> 	at org.apache.hadoop.dfs.$Proxy0.rollEditLog(Unknown Source)
> 	at
> org.apache.hadoop.dfs.SecondaryNameNode.doCheckpoint(SecondaryNameNode.java:
> 261)
> 	at
> org.apache.hadoop.dfs.SecondaryNameNode.run(SecondaryNameNode.java:190)
> 	at java.lang.Thread.run(Thread.java:619)
> 
>   Any suggestion to solve?
> 
> ChaoChun
> -- 
> View this message in context:
> http://www.nabble.com/secondary-namenode-errors-tf4321013.html#a12473770
> Sent from the Hadoop Users mailing list archive at Nabble.com.
> 
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/secondary-namenode-errors-tf4321013.html#a12492019
Sent from the Hadoop Users mailing list archive at Nabble.com.


Mime
View raw message