hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Oscar Gothberg <oscar.gothb...@gmail.com>
Subject job executions fail with NotReplicatedYetException
Date Mon, 10 May 2010 18:23:32 GMT
Hi,

I keep having jobs fail at the very end, with 100% complete "map",
100% complete "reduce",
due to NotReplicatedYetException w.r.t the _temporary subdirectory of
the job output directory.

It doesn't happen 100% of the time, so it's not trivially
reproducible, but it happens enough
(10-20% of runs) to make it a real pain.

Any ideas, has anyone seen something similar? Part of the stack trace:

NotReplicatedYetException: Not replicated
yet:/test/out/dayperiod=14731/_temporary/_attempt_201005052338_0194_r_000001_0/part-00001
at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:1253)
at org.apache.hadoop.hdfs.server.namenode.NameNode.addBlock(NameNode.java:422)
at sun.reflect.GeneratedMethodAccessor13.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
...

Thanks,
/ Oscar

Mime
View raw message