giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Avery Ching <ach...@apache.org>
Subject Re: Running Example Error: Child Error
Date Tue, 06 Aug 2013 04:05:20 GMT
Does the master log show ZooKeeper starting okay?

On 8/5/13 6:06 AM, Zeyuan Xiang wrote:
> I used Hadoop 1.1which is a real cluster with four slave node and I 
> build success.
> When I refer to reference "Shortest Paths", step by step, I got an 
> error as follows:
> ***java.lang.Throwable: Child Error *
> *       at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:242)
>        Caused by: java.io.IOException: Task process exit with nonzero 
> status of 1.
>        at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:229)*
> **
> *what's more, all the example produce this same error.......*
> *And I find someone also encounter this problem, but it seems not  solved*
> The command:
> ----------------------------------------------------------------------------------------------------------
> hadoop jar 
> /usr/giraph/giraph-examples/target/giraph-examples-1.1.0-SNAPSHOT-for-hadoop-1.0.2-jar-with-dependencies.jar

> org.apache.giraph.GiraphRunner 
> org.apache.giraph.examples.SimpleShortestPathsComputation
> -vif 
> org.apache.giraph.io.formats.JsonLongDoubleFloatDoubleVertexInputFormat
> -vip shortest/shortest.json
> -of org.apache.giraph.io.formats.IdWithValueTextOutputFormat
> -op outShort -w 1
> ----------------------------------------------------------------------------------------------------------
>
> Running info:
> ----------------------------------------------------------------------------------------------------------
> 13/08/02 10:10:27 INFO utils.ConfigurationUtils: No edge input format 
> specified. Ensure your InputFormat does not require one.
> 13/08/02 10:10:27 INFO job.GiraphJob: run: Since checkpointing is 
> disabled (default), do not allow any task retries (setting 
> mapred.map.max.attempts = 0, old value = 4)
> 13/08/02 10:10:33 INFO mapred.JobClient: Running job: 
> job_201308020915_0002
> 13/08/02 10:10:34 INFO mapred.JobClient:  map 0% reduce 0%
> 13/08/02 10:10:51 INFO mapred.JobClient:  map 50% reduce 0%
> 13/08/02 10:10:54 INFO mapred.JobClient:  map 100% reduce 0%
> 13/08/02 10:10:55 INFO mapred.JobClient:  map 50% reduce 0%
> 13/08/02 10:21:42 INFO mapred.JobClient:  map 0% reduce 0%
> 13/08/02 10:21:42 INFO mapred.JobClient: Job complete: 
> job_201308020915_0002
> 13/08/02 10:21:42 INFO mapred.JobClient: Counters: 6
> 13/08/02 10:21:42 INFO mapred.JobClient:   Job Counters
> 13/08/02 10:21:42 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=658594
> 13/08/02 10:21:42 INFO mapred.JobClient:     Total time spent by all 
> reduces waiting after reserving slots (ms)=0
> 13/08/02 10:21:42 INFO mapred.JobClient:     Total time spent by all 
> maps waiting after reserving slots (ms)=0
> 13/08/02 10:21:42 INFO mapred.JobClient:     Launched map tasks=2
> 13/08/02 10:21:42 INFO mapred.JobClient:     SLOTS_MILLIS_REDUCES=0
> 13/08/02 10:21:42 INFO mapred.JobClient:     Failed map tasks=1
> ----------------------------------------------------------------------------------------------------------
>
> Logs:
> ----------------------------------------------------------------------------------------------------------
> 2013-08-02 10:10:50,721 INFO org.apache.hadoop.mapred.TaskInProgress: 
> Error from attempt_201308020915_0002_m_000000_0: java.lang.Throwable: 
> Child Error
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271)
> Caused by: java.io.IOException: Task process exit with nonzero status 
> of 1.
> at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258)
>
> 2013-08-02 10:10:53,765 INFO org.apache.hadoop.mapred.TaskInProgress: 
> TaskInProgress task_201308020915_0002_m_000000 has failed 0 times.
> 2013-08-02 10:10:53,765 INFO org.apache.hadoop.mapred.JobTracker: 
> Removing task 'attempt_201308020915_0002_m_000000_0'
> 2013-08-02 10:10:53,765 INFO org.apache.hadoop.mapred.JobTracker: 
> Adding task (TASK_CLEANUP) 'attempt_201308020915_0002_m_000000_0' to 
> tip task_201308020915_0002_m_000000, for tracker 
> 'tracker_Slave3.Hadoop:Slave3.Hadoop/127.0.0.1:49172 
> <http://127.0.0.1:49172/>'
> 2013-08-02 10:10:54,070 INFO org.apache.hadoop.mapred.TaskInProgress: 
> TaskInProgress task_201308020915_0002_m_000000 has failed 1 times.
> 2013-08-02 10:10:54,071 INFO org.apache.hadoop.mapred.JobTracker: 
> Removing task 'attempt_201308020915_0002_m_000000_0'
> 2013-08-02 10:11:40,700 INFO org.apache.hadoop.mapred.TaskInProgress: 
> Error from attempt_201308020915_0002_m_000001_0: 
> java.lang.IllegalStateException: run: Caught an unrecoverable 
> exception exists: Failed to check 
> /_hadoopBsp/job_201308020915_0002/_applicationAttemptsDir/0/_superstepDir/-1/_addressesAndPartitions

> after 3 tries!
> at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:101)
> at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136)
> at org.apache.hadoop.mapred.Child.main(Child.java:249)
> Caused by: java.lang.IllegalStateException: exists: Failed to check 
> /_hadoopBsp/job_201308020915_0002/_applicationAttemptsDir/0/_superstepDir/-1/_addressesAndPartitions

> after 3 tries!
> at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:369)
> at 
> org.apache.giraph.worker.BspServiceWorker.startSuperstep(BspServiceWorker.java:686)
> at 
> org.apache.giraph.worker.BspServiceWorker.setup(BspServiceWorker.java:482)
> at 
> org.apache.giraph.graph.GraphTaskManager.execute(GraphTaskManager.java:246)
> at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:91)
> ... 7 more
>
> 2013-08-02 10:21:41,001 INFO org.apache.hadoop.mapred.TaskInProgress: 
> Error from attempt_201308020915_0002_m_000001_0: Task 
> attempt_201308020915_0002_m_000001_0 failed to report status for 600 
> seconds. Killing!
> -----------------------------------------------------------------------------------------------------
>
>         I also use  "mvn -Phadoop_1.0 clean package"  and  "mvn 
> -pHadoop=non_secure", but it did not take effect. Can anybody give me 
> some advice?
>         Thank you very much !!!
>
> And is ZooKeeper necessary to Giraph, do I need install it?
>
> Best regards,
> Your sincerely Allan Xiang.


Mime
View raw message