giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Avery Ching <ach...@apache.org>
Subject Re: Benchmark runs but tests fail
Date Sat, 26 Nov 2011 22:41:34 GMT
Hi Oana,

Thanks for your questions.  The fault tolerance should work if there is 
a viable checkpoint and there is a master and ZooKeeper process 
available to coordinate the application.  The only reason I believe that 
the fault tolerance won't work is if the number of task failures is 
exceeded (Hadoop configurable variable - map.max.attempts).  Can you 
show me the log of the master task?  It would be really helpful.

As far as the unittests failing, do you actually have a Hadoop instance 
running at localhost:50030?  The unittests can be run two different ways:

- Against an actual Hadoop instance (i.e. mvn test 
-Dprop.mapred.job.tracker=<jobtracker hostname>:<jobtracker port>)

- Using something called LocalJobRunner that simulates a Hadoop instance 
with a single map task at a time (i.e mvn test).

Hope that helps, let me know if you have other questions.

Avery

On 11/26/11 3:09 PM, Oana Theogarajan wrote:
> Hi,
> I've been testing Giraph on a hadoop custer set up on Amazon EC2 and I 
> encounter some issues. I can successfully run the PageRankBenchmark, 
> however if I am trying to test the fault tolerance by killing a 
> tasktracker the job eventually dies after trying repeatedly. I have 
> checkpoints enabled (the default every 2 supersteps - and I can see 
> them written in the checkpointing directory)
> I then tried to run the unit tests using
> mvn test -Dprop.mapred.job.tracker=localhost:50030
> and a lot of them fail. The output is quoted below. The surefire logs 
> show the following error. I am pretty new to both hadoop and Giraph 
> and I can't tell what could cause this error. I am puzzled since can 
> run Giraph PageRankBenchmark jobs but the tests fail.
>
> Thanks in advance for your help figuring this out.
> Best,
>     Oana
>
> Tests run: 9, Failures: 0, Errors: 7, Skipped: 0, Time elapsed: 0.5 
> sec <<< FAILURE!
> testBspFail(org.apache.giraph.TestBspBasic)  Time elapsed: 0.054 sec 
> <<< ERROR!
> java.io.IOException: Call to localhost/127.0.0.1:50030 failed on local 
> exception: java.io.EOFException
>     at org.apache.hadoop.ipc.Client.wrapException(Client.java:1065)
>     at org.apache.hadoop.ipc.Client.call(Client.java:1033)
>     at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:224)
>     at org.apache.hadoop.mapred.$Proxy2.getProtocolVersion(Unknown 
> Source)
>     at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:364)
>     at 
> org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:460)
>     at org.apache.hadoop.mapred.JobClient.init(JobClient.java:454)
>     at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:437)
>     at org.apache.hadoop.mapreduce.Job$1.run(Job.java:477)
>     at java.security.AccessController.doPrivileged(Native Method)
>     at javax.security.auth.Subject.doAs(Subject.java:416)
>     at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
>     at org.apache.hadoop.mapreduce.Job.connect(Job.java:475)
>     at org.apache.hadoop.mapreduce.Job.submit(Job.java:464)
>     at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:494)
>     at org.apache.giraph.graph.GiraphJob.run(GiraphJob.java:524)
>     at org.apache.giraph.TestBspBasic.testBspFail(TestBspBasic.java:180)
> Caused by: java.io.EOFException
>     at java.io.DataInputStream.readInt(DataInputStream.java:392)
>     at 
> org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:774)
>     at org.apache.hadoop.ipc.Client$Connection.run(Client.java:712)
>
>
> -------------------------------------------------------
>  T E S T S
> -------------------------------------------------------
> Running org.apache.giraph.TestManualCheckpoint
> Setting tasks to 3 for testBspCheckpoint since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspCheckpoint
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.664 
> sec <<< FAILURE!
> Running org.apache.giraph.TestAutoCheckpoint
> Setting tasks to 3 for testSingleFault since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testSingleFault
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.069 
> sec <<< FAILURE!
> Running org.apache.giraph.TestBspBasic
> Setting tasks to 3 for testInstantiateVertex since JobTracker exists...
> testInstantiateVertex: 
> java.class.path=/home/ubuntu/giraph/trunk/target/test-classes:/home/ubuntu/giraph/trunk/target/classes:/home/ubuntu/.m2/repository/junit/junit/3.8.1/junit-3.8.1.jar:/home/ubuntu/.m2/repository/org/apache/hadoop/hadoop-core/0.20.203.0/hadoop-core-0.20.203.0.jar:/home/ubuntu/.m2/repository/xmlenc/xmlenc/0.52/xmlenc-0.52.jar:/home/ubuntu/.m2/repository/commons-httpclient/commons-httpclient/3.0.1/commons-httpclient-3.0.1.jar:/home/ubuntu/.m2/repository/commons-logging/commons-logging/1.0.3/commons-logging-1.0.3.jar:/home/ubuntu/.m2/repository/commons-codec/commons-codec/1.4/commons-codec-1.4.jar:/home/ubuntu/.m2/repository/org/apache/commons/commons-math/2.1/commons-math-2.1.jar:/home/ubuntu/.m2/repository/commons-configuration/commons-configuration/1.6/commons-configuration-1.6.jar:/home/ubuntu/.m2/repository/commons-collections/commons-collections/3.2.1/commons-collections-3.2.1.jar:/home/ubuntu/.m2/repository/commons-lang/commons-lang/2.4/commons-lang-2.4.jar:/home/ubuntu/.m2/repository/commons-digester/commons-digester/1.8/commons-digester-1.8.jar:/home/ubuntu/.m2/repository/commons-beanutils/commons-beanutils/1.7.0/commons-beanutils-1.7.0.jar:/home/ubuntu/.m2/repository/commons-beanutils/commons-beanutils-core/1.8.0/commons-beanutils-core-1.8.0.jar:/home/ubuntu/.m2/repository/commons-net/commons-net/1.4.1/commons-net-1.4.1.jar:/home/ubuntu/.m2/repository/org/mortbay/jetty/jetty/6.1.26/jetty-6.1.26.jar:/home/ubuntu/.m2/repository/org/mortbay/jetty/servlet-api/2.5-20081211/servlet-api-2.5-20081211.jar:/home/ubuntu/.m2/repository/org/mortbay/jetty/jetty-util/6.1.26/jetty-util-6.1.26.jar:/home/ubuntu/.m2/repository/tomcat/jasper-runtime/5.5.12/jasper-runtime-5.5.12.jar:/home/ubuntu/.m2/repository/tomcat/jasper-compiler/5.5.12/jasper-compiler-5.5.12.jar:/home/ubuntu/.m2/repository/org/mortbay/jetty/jsp-api-2.1/6.1.14/jsp-api-2.1-6.1.14.jar:/home/ubuntu/.m2/repository/org/mortbay/jetty/servlet-api-2.5/6.1.14/servlet-api-2.5-6.1.14.jar:/home/ubuntu/.m2/repository/org/mortbay/jetty/jsp-2.1/6.1.14/jsp-2.1-6.1.14.jar:/home/ubuntu/.m2/repository/ant/ant/1.6.5/ant-1.6.5.jar:/home/ubuntu/.m2/repository/commons-el/commons-el/1.0/commons-el-1.0.jar:/home/ubuntu/.m2/repository/net/java/dev/jets3t/jets3t/0.7.1/jets3t-0.7.1.jar:/home/ubuntu/.m2/repository/net/sf/kosmosfs/kfs/0.3/kfs-0.3.jar:/home/ubuntu/.m2/repository/hsqldb/hsqldb/1.8.0.10/hsqldb-1.8.0.10.jar:/home/ubuntu/.m2/repository/oro/oro/2.0.8/oro-2.0.8.jar:/home/ubuntu/.m2/repository/org/eclipse/jdt/core/3.1.1/core-3.1.1.jar:/home/ubuntu/.m2/repository/org/codehaus/jackson/jackson-core-asl/1.8.0/jackson-core-asl-1.8.0.jar:/home/ubuntu/.m2/repository/org/apache/mahout/mahout-collections/1.0/mahout-collections-1.0.jar:/home/ubuntu/.m2/repository/com/google/guava/guava/r09/guava-r09.jar:/home/ubuntu/.m2/repository/org/codehaus/jackson/jackson-mapper-asl/1.8.0/jackson-mapper-asl-1.8.0.jar:/home/ubuntu/.m2/repository/org/apache/zookeeper/zookeeper/3.3.3/zookeeper-3.3.3.jar:/home/ubuntu/.m2/repository/log4j/log4j/1.2.15/log4j-1.2.15.jar:/home/ubuntu/.m2/repository/javax/mail/mail/1.4/mail-1.4.jar:/home/ubuntu/.m2/repository/javax/activation/activation/1.1/activation-1.1.jar:/home/ubuntu/.m2/repository/jline/jline/0.9.94/jline-0.9.94.jar:/home/ubuntu/.m2/repository/org/apache/commons/commons-io/1.3.2/commons-io-1.3.2.jar:/home/ubuntu/.m2/repository/commons-cli/commons-cli/1.2/commons-cli-1.2.jar:/home/ubuntu/.m2/repository/net/iharder/base64/2.3.8/base64-2.3.8.jar:/home/ubuntu/.m2/repository/org/json/json/20090211/json-20090211.jar:/home/ubuntu/.m2/repository/org/mockito/mockito-all/1.8.5/mockito-all-1.8.5.jar:
> testInstantiateVertex: Got vertex Vertex(id=null,value=null,#edges=0), 
> graphStateorg.apache.giraph.graph.GraphState@877ef83
> testInstantiateVertex: Example output split =
> Setting tasks to 3 for testLocalJobRunnerConfig since JobTracker 
> exists...
> testLocalJobRunnerConfig: Skipping for non-local
> Setting tasks to 3 for testBspFail since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspFail
> Setting tasks to 3 for testBspSuperStep since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspSuperStep
> Setting tasks to 3 for testBspMsg since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspMsg
> Setting tasks to 3 for testEmptyVertexInputFormat since JobTracker 
> exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for 
> testEmptyVertexInputFormat
> Setting tasks to 3 for testBspCombiner since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspCombiner
> Setting tasks to 3 for testBspPageRank since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspPageRank
> Setting tasks to 3 for testBspShortestPaths since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testBspShortestPaths
> Tests run: 9, Failures: 0, Errors: 7, Skipped: 0, Time elapsed: 0.501 
> sec <<< FAILURE!
> Running 
> org.apache.giraph.lib.TestTextDoubleDoubleAdjacencyListVertexInputFormat
> Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.321 sec
> Running org.apache.giraph.TestGraphPartitioner
> Setting tasks to 3 for testPartitioners since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testPartitioners
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.025 
> sec <<< FAILURE!
> Running org.apache.giraph.TestVertexTypes
> 11/11/26 21:29:55 WARN graph.GraphMapper: Input format vertex index 
> type is not known
> 11/11/26 21:29:55 WARN graph.GraphMapper: Input format vertex value 
> type is not known
> 11/11/26 21:29:55 WARN graph.GraphMapper: Input format edge value type 
> is not known
> 11/11/26 21:29:55 WARN graph.GraphMapper: Output format vertex index 
> type is not known
> 11/11/26 21:29:55 WARN graph.GraphMapper: Output format vertex value 
> type is not known
> 11/11/26 21:29:55 WARN graph.GraphMapper: Output format edge value 
> type is not known
> Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.062 sec
> Running 
> org.apache.giraph.lib.TestLongDoubleDoubleAdjacencyListVertexInputFormat
> Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.043 sec
> Running org.apache.giraph.TestZooKeeperExt
> testCreateExt: No prop.zookeeper.list set, skipping test
> testDeleteExt: No prop.zookeeper.list set, skipping test
> testGetChildrenExt: No prop.zookeeper.list set, skipping test
> Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.004 sec
> Running org.apache.giraph.lib.TestAdjacencyListTextVertexOutputFormat
> Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.082 sec
> Running org.apache.giraph.TestNotEnoughMapTasks
> Setting tasks to 3 for testNotEnoughMapTasks since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testNotEnoughMapTasks
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.028 
> sec <<< FAILURE!
> Running org.apache.giraph.TestMutateGraphVertex
> Setting tasks to 3 for testMutateGraph since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testMutateGraph
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.031 
> sec <<< FAILURE!
> Running org.apache.giraph.TestJsonBase64Format
> Setting tasks to 3 for testContinue since JobTracker exists...
> setup: Sending job to job tracker localhost:50030 with jar path 
> target/giraph-0.70-jar-with-dependencies.jar for testContinue
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.016 
> sec <<< FAILURE!
> Running org.apache.giraph.TestPredicateLock
> testWaitMsecs:
> Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.576 sec
>
> Results :
>
> Tests in error:
>   testBspCheckpoint(org.apache.giraph.TestManualCheckpoint)
>   testSingleFault(org.apache.giraph.TestAutoCheckpoint)
>   testBspFail(org.apache.giraph.TestBspBasic)
>   testBspSuperStep(org.apache.giraph.TestBspBasic)
>   testBspMsg(org.apache.giraph.TestBspBasic)
>   testEmptyVertexInputFormat(org.apache.giraph.TestBspBasic)
>   testBspCombiner(org.apache.giraph.TestBspBasic)
>   testBspPageRank(org.apache.giraph.TestBspBasic)
>   testBspShortestPaths(org.apache.giraph.TestBspBasic)
>   testPartitioners(org.apache.giraph.TestGraphPartitioner)
>   testNotEnoughMapTasks(org.apache.giraph.TestNotEnoughMapTasks)
>   testMutateGraph(org.apache.giraph.TestMutateGraphVertex)
>   testContinue(org.apache.giraph.TestJsonBase64Format)
>
> Tests run: 39, Failures: 0, Errors: 13, Skipped: 0
>
> [INFO] 
> ------------------------------------------------------------------------
> [INFO] BUILD FAILURE
> [INFO] 
> ------------------------------------------------------------------------
> [INFO] Total time: 11.267s
> [INFO] Finished at: Sat Nov 26 21:29:55 UTC 2011
> [INFO] Final Memory: 11M/324M
> [INFO] 
> ------------------------------------------------------------------------
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-surefire-plugin:2.6:test (default-test) 
> on project giraph: There are test failures.
> [ERROR]
>



Mime
View raw message