Return-Path: X-Original-To: apmail-spark-user-archive@minotaur.apache.org Delivered-To: apmail-spark-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1EEA410A63 for ; Sat, 28 Feb 2015 19:12:22 +0000 (UTC) Received: (qmail 42368 invoked by uid 500); 28 Feb 2015 19:12:19 -0000 Delivered-To: apmail-spark-user-archive@spark.apache.org Received: (qmail 42292 invoked by uid 500); 28 Feb 2015 19:12:19 -0000 Mailing-List: contact user-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list user@spark.apache.org Received: (qmail 42281 invoked by uid 99); 28 Feb 2015 19:12:19 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 28 Feb 2015 19:12:19 +0000 X-ASF-Spam-Status: No, hits=1.3 required=5.0 tests=URI_HEX X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [162.253.133.43] (HELO mwork.nabble.com) (162.253.133.43) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 28 Feb 2015 19:11:54 +0000 Received: from mben.nabble.com (unknown [162.253.133.72]) by mwork.nabble.com (Postfix) with ESMTP id B82CD15609FA for ; Sat, 28 Feb 2015 11:11:34 -0800 (PST) Date: Sat, 28 Feb 2015 12:11:31 -0700 (MST) From: shahid To: user@spark.apache.org Message-ID: <1425150691245-21860.post@n3.nabble.com> Subject: getting this error while runing MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org conf = SparkConf().setAppName("spark_calc3merged").setMaster("spark://ec2-54-145-68-13.compute-1.amazonaws.com:7077") sc = SparkContext(conf=conf,pyFiles=["/root/platinum.py","/root/collections2.py"]) 15/02/28 19:06:38 WARN scheduler.TaskSetManager: Lost task 5.0 in stage 3.0 (TID 38, ip-10-80-15-145.ec2.internal): com.esotericsoftware.kryo.KryoException: Buffer overflow. Available: 0, required: 73065 com.esotericsoftware.kryo.io.Output.require(Output.java:138) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:220) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:206) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:29) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:18) com.esotericsoftware.kryo.Kryo.writeObjectOrNull(Kryo.java:549) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:312) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:293) com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:568) org.apache.spark.serializer.KryoSerializerInstance.serialize(KryoSerializer.scala:156) org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:187) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) java.lang.Thread.run(Thread.java:745) 15/02/28 19:06:38 INFO scheduler.TaskSetManager: Starting task 5.1 in stage 3.0 (TID 44, ip-10-80-15-145.ec2.internal, NODE_LOCAL, 1502 bytes) 15/02/28 19:06:38 INFO scheduler.TaskSetManager: Finished task 8.0 in stage 3.0 (TID 41) in 7040 ms on ip-10-80-98-118.ec2.internal (9/11) 15/02/28 19:06:38 INFO scheduler.TaskSetManager: Finished task 9.0 in stage 3.0 (TID 42) in 7847 ms on ip-10-80-15-145.ec2.internal (10/11) 15/02/28 19:06:50 WARN scheduler.TaskSetManager: Lost task 5.1 in stage 3.0 (TID 44, ip-10-80-15-145.ec2.internal): com.esotericsoftware.kryo.KryoException: Buffer overflow. Available: 0, required: 73065 com.esotericsoftware.kryo.io.Output.require(Output.java:138) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:220) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:206) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:29) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:18) com.esotericsoftware.kryo.Kryo.writeObjectOrNull(Kryo.java:549) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:312) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:293) com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:568) org.apache.spark.serializer.KryoSerializerInstance.serialize(KryoSerializer.scala:156) org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:187) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) java.lang.Thread.run(Thread.java:745) 15/02/28 19:06:50 INFO scheduler.TaskSetManager: Starting task 5.2 in stage 3.0 (TID 45, ip-10-80-98-118.ec2.internal, NODE_LOCAL, 1502 bytes) 15/02/28 19:07:01 WARN scheduler.TaskSetManager: Lost task 5.2 in stage 3.0 (TID 45, ip-10-80-98-118.ec2.internal): com.esotericsoftware.kryo.KryoException: Buffer overflow. Available: 0, required: 73065 com.esotericsoftware.kryo.io.Output.require(Output.java:138) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:220) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:206) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:29) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:18) com.esotericsoftware.kryo.Kryo.writeObjectOrNull(Kryo.java:549) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:312) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:293) com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:568) org.apache.spark.serializer.KryoSerializerInstance.serialize(KryoSerializer.scala:156) org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:187) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) java.lang.Thread.run(Thread.java:745) 15/02/28 19:07:01 INFO scheduler.TaskSetManager: Starting task 5.3 in stage 3.0 (TID 46, ip-10-80-15-145.ec2.internal, NODE_LOCAL, 1502 bytes) 15/02/28 19:07:13 WARN scheduler.TaskSetManager: Lost task 5.3 in stage 3.0 (TID 46, ip-10-80-15-145.ec2.internal): com.esotericsoftware.kryo.KryoException: Buffer overflow. Available: 0, required: 73065 com.esotericsoftware.kryo.io.Output.require(Output.java:138) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:220) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:206) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:29) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:18) com.esotericsoftware.kryo.Kryo.writeObjectOrNull(Kryo.java:549) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:312) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:293) com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:568) org.apache.spark.serializer.KryoSerializerInstance.serialize(KryoSerializer.scala:156) org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:187) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) java.lang.Thread.run(Thread.java:745) 15/02/28 19:07:13 ERROR scheduler.TaskSetManager: Task 5 in stage 3.0 failed 4 times; aborting job 15/02/28 19:07:13 INFO scheduler.TaskSchedulerImpl: Removed TaskSet 3.0, whose tasks have all completed, from pool 15/02/28 19:07:13 INFO scheduler.TaskSchedulerImpl: Cancelling stage 3 15/02/28 19:07:13 INFO scheduler.DAGScheduler: Failed to run collect at /root/spark_calculations.py:152 Traceback (most recent call last): File "/root/spark_calculations.py", line 348, in CTGov_5year_all = C.get_count_five_year(ctgov_data) File "/root/spark_calculations.py", line 152, in get_count_five_year for year,data in year_data.collect(): File "/root/spark/python/pyspark/rdd.py", line 723, in collect bytesInJava = self._jrdd.collect().iterator() File "/root/spark/python/lib/py4j-0.8.2.1-src.zip/py4j/java_gateway.py", line 538, in __call__ File "/root/spark/python/lib/py4j-0.8.2.1-src.zip/py4j/protocol.py", line 300, in get_return_value py4j.protocol.Py4JJavaError: An error occurred while calling o59.collect. : org.apache.spark.SparkException: Job aborted due to stage failure: Task 5 in stage 3.0 failed 4 times, most recent failure: Lost task 5.3 in stage 3.0 (TID 46, ip-10-80-15-145.ec2.internal): com.esotericsoftware.kryo.KryoException: Buffer overflow. Available: 0, required: 73065 com.esotericsoftware.kryo.io.Output.require(Output.java:138) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:220) com.esotericsoftware.kryo.io.Output.writeBytes(Output.java:206) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:29) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ByteArraySerializer.write(DefaultArraySerializers.java:18) com.esotericsoftware.kryo.Kryo.writeObjectOrNull(Kryo.java:549) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:312) com.esotericsoftware.kryo.serializers.DefaultArraySerializers$ObjectArraySerializer.write(DefaultArraySerializers.java:293) com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:568) org.apache.spark.serializer.KryoSerializerInstance.serialize(KryoSerializer.scala:156) org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:187) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) java.lang.Thread.run(Thread.java:745) Driver stacktrace: -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/getting-this-error-while-runing-tp21860.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscribe@spark.apache.org For additional commands, e-mail: user-help@spark.apache.org