spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ram Sriharsha (JIRA)" <>
Subject [jira] [Commented] (SPARK-10251) Some internal spark classes are not registered with kryo
Date Wed, 26 Aug 2015 15:31:45 GMT


Ram Sriharsha commented on SPARK-10251:

as far as I can see, this happens from spark 1.2 onward. haven't gone back yet to see if this
was present before spark 1.2
a temporary workaround is to register the necessary classes manually by setting the following
conf property:
--conf "spark.kryo.classesToRegister = [Lscala.Tuple2;"

I'm looking into a better solution now

> Some internal spark classes are not registered with kryo
> --------------------------------------------------------
>                 Key: SPARK-10251
>                 URL:
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 1.4.1
>            Reporter: Soren Macbeth
>            Assignee: Ram Sriharsha
> When running a job using kryo serialization and setting `spark.kryo.registrationRequired=true`
some internal classes are not registered, causing the job to die. This is still a problem
when this setting is false (which is the default) because it makes the space required to store
serialized objects in memory or disk much much more expensive in terms of runtime and storage
> {code}
> 15/08/25 20:28:21 WARN spark.scheduler.TaskSetManager: Lost task 0.0 in stage 0.0 (TID
0, a.b.c.d): java.lang.IllegalArgumentException: Class is not registered: scala.Tuple2[]
> Note: To register this class use: kryo.register(scala.Tuple2[].class);
>         at com.esotericsoftware.kryo.Kryo.getRegistration(
>         at com.esotericsoftware.kryo.util.DefaultClassResolver.writeClass(
>         at com.esotericsoftware.kryo.Kryo.writeClass(
>         at com.esotericsoftware.kryo.Kryo.writeClassAndObject(
>         at org.apache.spark.serializer.KryoSerializerInstance.serialize(KryoSerializer.scala:250)
>         at org.apache.spark.executor.Executor$
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(
>         at java.util.concurrent.ThreadPoolExecutor$
>         at
> {code}

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message