Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 40FDA200D4F for ; Wed, 6 Dec 2017 09:40:08 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 3F608160C08; Wed, 6 Dec 2017 08:40:08 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5F419160BFD for ; Wed, 6 Dec 2017 09:40:07 +0100 (CET) Received: (qmail 70642 invoked by uid 500); 6 Dec 2017 08:40:06 -0000 Mailing-List: contact user-help@predictionio.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.apache.org Delivered-To: mailing list user@predictionio.apache.org Received: (qmail 70632 invoked by uid 99); 6 Dec 2017 08:40:06 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Dec 2017 08:40:06 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 79EEB1806D8 for ; Wed, 6 Dec 2017 08:40:05 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.379 X-Spam-Level: X-Spam-Status: No, score=0.379 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id iC3QsMwO3pQr for ; Wed, 6 Dec 2017 08:40:03 +0000 (UTC) Received: from mail-pg0-f42.google.com (mail-pg0-f42.google.com [74.125.83.42]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 34B3E5F253 for ; Wed, 6 Dec 2017 08:40:03 +0000 (UTC) Received: by mail-pg0-f42.google.com with SMTP id j9so1903342pgc.11 for ; Wed, 06 Dec 2017 00:40:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:content-transfer-encoding:mime-version:subject:message-id:date :to; bh=vt8t0h4Xf6BbtwtN2NpgRDHZuwJxs0cNclYxb9mbing=; b=UxMsJ9xTyGl4S10wQsVYhcYGKVKIcrmkfePEg/Hr72wjqf0mdCNnMGIY0yA/3FCRv7 IwFw40Lu0yg2O0jhuoEuwsh7LQNbeV9LP4XAGoPPwrWAxA+mYDhW7XAkJ0GNqbEr97Kr zV4ZvkuPYayFxNB6ytkbutkpTpqczYY/eG2wz8GnRf16pIeQJNPMNeVMiAfrOKFa3TO6 EpjcN84Q5PkO5stFChsyC45VyGNpYyE5V54WndXb1XYcDKpFKHITN4ss+0vUtGregjWr K9cZ6HHi9g0nRur9m+6Q3vAOKGuwNDpNp7D5QDsVkm2t+Ut04Bzj0ob9EHt4qPJZz2Zu wTpQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:content-transfer-encoding:mime-version :subject:message-id:date:to; bh=vt8t0h4Xf6BbtwtN2NpgRDHZuwJxs0cNclYxb9mbing=; b=cwZwgDRBOYRmUiF4DODjZPhVrv24LzEUeMtAsKkDGwHW7EBvFQZxxEUXFY/tC8yrXS 7cqUyM9HOwQlzSHiPy45N8iwsOouFHeipQJmbxMJ2TGbN/qZW++orCtyzhcVM43pLiKs 1deWgSSaEbSD/vRu38+Y6XE4P1xGEr2kXuxv25J33+JrtUDNeG6XuolGuecNgoqeud9k k8ip1DTqIJY3TjtyOwVEtnp4hOwJiG6h2cQ7mBATfb7Eh05i0lKsiRNMg07ZOHia7WyY gbER5TMX8z/jQt5199sGWBx0JM+3uHrwDjXu5nSwYp24arM+gleYVgJK2+XkkfAmZ810 ShPg== X-Gm-Message-State: AJaThX59BO2B66nJB4MKUl0M7zhihF99Xv9q2ejOXJGUnZcsN12YRTkq sU20ui5F7by7kampgqsGiizeIDyM X-Google-Smtp-Source: AGs4zMYUyt9ObMvWuTLkGz1L4Uz8A2YTNRg4vgwk3Udl9Q7iv0hI0wi8/+FU+JWvWueVaYkSP4Tcuw== X-Received: by 10.99.143.87 with SMTP id r23mr20098614pgn.224.1512549602012; Wed, 06 Dec 2017 00:40:02 -0800 (PST) Received: from [127.0.0.1] (127.179.201.35.bc.googleusercontent.com. [35.201.179.127]) by smtp.gmail.com with ESMTPSA id 84sm3458797pfp.180.2017.12.06.00.40.00 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 06 Dec 2017 00:40:01 -0800 (PST) From: LiJunjie Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Failed to persist models with java.lang.NegativeArraySizeException exception in kryo Message-Id: <204D5E99-F4D6-4315-96EB-9385F9B0D5D9@gmail.com> Date: Wed, 6 Dec 2017 16:39:59 +0800 To: user@predictionio.incubator.apache.org X-Mailer: Apple Mail (2.3273) archived-at: Wed, 06 Dec 2017 08:40:08 -0000 Hi all, I have successfully trained, but failed to persist models. I notice = PredictionIO will serialize models with kryo into Array[Byte], then what = will happen if the trained models are very big. I also noticed the = driver JVM heap grow rapidly at that time. I use the official e-commerce recommender template, and the model to be = serialized like this: class ECommModel( val rank: Int, val userFeatures: Map[Int, Array[Double]], val productModels: Map[Int, ProductModel], val userStringIntMap: BiMap[String, Int], val itemStringIntMap: BiMap[String, Int] ) extends Serializable But there are several millions of records in userFeatures and = productModels! Here is the backtrace: [INFO] [Engine$] EngineWorkflow.train completed [INFO] [Engine] engineInstanceId=3D9f31aafc-2d66-4b76-9995-ad3e9fb84b6b [INFO] [CoreWorkflow$] Inserting persistent model [INFO] [AbstractConnector] Stopped = Spark@5d1f640c{HTTP/1.1,[http/1.1]}{0.0.0.0:4040} Exception in thread "main" com.esotericsoftware.kryo.KryoException: = java.lang.NegativeArraySizeException Serialization trace: userFeatures (org.example.ecommercerecommendation.ECommModel) at = com.esotericsoftware.kryo.serializers.ObjectField.write(ObjectField.java:1= 01) at = com.esotericsoftware.kryo.serializers.FieldSerializer.write(FieldSerialize= r.java:518) at = com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:628) at = com.twitter.chill.TraversableSerializer$$anonfun$write$1.apply(Traversable= .scala:29) at = com.twitter.chill.TraversableSerializer$$anonfun$write$1.apply(Traversable= .scala:27) at scala.collection.Iterator$class.foreach(Iterator.scala:893) at = scala.collection.AbstractIterator.foreach(Iterator.scala:1336) at = scala.collection.IterableLike$class.foreach(IterableLike.scala:72) at scala.collection.AbstractIterable.foreach(Iterable.scala:54) at = com.twitter.chill.TraversableSerializer.write(Traversable.scala:27) at = com.twitter.chill.TraversableSerializer.write(Traversable.scala:21) at = com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:628) at = com.twitter.chill.SerDeState.writeClassAndObject(SerDeState.java:64) at = com.twitter.chill.KryoPool.toBytesWithClass(KryoPool.java:116) at = com.twitter.chill.KryoInjectionInstance.apply(KryoInjection.scala:64) at = com.twitter.chill.KryoInjectionInstance.apply(KryoInjection.scala:55) at = org.apache.predictionio.workflow.CoreWorkflow$.runTrain(CoreWorkflow.scala= :81) at = org.apache.predictionio.workflow.CreateWorkflow$.main(CreateWorkflow.scala= :251) at = org.apache.predictionio.workflow.CreateWorkflow.main(CreateWorkflow.scala)= at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at = sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:= 62) at = sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorIm= pl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at = org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$= runMain(SparkSubmit.scala:755) at = org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:180) at = org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:205) at = org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:119) at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala) Caused by: java.lang.NegativeArraySizeException at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.resize(IdentityObjectI= ntMap.java:447) at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.putStash(IdentityObjec= tIntMap.java:245) at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.push(IdentityObjectInt= Map.java:239) at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.put(IdentityObjectIntM= ap.java:135) at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.putStash(IdentityObjec= tIntMap.java:246) at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.push(IdentityObjectInt= Map.java:239) at = com.esotericsoftware.kryo.util.IdentityObjectIntMap.put(IdentityObjectIntM= ap.java:135) at = com.esotericsoftware.kryo.util.MapReferenceResolver.addWrittenObject(MapRe= ferenceResolver.java:41) at = com.esotericsoftware.kryo.Kryo.writeReferenceOrNull(Kryo.java:658) at = com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:623) at = com.twitter.chill.Tuple2Serializer.write(TupleSerializers.scala:37) at = com.twitter.chill.Tuple2Serializer.write(TupleSerializers.scala:33) at = com.esotericsoftware.kryo.Kryo.writeClassAndObject(Kryo.java:628) at = com.twitter.chill.TraversableSerializer$$anonfun$write$1.apply(Traversable= .scala:29) at = com.twitter.chill.TraversableSerializer$$anonfun$write$1.apply(Traversable= .scala:27) at = scala.collection.immutable.HashMap$HashMap1.foreach(HashMap.scala:221) at = scala.collection.immutable.HashMap$HashTrieMap.foreach(HashMap.scala:428) at = scala.collection.immutable.HashMap$HashTrieMap.foreach(HashMap.scala:428) at = scala.collection.immutable.HashMap$HashTrieMap.foreach(HashMap.scala:428) at = scala.collection.immutable.HashMap$HashTrieMap.foreach(HashMap.scala:428) at = scala.collection.immutable.HashMap$HashTrieMap.foreach(HashMap.scala:428) at = com.twitter.chill.TraversableSerializer.write(Traversable.scala:27) at = com.twitter.chill.TraversableSerializer.write(Traversable.scala:21) at com.esotericsoftware.kryo.Kryo.writeObject(Kryo.java:552) at = com.esotericsoftware.kryo.serializers.ObjectField.write(ObjectField.java:8= 0) ... 27 more