Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 59DFE200D46 for ; Sun, 12 Nov 2017 07:10:09 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 58860160C05; Sun, 12 Nov 2017 06:10:09 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A845D160C03 for ; Sun, 12 Nov 2017 07:10:08 +0100 (CET) Received: (qmail 25125 invoked by uid 500); 12 Nov 2017 06:10:07 -0000 Mailing-List: contact issues-help@systemml.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@systemml.apache.org Delivered-To: mailing list issues@systemml.apache.org Received: (qmail 25112 invoked by uid 99); 12 Nov 2017 06:10:07 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 12 Nov 2017 06:10:07 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 0D6D6C071A for ; Sun, 12 Nov 2017 06:10:06 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id DkvSG2cQUv5x for ; Sun, 12 Nov 2017 06:10:05 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id D674860DE7 for ; Sun, 12 Nov 2017 06:10:04 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id A4E4BE045B for ; Sun, 12 Nov 2017 06:10:03 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id A6E29240CE for ; Sun, 12 Nov 2017 06:10:01 +0000 (UTC) Date: Sun, 12 Nov 2017 06:10:00 +0000 (UTC) From: "Matthias Boehm (JIRA)" To: issues@systemml.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (SYSTEMML-2013) Perftest genStratStatsData failed for 80GB MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Sun, 12 Nov 2017 06:10:09 -0000 [ https://issues.apache.org/jira/browse/SYSTEMML-2013?page=3Dcom.atlas= sian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthias Boehm resolved SYSTEMML-2013. -------------------------------------- Resolution: Fixed Assignee: Matthias Boehm Fix Version/s: SystemML 1.0 > Perftest genStratStatsData failed for 80GB > ------------------------------------------ > > Key: SYSTEMML-2013 > URL: https://issues.apache.org/jira/browse/SYSTEMML-2013 > Project: SystemML > Issue Type: Bug > Reporter: Matthias Boehm > Assignee: Matthias Boehm > Fix For: SystemML 1.0 > > > {code} > Caused by: org.apache.sysml.runtime.DMLRuntimeException: ERROR: Runtime e= rror in program block generated from statement block between lines 107 and = 0 -- Error evaluating instruction: SPARK=C2=B0write=C2=B0_mVar123=C2=B7MATR= IX=C2=B7DOUBLE=C2=B0mbperftest/stratstats/A_10M/data=C2=B7SCALAR=C2=B7STRIN= G=C2=B7true=C2=B0binaryblock=C2=B7SCALAR=C2=B7STRING=C2=B7true=C2=B0=C2=B7S= CALAR=C2=B7STRING=C2=B7true > =09at org.apache.sysml.runtime.controlprogram.ProgramBlock.executeSingleI= nstruction(ProgramBlock.java:294) > =09at org.apache.sysml.runtime.controlprogram.ProgramBlock.executeInstruc= tions(ProgramBlock.java:218) > =09at org.apache.sysml.runtime.controlprogram.ProgramBlock.execute(Progra= mBlock.java:163) > =09at org.apache.sysml.runtime.controlprogram.Program.execute(Program.jav= a:118) > =09... 13 more > Caused by: org.apache.spark.SparkException: Job aborted due to stage fail= ure: Serialized task 15:2 was 323397641 bytes, which exceeds max allowed: s= park.rpc.message.maxSize (134217728 bytes). Consider increasing spark.rpc.m= essage.maxSize or using broadcast variables for large values. > =09at org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$= DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1435) > =09at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply= (DAGScheduler.scala:1423) > =09at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply= (DAGScheduler.scala:1422) > =09at scala.collection.mutable.ResizableArray$class.foreach(ResizableArra= y.scala:59) > =09at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:48) > =09at org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.sca= la:1422) > =09at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFaile= d$1.apply(DAGScheduler.scala:802) > =09at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFaile= d$1.apply(DAGScheduler.scala:802) > =09at scala.Option.foreach(Option.scala:257) > =09at org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGSche= duler.scala:802) > =09at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.doOnReceive= (DAGScheduler.scala:1650) > =09at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(D= AGScheduler.scala:1605) > =09at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(D= AGScheduler.scala:1594) > =09at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:48) > =09at org.apache.spark.scheduler.DAGScheduler.runJob(DAGScheduler.scala:6= 28) > =09at org.apache.spark.SparkContext.runJob(SparkContext.scala:1918) > =09at org.apache.spark.SparkContext.runJob(SparkContext.scala:1931) > =09at org.apache.spark.SparkContext.runJob(SparkContext.scala:1951) > =09at org.apache.spark.rdd.PairRDDFunctions$$anonfun$saveAsHadoopDataset$= 1.apply$mcV$sp(PairRDDFunctions.scala:1226) > =09at org.apache.spark.rdd.PairRDDFunctions$$anonfun$saveAsHadoopDataset$= 1.apply(PairRDDFunctions.scala:1168) > =09at org.apache.spark.rdd.PairRDDFunctions$$anonfun$saveAsHadoopDataset$= 1.apply(PairRDDFunctions.scala:1168) > =09at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope= .scala:151) > =09at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope= .scala:112) > =09at org.apache.spark.rdd.RDD.withScope(RDD.scala:362) > =09at org.apache.spark.rdd.PairRDDFunctions.saveAsHadoopDataset(PairRDDFu= nctions.scala:1168) > =09at org.apache.spark.rdd.PairRDDFunctions$$anonfun$saveAsHadoopFile$4.a= pply$mcV$sp(PairRDDFunctions.scala:1071) > =09at org.apache.spark.rdd.PairRDDFunctions$$anonfun$saveAsHadoopFile$4.a= pply(PairRDDFunctions.scala:1037) > =09at org.apache.spark.rdd.PairRDDFunctions$$anonfun$saveAsHadoopFile$4.a= pply(PairRDDFunctions.scala:1037) > =09at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope= .scala:151) > =09at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope= .scala:112) > =09at org.apache.spark.rdd.RDD.withScope(RDD.scala:362) > =09at org.apache.spark.rdd.PairRDDFunctions.saveAsHadoopFile(PairRDDFunct= ions.scala:1037) > =09at org.apache.spark.api.java.JavaPairRDD.saveAsHadoopFile(JavaPairRDD.= scala:803) > =09at org.apache.sysml.runtime.instructions.spark.WriteSPInstruction.proc= essMatrixWriteInstruction(WriteSPInstruction.java:218) > =09at org.apache.sysml.runtime.instructions.spark.WriteSPInstruction.proc= essInstruction(WriteSPInstruction.java:144) > =09at org.apache.sysml.runtime.controlprogram.ProgramBlock.executeSingleI= nstruction(ProgramBlock.java:264) > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)