systemml-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Dusenberry (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (SYSTEMML-1566) Possible regression from 0.13 -> 0.14 for MNIST LeNet script
Date Tue, 23 May 2017 19:01:04 GMT

    [ https://issues.apache.org/jira/browse/SYSTEMML-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16021668#comment-16021668
] 

Mike Dusenberry edited comment on SYSTEMML-1566 at 5/23/17 7:00 PM:
--------------------------------------------------------------------

Based on improvements made in SYSTEMML-1554 & SYSTEMML-1561, we are now actually faster
than we were in 0.14, despite the significant regressions.  Therefore, this can be resolved
for now.


was (Author: mwdusenb@us.ibm.com):
Based on improvements made in SYSTEMML-1554 & SYSTEMML-1561, we are now actually faster
than we were in 0.14, despite the significant regressions (which may have been due to correctness).
 Therefore, this can be resolved for now.

> Possible regression from 0.13 -> 0.14 for MNIST LeNet script
> ------------------------------------------------------------
>
>                 Key: SYSTEMML-1566
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1566
>             Project: SystemML
>          Issue Type: Bug
>            Reporter: Mike Dusenberry
>            Assignee: Mike Dusenberry
>             Fix For: SystemML 1.0
>
>         Attachments: explain.txt, stats.txt
>
>
> For the 0.14 release testing, I tried out the [MNIST LeNet example | https://github.com/apache/incubator-systemml/blob/master/scripts/nn/examples/mnist_lenet-train.dml]
on both 0.13 and 0.14 and noticed a possible regression.  Basically, on 0.14 the script took
longer to run and had 2513 Spark instructions executed, while on 0.13 only 864 Spark instructions
were executed.  This was run locally on a laptop using the 2 instructions at the top of the
script (and copied below).  I've also attached the stats and runtime explain logs.
> 1. Download data
> {code}
> nn/examples/get_mnist_data.sh
> {code}
> 2. Execute from the {{scripts}} directory.
> {code}
> spark-submit --master local[*] --driver-memory 10G --conf spark.driver.maxResultSize=0
--conf spark.rpc.message.maxSize=128 SystemML.jar -f nn/examples/mnist_lenet-train.dml -stats
-explain -nvargs train=nn/examples/data/mnist/mnist_train.csv test=nn/examples/data/mnist/mnist_test.csv
C=1 Hin=28 Win=28 epochs=1 out_dir=nn/examples/model/mnist_lenet
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message