Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 83744200C57 for ; Sat, 15 Apr 2017 13:38:28 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 820FF160BA0; Sat, 15 Apr 2017 11:38:28 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A2EAD160B9D for ; Sat, 15 Apr 2017 13:38:27 +0200 (CEST) Received: (qmail 9431 invoked by uid 500); 15 Apr 2017 11:38:25 -0000 Mailing-List: contact dev-help@systemml.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@systemml.incubator.apache.org Delivered-To: mailing list dev@systemml.incubator.apache.org Received: (qmail 9419 invoked by uid 99); 15 Apr 2017 11:38:24 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Apr 2017 11:38:24 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 00869C0370 for ; Sat, 15 Apr 2017 11:38:24 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.646 X-Spam-Level: X-Spam-Status: No, score=-0.646 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.796, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id JGC79YzIPEaz for ; Sat, 15 Apr 2017 11:38:21 +0000 (UTC) Received: from mail-io0-f181.google.com (mail-io0-f181.google.com [209.85.223.181]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id C7F425FBE5 for ; Sat, 15 Apr 2017 11:38:20 +0000 (UTC) Received: by mail-io0-f181.google.com with SMTP id r16so121744209ioi.2 for ; Sat, 15 Apr 2017 04:38:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=G5uwRvJj1vfF74PRJOEHtgVmStt7NIwa5s1XXbmgIc8=; b=XQ5Co8FycxyMxY4vf1SGjmjpKcCE1eGVTsogslzHM1vtY3SeSSREisfQXrOIMJyDk7 PlWANFZ4qOf0v+5Crza95wNkKT+M/gGbhdtnEVp2btKUDBYlwj8Da+UxVEGBmzdsZcm0 L5APMQzqUAoPKhvZ3U5sumXz0QkgbqWFdcVWuJ3EiHrBjVmZAl+9wGoi2S8omxTHUGVS 9BGQP1P/nciHdb3E1PiNNs5PdPkhcUDG6auI7GqYuzwoOC7y84MzHkVXCTArsHJz2kdz MCd2DfclvAnoNGRu7ZfrTgEv/j+gWwdk4ZE3BWP/Iu65xWusiIg5OPsH6P4g22DUvgju G2MQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=G5uwRvJj1vfF74PRJOEHtgVmStt7NIwa5s1XXbmgIc8=; b=GLXjdsk/bJLa/xsolybyjRAb7l3/evQM79gZVkKKq3ccPH008MQTxGZGEyKvupi3VT tYuGUgqAb8/7z+PW6aahXP/6RyCrTimYlVw2ARwur25nzp1x1F/UYOtFEM5XCzHPgj/f asqWW/LAjkEKPGpOjkOF3WfBrH8/Ik8ZeOUKJltKbCYMLciCP3GlzKcmEw0dJjS1TjFc RQXsycDrcYsEnfn5M1KCGONhJsyBhEzyA4/yqpj3ZiKxSJn+OI4+MQiPwsmGDDbkewY/ tKVDYBOMGgVNWJiWVpowknjSEK/EUADYs0bf5owS9LqtH/1gwB7vqMfnSmjrY3nmYDBj Mmvw== X-Gm-Message-State: AN3rC/53NvAvzpW7FyoeIt5Vjl4nqpSVUEb6VtXYgy/5hzX7dQU5rRCn +B4nmOgygYM8HCd92PhjHlL6+RaxDw== X-Received: by 10.107.142.201 with SMTP id q192mr1620649iod.138.1492256299555; Sat, 15 Apr 2017 04:38:19 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.62.197 with HTTP; Sat, 15 Apr 2017 04:38:19 -0700 (PDT) Received: by 10.107.62.197 with HTTP; Sat, 15 Apr 2017 04:38:19 -0700 (PDT) In-Reply-To: References: From: Aishwarya Chaurasia Date: Sat, 15 Apr 2017 17:08:19 +0530 Message-ID: Subject: Re: Regarding incubator systemml/breast_cancer project To: dev@systemml.incubator.apache.org Content-Type: multipart/alternative; boundary=001a114ec68ca6262e054d32fedb archived-at: Sat, 15 Apr 2017 11:38:28 -0000 --001a114ec68ca6262e054d32fedb Content-Type: text/plain; charset=UTF-8 Hello sir, Can you please elaborate more on what output we would be getting because we tried executing the preprocess.py file using spark submit it keeps on adding the tiles in rdd and while running the visualisation.py file it isn't showing any output. Can you please help us out asap stating the output we will be getting and the sequence of execution of files. Thank you. On 07-Apr-2017 5:54 AM, wrote: > Hi Aishwarya, > > Thanks for sharing more info on the issue! > > To facilitate easier usage, I've updated the preprocessing code by pulling > out most of the logic into a `breastcancer/preprocessing.py` module, > leaving just the execution in the `Preprocessing.ipynb` notebook. There is > also a `preprocess.py` script with the same contents as the notebook for > use with `spark-submit`. The choice of the notebook or the script is just > a matter of convenience, as they both import from the same > `breastcancer/preprocessing.py` package. > > As part of the updates, I've added an explicit SparkSession parameter > (`spark`) to the `preprocess(...)` function, and updated the body to use > this SparkSession object rather than the older SparkContext `sc` object. > Previously, the `preprocess(...)` function accessed the `sc` object that > was pulled in from the enclosing scope, which would work while all of the > code was colocated within the notebook, but not if the code was extracted > and imported. The explicit parameter now allows for the code to be > imported. > > Can you please try again with the latest updates? We are currently using > Spark 2.x with Python 3. If you use the notebook, the pyspark kernel > should have a `spark` object available that can be supplied to the > functions (as is done now in the notebook), and if you use the > `preprocess.py` script with `spark-submit`, the `spark` object will be > created explicitly by the script. > > For a bit of context to others, Aishwarya initially reached out to find > out if our breast cancer project could be applied to TIFF images, rather > than the SVS images we are currently using (the answer is "yes" so long as > they are "generic tiled TIFF images, according to the OpenSlide > documentation), and then followed up with Spark issues related to the > preprocessing code. This conversation has been promptly moved to the > mailing list so that others in the community can benefit. > > > Thanks! > > -Mike > > -- > > Mike Dusenberry > GitHub: github.com/dusenberrymw > LinkedIn: linkedin.com/in/mikedusenberry > > Sent from my iPhone. > > > > On Apr 6, 2017, at 5:09 AM, Aishwarya Chaurasia > wrote: > > > > Hey, > > > > The object sc is already defined in pyspark and yet this name error keeps > > occurring. We are using spark 2.* > > > > Here is the link to error that we are getting : > > https://paste.fedoraproject.org/paste/89iQODxzpNZVbSfgwocH8l5M1UNdIG > YhyRLivL9gydE= > --001a114ec68ca6262e054d32fedb--