Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 2BAA3200CD0 for ; Tue, 25 Jul 2017 20:14:49 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 29FA2166E5A; Tue, 25 Jul 2017 18:14:49 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 4639B166DE9 for ; Tue, 25 Jul 2017 20:14:48 +0200 (CEST) Received: (qmail 7520 invoked by uid 500); 25 Jul 2017 18:14:47 -0000 Mailing-List: contact dev-help@systemml.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@systemml.apache.org Delivered-To: mailing list dev@systemml.apache.org Received: (qmail 7508 invoked by uid 99); 25 Jul 2017 18:14:47 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Jul 2017 18:14:47 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id B2E3F1A082F for ; Tue, 25 Jul 2017 18:14:46 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 4.335 X-Spam-Level: **** X-Spam-Status: No, score=4.335 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FORGED_HOTMAIL_RCVD2=1.187, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_REPLY=1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=hotmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id FS18bd5vtl7O for ; Tue, 25 Jul 2017 18:14:44 +0000 (UTC) Received: from APC01-HK2-obe.outbound.protection.outlook.com (mail-oln040092255057.outbound.protection.outlook.com [40.92.255.57]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 0CD535FCD8 for ; Tue, 25 Jul 2017 18:14:43 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=hotmail.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=v2dT6P/IQJ+frOtzzxCXDmWzNp4WTr1haGsz4NZfm9M=; b=YOVw/uegplm0gt5bo7qDFmoIthRoyXfGfr6bmdgnpTdwyXeUJL3I/e2PkWtBtUSoIyhYehzu8SMKluvt8PnPQ4GBGcCdN8qSf6Q+fr4p0090ztTyXi6buqwQv2YVhdrd4Yo1FF8SrB+3XrEtNmEYXqwpfSkZeelHc48bIGP8idqQDpoWII+Zji0TTdKtbSNrrq8HDgGVQywfR4PdBLtibsr1nY6/5cg7UBvCr+6bByMavsjjqyoFKg73CjWbgX6hPjbQdSzBBxWSd2hJYg7LKYzvm/tfmqi8nOGrtlpO1ZMf4qHvxXJXf+4piIkAQK5tftZehw0xJZSoUxLBAIwrsw== Received: from SG2APC01FT006.eop-APC01.prod.protection.outlook.com (10.152.250.57) by SG2APC01HT070.eop-APC01.prod.protection.outlook.com (10.152.251.221) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.1220.9; Tue, 25 Jul 2017 18:14:34 +0000 Received: from MAXPR01MB0220.INDPRD01.PROD.OUTLOOK.COM (10.152.250.52) by SG2APC01FT006.mail.protection.outlook.com (10.152.250.165) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1220.9 via Frontend Transport; Tue, 25 Jul 2017 18:14:34 +0000 Received: from MAXPR01MB0220.INDPRD01.PROD.OUTLOOK.COM ([10.164.149.150]) by MAXPR01MB0220.INDPRD01.PROD.OUTLOOK.COM ([10.164.149.150]) with mapi id 15.01.1282.017; Tue, 25 Jul 2017 18:14:34 +0000 From: arijit chakraborty To: "dev@systemml.apache.org" Subject: RE: Update Spark Configuration to improve SystemML performance Thread-Topic: Update Spark Configuration to improve SystemML performance Thread-Index: AQHTBSG5QAx3DFMUxkeplT1b4VUL5aJkQ+qwgABg4ICAACYogIAADit8 Date: Tue, 25 Jul 2017 18:14:34 +0000 Message-ID: References: , In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: systemml.apache.org; dkim=none (message not signed) header.d=none;systemml.apache.org; dmarc=none action=none header.from=hotmail.com; x-incomingtopheadermarker: OriginalChecksum:D47B9854BADD230FD8D95CEEDC83A838F88D44BF32EA9318EB9CA0E5EB43AB8F;UpperCasedChecksum:1F1930F5E38561AE6312EFEF434588E23571CBD4CEACC478929679B98B869F29;SizeAsReceived:7425;Count:45 x-ms-exchange-messagesentrepresentingtype: 1 x-tmn: [vvx+/wP9Vefg56YMf7FLieGLF/bF6RGq] x-ms-publictraffictype: Email x-microsoft-exchange-diagnostics: 1;SG2APC01HT070;7:/qGxrx71HzjHkzW3xGnsmabWqmHXgiUtz3m1bhrAOMS2PYhX1g17XbI8dA4/c8t04bCErryrcneDA1sktOEN3+6qp3oKM5CdZHUH2iTBoB4f2pazpSSXnC4XAgihIRtpQXDdW1Xso/C2KTdLar2upOl2eaNBhRuYsgEFTRpcNgt3UIYFIf2NXx6vtwkIyEfQapNiEkFEmR8UIkibwD7hK3NoyB2hAp6FoJhlUU73j+aFAsXf0uEUrjAcaEq8D5xHtVBrS2IVZOLPWS2E93bojOGFfwoI4inRvRN3y0F9I491Ufy8hCQA7ks6BGdBEJwN39SsgddLToE9jyYjfx/510x21RpaYXnZKF0WcB3L1t/y4uViHE2ZvEDy3AHyOMNQigFF44DJbNkES6foWpS9qZRT1ekaimhpqBspwb9jFw9AKEkRuYtRX6m/MCp8RWYThj+ZGxuRjFVqOA6JPnHaHVebL7t9B0dlj80HT2THyD4gJ6MmEYx/WkZb9o30MX/nPAN8ummT5Xo8ddDRXa9cRv3viTF97rhPY8f8OPddtvZJG6s1Tj6Pxz3tbs+jQCdQKg7bLsJvc2g0OWudG1gNvYGRx3xSsMP0NoDhp+Scr5JY8juAULsFs25xlXgpObYB45eT5/F02Z/ICwhLuah3YsoTVr3xJ7extfnd675FsH2a9v2I/M8Dv+2cEFpQ+MI3Ouv8AEKPd2jI9B9Ex8bUBc9dT9jrI5Ft/EOflSWog1h9dsoRj+9zPcUx4CvFb9xewlMuELr6wvH0czX7Cq5iLw== x-incomingheadercount: 45 x-eopattributedmessage: 0 x-forefront-antispam-report: EFV:NLI;SFV:NSPM;SFS:(7070007)(98901004);DIR:OUT;SFP:1901;SCL:1;SRVR:SG2APC01HT070;H:MAXPR01MB0220.INDPRD01.PROD.OUTLOOK.COM;FPR:;SPF:None;LANG:en; x-ms-office365-filtering-correlation-id: 364f2f74-d03f-40cd-03c3-08d4d38900b4 x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(300000500095)(300135000095)(300000501095)(300135300095)(300000502095)(300135100095)(22001)(300000503095)(300135400095)(201702061074)(5061506573)(5061507331)(1603103135)(2017031320274)(2017031324274)(2017031323274)(2017031322350)(1601125374)(1603101448)(1701031045)(300000504095)(300135200095)(300000505095)(300135600095)(300000506095)(300135500095);SRVR:SG2APC01HT070; x-ms-traffictypediagnostic: SG2APC01HT070: x-exchange-antispam-report-test: UriScan:(134217032509453)(158342451672863)(156600954879566)(194151415913766)(21748063052155)(5213294742642)(8415204561270); x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(100000700101)(100105000095)(100000701101)(100105300095)(100000702101)(100105100095)(444000031);SRVR:SG2APC01HT070;BCL:0;PCL:0;RULEID:(100000800101)(100110000095)(100000801101)(100110300095)(100000802101)(100110100095)(100000803101)(100110400095)(100000804101)(100110200095)(100000805101)(100110500095);SRVR:SG2APC01HT070; x-forefront-prvs: 03793408BA spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: multipart/alternative; boundary="_000_MAXPR01MB02202437141AF12019524575A3B80MAXPR01MB0220INDP_" MIME-Version: 1.0 X-OriginatorOrg: hotmail.com X-MS-Exchange-CrossTenant-originalarrivaltime: 25 Jul 2017 18:14:34.1359 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Internet X-MS-Exchange-CrossTenant-id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-Transport-CrossTenantHeadersStamped: SG2APC01HT070 archived-at: Tue, 25 Jul 2017 18:14:49 -0000 --_000_MAXPR01MB02202437141AF12019524575A3B80MAXPR01MB0220INDP_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable I=92ll also pick one issue and try to solve it. And it is really a very friendly group! Thanks! Arijit Sent from Mail for Window= s 10 From: Janardhan Pulivarthi Sent: Tuesday, July 25, 2017 10:53 PM To: himanshu.mohan78@gmail.com; dev@syst= emml.apache.org Subject: Re: Update Spark Configuration to improve SystemML performance Hi Himanshu! We feel great you are here. SystemML is a very friendly community. To get started. 1. see the components of the SystemML, here in this link: https://issues.apache.org/jira/projects/SYSTEMML?selectedItem=3Dcom.atlassi= an.jira.jira-projects-plugin:components-page 2. Through this link you can select any component of your interest, and you can browse through the corresponding issues and find which one(s) of them fascinates you. 3. If you any need guidance on how to proceed on that particular issues notify the other committers on the mailing list or in the jira itself. Thanks a lot, Cheers, Janardhan On Tue, Jul 25, 2017 at 8:36 PM, Himanshu Mohan wrote: > I am also interested in doing some real life hands on work in SystemML > > Thanks and Regards > Himanshu > > > On 25-Jul-2017, at 2:57 PM, arijit chakraborty > wrote: > > > > Hi Matthias, > > > > > > Thanks for your mail. I'm attaching again the server configurations. I'= m > also adding your personal email id, just to be double sure you can see th= e > images. Pardon me for that. I could improve the setup further so that now= I > can run the code at the same speed as R (around 40 mins). But this setup > I'm sharing is the older setup. So most probably the performance of my co= de > was dependent on spark configuration. So if you can help me on that. > > > > > > Also, currently I'm mainly working on CNN works. And I've decent > programming experience in python & R. But I would request you to share wi= th > me project which is among the least priority one. This will help me to ge= t > accustomed with this project setup without getting bothered about time > lines. > > > > > > Thank you! > > > > Arijit > > > > > > From: Matthias Boehm > > Sent: Tuesday, July 25, 2017 2:10:52 PM > > To: dev@systemml.apache.org > > Subject: Re: Update Spark Configuration to improve SystemML performance > > > > great to hear that - we welcome additional contributions. Just let us > know > > in which area you're most interested in (e.g., algorithms, APIs, > optimizer, > > runtime, etc) and we could identifying a couple of tasks to get you > started. > > > > Regarding the performance numbers, I am not able to see the details. Al= so > > could you share which operation was causing the large GC overhead - may= be > > we can improve the runtime for the specific scenario. Thanks. > > > > Regards, > > Matthias > > > > On Mon, Jul 24, 2017 at 12:17 PM, arijit chakraborty > > wrote: > > > > > Hi, > > > > > > > > > I tried to work on spark configuration file to improve the systemML > > > performance. Even after much tuning R code is running in 40 mins, but > > > SystemML is taking 2.2 hours. Please find the spark configuration > > > screenshots. Please let me know if I'm making some mistake in tuning > of the > > > spark configuration. One problem we could rectify is garbage time > error. > > > Now, it's completely not there. That was one major bottleneck which w= as > > > making the code extremely slow. > > > > > > > > > I'm working in local system and created a standalone version of spar= k, > > > with master and workers. The following are the details: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > I also wants to know is it possible to get involved with systemML > > > development? My project is almost on the verge of completion and I > learned > > > a lot from you all people. And I really liked this project. So I want > to > > > contribute more fruitfully in it. > > > > > > > > > Thank you! > > > > > > Arijit > > > > --_000_MAXPR01MB02202437141AF12019524575A3B80MAXPR01MB0220INDP_--