Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 1E2E8200D31 for ; Sat, 21 Oct 2017 01:38:10 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 1C884160BED; Fri, 20 Oct 2017 23:38:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3B5A0160BCB for ; Sat, 21 Oct 2017 01:38:09 +0200 (CEST) Received: (qmail 25167 invoked by uid 500); 20 Oct 2017 23:38:08 -0000 Mailing-List: contact dev-help@mxnet.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mxnet.incubator.apache.org Delivered-To: mailing list dev@mxnet.incubator.apache.org Received: (qmail 25155 invoked by uid 99); 20 Oct 2017 23:38:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Oct 2017 23:38:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 518681807DC for ; Fri, 20 Oct 2017 23:38:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.63 X-Spam-Level: ** X-Spam-Status: No, score=2.63 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, KAM_SHORT=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id RnvLH-etFK3O for ; Fri, 20 Oct 2017 23:38:04 +0000 (UTC) Received: from mail-it0-f49.google.com (mail-it0-f49.google.com [209.85.214.49]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id A122C5F5CC for ; Fri, 20 Oct 2017 23:38:03 +0000 (UTC) Received: by mail-it0-f49.google.com with SMTP id n195so444738itg.0 for ; Fri, 20 Oct 2017 16:38:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=Rc/QOae1Cq8OV5vIhMpa5Ey3XoOwyI1zPr4NjB8heWI=; b=NdeYS8v9+OnJ7AGuZdb9X7QVYSG34wL8+WFKVxPMOxWYAUenegbjTLtXWebyOKwdxs GAANXofcDSyiGLs+1j8+OSZ86HTRvzTuFPTqMDRjLJN3EPP9rGzX3dYIWrR60moI6Ksg xV9PLz79eggTKhuorvMv10k/MZmo0XAd/B0cRu2nuuJQuefy8dyTxperRWmdwqZs/Ij7 ECmYpvvynFvLr8bVp5q4OUWj+sAWH6Gf8NFp3Q+g+NbBAqAeA1Oamnw/DvD1fU26b5cw 4PCDe3dFKzIwEAU8eaNs6l3+DRdnDAQjLCGA+PmVGOjV9Ar2MJCubzkoiXyaNtQkKlsP 9igw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=Rc/QOae1Cq8OV5vIhMpa5Ey3XoOwyI1zPr4NjB8heWI=; b=m/fV6n+u1YteTN5ga5hu+2e/PrrCSGVEjs18wAzZYqfLChJCX6kVMIFLCw/CCKViu1 fja4m3dsh2KgQCHxWGK9pJ5zKfzjRmlnD7NYRDfFqGx0ras8exTQ2TsR25OkPaoSjLSJ N3Qz7tcnOOQFGTRk2tYsBE35RGUzwqRsfwA88FdsKD1ZarLyim9Y9GQn8tAhPSJ0X9qA +A3kh4oShzjr7h+PYGNtIHtcwfg5M6pDf7DtNaukDHNLFOJ78unnjvVsuYxnVeyjMcPk llEmAjibSkmBkt/USxuOvct906ZocNbul4vBeY3JT7iyij5xllgFmLujjfUUoao1I519 fJYw== X-Gm-Message-State: AMCzsaV+9U9KURjvAInwiN9YK2hYNmZiRmc/VZ8IMpwR6SqxkwajiMe2 FDtbNXeSJAVjdxDcI8u8OOp+Sgk6SlToIa4f7Cb9KA== X-Google-Smtp-Source: ABhQp+RwcyCvwt5hg0zJlGeHEIfjkN5p8YmuQ3YUMICU+MpvFfLl0jJJmAwLnSz8GSwBm5BXKPPjUoFYnP1dLNkR5uM= X-Received: by 10.36.76.1 with SMTP id a1mr670705itb.94.1508542682177; Fri, 20 Oct 2017 16:38:02 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.7.161 with HTTP; Fri, 20 Oct 2017 16:38:01 -0700 (PDT) In-Reply-To: References: From: Chris Olivier Date: Fri, 20 Oct 2017 16:38:01 -0700 Message-ID: Subject: Re: [Proposal] Stabilizing Apache MXNet CI build system To: dev@mxnet.incubator.apache.org Content-Type: multipart/alternative; boundary="001a11447da4b326be055c02f681" archived-at: Fri, 20 Oct 2017 23:38:10 -0000 --001a11447da4b326be055c02f681 Content-Type: text/plain; charset="UTF-8" I believe that Mu already started that discussion about using old mxnet.io Jenkins server. I expect deciding whether to replace would hinge in large part upon what it would be replaced with. On Fri, Oct 20, 2017 at 4:30 PM, sandeep krishnamurthy < sandeep.krishna98@gmail.com> wrote: > Chris: If the community decides to go with separate setup, then there will > be a tech design discussion and CodeCommit / Jenkins / Travis such > proposals will be covered and discussed. > > Thanks, > Sandeep > > On Fri, Oct 20, 2017 at 4:22 PM, Seb Kiureghian > wrote: > > > But the feather can definitely be added once MXNet graduates. > > > > On Fri, Oct 20, 2017 at 4:21 PM, Seb Kiureghian > > wrote: > > > > > The feather can only be used by Top Level Projects. > > > > > > On Fri, Oct 20, 2017 at 4:19 PM, Chris Olivier > > > wrote: > > > > > >> When the word Apache is in the Hadoop logo (not always), it includes > the > > >> feather and color scheme. > > >> > > >> On Fri, Oct 20, 2017 at 4:18 PM, Chris Olivier > > > >> wrote: > > >> > > >>> Thanks. > > >>> > > >>> Is there any way to work the feather into it? > > >>> > > >>> i.e. https://goo.gl/images/BU4dnG > > >>> > > >>> On Fri, Oct 20, 2017 at 4:11 PM, Seb Kiureghian > > >>> wrote: > > >>> > > >>>> https://imgur.com/a/aADkA > > >>>> > > >>>> On Fri, Oct 20, 2017 at 4:07 PM, Chris Olivier < > cjolivier01@gmail.com > > > > > >>>> wrote: > > >>>> > > >>>> > Why don;t we look into fully managed AWS CodeBuild? It maintains > > >>>> > everything. It's also compatible with Jenkins. > > >>>> > > > >>>> > On Fri, Oct 20, 2017 at 1:51 PM, Tianqi Chen < > > >>>> tqchen@cs.washington.edu> > > >>>> > wrote: > > >>>> > > > >>>> > > +1 > > >>>> > > > > >>>> > > Tianqi > > >>>> > > On Fri, Oct 20, 2017 at 1:39 PM Mu Li > wrote: > > >>>> > > > > >>>> > > > +1 > > >>>> > > > > > >>>> > > > > > >>>> > > > It seems that the Apache CI is quite overloaded these days, > and > > >>>> MXNet's > > >>>> > > CI > > >>>> > > > pipeline is too complex to run there. In addition, we may need > > to > > >>>> add > > >>>> > > more > > >>>> > > > devices, e.g. macpro and rasbperry pi, into the server, and > more > > >>>> tasks > > >>>> > > such > > >>>> > > > as pip build. It means a lot of requests to the Infra team. > > >>>> > > > > > >>>> > > > We can reuse our previous Jenkins server at > http://ci.mxnet.io/ > > . > > >>>> But > > >>>> > we > > >>>> > > > probably need a dedicate developer to maintain it. > > >>>> > > > > > >>>> > > > > > >>>> > > > > > >>>> > > > On Fri, Oct 20, 2017 at 1:01 PM, sandeep krishnamurthy < > > >>>> > > > sandeep.krishna98@gmail.com> wrote: > > >>>> > > > > > >>>> > > > > Hello all, > > >>>> > > > > > > >>>> > > > > I am hereby opening up a discussion thread on how we can > > >>>> stabilize > > >>>> > > Apache > > >>>> > > > > MXNet CI build system. > > >>>> > > > > > > >>>> > > > > Problems: > > >>>> > > > > > > >>>> > > > > ======== > > >>>> > > > > > > >>>> > > > > Recently, we have seen following issues with Apache MXNet CI > > >>>> build > > >>>> > > > systems: > > >>>> > > > > > > >>>> > > > > 1. Apache Jenkins master is overloaded and we see issues > > >>>> like - > > >>>> > > unable > > >>>> > > > > to trigger builds, difficult to load and view the blue > > ocean > > >>>> and > > >>>> > > other > > >>>> > > > > Jenkins build status page. > > >>>> > > > > 2. We are generating too many request/interaction on > Apache > > >>>> Infra > > >>>> > > > team. > > >>>> > > > > 1. Addition/deletion of new slave: Caused from scaling > > >>>> > activity, > > >>>> > > > > recycling, troubleshooting or any actions leading to > > >>>> change of > > >>>> > > > slave > > >>>> > > > > machines. > > >>>> > > > > 2. Plugins / other Jenkins Master configurations. > > >>>> > > > > 3. Experimentation on CI pipelines. > > >>>> > > > > 3. Harder to debug and resolve issues - Since access to > > >>>> master and > > >>>> > > > slave > > >>>> > > > > is not with the same community, it requires Infra and > > >>>> community to > > >>>> > > > dive > > >>>> > > > > deep together on all action items. > > >>>> > > > > > > >>>> > > > > Possible Solutions: > > >>>> > > > > > > >>>> > > > > ============== > > >>>> > > > > > > >>>> > > > > 1. Can we set up a separate Jenkins CI build system for > > >>>> Apache > > >>>> > MXNet > > >>>> > > > > outside Apache Infra? > > >>>> > > > > 2. Can we have a separate Jenkins Master in Apache Infra > > for > > >>>> > MXNet? > > >>>> > > > > 3. Review design of current setup, refine and fill the > > gaps. > > >>>> > > > > > > >>>> > > > > @ Mentors/Infra team/Community: > > >>>> > > > > > > >>>> > > > > ========================== > > >>>> > > > > > > >>>> > > > > Please provide your suggestions on how we can proceed > further > > >>>> and > > >>>> > work > > >>>> > > on > > >>>> > > > > stabilizing the CI build systems for MXNet. > > >>>> > > > > > > >>>> > > > > Also, if the community decides on separate Jenkins CI build > > >>>> system, > > >>>> > > what > > >>>> > > > > important points should be taken care of apart from the > below: > > >>>> > > > > > > >>>> > > > > 1. Community being able to access the build page for > build > > >>>> > statuses. > > >>>> > > > > 2. Committers being able to login with apache > credentials. > > >>>> > > > > 3. Hook setup from apache/incubator-mxnet repo to Jenkins > > >>>> master. > > >>>> > > > > > > >>>> > > > > > > >>>> > > > > Irrespective of the solution we come up, I think we should > > >>>> initiate a > > >>>> > > > > technical design discussion on how to setup the CI build > > system. > > >>>> > > > Probably 1 > > >>>> > > > > or 2 pager documents with the architecture and review with > > >>>> Infra and > > >>>> > > > > community members. > > >>>> > > > > > > >>>> > > > > ***There were few proposal and discussion on the slack > > channel, > > >>>> to > > >>>> > > reach > > >>>> > > > > wider community members, moving that discussion formally to > > this > > >>>> > list. > > >>>> > > > > > > >>>> > > > > > > >>>> > > > > My Proposal: Option 1 - Set up separate Jenkins CI build > > system. > > >>>> > > > > > > >>>> > > > > Thanks, > > >>>> > > > > > > >>>> > > > > Sandeep > > >>>> > > > > > > >>>> > > > > > > >>>> > > > > > > >>>> > > > > -- > > >>>> > > > > Sandeep Krishnamurthy > > >>>> > > > > > > >>>> > > > > > >>>> > > > > >>>> > > > >>>> > > >>> > > >>> > > >> > > > > > > > > > -- > Sandeep Krishnamurthy > --001a11447da4b326be055c02f681--