From dev-return-7333-archive-asf-public=cust-asf.ponee.io@mxnet.incubator.apache.org Thu Feb 13 17:53:00 2020 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 15A301802C7 for ; Thu, 13 Feb 2020 18:52:59 +0100 (CET) Received: (qmail 53270 invoked by uid 500); 13 Feb 2020 17:52:59 -0000 Mailing-List: contact dev-help@mxnet.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mxnet.incubator.apache.org Delivered-To: mailing list dev@mxnet.incubator.apache.org Received: (qmail 53257 invoked by uid 99); 13 Feb 2020 17:52:59 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Feb 2020 17:52:59 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 83585C005E for ; Thu, 13 Feb 2020 17:52:58 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.196 X-Spam-Level: X-Spam-Status: No, score=-0.196 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, MIME_QP_LONG_LINE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-ec2-va.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id w3jdinBQnB6R for ; Thu, 13 Feb 2020 17:52:57 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=209.85.160.177; helo=mail-qt1-f177.google.com; envelope-from=dzianis.davydzenka@gmail.com; receiver= Received: from mail-qt1-f177.google.com (mail-qt1-f177.google.com [209.85.160.177]) by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTPS id 219B6BB8C3 for ; Thu, 13 Feb 2020 17:52:57 +0000 (UTC) Received: by mail-qt1-f177.google.com with SMTP id v25so5029610qto.7 for ; Thu, 13 Feb 2020 09:52:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=user-agent:date:subject:from:to:message-id:thread-topic:references :in-reply-to:mime-version:content-transfer-encoding; bh=jaHw8sIIq2CzfZFqjT/HBoBXzIzquMpTvVy6Cot47IQ=; b=YW5KuusJWoesQHBZ/muogd8VOiF+ry8Vqo98CgQmPWwgw4/jqrRkZ8/Hn4CQNrk4ez fHe90naFPyMCX5J50JAlO6d3p7FO5QMUINR0Gkn0HwuPP4TOHrc5AHE2uX4K+UYPMFsm FK8/YDuQZAHNa7nivPLPqfeF6/Y9Hy7xr8v9UQ+8RF4iB9+GHwpLEk0qd9b52YG8kmaF RW7EMLtf2qLLaB47gpUlK0cQtprsk5e/QH+HzQj4Kd8YKJLxsqkE/tZDe8cgaDPsuwJa VgHLBsTaLDlWdulWFeFm5FMN/Zl8+aZTAZEBZE1jU480jkMWx1ifK5I9qqx0XcxGtoBY GLWw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:user-agent:date:subject:from:to:message-id :thread-topic:references:in-reply-to:mime-version :content-transfer-encoding; bh=jaHw8sIIq2CzfZFqjT/HBoBXzIzquMpTvVy6Cot47IQ=; b=X31v/SfgBCeLW63p3MYHTf3vygDw1/zSWhNZU6KnMBQGe7G6PkE3ivs0s+gVRW8x1N 944H7u0I+q1GOYMyzrZrQLAF5HjAaI/rbDEDooUY8pjFtjvvRgi35fOmvcQkvjZamJTX hudfF3HLeWTQvUNIu/9J6K+aKlPq7fQgKsyLP9u7ROyi3ho808os6UISNpYqPaZuhH8v t9Xr/0WOH0jzMiTbqkkPJ+IX3TIPqrCvdVSz0x05ZHIGt1gPUMBpXj9yyOum15U8z4Jq mB4cedFQCLjH/MisV5Y+iLNtbncwb5i32PCYzdcL9b2VwtOZ2tKJop9r7SUJ3MrPd4qd ZHhQ== X-Gm-Message-State: APjAAAUG8FElTqt3VqmJf71Bfvi774I02oXyo2Ow1OsD/WWpgDC/czRU PhHGRKkPN2QU+rBPrx8nVpFlnK+/5m8= X-Google-Smtp-Source: APXvYqyrPYI0Ci+wVZ54mFfeCsr/VjHijWdpkqRfIghL9b8vBbZWT0+qB6tJ6jdmzax4UUtlqAuVgw== X-Received: by 2002:ac8:540f:: with SMTP id b15mr12916357qtq.237.1581616376340; Thu, 13 Feb 2020 09:52:56 -0800 (PST) Received: from [10.95.182.191] (72-21-196-67.amazon.com. [72.21.196.67]) by smtp.gmail.com with ESMTPSA id w53sm1882471qtb.91.2020.02.13.09.52.55 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 13 Feb 2020 09:52:55 -0800 (PST) User-Agent: Microsoft-MacOutlook/10.1f.0.191110 Date: Thu, 13 Feb 2020 09:52:54 -0800 Subject: Re: Update on upcoming changes to the MXNet CI: Jenkins From: "Davydenko, Denis" To: Message-ID: <6F22B7CE-B534-4131-B5C1-E08DBB2AA107@amazon.com> Thread-Topic: Update on upcoming changes to the MXNet CI: Jenkins References: <9067BC28-2D66-49A8-8AA0-D62F052D13AD@gmail.com> In-Reply-To: Mime-version: 1.0 Content-type: text/plain; charset="UTF-8" Content-transfer-encoding: quoted-printable This makes total sense, Aaron. We can probably spend some time on these mod= ifications once we complete originally mentioned changes __ =EF=BB=BFOn 2/13/20, 9:21 AM, "Aaron Markham" wrote: +1 These are good action items that should help alleviate part of the CI issues. =20 The following comments are not to take away from your proposal. Move forward, assuming the community agrees. I'd really like to see particular tests run only when the PR is touching a related part. While this is more effort, it would really make a major difference. Light research shows that projects have been doing this for quite some time, so it wouldn't be a new invention and deep exploration. =20 I realize there are a lot of interdependencies and it would probably not work for everything. But, what if we start small? --> Docs pages (*.rst, *.md, *.html, *.js, *.css): don't trigger most tests, especially GPU and cross-platform tests. --> Tutorials that have GPU requirements run their own validation tests, and tutorials that don't have GPU requirement don't get tested on GPUs. =20 Cheers, Aaron =20 =20 =20 On Wed, Feb 12, 2020 at 10:12 AM Davydenko, Denis wrote: > > Hello, MXNet dev community, > As you all know, the experience with CI infrastructure isn=E2=80=99t ideal = in spite of its high cost. For this reason, we=E2=80=99re proposing the following = changes to improve stability, reduce cost, and grant more control to contrib= utors. As we work in a refresh of CI, we believe these changes will reduce t= he pain we all suffer when we try to push a PR through the system. > > Following is the list of changes: > Fix missing status reports between GH and Jenkins > Update Jenkins permission groups to re-trigger builds > Introduce per-PR CI bot > Details: > > - Fix missing status reports > Currently, once commit gets added to PR - the CI is run on that added= commit. Sometimes, CI run status is missing from the commit in Github despi= te having completed in Jenkins. Example: CI run: http://jenkins.mxnet-ci.ama= zon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-= 17376/17/pipeline, commit status in github (missing unix-cpu, unix-gpu and w= indows-gpu statuses): https://github.com/apache/incubator-mxnet/pull/17376#p= artial-pull-merging. > Problem: There seems to be a bug where some status reports are missin= g on Github. The hypothesis is that there is some issue with Github Hooks. > > - Update Jenkins permission groups to re-trigger builds > Problem: Currently, only MXNet Committers and selected people from AW= S have the ability to re-trigger CI runs on PRs. This leaves the PR Authors = waiting for authorized users to re-trigger their PRs for them. > Solution : Allow these membership categories Jenkins Admins, MXNet Co= mmitters, and PR Authors to re-trigger PR builds. > > - Introduce per-PR CI bot > Problem: As of date, MXNet CI is automated. It runs every time a comm= it is pushed onto your Github PR. This results in lot of unnecessary CI runs= apart from added costs. > Solution: Switch to Manual Trigger. Users from authorized groups (1 o= f the 3 categories mentioned above) can trigger CI run by adding a simple co= mment to PR: =E2=80=9C[mxnet-ci] run=E2=80=9D. > > -- > Thank you, > > AWS MXNet team > > > =20