From dev-return-4357-archive-asf-public=cust-asf.ponee.io@mxnet.incubator.apache.org Tue Oct 2 05:16:36 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 82F85180660 for ; Tue, 2 Oct 2018 05:16:35 +0200 (CEST) Received: (qmail 37653 invoked by uid 500); 2 Oct 2018 03:16:33 -0000 Mailing-List: contact dev-help@mxnet.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mxnet.incubator.apache.org Delivered-To: mailing list dev@mxnet.incubator.apache.org Received: (qmail 37641 invoked by uid 99); 2 Oct 2018 03:16:32 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Oct 2018 03:16:32 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 3099B1A0F7E for ; Tue, 2 Oct 2018 03:16:32 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.869 X-Spam-Level: * X-Spam-Status: No, score=1.869 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, T_DKIMWL_WL_MED=-0.01] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id t91DxCVIm4Ly for ; Tue, 2 Oct 2018 03:16:31 +0000 (UTC) Received: from mail-ot1-f47.google.com (mail-ot1-f47.google.com [209.85.210.47]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id E62F55F24A for ; Tue, 2 Oct 2018 03:16:30 +0000 (UTC) Received: by mail-ot1-f47.google.com with SMTP id h26-v6so483518otl.9 for ; Mon, 01 Oct 2018 20:16:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=g8SHGYi8PioDIpy7GvbWZVIgp0EYJNGGtIzn/lHJVGw=; b=pr/YJZAgEwkepYaFuwcgt7J2Qj39fHtI286EseXGrqkMd0gtNYQYTaWEmbKEo/X+HC mNSdqrtTWzPJKnRZkGfhPCHz2BCU3I46sdhTxK+ige3eLUQPAkP0Kq4U/tcVoWN6Hpjt GJq3GS814y2KPuVsB0vTWn+9nh/t9K4784EtP3XYonST9ahrB0NYvI/LjF3Sab+t1YhK thj2TRoigxpjSwOVCyFkfub0GvyfK9ktX6tTgJBGYrKvm+0W0EVt7xUEUBu6JNj9FpEc 5HRWHaDyR4PsPNwPzgGb/POQzbV5bpJUy0HcKW+QIjzDZtve/VhGu05hcZ5tfxk6SAbc 5NMw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=g8SHGYi8PioDIpy7GvbWZVIgp0EYJNGGtIzn/lHJVGw=; b=LSUkQZ/jX72ExTISCdphym86pq9cqE7jOKW1bsfG4J4AKLKYEuwtGWNB5Csn/RBbwi FALA25sXdh9TiCAM34vy0yI75in/fsXzY2fTUDvAMLRJNenhlvuhwQsuClArMCQz+je1 3cpd761A0J+GewrYzdaU7p6SOzX8obAe4+JFswYS/AHDhajETKq0jo6fReZnhpIXtsoJ +Kn/F0sdNEMFVvq1JXjqOPEVtKrMbsjS+AI+jtDIMLsVO5+waQYdxpjUXR0DpbXYMdrp ElIK4cAAEsQdrBx7KKvaBGk+6MlT6haZGTuA99ysS0Xav/ijTytr4SOIqgfZttkL2aVp tlMg== X-Gm-Message-State: ABuFfoinUSN4vcdgbiP2sovNsW1/NaaNz4fKqXzwNj0bWACLv+4h/c4/ lYzZHuY7IncuOOFhVtP2WqzYcSCDMN407bHZWTknwRUk X-Google-Smtp-Source: ACcGV61XHEVuxU3gQ/+vA/2QrT5BaQGUl53c9vgzdGiE+S4mZTZ74bKESscNjfj/U1wmr9dCjNu3hv/Pn+00me0EG5w= X-Received: by 2002:a9d:5226:: with SMTP id e38-v6mr8381876oth.213.1538450189705; Mon, 01 Oct 2018 20:16:29 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Lin Yuan Date: Mon, 1 Oct 2018 20:16:16 -0700 Message-ID: Subject: Re: CUDNN algorithm selection failure To: dev@mxnet.incubator.apache.org Content-Type: multipart/alternative; boundary="0000000000000fe8c50577365983" --0000000000000fe8c50577365983 Content-Type: text/plain; charset="UTF-8" Hi Pedro, I also got this failure in my PR http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/incubator-mxnet/detail/PR-11742/27/pipeline I was not able to identify the root cause of it from changelist. Are you suggesting there is some flakiness in the master branch too? Thanks, Lin On Mon, Oct 1, 2018 at 4:55 PM Pedro Larroy wrote: > Hi > > I saw this failure on CI: > > http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/incubator-mxnet/detail/master/1697/pipeline > > Have you seen other cases where we fail to select the best CUDNN algorithm? > In which circumstances this could happen, and do you think is a good idea > to have one selected by default as a last resort? > > > Pedro. > --0000000000000fe8c50577365983--