From dev-return-5491-archive-asf-public=cust-asf.ponee.io@singa.apache.org Thu May 28 09:04:00 2020 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 23E2B180643 for ; Thu, 28 May 2020 11:04:00 +0200 (CEST) Received: (qmail 39668 invoked by uid 500); 28 May 2020 09:03:59 -0000 Mailing-List: contact dev-help@singa.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@singa.apache.org Delivered-To: mailing list dev@singa.apache.org Received: (qmail 39658 invoked by uid 99); 28 May 2020 09:03:59 -0000 Received: from ec2-52-202-80-70.compute-1.amazonaws.com (HELO gitbox.apache.org) (52.202.80.70) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 May 2020 09:03:59 +0000 From: =?utf-8?q?GitBox?= To: dev@singa.apache.org Subject: =?utf-8?q?=5BGitHub=5D_=5Bsinga=5D_chrishkchris_edited_a_comment_on_pull_req?= =?utf-8?q?uest_=23709=3A_Upgrade_CUDNN_version_from_7=2E3=2E1_to_7=2E6=2E5?= Message-ID: <159065663938.10999.14220979966452530207.asfpy@gitbox.apache.org> Date: Thu, 28 May 2020 09:03:59 -0000 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit In-Reply-To: References: chrishkchris edited a comment on pull request #709: URL: https://github.com/apache/singa/pull/709#issuecomment-635215622 Concerning my conda build test in our server: Firstly, the cuda runtime library version at each node is: panda 1-4 cuda 9.0 panda 5-8 cuda 10.0 panda 5-16 cuda 10.1 panda 17-18 cuda 10.2 1. When I use the nvidia docker nvidia/cuda:10.0-devel-ubuntu16.04 to build the conda package on panda 18 using cudatoolkit 10.2 and cudnn 7.6.5. It can build the package and pass the test case. However, when I install the conda package and run, it returns error message: "ImportError: libcudart.so.10.0: cannot open shared object file: No such file or directory" 2. When I use the nvidia docker nvidia/cuda:10.2-devel-ubuntu16.04 to build conda package on panda 18 using cudatoolkit 10.2 and cudnn 7.6.5. It can not compile with the error message: "/root/miniconda/conda-bld/singa_1590645399908/work/include/singa/utils/cuda_utils.h:24:23: fatal error: cublas_v2.h: No such file or directory" 3. Last time I built the conda package using cuda 10.0 cudnn 7.6.5, the package can be used sucessfully at panda 7 and panda 13 (seems that runtime cuda library installed at every nodes are backward compatiable) Therefore, in this PR I cannot update to cuda 10.2 (because I did not success to make it run after building it) ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org