From dev-return-6902-archive-asf-public=cust-asf.ponee.io@mxnet.incubator.apache.org Tue Nov 12 20:00:51 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 013A6180656 for ; Tue, 12 Nov 2019 21:00:50 +0100 (CET) Received: (qmail 34063 invoked by uid 500); 12 Nov 2019 20:00:50 -0000 Mailing-List: contact dev-help@mxnet.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mxnet.incubator.apache.org Delivered-To: mailing list dev@mxnet.incubator.apache.org Received: (qmail 34049 invoked by uid 99); 12 Nov 2019 20:00:49 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Nov 2019 20:00:49 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 46B74180C51 for ; Tue, 12 Nov 2019 20:00:49 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.631 X-Spam-Level: X-Spam-Status: No, score=0.631 tagged_above=-999 required=6.31 tests=[DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.25, HTML_IMAGE_ONLY_24=1.282, HTML_MESSAGE=0.2, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=github.com Received: from mx1-he-de.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id BAYcDVzJdaBq for ; Tue, 12 Nov 2019 20:00:48 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=2a00:1450:4864:20::229; helo=mail-lj1-x229.google.com; envelope-from=dmlc.notification+caf_=dev=mxnet.apache.org@gmail.com; receiver= Received: from mail-lj1-x229.google.com (mail-lj1-x229.google.com [IPv6:2a00:1450:4864:20::229]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with ESMTPS id 7F54A7DD69 for ; Tue, 12 Nov 2019 20:00:47 +0000 (UTC) Received: by mail-lj1-x229.google.com with SMTP id g3so19201662ljl.11 for ; Tue, 12 Nov 2019 12:00:47 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:delivered-to:dkim-signature:date:from:reply-to :to:cc:message-id:in-reply-to:references:subject:mime-version :content-transfer-encoding:precedence:list-id:list-archive:list-post :list-unsubscribe; bh=X39usLpqgN9I31py89UsXOKOV3n4vl/2CKRXIu9gxcI=; b=SDYjDMnzj0UUUGRFtSB+8eM6Bt4o3w4WWitWssusjLoBGcbKWVr6hcLUVFeiLE52D4 ElWXKBmm1vWkhe44QfspJjS7fV/SCUje8cP/nLXKJZag96WvJZN5FP3s+dpFzl+RAB8c s3aFZ8i3mWxfwtCO8l92S7avHskwZ51yxTeYH5cycl8BmqwKyXF8xpwsAWDnwDVubSmV YsiUUSDXXG2Xyx8u+NtZy08s7dp89B/hFEgwTVGEFdczHHt7U1H2/R0oUwZqrj8VkOof A74MdLeShVpJMXExACY74i6dO2EWFLFK2Q0RfQsr7t8emFgJU1eA2m5//xdI3bNbzg/h co9w== X-Gm-Message-State: APjAAAXGsSxQPSbS0pt65875vsUSvm7sIP+GU/owqXD8aRvhOePwcR8w YzI6FWvE3Gf0lJEmIZ01gT71Y9/qlKeYViku8wHLEmmsrMU/TJ8= X-Received: by 2002:a2e:95c5:: with SMTP id y5mr21884630ljh.184.1573588846911; Tue, 12 Nov 2019 12:00:46 -0800 (PST) X-Forwarded-To: dev@mxnet.apache.org X-Forwarded-For: dmlc.notification@gmail.com dev@mxnet.apache.org Delivered-To: dmlc.notification@gmail.com Received: by 2002:a05:6504:1343:0:0:0:0 with SMTP id m3csp8505129ltp; Tue, 12 Nov 2019 12:00:45 -0800 (PST) X-Google-Smtp-Source: APXvYqzUJA64zZf44tlZBxHYiO7KzY7aA9hr9Hp7xVkU5Fg+3Upn7nRYsil6QXCJvr0FE/cQ6hqj X-Received: by 2002:ae9:ef05:: with SMTP id d5mr15493256qkg.242.1573588845511; Tue, 12 Nov 2019 12:00:45 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1573588845; cv=none; d=google.com; s=arc-20160816; b=XOTqOBihqTr6Xwx2pb+gEuSQ+s6yvOXq4R/1TdGVmslnGCl9G92WaWTDDayEP0y40b zTTwyEs4kpGb7bz0h52ecGzlYrZCC/vYkA6MFioz3qhgKTGGjpaoiL4Pp77ZqrBJqHyW t/wW5nIDXS1UYID9Ryz9nHlyWBaEirB0FzgjMvPl52H0jsDdYzoUHPYqga0vrIZ+pfHp 5rVRuxXWqvvqoiDnCb7lEXwVguVQe41WRg95xppmCuiUY3h38v6rYfZThq2/MFG83G2d xNF9UKEbVpvKR/4yRKNl7AQa+j3UkVR6rLrJvO80jl6SsHXEGkBwV629naQLklf7To1u kZjA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-post:list-archive:list-id:precedence :content-transfer-encoding:mime-version:subject:references :in-reply-to:message-id:cc:to:reply-to:from:date:dkim-signature; bh=X39usLpqgN9I31py89UsXOKOV3n4vl/2CKRXIu9gxcI=; b=ig/1xMgNFp+6To9dJAUizt5PxKj2GgCgMXW+fWm2rWNLbpbAxWXLZy4wotMW2aEeH9 C8zKjtImu/DQ7wVzlg8NUUXIJ9EbEPjzjB/et/cAv+7bXAohrSXOZrpUT4P9fyJYSqFS ga648HHAGYaq1MGOqf/3mnhCEwHhb7kVC7VCEVEXx7KRRy3uq8xb1Bn3sQE4D6dloRdT C/pOsSrRkbzmnZ8lcnYLhjGQJcuEEIuBD5IsG4uRjvaQdkiA7RsmTEWWR0amhDEzmLb+ sxiGRWHgZC2zCpbcp1h8apFKn55d8Ipge31u7mvlxpL5YY9zoO47BhTINzMGLSGPYnJ9 9bAg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass (test mode) header.i=@github.com header.s=pf2014 header.b=bkAdnPhg; spf=pass (google.com: domain of noreply@github.com designates 192.30.252.201 as permitted sender) smtp.mailfrom=noreply@github.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=github.com Received: from out-18.smtp.github.com (out-18.smtp.github.com. [192.30.252.201]) by mx.google.com with ESMTPS id p79si6718063qke.341.2019.11.12.12.00.45 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 12 Nov 2019 12:00:45 -0800 (PST) Received-SPF: pass (google.com: domain of noreply@github.com designates 192.30.252.201 as permitted sender) client-ip=192.30.252.201; Authentication-Results: mx.google.com; dkim=pass (test mode) header.i=@github.com header.s=pf2014 header.b=bkAdnPhg; spf=pass (google.com: domain of noreply@github.com designates 192.30.252.201 as permitted sender) smtp.mailfrom=noreply@github.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=github.com Received: from github-lowworker-275fa97.va3-iad.github.net (github-lowworker-275fa97.va3-iad.github.net [10.48.17.64]) by smtp.github.com (Postfix) with ESMTP id 343FD6E018A for ; Tue, 12 Nov 2019 12:00:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=github.com; s=pf2014; t=1573588845; bh=X39usLpqgN9I31py89UsXOKOV3n4vl/2CKRXIu9gxcI=; h=Date:From:Reply-To:To:Cc:In-Reply-To:References:Subject:List-ID: List-Archive:List-Post:List-Unsubscribe:From; b=bkAdnPhgCK5RI3pdMR/Vjutx9d7D5/V2ul6ZIqU2bVnelkqfGZzwt1GEtn3fEyZCu Gu+1yLWktMvI2UBBOfmngmZPT/8OZv1wkaASbgdEViiNKJeFUXoODNYDSagHlocWT4 GEbD24R5P0CwWafjccIlN5BPQr8dRe5VrAtsfhhs= Date: Tue, 12 Nov 2019 12:00:45 -0800 From: Haibin Lin Reply-To: apache/incubator-mxnet To: apache/incubator-mxnet Cc: Subscribed Message-ID: In-Reply-To: References: Subject: Re: [apache/incubator-mxnet] [RFC] Unified API for Distributed Data Parallel Training (#16795) Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="--==_mimepart_5dcb0f6d26e51_4333fb36eccd95c629783"; charset=UTF-8 Content-Transfer-Encoding: 7bit X-GitHub-Sender: eric-haibin-lin X-GitHub-Recipient: szha X-GitHub-Reason: subscribed List-Archive: https://github.com/apache/incubator-mxnet X-Auto-Response-Suppress: All X-GitHub-Recipient-Address: dmlc.notification@gmail.com ----==_mimepart_5dcb0f6d26e51_4333fb36eccd95c629783 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit I did mean use case 2,3,4. Initialization is done in the constructor `kv.__init__()`, and for horovod it could be simply a `hvd.init()` call. I have not discussed problem 1 for too much details. horovod uses mpirun to setup connection and launch processes, while byteps/p3 and native kvstore currently use the `dmlc/launcher` script. I do see that `dmlc/launcher` has mpi support, but I need to play more with it to see if it fits existing use cases. But I don't see fundamental blockers for (1). -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/apache/incubator-mxnet/issues/16795#issuecomment-553089601 ----==_mimepart_5dcb0f6d26e51_4333fb36eccd95c629783--