mxnet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Qin, Zhennan" <zhennan....@intel.com>
Subject Re: BytePS-MXNet Integration
Date Wed, 06 Nov 2019 06:59:39 GMT
Hi Yimin,

Welcome to make contribution to MXNet project!

From <https://github.com/bytedance/byteps/blob/master/README.md> https://github.com/bytedance/byteps/blob/master/README.md
I found another limitation that isn't shown in your proposal:

BytePS does not support pure CPU training for now. One reason is that the cheap PS assumption<https://github.com/bytedance/byteps/blob/master/docs/rationale.md>
of BytePS do not hold for CPU training. Consequently, you need CUDA and NCCL to build and
run BytePS.

I have a couple of question for this: How's the status of CPU training support? If CPU training
isn't supported yet, what's the challenge to support it? Do you have a plan to support it?

Thanks,
Zhennan

On Wed, 2019-11-06 at 12:14 +0800, Yimin Jiang wrote:

Hi MXNet Community,


BytePS (https://github.com/bytedance/byteps) is a high-performance,

cross-framework architecture for distributed training. BytePS developers

are planning to integrate a part of BytePS into MXNet. The link below is

the proposal. Feedbacks are welcome.


https://cwiki.apache.org/confluence/display/MXNET/BytePS-MXNet+Integration



Thank you,

Yimin Jiang

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message