mxnet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yimin Jiang <jym...@gmail.com>
Subject Re: BytePS-MXNet Integration
Date Wed, 06 Nov 2019 16:13:35 GMT
Hi Zhennan,

Thanks for your interest. To be honest, our team currently do not have a
plan for CPU training. That said, the notion of BytePS is not GPU-specific
and should also apply to CPU. I do not see a fundamental challenge yet. And
we welcome contributions on this.

Thank you,
Yimin

On Wed, Nov 6, 2019 at 2:59 PM Qin, Zhennan <zhennan.qin@intel.com> wrote:

> Hi Yimin,
>
> Welcome to make contribution to MXNet project!
>
> From <https://github.com/bytedance/byteps/blob/master/README.md>
> https://github.com/bytedance/byteps/blob/master/README.md I found another
> limitation that isn't shown in your proposal:
>
> BytePS does not support pure CPU training for now. One reason is that the
> cheap PS assumption<
> https://github.com/bytedance/byteps/blob/master/docs/rationale.md> of
> BytePS do not hold for CPU training. Consequently, you need CUDA and NCCL
> to build and run BytePS.
>
> I have a couple of question for this: How's the status of CPU training
> support? If CPU training isn't supported yet, what's the challenge to
> support it? Do you have a plan to support it?
>
> Thanks,
> Zhennan
>
> On Wed, 2019-11-06 at 12:14 +0800, Yimin Jiang wrote:
>
> Hi MXNet Community,
>
>
> BytePS (https://github.com/bytedance/byteps) is a high-performance,
>
> cross-framework architecture for distributed training. BytePS developers
>
> are planning to integrate a part of BytePS into MXNet. The link below is
>
> the proposal. Feedbacks are welcome.
>
>
> https://cwiki.apache.org/confluence/display/MXNET/BytePS-MXNet+Integration
>
>
>
> Thank you,
>
> Yimin Jiang
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message