tvm-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zhao Wu <>
Subject Re: [dmlc/tvm] [RFC] Frontend layout transformation (#2519)
Date Fri, 19 Apr 2019 02:20:42 GMT
@srkreddy1238 @yzhliu Thanks comments!

If all of you agree, I will make TFLite frontend support from NCHW to NHWC.

@yzhliu Yes. quantization part support is not been upstreamed yet. It has many changes. I
plan to upstream it in dev 0.6. My original plan is to support TFLite NHWC the reason is we
could leverage auto tuning of NCHW and see the performance of quantization model. The initial
work is we could faster than FP32 30% in Mobilenet V1 using spatial pack. We also find this
is the limit of quantization model, we could tensorize `q_conv2d` to get better performance.
However, if we change the layout from NCHW to NHWC, we should have some additional work to
do, for example auto tuning of NHWC support (including conv2d and depthwise convolution).
Alright, I could start to do this work firstly to support TFLite NHWC and upstream it before
quantization part, because this work is much easy than quantization part.

You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message