Optimization for BERT Inference Performance on CPU

Source: Deep Learning on Medium

Author: Shufan Wu, Tao Lv, Pengxin Yuan, Patric Zhao, Jason Ye

Continue reading on Apache MXNet »