Intro — BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Source: Deep Learning on Medium

References: https://spaces.ac.cn/archives/4122
https://www.jiqizhixin.com/articles/2018-12-10-8
http://jalammar.github.io/illustrated-bert/

Continue reading on Medium »