"Language Modeling with Deep Transformers."

Kazuki Irie et al. (2019)
a service of Schloss Dagstuhl - Leibniz Center for Informatics