"Megatron-LM: Training Multi-Billion Parameter Language Models Using Model ..."

Mohammad Shoeybi et al. (2019)
a service of Schloss Dagstuhl - Leibniz Center for Informatics