"Large Batch Optimization for Deep Learning: Training BERT in 76 minutes."

Yang You et al. (2020)

Details and statistics

DOI:

access: open

type: Conference or Workshop Paper

metadata version: 2022-12-17

a service of  Schloss Dagstuhl - Leibniz Center for Informatics