"DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models ..."

Reza Yazdani Aminabadi et al. (2022)

Details and statistics

DOI: 10.1109/SC41404.2022.00051

access: closed

type: Conference or Workshop Paper

metadata version: 2023-05-24

a service of  Schloss Dagstuhl - Leibniz Center for Informatics