"GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy ..."

Ali Hadi Zadeh et al. (2020)

Details and statistics

DOI: 10.1109/MICRO50266.2020.00071

access: closed

type: Conference or Workshop Paper

metadata version: 2022-04-09

a service of  Schloss Dagstuhl - Leibniz Center for Informatics