


default search action
53rd MICRO 2020: Athens, Greece
- 53rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2020, Athens, Greece, October 17-21, 2020. IEEE 2020, ISBN 978-1-7281-7383-2

Session 1A: Security and Privacy I
- Yeonhong Park, Woosuk Kwon, Eojin Lee, Tae Jun Ham, Jung Ho Ahn

, Jae W. Lee:
Graphene: Strong yet Lightweight Row Hammer Protection. 1-13 - Alexander Freij, Shougang Yuan, Huiyang Zhou, Yan Solihin:

Persist Level Parallelism: Streamlining Integrity Tree Updates for Secure Persistent Memory. 14-27 - Zhi Zhang, Yueqiang Cheng, Dongxi Liu, Surya Nepal, Zhi Wang, Yuval Yarom:

PThammer: Cross-User-Kernel-Boundary Rowhammer through Implicit Accesses. 28-41 - Dimitrios Skarlatos, Qingrong Chen, Jianyan Chen, Tianyin Xu, Josep Torrellas:

Draco: Architectural and Operating System Support for System Call Security. 42-57
Session 1B: Machine Learning Accelerators with New Technologies
- Koki Ishida, Ilkwon Byun

, Ikki Nagaoka, Kosuke Fukumitsu, Masamitsu Tanaka
, Satoshi Kawakami
, Teruo Tanimoto
, Takatsugu Ono, Jangwoo Kim, Koji Inoue:
SuperNPU: An Extremely Fast Neural Processing Unit Using Superconducting Logic Devices. 58-72 - Muhammad Husnain Mubarik, Dennis D. Weller, Nathaniel Bleier

, Matthew Tomei, Jasmin Aghassi-Hagmann
, Mehdi B. Tahoori, Rakesh Kumar:
Printed Machine Learning Classifiers. 73-87 - Akshay Krishna Ramanathan, Gurpreet S. Kalsi, Srivatsa Srinivasa, Tarun Makesh Chandran, Kamlesh R. Pillai, Om Ji Omer

, Vijaykrishnan Narayanan
, Sreenivas Subramoney
:
Look-Up Table based Energy Efficient Processing in Cache Support for Neural Network Acceleration. 88-101 - Ashutosh Dhar, Xiaohao Wang, Hubertus Franke, Jinjun Xiong

, Jian Huang, Wen-Mei W. Hwu, Nam Sung Kim, Deming Chen:
FReaC Cache: Folded-logic Reconfigurable Computing in the Last Level Cache. 102-117
Session 1C: Microarchitecture I
- Siavash Zangeneh, Stephen Pruett, Sangkug Lym, Yale N. Patt:

BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict Branches. 118-130 - Samira Mirbagher Ajorpaz

, Elba Garza, Gilles Pokam, Daniel A. Jiménez
:
CHiRP: Control-Flow History Reuse Prediction. 131-145 - Tanvir Ahmed Khan

, Akshitha Sriraman, Joseph Devietti
, Gilles Pokam, Heiner Litz, Baris Kasikci
:
I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing. 146-159 - Jagadish B. Kotra, John Kalamatianos:

Improving the Utilization of Micro-operation Caches in x86 Processors. 160-172
Session 2A: Quantum Computing
- Casey Duckering, Jonathan M. Baker

, David I. Schuster
, Frederic T. Chong
:
Virtualized Logical Qubits: A 2.5D Architecture for Error-Corrected Quantum Computing. 173-185 - Pranav Gokhale, Ali Javadi-Abhari, Nathan Earnest, Yunong Shi, Frederic T. Chong

:
Optimized Quantum Compilation for Near-Term Algorithms with OpenPulse. 186-200 - Yongshan Ding

, Pranav Gokhale, Sophia Fuhui Lin, Richard Rines
, Thomas Propson
, Frederic T. Chong
:
Systematic Crosstalk Mitigation for Superconducting Qubits via Frequency-Aware Compilation. 201-214 - Mahabubul Alam, Abdullah Ash-Saki, Swaroop Ghosh:

Circuit Compilation Methodologies for Quantum Approximate Optimization Algorithm. 215-228
Session 2B: Robust Machine Learning
- Qiyu Wan, Xin Fu:

Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural Networks. 229-240 - Yiming Gan, Yuxian Qiu, Jingwen Leng, Minyi Guo, Yuhao Zhu:

Ptolemy: Architecture Support for Robust Deep Learning. 241-255 - Gil Shomron, Uri C. Weiser:

Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks. 256-269 - Yi He

, Prasanna Balaprakash
, Yanjing Li:
FIdelity: Efficient Resilience Analysis Framework for Deep Learning Accelerators. 270-281
Session 2C: Memory I
- Minesh Patel, Jeremie S. Kim, Taha Shahroodi, Hasan Hassan, Onur Mutlu

:
Bit-Exact ECC Recovery (BEER): Determining DRAM On-Die ECC Functions by Exploiting DRAM Data Retention Characteristics. 282-297 - Lev Mukhanov, Dimitrios S. Nikolopoulos, Georgios Karakonstantis:

DStress: Automatic Synthesis of DRAM Reliability Stress Viruses using Genetic Algorithms. 298-312 - Yaohua Wang, Lois Orosa, Xiangjun Peng

, Yang Guo, Saugata Ghose, Minesh Patel, Jeremie S. Kim, Juan Gómez-Luna, Mohammad Sadrosadati, Nika Mansouri-Ghiasi, Onur Mutlu
:
FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching. 313-328 - Themis Melissaris, Markos Markakis

, Kelly A. Shaw, Margaret Martonosi:
PerpLE: Improving the Speed and Effectiveness of Memory Consistency Testing. 329-341
Session 3A: Near/In-Memory Computing
- Dibei Chen, Zhaoshi Li, Tianzhu Xiong, Zhiwei Liu, Jun Yang, Shouyi Yin, Shaojun Wei, Leibo Liu:

CATCAM: Constant-time Alteration Ternary CAM with Scalable In-Memory Architecture. 342-355 - Mohsen Imani, Saikishan Pampana, Saransh Gupta, Minxuan Zhou, Yeseong Kim, Tajana Rosing:

DUAL: Acceleration of Clustering Algorithms using Digital-based Processing In-Memory. 356-371 - Mingxuan He, Choungki Song, Ilkon Kim, Chunseok Jeong, Seho Kim, Il Park, Mithuna Thottethodi

, T. N. Vijaykumar:
Newton: A DRAM-maker's Accelerator-in-Memory (AiM) Architecture for Machine Learning. 372-385 - Shuotao Xu

, Thomas Bourgeat
, Tianhao Huang, Hojun Kim, Sungjin Lee, Arvind:
AQUOMAN: An Analytic-Query Offloading Machine. 386-399 - Salonik Resch, S. Karen Khatamifard, Zamshed I. Chowdhury, Masoud Zabihi, Zhengyang Zhao, M. Hüsrev Cilasun

, Jianping Wang, Sachin S. Sapatnekar
, Ulya R. Karpuzcu:
MOUSE: Inference In Non-volatile Memory for Energy Harvesting Applications. 400-414
Session 3B: Compilation, Modeling, and Simulation
- Jinhu Jiang

, Rongchao Dong, Zhongjun Zhou, Changheng Song, Wenwen Wang, Pen-Chung Yew
, Weihua Zhang:
More with Less - Deriving More Translation Rules with Less Training Data for DBTs Using Parameterization. 415-426 - Jie Zhao

, Peng Di
:
Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data. 427-441 - Alex Renda, Yishen Chen, Charith Mendis, Michael Carbin:

DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates. 442-455 - Mohammad Agbarya

, Idan Yaniv
, Jayneel Gandhi
, Dan Tsafrir:
Predicting Execution Times With Partial Simulations in Virtual Memory Research: Why and How. 456-470 - Samuel Rogers, Joshua Slycord, Mohammadreza Baharani

, Hamed Tabkhi:
gem5-SALAM: A System Architecture for LLVM-based Accelerator Modeling. 471-482
Session 3C: Non-volatile Memories
- Qiao Li

, Min Ye
, Yufei Cui
, Liang Shi, Xiaoqiang Li, Tei-Wei Kuo
, Chun Jason Xue:
Shaving Retries with Sentinels for Fast Read over High-Density 3D Flash. 483-495 - Zixuan Wang

, Xiao Liu, Jian Yang, Theodore Michailidis
, Steven Swanson
, Jishen Zhao:
Characterizing and Modeling Non-Volatile Memory Systems. 496-508 - Apostolos Kokolis

, Thomas Shull, Jian Huang, Josep Torrellas:
P-INSPECT: Architectural Support for Programmable Non-Volatile Memory Frameworks. 509-524 - Jungi Jeong, Jaewan Hong

, Seungryoul Maeng, Changhee Jung, Youngjin Kwon:
Unbounded Hardware Transactional Memory for a Hybrid DRAM/NVM Memory System. 525-538 - Sara Mahdizadeh-Shahri, Seyed Armin Vakil-Ghahani

, Aasheesh Kolli:
(Almost) Fence-less Persist Ordering. 539-554
Session 4A: Microarchitecture II
- Alberto Ros

, Stefanos Kaxiras:
Speculative Enforcement of Store Atomicity. 555-567 - Juan M. Cebrian

, Stefanos Kaxiras, Alberto Ros
:
Boosting Store Buffer Efficiency with Store-Prefetch Bursts. 568-580 - Minli Julie Liao, Jack Sampson:

D-SOAP: Dynamic Spatial Orientation Affinity Prediction for Caching in Multi-Orientation Memory Systems. 581-595 - Quan M. Nguyen, Daniel Sánchez:

Pipette: Improving Core Utilization on Irregular Applications through Intra-Core Pipeline Parallelism. 596-608 - Chao Zhang, Yuan Zeng, John Shalf

, Xiaochen Guo:
RnR: A Software-Assisted Record-and-Replay Hardware Prefetcher. 609-621
Session 4B: Resource Management
- Sheng-Chun Kao, Geonhwa Jeong, Tushar Krishna:

ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning. 622-636 - Liang Zhou, Laxmi N. Bhuyan, K. K. Ramakrishnan

:
Gemini: Learning to Manage CPU Power for Latency-Critical Search Engines. 637-349 - Neeraj Kulkarni, Gonzalo Gonzalez-Pumariega, Amulya Khurana, Christine A. Shoemaker

, Christina Delimitrou, David H. Albonesi:
CuttleSys: Data-Driven Resource Management for Interactive Services on Reconfigurable Multicores. 650-664 - Brian C. Schwedock

, Nathan Beckmann:
Jumanji: The Case for Dynamic NUCA in the Datacenter. 665-680 - Soroush Ghodrati, Byung Hoon Ahn, Joon Kyung Kim, Sean Kinzer, Brahmendra Reddy Yatham, Navateja Alla, Hardik Sharma, Mohammad Alian

, Eiman Ebrahimi, Nam Sung Kim, Cliff Young, Hadi Esmaeilzadeh:
Planaria: Dynamic Architecture Fission for Spatial Multi-Tenant Acceleration of Deep Neural Networks. 681-697
Session 4C: Machine Learning Accelerators I
- Zhuoran Song, Feiyang Wu, Xueyuan Liu, Jing Ke, Naifeng Jing, Xiaoyao Liang:

VR-DANN: Real-Time Video Recognition via Decoder-Assisted Neural Network Acceleration. 698-710 - Dingqing Yang

, Amin Ghasemazar, Xiaowei Ren, Maximilian Golub, Guy Lemieux, Mieszko Lis:
Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training. 711-724 - Hyeonjin Kim

, Sungwoo Ahn, Yunho Oh
, Bogil Kim, Won Woo Ro, William J. Song
:
Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores. 725-737 - Liu Liu

, Zheng Qu, Lei Deng
, Fengbin Tu
, Shuangchen Li, Xing Hu, Zhenyu Gu, Yufei Ding, Yuan Xie:
DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture. 738-750
Session 5A: Machine Learning Accelerators II
- Huiyu Mo, Leibo Liu, Wenjing Hu, Wenping Zhu, Qiang Li, Ang Li

, Shouyi Yin, Jian Chen, Xiaowei Jiang, Shaojun Wei:
TFE: Energy-efficient Transferred Filter-based Engine to Compress and Accelerate Convolutional Neural Networks. 751-765 - Nitish Kumar Srivastava, Hanchen Jin, Jie Liu

, David H. Albonesi, Zhiru Zhang
:
MatRaptor: A Sparse-Sparse Matrix Multiplication Accelerator Based on Row-Wise Product. 766-780 - Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh

, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos:
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training. 781-795 - Zhangxiaowen Gong, Houxiang Ji

, Christopher W. Fletcher, Christopher J. Hughes
, Sara S. Baghsorkhi, Josep Torrellas:
SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUs. 796-810 - Ali Hadi Zadeh

, Isak Edo, Omar Mohamed Awad, Andreas Moshovos:
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference. 811-824
Session 5B: Cloud and Datacenter
- Pyeongsu Park, Heetaek Jeong, Jangwoo Kim:

TrainBox: An Extreme-Scale Neural Network Training Server Architecture by Systematically Balancing Operations. 825-838 - Sulav Malla, Qingyuan Deng, Zoh Ebrahimzadeh, Joe Gasperetti, Sajal Jain, Parimala Kondety, Thiara Ortiz, Debra Vieira:

Coordinated Priority-aware Charging of Distributed Batteries in Oversubscribed Data Centers. 839-851 - Amirhossein Mirhosseini, Hossein Golestani

, Thomas F. Wenisch:
HyperPlane: A Scalable Low-Latency Notification Accelerator for Software Data Planes. 852-867 - Christian Pinto, Dimitris Syrivelis, Michele Gazzetti, Panos K. Koutsovasilis, Andrea Reale, Kostas Katrinis, H. Peter Hofstee:

ThymesisFlow: A Software-Defined, HW/SW co-Designed Interconnect Stack for Rack-Scale Memory Disaggregation. 868-880 - Tianyi Liu, Sen He

, Sunzhou Huang
, Danny H. K. Tsang, Lingjia Tang, Jason Mars, Wei Wang
:
A Benchmarking Framework for Interactive 3D Applications in the Cloud. 881-894
Session 5C: Domain-Specific Architecture
- Pengcheng Yao

, Long Zheng, Zhen Zeng, Yu Huang, Chuangyi Gui, Xiaofei Liao, Hai Jin, Jingling Xue:
A Locality-Aware Energy-Efficient Accelerator for Graph Mining Applications. 895-907 - Shafiur Rahman, Nael B. Abu-Ghazaleh, Rajiv Gupta

:
GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing. 908-921 - Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi

, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt:
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing. 922-936 - Daichi Fujiki

, Shunhao Wu, Nathan Ozog, Kush Goliya, David T. Blaauw, Satish Narayanasamy
, Reetuparna Das
:
SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space. 937-950 - Damla Senol Cali

, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gómez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney
, Can Alkan
, Saugata Ghose, Onur Mutlu
:
GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. 951-966
Session 6A: GPGPU
- Xia Zhao, Magnus Jahre

, Lieven Eeckhout:
Selective Replication in Memory-Side GPU Caches. 967-980 - Yuan-Hsi Chou

, Christopher Ng, Shaylin Cattell, Jeremy Intan, Matthew D. Sinclair, Joseph Devietti
, Timothy G. Rogers
, Tor M. Aamodt:
Deterministic Atomic Buffering. 981-995 - Hodjat Asghari Esfeden, AmirAli Abdolrashidi, Shafiur Rahman, Daniel Wong, Nael B. Abu-Ghazaleh:

BOW: Breathing Operand Windows to Exploit Bypassing in GPUs. 996-1008 - Lu Wang, Magnus Jahre

, Almutaz Adileh, Lieven Eeckhout:
MDM: The GPU Memory Divergence Model. 1009-1021 - Mahmoud Khairy, Vadim Nikiforov, David W. Nellans, Timothy G. Rogers

:
Locality-Centric Data and Threadblock Management for Massive GPUs. 1022-1036
Session 6B: Mobile and Embedded Architecture
- Yu Feng, Boyuan Tian, Tiancheng Xu

, Paul N. Whatmough, Yuhao Zhu:
Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation. 1037-1050 - Jawad Haj-Yahya, Mohammed Alser, Jeremie S. Kim, Lois Orosa

, Efraim Rotem, Avi Mendelson, Anupam Chattopadhyay, Onur Mutlu
:
FlexWatts: A Power- and Workload-Aware Hybrid Power Delivery Network for Energy-Efficient Microprocessors. 1051-1066 - Bo Yu, Wei Hu, Leimeng Xu, Jie Tang, Shaoshan Liu, Yuhao Zhu:

Building the Computing System for Autonomous Micromobility Vehicles: Design Constraints and Architectural Optimizations. 1067-1081 - Young Geun Kim, Carole-Jean Wu:

AutoScale: Energy Efficiency Optimization for Stochastic Edge Inference Using Reinforcement Learning. 1082-1096 - Tianyu Jia, Yuhao Ju, Russ Joseph, Jie Gu:

NCPU: An Embedded Neural CPU Architecture on Resource-Constrained Low Power Devices for Real-time End-to-End Performance. 1097-1109
Session 6C: Security and Privacy II
- Thomas Bourgeat

, Jules Drean, Yuheng Yang
, Lillian Tsai, Joel S. Emer, Mengjia Yan
:
CaSA: End-to-end Quantitative Security Analysis of Randomly Mapped Caches. 1110-1123 - Samira Mirbagher Ajorpaz

, Gilles Pokam, Esmaeil Mohammadian Koruyeh, Elba Garza, Nael B. Abu-Ghazaleh, Daniel A. Jiménez
:
PerSpectron: Detecting Invariant Footprints of Microarchitectural Attacks with Perceptron. 1124-1137 - Zirui Neil Zhao, Houxiang Ji

, Mengjia Yan
, Jiyong Yu
, Christopher W. Fletcher, Adam Morrison, Darko Marinov, Josep Torrellas:
Speculation Invariance (InvarSpec): Faster Safe Execution Through Program Analysis. 1138-1152 - Yonghae Kim, Jaekyu Lee

, Hyesoon Kim:
Hardware-based Always-On Heap Memory Safety. 1153-1166

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














