default search action
ASPLOS 2021: Virtual Event, USA
- Tim Sherwood, Emery D. Berger, Christos Kozyrakis:
ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Virtual Event, USA, April 19-23, 2021. ACM 2021, ISBN 978-1-4503-8317-2
Session 1: Packet Up
- Alireza Farshin, Tom Barbette, Amir Roozbeh, Gerald Q. Maguire Jr., Dejan Kostic:
PacketMill: toward per-Core 100-Gbps networking. 1-17 - Boris Pismenny, Haggai Eran, Aviad Yehezkel, Liran Liss, Adam Morrison, Dan Tsafrir:
Autonomous NIC offloads. 18-35 - Nikita Lazarev, Shaojie Xiang, Neil Adit, Zhiru Zhang, Christina Delimitrou:
Dagger: efficient and fast RPCs in cloud microservices with near-memory reconfigurable NICs. 36-51
Session 2: Memory Systems
- Sungbo Park, Ingab Kang, Yaebin Moon, Jung Ho Ahn, G. Edward Suh:
BCD deduplication: effective memory compression using partial cache-line deduplication. 52-64 - Sudarsun Kannan, Yujie Ren, Abhishek Bhattacharjee:
KLOCs: kernel-level object contexts for heterogeneous memory systems. 65-78 - Irina Calciu, M. Talha Imran, Ivan Puddu, Sanidhya Kashyap, Hasan Al Maruf, Onur Mutlu, Aasheesh Kolli:
Rethinking software runtimes for disaggregated memory. 79-92
Session 3: Flow
- Dong Kai Wang, Nam Sung Kim:
DiAG: a dataflow-inspired architecture for general-purpose processors. 93-106 - Anand Jayarajan, Kimberly Hau, Andrew Goodwin, Gennady Pekhimenko:
LifeStream: a high-performance stream processing engine for periodic streams. 107-122 - Yue Zha, Jing Li:
When application-specific ISA meets FPGAs: a multi-layer virtualization framework for heterogeneous cloud FPGAs. 123-134
Session 4: Microservices
- Yu Gan, Mingyu Liang, Sundar Dev, David Lo, Christina Delimitrou:
Sage: practical and scalable ML-driven performance debugging in microservices. 135-151 - Zhipeng Jia, Emmett Witchel:
Nightcore: efficient and scalable serverless computing for latency-sensitive, interactive microservices. 152-166 - Yanqi Zhang, Weizhe Hua, Zhuangzhuang Zhou, G. Edward Suh, Christina Delimitrou:
Sinan: ML-based and QoS-aware resource management for cloud microservices. 167-181
Session 5: Pages and Machine Architecture
- Ali Hajiabadi, Andreas Diavastos, Trevor E. Carlson:
NOREBA: a compiler-informed non-speculative out-of-order commit processor. 182-193 - Ashish Panwar, Reto Achermann, Arkaprava Basu, Abhishek Bhattacharjee, K. Gopinath, Jayneel Gandhi:
Fast local page-tables for virtualized NUMA servers with vMitosis. 194-210 - Artemiy Margaritov, Dmitrii Ustiugov, Amna Shahab, Boris Grot:
PTEMagnet: fine-grained physical memory reservation for faster page walks in public clouds. 211-223
Session 6: Languages and Systems I
- Shengjie Xu, Wei Huang, David Lie:
In-fat pointer: hardware-assisted tagged-pointer spatial memory safety defense with subobject granularity protection. 224-240 - Mengchi Zhang, Ahmad Alawneh, Timothy G. Rogers:
Judging a type by its pointer: optimizing GPU virtual functions. 241-254 - Adrien Ghosn, Marios Kogias, Mathias Payer, James R. Larus, Edouard Bugnion:
Enclosure: language-based restriction of untrusted libraries. 255-267
Session 7: Towards Improved Throughputs
- Marcel Blöcher, Lin Wang, Patrick Eugster, Max Schmidt:
Switches for HIRE: resource scheduling for data center in-network computing. 268-285 - Qiao Kang, Jiarong Xing, Yiming Qiu, Ang Chen:
Probabilistic profiling of stateful data planes for adversarial testing. 286-301 - Yejin Lee, Seong Hoon Seo, Hyunji Choi, Hyoung Uk Sul, Soosung Kim, Jae W. Lee, Tae Jun Ham:
MERCI: efficient embedding reduction on commodity hardware via sub-query memoization. 302-313
Session 8: Tools and Frameworks
- Guangpu Li, Dongjie Chen, Shan Lu, Madanlal Musuvathi, Suman Nath:
SherLock: unsupervised synchronization-operation inference. 314-328 - Nastaran Hajinazar, Geraldo F. Oliveira, Sven Gregorio, João Dinis Ferreira, Nika Mansouri-Ghiasi, Minesh Patel, Mohammed Alser, Saugata Ghose, Juan Gómez-Luna, Onur Mutlu:
SIMDRAM: a framework for bit-serial SIMD processing using DRAM. 329-345 - Yi Xu, Joseph Izraelevitz, Steven Swanson:
Clobber-NVM: log less, re-execute more. 346-359
Session 9: Mapping and Management of Quantum and Cloud
- Chi Zhang, Ari B. Hayes, Longfei Qiu, Yuwei Jin, Yan-Hao Chen, Eddy Z. Zhang:
Time-optimal Qubit mapping. 360-374 - Casey Duckering, Jonathan M. Baker, Andrew Litteken, Frederic T. Chong:
Orchestrated trios: compiling for efficient communication in Quantum programs with 3-Qubit gates. 375-385 - Alexander Fuerst, Prateek Sharma:
FaasCache: keeping serverless computing alive with greedy-dual caching. 386-400
Session 10: Persistence I
- Ian Neal, Andrew Quinn, Baris Kasikci:
Hippocrates: healing persistent memory bugs without doing any harm. 401-414 - Hamed Gorjiara, Guoqing Harry Xu, Brian Demsky:
Jaaru: efficiently model checking persistent memory programs. 415-428 - Morteza Hoseinzadeh, Steven Swanson:
Corundum: statically-enforced persistent memory safety. 429-442
Session 11: Quantum Abstractions
- Tirthak Patel, Devesh Tiwari:
Qraft: reverse your Quantum circuit and know the correct program output. 443-455 - Yipeng Huang, Steven Holtzen, Todd D. Millstein, Guy Van den Broeck, Margaret Martonosi:
Logical abstractions for noisy variational Quantum algorithm simulation. 456-472 - Wei Tang, Teague Tomesh, Martin Suchara, Jeffrey Larson, Margaret Martonosi:
CutQC: using small Quantum computers for large Quantum circuit evaluations. 473-486
Session 12: Persistence II
- Sihang Liu, Suyash Mahar, Baishakhi Ray, Samira Manabi Khan:
PMFuzz: test case generation for persistent memory programs. 487-502 - Bang Di, Jiawen Liu, Hao Chen, Dong Li:
Fast, flexible, and comprehensive bug detection for persistent memory programs. 503-516 - Jungi Jeong, Changhee Jung:
PMEM-spec: persistent memory speculation (strict persistency can trump relaxed persistency). 517-529
Session 13: Systems Software
- Jonas Oberhauser, Rafael Lourenco de Lima Chehab, Diogo Behrens, Ming Fu, Antonio Paolillo, Lilith Oberhauser, Koustubha Bhat, Yuzhong Wen, Haibo Chen, Jaeho Kim, Viktor Vafeiadis:
VSync: push-button verification and optimization for synchronization primitives on weak memory models. 530-545 - Vasily A. Sartakov, Lluís Vilanova, Peter R. Pietzuch:
CubicleOS: a library OS with software componentisation for practical isolation. 546-558 - Dmitrii Ustiugov, Plamen Petrov, Marios Kogias, Edouard Bugnion, Boris Grot:
Benchmarking, analysis, and optimization of serverless function snapshots. 559-572
Session 14: Beyond the Pixels
- Venkatesh Kodukula, Alexander Shearer, Van Nguyen, Srinivas Lingutla, Yifei Liu, Robert LiKamWa:
Rhythmic pixel regions: multi-resolution visual sensing system towards high-precision visual computing at low power. 573-586 - Chenhao Xie, Xie Li, Yang Hu, Huwan Peng, Michael B. Taylor, Shuaiwen Leon Song:
Q-VR: system-level design for future mobile collaborative virtual reality. 587-599 - Parthasarathy Ranganathan, Daniel Stodolsky, Jeff Calow, Jeremy Dorfman, Marisabel Guevara, Clinton Wills Smullen IV, Aki Kuusela, Raghu Balasubramanian, Sandeep Bhatia, Prakash Chauhan, Anna Cheung, In Suk Chong, Niranjani Dasharathi, Jia Feng, Brian Fosco, Samuel Foss, Ben Gelb, Sara J. Gwin, Yoshiaki Hase, Da-ke He, C. Richard Ho, Roy W. Huffman Jr., Elisha Indupalli, Indira Jayaram, Poonacha Kongetira, Cho Mon Kyaw, Aaron Laursen, Yuan Li, Fong Lou, Kyle A. Lucke, J. P. Maaninen, Ramon Macias, Maire Mahony, David Alexander Munday, Srikanth Muroor, Narayana Penukonda, Eric Perkins-Argueta, Devin Persaud, Alex Ramírez, Ville-Mikko Rautio, Yolanda Ripley, Amir Salek, Sathish Sekar, Sergey N. Sokolov, Rob Springer, Don Stark, Mercedes Tan, Mark S. Wachsler, Andrew C. Walton, David A. Wickeraad, Alvin Wijaya, Hon Kwan Wu:
Warehouse-scale video acceleration: co-design and deployment in the wild. 600-615
Session 15: Races and Concurrency
- Ziheng Liu, Shuofei Zhu, Boqin Qin, Hao Chen, Linhai Song:
Automatically detecting and fixing concurrency bugs in go software systems. 616-629 - Weiyu Luo, Brian Demsky:
C11Tester: a race detector for C/C++ atomics. 630-646 - Adil Ahmad, Sangho Lee, Pedro Fonseca, Byoungyoung Lee:
Kard: lightweight data race detection with per-thread memory protection. 647-660
Session 16: Robots, Optimization, and Robo-Optimization
- Ramyad Hadidi, Bahar Asgari, Sam Jijina, Adriana Amyette, Nima Shoghi, Hyesoon Kim:
Quantifying the design-space tradeoffs in autonomous drones. 661-673 - Sabrina M. Neuman, Brian Plancher, Thomas Bourgeat, Thierry Tambe, Srinivas Devadas, Vijay Janapa Reddi:
Robomorphic computing: a design methodology for domain-specific accelerators parameterized by robot morphology. 674-686 - Guowei Zhang, Nithya Attaluri, Joel S. Emer, Daniel Sánchez:
Gamma: leveraging Gustavson's algorithm to accelerate sparse matrix multiplication. 687-701
Session 17: Solid State Drives
- Jisung Park, Myungsuk Kim, Myoungjun Chun, Lois Orosa, Jihong Kim, Onur Mutlu:
Reducing solid-state drive read latency by optimizing read-retry. 702-716 - Mark Wilkening, Udit Gupta, Samuel Hsia, Caroline Trippel, Carole-Jean Wu, David Brooks, Gu-Yeon Wei:
RecSSD: near data processing for solid state drive based recommendation inference. 717-729 - Chun-Yi Liu, Yunju Lee, Myoungsoo Jung, Mahmut Taylan Kandemir, Wonil Choi:
Prolonging 3D NAND SSD lifetime via read latency relaxation. 730-742
Session 18: Security I
- Victor Duta, Cristiano Giuffrida, Herbert Bos, Erik van der Kouwe:
PIBE: practical kernel control-flow hardening with profile-guided indirect branch elimination. 743-757 - Dmitry Evtyushkin, Thomas Benjamin, Jesse Elwell, Jeffrey A. Eitel, Angelo Sapello, Abhrajit Ghosh:
Computing with time: microarchitectural weird machines. 758-772 - Daming D. Chen, Wen Shih Lim, Mohammad Bakhshalipour, Phillip B. Gibbons, James C. Hoe, Bryan Parno:
HerQules: securing programs via hardware-enforced message queues. 773-788
Session 19: Better Hardware through Compilers
- Clément Pit-Claudel, Thomas Bourgeat, Stella Lau, Arvind, Adam Chlipala:
Effective simulation and debugging for a high-level hardware language using software compilers. 789-803 - Rachit Nigam, Samuel Thomas, Zhijing Li, Adrian Sampson:
A compiler infrastructure for accelerator generators. 804-817 - Joshua Landgraf, Tiffany Yang, Will Lin, Christopher J. Rossbach, Eric Schkufza:
Compiler-driven FPGA virtualization with SYNERGY. 818-831
Session 20: Data Driven Optimization
- Subho S. Banerjee, Saurabh Jha, Zbigniew Kalbarczyk, Ravishankar K. Iyer:
BayesPerf: minimizing performance monitoring errors using Bayesian statistics. 832-844 - Sai Qian Zhang, Bradley McDanel, H. T. Kung, Xin Dong:
Training for multi-resolution inference using reusable quantization terms. 845-860 - Zhan Shi, Akanksha Jain, Kevin Swersky, Milad Hashemi, Parthasarathy Ranganathan, Calvin Lin:
A hierarchical neural model of data prefetching. 861-873
Session 21: Supporting Hardware Parallelism
- Alexa VanHattum, Rachit Nigam, Vincent T. Lee, James Bornholt, Adrian Sampson:
Vectorization for digital signal processors via equality saturation. 874-886 - Junqiao Qiu, Xiaofan Sun, Amir Hossein Nodehi Sabet, Zhijia Zhao:
Scalable FSM parallelization via path fusion and higher-order speculation. 887-901 - Yishen Chen, Charith Mendis, Michael Carbin, Saman P. Amarasinghe:
VeGen: a vectorizer generator for SIMD and beyond. 902-914
Session 22: Neural Net Optimization
- Jack Turner, Elliot J. Crowley, Michael F. P. O'Boyle:
Neural architecture search as program transformation exploration. 915-927 - Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan:
Analytical characterization and design space exploration for optimization of CNNs. 928-942 - Kartik Hegde, Po-An Tsai, Sitao Huang, Vikas Chandra, Angshuman Parashar, Christopher W. Fletcher:
Mind mappings: enabling efficient algorithm-accelerator mapping space search. 943-958
Session 23: Beyond Neural Nets
- Xiangyu Zhang, Ramin Bashizade, Yicheng Wang, Sayan Mukherjee, Alvin R. Lebeck:
Statistical robustness of Markov chain Monte Carlo accelerators. 959-974 - Hunjun Lee, Chanmyeong Kim, Yujin Chung, Jangwoo Kim:
NeuroEngine: a hardware-based event-driven simulation system for advanced brain-inspired computing. 975-989 - Amira Guesmi, Ihsen Alouani, Khaled N. Khasawneh, Mouna Baklouti, Tarek Frikha, Mohamed Abid, Nael B. Abu-Ghazaleh:
Defensive approximation: securing CNNs using approximate computing. 990-1003
Session 24: Languages and Systems II
- Theodoros Kasampalis, Daejun Park, Zhengyao Lin, Vikram S. Adve, Grigore Rosu:
Language-parametric compiler validation with application to LLVM. 1004-1019 - Xiaozhu Meng, Weijie Liu:
Incremental CFG patching for binary rewriting. 1020-1033 - Giuseppe Antonio Di Luna, Davide Italiano, Luca Massarelli, Sebastian Österlund, Cristiano Giuffrida, Leonardo Querzoni:
Who's debugging the debuggers? exposing debug information bugs in optimized binaries. 1034-1045
Session 25: Security II
- Mohammad Behnia, Prateek Sahu, Riccardo Paccagnella, Jiyong Yu, Zirui Neil Zhao, Xiang Zou, Thomas Unterluggauer, Josep Torrellas, Carlos V. Rozas, Adam Morrison, Frank McKeen, Fangfei Liu, Ron Gabor, Christopher W. Fletcher, Abhishek Basak, Alaa R. Alameldeen:
Speculative interference attacks: breaking invisible speculation schemes. 1046-1060 - Dimitrios Skarlatos, Zirui Neil Zhao, Riccardo Paccagnella, Christopher W. Fletcher, Josep Torrellas:
Jamais vu: thwarting microarchitectural replay attacks. 1061-1076 - Gururaj Saileshwar, Christopher W. Fletcher, Moinuddin K. Qureshi:
Streamline: a fast, flushless cache covert-channel attack by enabling asynchronous collusion. 1077-1090
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.