


default search action
ACM Transactions on Architecture and Code Optimization, Volume 19
Volume 19, Number 1, March 2022
- Aditya Ukarande

, Suryakant Patidar, Ram Rangan:
Locality-Aware CTA Scheduling for Gaming Applications. 1:1-1:26 - Hongzhi Liu

, Jie Luo, Ying Li, Zhonghai Wu
:
Iterative Compilation Optimization Based on Metric Learning and Collaborative Filtering. 2:1-2:25 - Muhammad Aditya Sasongko, Milind Chabbi, Mandana Bagheri-Marzijarani, Didem Unat

:
ReuseTracker: Fast Yet Accurate Multicore Reuse Distance Analyzer. 3:1-3:25 - Yaosheng Fu, Evgeny Bolotin, Niladrish Chatterjee, David W. Nellans, Stephen W. Keckler:

GPU Domain Specialization via Composable On-Package Architecture. 4:1-4:23 - Daeyeal Lee, Bill Lin, Chung-Kuan Cheng:

SMT-Based Contention-Free Task Mapping and Scheduling on 2D/3D SMART NoC with Mixed Dimension-Order Routing. 5:1-5:21 - Prasanth Chatarasi, Hyoukjun Kwon

, Angshuman Parashar, Michael Pellauer, Tushar Krishna, Vivek Sarkar:
Marvel: A Data-Centric Approach for Mapping Deep Learning Operators on Spatial Accelerators. 6:1-6:26 - Dennis Rieber, Axel Acosta, Holger Fröning:

Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint Programming. 7:1-7:26 - Mengya Lei

, Fan Li, Fang Wang, Dan Feng, Xiaomin Zou, Renzhi Xiao:
SecNVM: An Efficient and Write-Friendly Metadata Crash Consistency Scheme for Secure NVM. 8:1-8:26 - Bang Di, Daokun Hu

, Zhen Xie, Jianhua Sun, Hao Chen, Jinkui Ren, Dong Li:
TLB-pilot: Mitigating TLB Contention Attack on GPUs with Microarchitecture-Aware Scheduling. 9:1-9:23 - Gururaj Saileshwar

, Rick Boivie, Tong Chen, Benjamin Segal, Alper Buyuktosunoglu:
HeapCheck: Low-cost Hardware Support for Memory Safety. 10:1-10:24 - Muhammad Waqar Azhar

, Miquel Pericàs, Per Stenström:
Task-RM: A Resource Manager for Energy Reduction in Task-Parallel Applications under Quality of Service Constraints. 11:1-11:26 - Cesar Gomes

, Maziar Amiraski, Mark Hempstead
:
CASHT: Contention Analysis in Shared Hierarchies with Thefts. 12:1-12:27 - Yufei Wang

, Xiaoshe Dong
, Longxiang Wang, Weiduo Chen, Xingjun Zhang
:
Optimizing Small-Sample Disk Fault Detection Based on LSTM-GAN Model. 13:1-13:24 - Franyell Silfa

, José-María Arnau, Antonio González:
E-BATCH: Energy-Efficient and High-Throughput RNN Batching. 14:1-14:23 - Chen Ding

, Dong Chen
, Fangzhou Liu
, Benjamin Reber
, Wesley Smith
:
CARL: Compiler Assigned Reference Leasing. 15:1-15:28
Volume 19, Number 2, June 2022
- Christof Schlaak

, Tzung-Han Juang
, Christophe Dubach
:
Memory-Aware Functional IR for Higher-Level Synthesis of Accelerators. 16:1-16:26 - Kartik Lakshminarasimhan

, Ajeya Naithani
, Josué Feliu
, Lieven Eeckhout:
The Forward Slice Core: A High-Performance, Yet Low-Complexity Microarchitecture. 17:1-17:25 - Sharanyan Srikanthan, Sayak Chakraborti

, Princeton Ferro, Sandhya Dwarkadas
:
MAPPER: Managing Application Performance via Parallel Efficiency Regulation*. 18:1-18:26 - Athanasios Tziouvaras

, Georgios Dimitriou, Georgios I. Stamoulis:
Low-power Near-data Instruction Execution Leveraging Opcode-based Timing Analysis. 19:1-19:26 - Xingguo Jia

, Jin Zhang, Boshi Yu, Xingyue Qian, Zhengwei Qi, Haibing Guan:
GiantVM: A Novel Distributed Hypervisor for Resource Aggregation with DSM-aware Optimizations. 20:1-20:27 - Mehrzad Nejat

, Madhavan Manivannan, Miquel Pericàs, Per Stenström:
Cooperative Slack Management: Saving Energy of Multicore Processors by Trading Performance Slack Between QoS-Constrained Applications. 21:1-21:27 - Hugo Pompougnac, Ulysse Beaugnon, Albert Cohen, Dumitru Potop-Butucaru:

Weaving Synchronous Reactions into the Fabric of SSA-form Compilers. 22:1-22:25 - Ghassan Shobaki

, Vahl Scott Gordon, Paul McHugh, Theodore Dubois, Austin Kerbow:
Register-Pressure-Aware Instruction Scheduling Using Ant Colony Optimization. 23:1-23:23 - Qihan Wang

, Zhen Peng, Bin Ren, Jie Chen, Robert G. Edwards:
MemHC: An Optimized GPU Memory Management Framework for Accelerating Many-body Correlation. 24:1-24:26 - Rakesh Kumar

, Mehdi Alipour, David Black-Schaffer:
Dependence-aware Slice Execution to Boost MLP in Slice-out-of-order Cores. 25:1-25:28 - Nandita Vijaykumar, Ataberk Olgun

, Konstantinos Kanellopoulos, F. Nisa Bostanci, Hasan Hassan, Mehrshad Lotfi, Phillip B. Gibbons
, Onur Mutlu
:
MetaSys: A Practical Open-source Metadata Management System to Implement and Evaluate Cross-layer Optimizations. 26:1-26:29 - Jing Chen

, Madhavan Manivannan, Mustafa Abduljabbar
, Miquel Pericàs:
ERASE: Energy Efficient Task Mapping and Resource Management for Work Stealing Runtimes. 27:1-27:29 - Chencheng Ye

, Yuanchao Xu, Xipeng Shen
, Hai Jin, Xiaofei Liao, Yan Solihin:
Preserving Addressability Upon GC-Triggered Data Movements on Non-Volatile Memory. 28:1-28:26 - George Michelogiannakis

, Benjamin Klenk
, Brandon Cook
, Min Yee Teh
, Madeleine Glick
, Larry Dennison
, Keren Bergman
, John Shalf
:
A Case For Intra-rack Resource Disaggregation in HPC. 29:1-29:26
Volume 19, Number 3, September 2022
- Ping Wang

, Fei Wen
, Paul V. Gratz
, Alex Sprintson
:
SIMD-Matcher: A SIMD-based Arbitrary Matching Framework. 30:1-30:20 - Marcel Mettler

, Martin Rapp, Heba Khdr
, Daniel Mueller-Gritschneder
, Jörg Henkel, Ulf Schlichtmann:
An FPGA-based Approach to Evaluate Thermal and Resource Management Strategies of Many-core Processors. 31:1-31:24 - Paschalis Mpeis

, Pavlos Petoumenos
, Kim M. Hazelwood, Hugh Leather:
Object Intersection Captures on Interactive Apps to Drive a Crowd-sourced Replay-based Compiler Optimization. 32:1-32:25 - Cunlu Li

, Dezun Dong, Xiangke Liao:
MUA-Router: Maximizing the Utility-of-Allocation for On-chip Pipelining Routers. 33:1-33:23 - Ziaul Choudhury

, Shashwat Shrivastava, Lavanya Ramapantulu, Suresh Purini:
An FPGA Overlay for CNN Inference with Fine-grained Flexible Parallelism. 34:1-34:26 - Diksha Moolchandani

, Anshul Kumar, Smruti R. Sarangi:
Performance and Power Prediction for Concurrent Execution on GPUs. 35:1-35:27 - Ali Jahanshahi

, Nanpeng Yu
, Daniel Wong:
PowerMorph: QoS-Aware Server Power Reshaping for Data Center Regulation Service. 36:1-36:27 - Peng Xu

, Nannan Zhao, Jiguang Wan, Wei Liu, Shuning Chen
, Yuanhui Zhou, Hadeel Albahar
, Hanyang Liu, Liu Tang, Zhi-hu Tan:
Building a Fast and Efficient LSM-tree Store by Integrating Local Storage with Cloud Storage. 37:1-37:26 - Horng-Ruey Huang

, Ding-Yong Hong
, Jan-Jan Wu, Kung-Fu Chen, Pangfeng Liu, Wei-Chung Hsu:
Accelerating Video Captioning on Heterogeneous System Architectures. 38:1-38:25 - Shivam Kundan

, Theodoros Marinakis, Iraklis Anagnostopoulos
, Dimitri Kagaris:
A Pressure-Aware Policy for Contention Minimization on Multicore Systems. 40:1-40:26 - Johnathan Alsop

, Weon Taek Na
, Matthew D. Sinclair
, Samuel Grayson
, Sarita V. Adve
:
A Case for Fine-grain Coherence Specialization in Heterogeneous Systems. 41:1-41:26 - Mohammadreza Soltaniyeh

, Richard P. Martin
, Santosh Nagarakatte
:
An Accelerator for Sparse Convolutional Neural Networks Leveraging Systolic General Matrix-matrix Multiplication. 42:1-42:26 - Dharanidhar Dang

, Bill Lin
, Debashis Sahoo
:
LiteCON: An All-photonic Neuromorphic Accelerator for Energy-efficient Deep Learning. 43:1-43:22 - Lokesh Siddhu

, Rajesh Kedia
, Shailja Pandey
, Martin Rapp
, Anuj Pathania
, Jörg Henkel
, Preeti Ranjan Panda
:
CoMeT: An Integrated Interval Thermal Simulation Toolchain for 2D, 2.5D, and 3D Processor-Memory Systems. 44:1-44:25 - Matthew Benjamin Olson

, Brandon Kammerdiener
, Michael R. Jantz
, Kshitij A. Doshi
, Terry R. Jones
:
Online Application Guidance for Heterogeneous Memory Systems. 45:1-45:27 - Bruno Chinelato Honorio

, João P. L. de Carvalho
, Catalina Munoz Morales
, Alexandro Baldassin
, Guido Araujo
:
Using Barrier Elision to Improve Transactional Code Generation. 46:1-46:23
Volume 19, Number 4, December 2022
- Jiansong Li

, Xueying Wang
, Xiaobing Chen, Guangli Li, Xiao Dong
, Peng Zhao
, Xianzhi Yu, Yongxin Yang, Wei Cao
, Lei Liu, Xiaobing Feng:
An Application-oblivious Memory Scheduling System for DNN Accelerators. 47:1-47:26 - Aditya Narayan

, Yvain Thonnart
, Pascal Vivet
, Ayse K. Coskun
, Ajay Joshi
:
Architecting Optically Controlled Phase Change Memory. 48:1-48:26 - Chao Zhang

, Maximilian H. Bremer
, Cy P. Chan
, John Shalf
, Xiaochen Guo
:
ASA: Accelerating Sparse Accumulation in Column-wise SpGEMM. 49:1-49:24 - Aart J. C. Bik, Penporn Koanantakool

, Tatiana Shpeisman
, Nicolas Vasilache
, Bixia Zheng
, Fredrik Kjolstad
:
Compiler Support for Sparse Tensor Computations in MLIR. 50:1-50:25 - Pierre Michaud

, Anis Peysieux
:
HAIR: Halving the Area of the Integer Register File with Odd/Even Banking. 51:1-51:25 - Amirreza Yousefzadeh

, Jan Stuijt
, Martijn Hijdra
, Hsiao-Hsuan Liu
, Anteneh Gebregiorgis
, Abhairaj Singh
, Said Hamdioui
, Francky Catthoor
:
Energy-efficient In-Memory Address Calculation. 52:1-52:16 - Hwisoo So

, Moslem Didehban
, Yohan Ko
, Aviral Shrivastava
, Kyoungwoo Lee
:
EXPERTISE: An Effective Software-level Redundant Multithreading Scheme against Hardware Faults. 53:1-53:26 - Tim Hartley

, Foivos S. Zakkak
, Andy Nisbet
, Christos Kotselidis
, Mikel Luján
:
Just-In-Time Compilation on ARM - A Closer Look at Call-Site Code Consistency. 54:1-54:23 - Erling Rennemo Jellum

, Milica Orlandic
, Edmund Brekke
, Tor Arne Johansen, Torleiv H. Bryne
:
Solving Sparse Assignment Problems on FPGAs. 55:1-55:20 - Yuhao Li

, Benjamin C. Lee
:
Phronesis: Efficient Performance Modeling for High-dimensional Configuration Tuning. 56:1-56:26 - Chandrahas Tirumalasetty

, Chih-Chieh Chou
, A. L. Narasimha Reddy
, Paul Gratz
, Ayman Abouelwafa
:
Reducing Minor Page Fault Overheads through Enhanced Page Walker. 57:1-57:26 - Lan Gao

, Jing Wang
, Weigong Zhang:
Adaptive Contention Management for Fine-Grained Synchronization on Commodity GPUs. 58:1-58:21 - Ruobing Han

, Jaewon Lee
, Jaewoong Sim
, Hyesoon Kim
:
COX : Exposing CUDA Warp-level Functions to CPUs. 59:1-59:25 - Yiding Liu

, Xingyao Zhang
, Donglin Zhuang
, Xin Fu
, Shuaiwen Song
:
DynamAP: Architectural Support for Dynamic Graph Traversal on the Automata Processor. 60:1-60:26 - Changwei Zou

, Yaoqing Gao
, Jingling Xue
:
Practical Software-Based Shadow Stacks on x86-64. 61:1-61:26

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














