


default search action
19th CGO 2021: Seoul, South Korea
- Jae W. Lee, Mary Lou Soffa, Ayal Zaks:

IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2021, Seoul, South Korea, February 27 - March 3, 2021. IEEE 2021, ISBN 978-1-7281-8613-9
Frontmatter
- Jae W. Lee:

Message from the General Chair. iii-iv - Mary Lou Soffa, Ayal Zaks:

Message from the Program Chairs. v - Jubi Taneja, Michel Steuwer:

Report from the Artifact Evaluation Committee. x-xi
Keynote
- Mary W. Hall:

Data Layout and Data Representation Optimizations to Reduce Data Movement Keynote. 1
Compiler Infrastructure
- Chris Lattner, Mehdi Amini

, Uday Bondhugula
, Albert Cohen
, Andy Davis, Jacques A. Pienaar
, River Riddle, Tatiana Shpeisman, Nicolas Vasilache
, Oleksandr Zinenko
:
MLIR: Scaling Compiler Infrastructure for Domain Specific Computation. 2-14 - Lorenzo Chelini, Andi Drebes, Oleksandr Zinenko, Albert Cohen, Nicolas Vasilache, Tobias Grosser

, Henk Corporaal:
Progressive Raising in Multi-level IR. 15-26 - Thomas Koehler

, Michel Steuwer:
Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs. 27-38 - Ajay Brahmakshatriya, Saman P. Amarasinghe:

BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++. 39-51
Dealing with Precision
- Joao Rivera, Franz Franchetti, Markus Püschel:

An Interval Compiler for Sound Floating-Point Computations. 52-64 - Tiago Trevisan Jost, Yves Durand, Christian Fabre

, Albert Cohen, Frédéric Pétrot:
Seamless Compiler Integration of Variable Precision Floating-Point Arithmetic. 65-76 - Jian Weng, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki:

UNIT: Unifying Tensorized Instruction Compilation. 77-89 - Guangli Li, Jingling Xue, Lei Liu, Xueying Wang

, Xiu Ma, Xiao Dong, Jiansong Li
, Xiaobing Feng:
Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUs. 90-102
Binary Profiling, Tracing, Sampling
- Mahwish Arif, Ruoyu Zhou, Hsi-Ming Ho, Timothy M. Jones:

Cinnamon: A Domain-Specific Language for Binary Profiling and Monitoring. 103-114 - Keren Zhou

, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey
:
GPA: A GPU Performance Advisor Based on Instruction Sampling. 115-125 - Harish Patil, Alexander Isaev, Wim Heirman, Alen Sabu

, Ali Hajiabadi
, Trevor E. Carlson:
ELFies: Executable Region Checkpoints for Performance Analysis and Simulation. 126-136 - David Pankratz, Tyler Nowicki, Ahmed Eltantawy, José Nelson Amaral:

Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics Instrumentation. 137-149
Parallelism - Optimizing, Modeling, Testing
- Christos Vasiladiotis

, Roberto Castañeda Lozano, Murray Cole, Björn Franke:
Loop Parallelization using Dynamic Commutativity Analysis. 150-161 - Seungbin Song, Heelim Choi, Hanjun Kim

:
Fine-Grained Pipeline Parallelization for Network Function Programs. 162-173 - Christie L. Alappat

, Johannes Seiferth, Georg Hager, Matthias Korch, Thomas Rauber, Gerhard Wellein:
YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures. 174-186 - Ting Yuan, Guangwei Li, Jie Lu, Chen Liu, Lian Li, Jingling Xue:

GoBench: A Benchmark Suite of Real-World Go Concurrency Bugs. 187-199
Memory Optimization and Safeness
- Luigi D. C. Soares

, Fernando Magno Quintão Pereira:
Memory-Safe Elimination of Side Channels. 200-210 - Naveen Namashivavam

, Sanyam Mehta, Pen-Chung Yew
:
Variable-Sized Blocks for Locality-Aware SpMV. 211-221 - Mohamad Barbar, Yulei Sui, Shiping Chen

:
Object Versioning for Flow-Sensitive Pointer Analysis. 222-235 - Haofeng Li

, Haining Meng, Hengjie Zheng, Liqing Cao, Jie Lu, Lian Li, Lin Gao:
Scaling Up the IFDS Algorithm with Efficient Disk-Assisted Computing. 236-247
Compiling Graph Algorithms, Compiling for GPUs
- Ajay Brahmakshatriya, Yunming Zhang, Changwan Hong, Shoaib Kamil, Julian Shun

, Saman P. Amarasinghe:
Compiling Graph Applications for GPU s with GraphIt. 248-261 - Ruohuang Zheng, Sreepathi Pai

:
Efficient Execution of Graph Algorithms on CPU with SIMD Extensions. 262-276 - Alexander Krolik, Clark Verbrugge, Laurie J. Hendren:

r3d3: Optimized Query Compilation on GPUs. 277-288 - Guei-Yuan Lueh, Kaiyu Chen, Gang Chen, Joel Fuentes

, Wei-Yu Chen, Fangwen Fu, Hong Jiang, Hongzheng Li, Daniel Rhee:
C-for-Metal: High Performance Simd Programming on Intel GPUs. 289-300
Compiling for Spatial, Quantum, and Embedded Devices
- Ji Liu

, Luciano Bello, Huiyang Zhou
:
Relaxed Peephole Optimization: A Novel Compiler Optimization for Quantum Circuits. 301-314 - Johannes de Fine Licht

, Andreas Kuster, Tiziano De Matteis
, Tal Ben-Nun, Dominic Hofer
, Torsten Hoefler:
StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems. 315-326 - Changsu Kim

, Shinnung Jeong
, Sungjun Cho
, Yongwoo Lee
, William Song
, Youngsok Kim, Hanjun Kim
:
Thread-Aware Area-Efficient High-Level Synthesis Compiler for Embedded Devices. 327-339
JIT and Binary Translation
- Guilherme Ottoni, Bin Liu:

HHVM Jump-Start: Boosting Both Warmup and Steady-State Performance at Scale. 340-350 - Ziyi Zhao

, Zhang Jiang
, Ying Chen, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew
:
Enhancing Atomic Instruction Emulation for Cross-ISA Dynamic Binary Translation. 351-362 - Milind Chabbi, Jin Lin, Raj Barik:

An Experience with Code-Size Optimization for Production iOS Mobile Applications. 363-377 - Anderson Faustino da Silva, Bruno Conde Kind, José Wesley de Souza Magalhães

, Jerônimo Nunes Rocha, Breno Campos Ferreira Guimarães, Fernando Magno Quintão Pereira:
ANGHABENCH: A Suite with One Million Compilable C Benchmarks for Code-Size Reduction. 378-390

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














