


default search action
28th PACT 2019: Seattle, WA, USA
- 28th International Conference on Parallel Architectures and Compilation Techniques, PACT 2019, Seattle, WA, USA, September 23-26, 2019. IEEE 2019, ISBN 978-1-7281-3613-4

Session 1: Best-Papers
- Udit Gupta

, Brandon Reagen
, Lillian Pentecost
, Marco Donato, Thierry Tambe
, Alexander M. Rush
, Gu-Yeon Wei, David Brooks:
MASR: A Modular Accelerator for Sparse RNNs. 1-14 - Roshan Dathathri, Gurbinder Gill, Loc Hoang, Vishwesh Jatala, Keshav Pingali, V. Krishna Nandivada, Hoang-Vu Dang, Marc Snir:

Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics. 15-28 - Shintaro Iwasaki

, Abdelhalim Amer, Kenjiro Taura
, Sangmin Seo, Pavan Balaji:
BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads. 29-42 - Daniel Townley, Dmitry Ponomarev:

SMT-COP: Defeating Side-Channel Attacks on Execution Units in SMT Processors. 43-54
Session 2A: Compiler Optimization and Code Generation 1
- Bruce Collie

, Philip Ginsbach, Michael F. P. O'Boyle:
Type-Directed Program Synthesis and Constraint Generation for Library Portability. 55-67 - Apala Guha, Naveen Vedula, Arrvindh Shriraman:

Deepframe: A Profile-Driven Compiler for Spatial Hardware Accelerators. 68-81 - Patrick Nappa, David Zhao

, Pavle Subotic, Bernhard Scholz:
Fast Parallel Equivalence Relations in a Datalog Compiler. 82-96
Session 2B: Memory/Storage Systems 1
- Jongwook Chung, Yuhwan Ro, Joonsung Kim, Jaehyung Ahn, Jangwoo Kim, John Kim

, Jae W. Lee, Jung Ho Ahn
:
Enforcing Last-Level Cache Partitioning through Memory Virtual Channels. 97-109 - Richard Afoakwa, Lejie Lu, Hui Wu, Michael C. Huang

:
To Stack or Not To Stack. 110-123 - Soklong Lim, Zaixin Lu, Bin Ren, Xuechen Zhang:

Enforcing Crash Consistency of Evolving Network Analytics in Non-Volatile Main Memory Systems. 124-137
Session 3A: Hardware/Software for Security
- Biswabandan Panda

:
Fooling the Sense of Cross-Core Last-Level Cache Eviction Based Attacker by Prefetching Common Sense. 138-150 - Kristin Barber, Anys Bacha, Li Zhou, Yinqian Zhang, Radu Teodorescu:

SpecShield: Shielding Speculative Data from Microarchitectural Covert Channels. 151-164
Session 3B: Hardware/Software for Machine Learning
- Myeonggyun Han, Jihoon Hyun

, Seongbeom Park, Jinsu Park, Woongki Baek:
MOSAIC: Heterogeneity-, Communication-, and Constraint-Aware Model Slicing and Execution for Accurate and Efficient Inference. 165-177 - Xiao Dong

, Lei Liu, Peng Zhao, Guangli Li, Jiansong Li
, Xueying Wang, Xiaobing Feng:
Acorns: A Framework for Accelerating Deep Neural Networks with Input Sparsity. 178-191
Session 4A: Concurrency Management
- Sunjae Park, Christopher J. Hughes

, Milos Prvulovic:
Forgive-TM: Supporting Lazy Conflict Detection In Eager Hardware Transactional Memory. 192-204 - Naama Ben-David, Ziv Scully

, Guy E. Blelloch:
Unfair Scheduling Patterns in NUMA Architectures. 205-218 - Pantea Zardoshti, Tingzhe Zhou, Yujie Liu, Michael F. Spear

:
Optimizing Persistent Memory Transactions. 219-231
Session 4B: Heterogeneous Systems and Accelerators 1
- Daniel Castro

, Paolo Romano
, Aleksandar Ilic
, Amin M. Khan:
HeTM: Transactional Memory for Heterogeneous Systems. 232-244 - Amir Watad, Alexander Libov, Ohad Shacham, Edward Bortnikov, Mark Silberstein:

Achieving Scalability in a k-NN Multi-GPU Network Service with Centaur. 245-257 - Mohamed Assem Ibrahim

, Hongyuan Liu
, Onur Kayiran, Adwait Jog:
Analyzing and Leveraging Remote-Core Bandwidth for Enhanced Performance in GPUs. 258-271
Session 5A: Domain/Application-Specific Hardware/Software
- Lewis Crawford, Michael F. P. O'Boyle:

Specialization Opportunities in Graphical Workloads. 272-283 - Farzaneh Zokaee, Mingzhe Zhang, Lei Jiang:

FindeR: Accelerating FM-Index-Based Exact Pattern Matching in Genomic Sequences through ReRAM Technology. 284-295 - Yan Pei, Swarnendu Biswas, Donald S. Fussell, Keshav Pingali:

SLAMBooster: An Application-Aware Online Controller for Approximation in Dense SLAM. 296-310
Session 5B: Heterogeneous Systems and Accelerators 2
- Zhen Lin

, Mohammad A. Alshboul
, Yan Solihin, Huiyang Zhou
:
Exploring Memory Persistency Models for GPUs. 311-323 - Ahmed E. Helal, Ashwin M. Aji, Michael L. Chu, Bradford M. Beckmann, Wu-chun Feng:

Adaptive Task Aggregation for High-Performance Sparse Solvers on GPUs. 324-336 - Tayler Hicklin Hetherington, Maria Lubeznov, Deval Shah, Tor M. Aamodt:

EDGE: Event-Driven GPU Execution. 337-353
Session 6A: Compiler Optimization and Code Generation 2
- Ari Rasch, Richard Schulze, Sergei Gorlatch:

Generating Portable High-Performance Code via Multi-Dimensional Homomorphisms. 354-369 - Tobias Gysi, Tobias Grosser

, Torsten Hoefler:
Absinthe: Learning an Analytical Performance Model to Fuse and Tile Stencil Codes in One Shot. 370-382
Session 6B: Memory/Storage Systems 2
- Jiajun Wang, Prakash Ramrakhyani, Wendy Elsasser

, Lizy Kurian John:
Reducing Data Movement and Energy in Multilevel Cache Hierarchies without Losing Performance: Can you have it all? 383-394 - Ziqi Wang

, Michael A. Kozuch, Todd C. Mowry, Vivek Seshadri:
Multiversioned Page Overlays: Enabling Faster Serializable Hardware Transactional Memory. 395-408
Session 7: Parallel Algorithms and Applications
- Zhenghai Chen, Tiow Seng Tan:

Computing Three-Dimensional Constrained Delaunay Refinement Using the GPU. 409-420 - Jesun Sahariar Firoz, Marcin Zalewski, Andrew Lumsdaine

:
A Synchronization-Avoiding Distance-1 Grundy Coloring Algorithm for Power-Law Graphs. 421-432 - Giovanni Balduzzi, Arghya Chatterjee

, Ying Wai Li, Peter W. Doak, Urs R. Hähner, Eduardo F. D'Azevedo, Thomas A. Maier
, Thomas C. Schulthess:
Accelerating DCA++ (Dynamical Cluster Approximation) Scientific Application on the Summit Supercomputer. 433-444 - Gangyi Zhu, Peng Jiang

, Gagan Agrawal:
A Methodology for Characterizing Sparse Datasets and Its Application to SIMD Performance Prediction. 445-456
Posters
- Ningxin Zheng, Quan Chen, Yong Yang, Jin Li, Wenli Zheng, Minyi Guo:

POSTER: Precise Capacity Planning for Database Public Clouds. 457-458 - Roman Kaplan, Leonid Yavits, Ran Ginosar:

POSTER: BioSEAL: In-Memory Biological Sequence Alignment Accelerator for Large-Scale Genomic Data. 459-460 - Jinsu Park, Seongbeom Park, Myeonggyun Han, Woongki Baek:

POSTER: The Performance Impact of Thread Packing on Synchronization-Intensive Applications. 461-462 - Reza Yazdani, José-María Arnau, Antonio González:

POSTER: Leveraging Run-Time Feedback for Efficient ASR Acceleration. 463-464 - Charles Jin, Muthu Manikandan Baskaran, Benoît Meister:

POSTER: Automatic Parallelization Targeting Asynchronous Task-Based Runtimes. 465-466 - Xi Wang, Jie Li, Antonino Tumeo, John D. Leidel, Yong Chen:

POSTER: Memory Hotspot Optimization for Data-Intensive Applications. 467-468 - Jungwoo Choi, Boyeal Kim, Ji-Ye Jeon, Hyuk-Jae Lee, Euicheol Lim, Chae-Eun Rhee:

POSTER: GPU Based Near Data Processing for Image Processing with Pattern Aware Data Allocation and Prefetching. 469-470 - Sayantan Ray, Madhu Mutyam:

POSTER: Variable Sized Cache-Block Compaction. 471-472 - Priyank Faldu

, Jeff Diamond, Boris Grot
:
POSTER: Domain-Specialized Cache Management for Graph Analytics. 473-474 - Abdullah Khalufa, Graham D. Riley, Mikel Luján:

POSTER: Runtime Adaptations for Energy-Efficient VSLAM. 475-476 - Leonid Yavits, Roman Kaplan, Ran Ginosar:

POSTER: GIRAF: General Purpose In-Storage Resistive Associative Framework. 477-478 - Adrián Barredo

, Juan M. Cebrian
, Miquel Moretó
, Marc Casas
, Mateo Valero:
POSTER: An Optimized Predication Execution for SIMD Extensions. 479-480 - Blaise-Pascal Tine, Sudhakar Yalamanchili, Hyesoon Kim, Jeffrey S. Vetter:

POSTER: Tango: An Optimizing Compiler for Just-In-Time RTL Simulation. 481-482 - Adrián Barredo

, Jonathan C. Beard, Miquel Moretó
:
POSTER: SPiDRE: Accelerating Sparse Memory Access Patterns. 483-484 - Hyojin Sung, Tong Chen, Alexandre E. Eichenberger, Kevin K. O'Brien:

POSTER: CogR: Exploiting Program Structures for Machine-Learning Based Runtime Solutions. 485-486 - Teng Yu, Pavlos Petoumenos

, Vladimir Janjic, Mingcan Zhu, Hugh Leather, John Thomson:
POSTER: A Collaborative Multi-Factor Scheduler for Asymmetric Multicore Processors. 487-488 - Yuan Wen, Andrew Anderson, Valentin Radu

, Michael F. P. O'Boyle, David Gregg:
POSTER: Space and Time Optimal DNN Primitive Selection with Integer Linear Programming. 489-490 - Zhenwei Wu, Kai Lu, Wenzhe Zhang, Andrew Nisbet

, Mikel Luján:
POSTER: Quiescent and Versioned Shadow Copies for NVM. 491-492 - Satoshi Imamura

, Eiji Yoshida
:
POSTER: AR-MMAP: Write Performance Improvement of Memory-Mapped File. 493-494 - Masab Ahmad, Mohsin Shan, Akif Rehman, Omer Khan:

POSTER: Exploiting Multi-Level Task Dependencies to Prune Redundant Work in Relax-Ordered Task-Parallel Algorithms. 495-496 - Mengchi Zhang

, Roland N. Green, Timothy G. Rogers:
POSTER: Quantifying the Direct Overhead of Virtual Function Calls on Massively Parallel Architectures. 497-498 - Eddie C. Davis, Catherine Olschanowsky:

POSTER: A Polyhedral+Dataflow Intermediate Language for Performance Exploration. 499-500 - Babak Zamirai, Salar Latifi, Scott A. Mahlke:

POSTER: Pairing Up CNNs for High Throughput Deep Learning. 501-502 - Xiaojia Song, Tao Xie, Stephen Fischer:

POSTER: A Memory-Access-Efficient Adaptive Implementation of kNN on FPGA through HLS. 503-504

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














