


default search action
32nd SBAC-PAD 2020: Porto, Portugal
- 32nd IEEE International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2020, Porto, Portugal, September 9-11, 2020. IEEE 2020, ISBN 978-1-7281-9924-5

Conference Papers
Computer Architecture
- Francisco Mendes, Pedro Tomás

, Nuno Roma
:
Exploiting Non-conventional DVFS on GPUs: Application to Deep Learning. 1-9 - Nicolas Bohm Agostini

, Shi Dong, Elmira Karimi, Marti Torrents Lapuerta, José Cano
, José L. Abellán, David R. Kaeli:
Design Space Exploration of Accelerators and End-to-End DNN Evaluation with TFLITE-SOC. 10-19 - Rico Amslinger, Christian Piatka, Florian Haas, Sebastian Weis, Theo Ungerer, Sebastian Altmeyer:

Hardware Multiversioning for Fail-Operational Multithreaded Applications. 20-27 - Syed Ali Hasnain, Rabi N. Mahapatra:

On-chip Parallel Photonic Reservoir Computing using Multiple Delay Lines. 28-34 - Douglas Pereira Pasqualin

, Matthias Diener, André Rauber Du Bois, Maurício Lima Pilla:
Online Sharing-Aware Thread Mapping in Software Transactional Memory. 35-42 - Jorge González, Alexander Gazman, Maarten Hattink, Mauricio G. Palma

, Meisam Bahadori, Ruth Rubio-Noriega
, Lois Orosa
, Madeleine Glick, Onur Mutlu
, Keren Bergman, Rodolfo Azevedo:
Optically Connected Memory for Disaggregated Data Centers. 43-50
Networking and Distributed Systems
- Vinu E. Venugopal

, Martin Theobald
, Samira Chaychi, Amal Tawakuli:
AIR: A Light-Weight Yet High-Performance Dataflow Engine based on Asynchronous Iterative Routing. 51-58 - Felipe Rodrigo de Souza, Marcos Dias de Assunção, Eddy Caron, Alexandre da Silva Veith

:
An Optimal Model for Optimizing the Placement and Parallelism of Data Stream Processing Applications on Cloud-Edge Computing. 59-66 - Anderson Andrei Da Silva, Clément Mommessin, Pierre Neyron, Denis Trystram, Adwait Bauskar, Adrien Lebre, Alexandre van Kempen, Yanik Ngoko, Yoann Ricordel:

Evaluating Computation and Data Placements in Edge Infrastructures through a Common Simulator. 67-74 - Adrien Gougeon, Benjamin Camus, Anne-Cécile Orgerie:

Optimizing Green Energy Consumption of Fog Computing Architectures. 75-82
Parallel Applications and Algorithms
- Ivan Fernandez, Ricardo Quislant

, Eladio Gutiérrez, Oscar G. Plata:
Energy-Efficient Time Series Analysis Using Transprecision Computing. 83-90 - Pablo San Juan, Adrián Castelló

, Manuel F. Dolz
, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
High Performance and Portable Convolution Operators for Multicore Processors. 91-98 - Andrew Anderson, Aravind Vasudevan, Cormac Keane, David Gregg:

High-Performance Low-Memory Lowering: GEMM-based Algorithms for DNN Convolution. 99-106 - Christina L. Peterson, Amalee Wilson, Peter Pirkelbauer, Damian Dechev:

Optimized Transactional Data Structure Approach to Concurrency Control for In-Memory Databases. 107-115 - Changjiang Gou, Anne Benoit

, Mingsong Chen, Loris Marchal, Tongquan Wei:
Reliable and Energy-aware Mapping of Streaming Series-parallel Applications onto Hierarchical Platforms. 116-123 - Guilherme Andrade, George Teodoro, Renato Ferreira:

Scalable and Efficient Spatial-Aware Parallelization Strategies for Multimedia Retrieval. 124-131 - Pawel Zuk, Krzysztof Rzadca:

Scheduling Methods to Reduce Response Latency of Function as a Service. 132-140 - Hongyang Sun

, Ana Gainaru, Manu Shantharam, Padma Raghavan:
Selective Protection for Sparse Iterative Solvers to Reduce the Resilience Overhead. 141-148 - Steven Wei Der Chien

, Jonas Nylund, Gabriel Bengtsson, Ivy Bo Peng
, Artur Podobas, Stefano Markidis:
sputniPIC: An Implicit Particle-in-Cell Code for Multi-GPU Systems. 149-156 - Samuel Thomas, Roxana Hayne, Jonad Pulaj, Hammurabi Mendes:

Using Skip Graphs for Increased NUMA Locality. 157-166
Performance Evaluation
- Martin Johnson, Daniel P. Playne:

A Fast and Concise Parallel Implementation of the 8x8 2D IDCT using Halide. 167-174 - David Quaresma, Daniel Fireman, Thiago Emmanuel Pereira:

Controlling Garbage Collection and Request Admission to Improve Performance of FaaS Applications. 175-182 - Ivy Bo Peng

, Roger Pearce, Maya B. Gokhale:
On the Memory Underutilization: Exploring Disaggregated Memory on HPC Systems. 183-190 - Dorra Boughzala, Laurent Lefèvre, Anne-Cécile Orgerie:

Predicting the Energy Consumption of CUDA Kernels using SimGrid. 191-198 - Yuan Wen, Andrew Anderson, Valentin Radu

, Michael F. P. O'Boyle, David Gregg:
TASO: Time and Space Optimization for Memory-Constrained DNN Inference. 199-208 - Riccardo Mancini, Antonio Ritacco, Giacomo Lanciano

, Tommaso Cucinotta:
XPySom: High-Performance Self-Organizing Maps. 209-216
System Software
- Wei Liu, Hao Wu, Ziyue Jiang

, Yifan Gong, Jiangming Jin:
A Robotic Communication Middleware Combining High Performance and High Reliability. 217-224 - Rafael A. Lopes, Samuel Thibault, Alba C. M. A. Melo

:
MASA-StarPU: Parallel Sequence Comparison with Multiple Scheduling Policies and Pruning. 225-232 - Marcus Karpoff, José Nelson Amaral, Kai-Ting Amy Wang, Rayson Ho, Brice Dobry:

PSU: A Framework for Dynamic Software Updates in Multi-threaded C-Language Programs. 233-240 - Ioannis Vardas

, Manolis Ploumidis, Manolis Marazakis:
Towards Communication Profile, Topology and Node Failure Aware Process Placement. 241-248
WAMCA Workshop Papers
- Vitoria Pinho

, Hervé Yviquel
, Márcio Machado Pereira, Guido Araujo:
OmpTracing: Easy Profiling of OpenMP Programs. 249-256 - Diana A. Barros, Cristiana Bentes:

Analyzing the Loop Scheduling Mechanisms on Julia Multithreading. 257-264 - Alexandre Azevedo, Cristiana Bentes, Maria Clicia Stelling de Castro, Claude Tadonki:

Performance Analysis and Optimization of the Vector-Kronecker Product Multiplication. 265-272 - Daniel Di Domenico, Gerson G. H. Cavalheiro:

JAMPI: A C++ Parallel Programming Interface Allowing the Implementation of Custom and Generic Scheduling Mechanisms. 273-280 - Christophe Cérin, Nicolas Grenèche, Tarek Menouer

:
Towards Pervasive Containerization of HPC Job Schedulers. 281-288 - Stefan Sydow

, Mohannad Nabelsee, Sabine Glesner, Paula Herber:
Towards Profile-Guided Optimization for Safe and Efficient Parallel Stream Processing in Rust. 289-296 - Xi Zhang, Xu Sun, Xiaohu Guo

, Yunfei Du
, Yutong Lu, Yang Liu:
Re-evaluation of Atomic Operations and Graph Coloring for Unstructured Finite Volume GPU Simulations. 297-304 - Rui Alves

, José Rufino
:
Extending Heterogeneous Applications to Remote Co-processors with rOpenCL. 305-312 - Maron Schlemon, Jamin Naghmouchi:

FFT Optimizations and Performance Assessment Targeted towards Satellite and Airborne Radar Processing. 313-320 - Suyash Bakshi, S. Lennart Johnsson:

A Highly Efficient SGEMM Implementation using DMA on the Intel/Movidius Myriad-2. 321-328

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














