


default search action
27th Euro-Par 2021: Lisbon, Portugal
- Leonel Sousa

, Nuno Roma
, Pedro Tomás
:
Euro-Par 2021: Parallel Processing - 27th International Conference on Parallel and Distributed Computing, Lisbon, Portugal, September 1-3, 2021, Proceedings. Lecture Notes in Computer Science 12820, Springer 2021, ISBN 978-3-030-85664-9
Compilers, Tools and Environments
- Daniel Maier, Biagio Cosenza

, Ben H. H. Juurlink:
ALONA: Automatic Loop Nest Approximation with Reconstruction and Space Pruning. 3-18 - Peter Arzt

, Yannic Fischler
, Jan-Patrick Lehr
, Christian H. Bischof
:
Automatic Low-Overhead Load-Imbalance Detection in MPI Applications. 19-34
Performance and Power Modeling, Prediction and Evaluation
- Yannis Sfakianakis, Eleni Kanellou, Manolis Marazakis, Angelos Bilas

:
Trace-Based Workload Generation and Execution. 37-54 - Anne Benoit, Louis-Claude Canon, Redouane Elghazi, Pierre-Cyrille Héam:

Update on the Asymptotic Optimality of LPT. 55-69 - Burak Aksar

, Benjamin Schwaller, Omar Aaziz, Vitus J. Leung, Jim M. Brandt, Manuel Egele, Ayse K. Coskun:
E2EWatch: An End-to-End Anomaly Diagnosis Framework for Production HPC Systems. 70-85
Scheduling and Load Balancing
- Zhuoran Ji

, Cho-Li Wang:
Collaborative GPU Preemption via Spatial Multitasking for Efficient GPU Sharing. 89-104 - Ning Tang

, Alix Munier Kordon
:
A Fixed-Parameter Algorithm for Scheduling Unit Dependent Tasks with Unit Communication Delays. 105-119 - Jan Kopanski

, Krzysztof Rzadca
:
Plan-Based Job Scheduling for Supercomputers with Shared Burst Buffers. 120-135 - Sonia Ben Mokhtar, Louis-Claude Canon, Anthony Dugois

, Loris Marchal, Etienne Rivière:
Taming Tail Latency in Key-Value Stores: A Scheduling Perspective. 136-150 - Adrian Naruszko

, Bartlomiej Przybylski
, Krzysztof Rzadca
:
A Log-Linear (2 +5/6)-Approximation Algorithm for Parallel Machine Scheduling with a Single Orthogonal Resource. 151-166 - Maria Predari, Charilaos Tzovas, Christian Schulz, Henning Meyerhenke:

An MPI-based Algorithm for Mapping Complex Networks onto Hierarchical Architectures. 167-182 - Olivier Beaumont

, Lionel Eyraud-Dubois
, Alena Shilova
:
Pipelined Model Parallelism: Complexity Results and Memory Considerations. 183-198
Data Management, Analytics and Machine Learning
- Haoran Wang

, Chong Li
, Thibaut Tachon, Hongxing Wang, Sheng Yang, Sébastien Limet, Sophie Robert:
Efficient and Systematic Partitioning of Large and Deep Neural Networks for Parallelization. 201-216 - Kyusik Choi

, Hoeseok Yang
:
A GPU Architecture Aware Fine-Grain Pruning Technique for Deep Neural Networks. 217-231 - Zhongyi Lin

, Evangelos Georganas, John D. Owens
:
Towards Flexible and Compiler-Friendly Layer Fusion for CNNs on Multicore CPUs. 232-248 - Tiago Lopes, Miguel E. Coimbra

, Luís Veiga
:
Smart Distributed DataSets for Stream Processing. 249-265
Cluster, Cloud and Edge Computing
- Francesc Lordan

, Daniele Lezzi
, Rosa M. Badia
:
Colony: Parallel Functions as a Service on the Cloud-Edge Continuum. 269-284 - David Delande

, Patricia Stolf
, Raphaël Féraud
, Jean-Marc Pierson
, André Bottaro
:
Horizontal Scaling in Cloud Using Contextual Bandits. 285-300 - Ronan-Alexandre Cherrueau, Marie Delavergne, Adrien Lèbre:

Geo-distribute Cloud Applications at the Edge. 301-316 - Rafaela C. Brum, Walisson P. Sousa, Alba C. M. A. Melo

, Cristiana Bentes, Maria Clicia Stelling de Castro, Lúcia Maria de A. Drummond:
A Fault Tolerant and Deadline Constrained Sequence Alignment Application on Cloud-Based Spot GPU Instances. 317-333 - Sophie Cerf

, Raphaël Bleuse
, Valentin Reis, Swann Perarnau
, Éric Rutten
:
Sustaining Performance While Reducing Energy Consumption: A Control Theory Approach. 334-349
Theory and Algorithms for Parallel and Distributed Processing
- Rezaul Chowdhury, Francesco Silvestri

, Flavio Vella
:
Algorithm Design for Tensor Units. 353-367 - Jeremy Buhler

, Thomas Lavastida
, Kefu Lu, Benjamin Moseley:
A Scalable Approximation Algorithm for Weighted Longest Common Subsequence. 368-384 - Adones Rukundo, Philippas Tsigas

:
TSLQueue: An Efficient Lock-Free Design for Priority Queues. 385-401 - Bryan Rowe, Rajiv Gupta

:
G-Morph: Induced Subgraph Isomorphism Search of Labeled Graphs on a GPU. 402-417
Parallel and Distributed Programming, Interfaces, and Languages
- Catalina Munoz Morales, Rafael Murari, Joao P. L. de Carvalho

, Bruno Chinelato Honorio, Alexandro Baldassin
, Guido Araujo:
Accelerating Graph Applications Using Phased Transactional Memory. 421-434 - Dian-Lun Lin

, Tsung-Wei Huang:
Efficient GPU Computation Using Task Graph Parallelism. 435-450 - Nicolas M. Morales

, Keita Teranishi, Bogdan Nicolae, Christian Trott, Franck Cappello:
Towards High Performance Resilience Using Performance Portable Abstractions. 451-465 - Thomas Dionisi, Stéphane Bouhrour, Julien Jaeger, Patrick Carribault, Marc Pérache:

Enhancing Load-Balancing of MPI Applications with Workshare. 466-481 - Nicolas L. Guidotti

, Pedro Ceyrat, João Barreto
, José Monteiro, Rodrigo Rodrigues, Ricardo Fonseca, Xavier Martorell, Antonio J. Peña:
Particle-In-Cell Simulation Using Asynchronous Tasking. 482-498
Multicore and Manycore Parallelism
- Raúl Nozal, José Luis Bosque

:
Exploiting Co-execution with OneAPI: Heterogeneity from a Modern Perspective. 501-516
Parallel Numerical Methods and Applications
- Yuankun Fu

, Fengguang Song
:
Designing a 3D Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems. 519-535 - Camille Coti, Laure Petrucci

, Daniel Alberto Torres González:
Fault-Tolerant LU Factorization Is Low Cost. 536-549 - Fritz Göbel, Thomas Grützmacher

, Tobias Ribizel
, Hartwig Anzt
:
Mixed Precision Incomplete and Factorized Sparse Approximate Inverse Preconditioning on GPUs. 550-564 - Yuxi Hong

, El Houcine Bergou, Nicolas Doucet, Hao Zhang, Jesse Cranney
, Hatem Ltaief
, Damien Gratadour
, François Rigaut
, David E. Keyes:
Outsmarting the Atmospheric Turbulence for Ground-Based Telescopes Using the Stochastic Levenberg-Marquardt Method. 565-579 - Adam Smelko

, Miroslav Kratochvíl
, Martin Krulis
, Tomás Sieger
:
GPU-Accelerated Mahalanobis-Average Hierarchical Clustering Analysis. 580-595
High Performance Architectures and Accelerators
- Vladimir Dimic

, Miquel Moretó
, Marc Casas
, Mateo Valero
:
PrioRAT: Criticality-Driven Prioritization Inside the On-Chip Memory Hierarchy. 599-615 - Alberto Zeni

, Kenneth O'Brien, Michaela Blott, Marco D. Santambrogio:
Optimized Implementation of the HPCG Benchmark on Reconfigurable Hardware. 616-630

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














