


default search action
22nd Euro-Par 2016: Grenoble, France
- Pierre-François Dutot, Denis Trystram:

Euro-Par 2016: Parallel Processing - 22nd International Conference on Parallel and Distributed Computing, Grenoble, France, August 24-26, 2016, Proceedings. Lecture Notes in Computer Science 9833, Springer 2016, ISBN 978-3-319-43658-6
Invited Papers
- Dror G. Feitelson:

Resampling with Feedback - A New Paradigm of Using Workload Data for Performance Evaluation. 3-21 - Arnold L. Rosenberg:

Scheduling DAGs Opportunistically: The Dream and the Reality Circa 2016. 22-33
Support Tools and Environments
- Olaf Krzikalla, Ralph Müller-Pfefferkorn

, Wolfgang E. Nagel:
Synchronization Debugging of Hybrid Parallel Programs. 37-50 - Roger Kowalewski, Karl Fürlinger

:
Nasty-MPI: Debugging Synchronization Errors in MPI-3 One-Sided Applications. 51-62 - Alexis Martin, Vania Marangozova-Martin:

Automatic Benchmark Profiling Through Advanced Trace Analysis. 63-74
Performance and Power Modeling, Prediction and Evaluation
- Paul F. Baumeister, Marcel Bornemann

, Markus Bühler, Thorsten Hater, Benjamin Krill, Dirk Pleiter, Rudolf Zeller:
Addressing Materials Science Challenges Using GPU-accelerated POWER8 Nodes. 77-89 - Christoph Lehnert

, Rudolf Berrendorf, Jan P. Ecker, Florian Mannuss:
Performance Prediction and Ranking of SpMV Kernels on GPU Architectures. 90-102 - Sandra Catalán

, A. Cristiano I. Malossi
, Costas Bekas, Enrique S. Quintana-Ortí
:
The Impact of Voltage-Frequency Scaling for the Matrix-Vector Product on the IBM POWER8. 103-116 - Alina Sîrbu

, Özalp Babaoglu
:
Power Consumption Modeling and Prediction in a Hybrid CPU-GPU-MIC Supercomputer. 117-130
Scheduling and Load Balancing
- Louis-Claude Canon, Pierre-Cyrille Héam, Laurent Philippe:

Controlling and Assessing Correlations of Cost Matrices in Heterogeneous Scheduling. 133-145 - Tim Kiefer, Dirk Habich, Wolfgang Lehner

:
Penalized Graph Partitioning for Static and Dynamic Load Balancing. 146-158 - Klaus Jansen, Felix Land:

Non-preemptive Scheduling with Setup Times: A PTAS. 159-170 - Olivier Beaumont

, Lionel Eyraud-Dubois, Thomas Lambert:
Cuboid Partitioning for Parallel Matrix Multiplication on Heterogeneous Platforms. 171-182 - Antón Rey, Francisco D. Igual

, Manuel Prieto-Matías
:
HeSP: A Simulation Framework for Solving the Task Scheduling-Partitioning Problem on Heterogeneous Architectures. 183-195 - Eric Angel, Cédric Chevalier, Franck Ledoux, Sébastien Morais, Damien Regnault:

FPT Approximation Algorithm for Scheduling with Memory Constraints. 196-208 - Dimitris Fotakis, Ioannis Milis, Orestis Papadigenopoulos, Vasilis Vassalos, Georgios Zois:

Scheduling MapReduce Jobs Under Multi-round Precedences. 209-222
High Performance Architectures and Compilers
- Juan Manuel Martinez Caamaño, Willy Wolff, Philippe Clauss

:
Code Bones: Fast and Flexible Code Generation for Dynamic and Speculative Polyhedral Optimization. 225-237 - Mihail Popov, Chadi Akel, William Jalby, Pablo de Oliveira Castro

:
Piecewise Holistic Autotuning of Compiler and Runtime Parameters. 238-250 - Ricardo Quislant

, Eladio Gutiérrez, Emilio L. Zapata, Oscar G. Plata:
Insights into the Fallback Path of Best-Effort Hardware Transactional Memory Systems. 251-263 - Florian Wende, Matthias Noack, Thomas Steinke, Michael Klemm, Chris J. Newburn, Georg Zitzlsberger

:
Portable SIMD Performance with OpenMP* 4.x Compiler Directives. 264-277
Parallel and Distributed Data Management and Analytics
- Luca Salucci, Daniele Bonetta

, Walter Binder
:
Lightweight Multi-language Bindings for Apache Spark. 281-292 - Jianwei Liao, Balazs Gerofi, Guo-Yuan Lien, Seiya Nishizawa

, Takemasa Miyoshi, Hirofumi Tomita, Yutaka Ishikawa:
Toward a General I/O Arbitration Framework for netCDF Based Big Data Processing. 293-305 - Angelos Papatriantafyllou, Dimitris Sacharidis:

High Performance Parallel Summed-Area Table Kernels for Multi-core and Many-core Systems. 306-318 - Dipanjan Sengupta, Narayanan Sundaram, Xia Zhu, Theodore L. Willke, Jeffrey S. Young

, Matthew Wolf, Karsten Schwan:
GraphIn: An Online High Performance Incremental Graph Processing Framework. 319-333 - Long Cheng

, Spyros Kotoulas
:
Efficient Large Outer Joins over MapReduce. 334-346
Cluster and Cloud Computing
- Jie Zhang, Xiaoyi Lu, Sourav Chakraborty, Dhabaleswar K. Panda:

Slurm-V: Extending Slurm for Building Efficient HPC Cloud with SR-IOV and IVShmem. 349-362 - Fernanda G. O. Passos

, Vinod E. F. Rebello:
An Autonomic Parallel Strategy for the Projection of Ecological Niche Models in Heterogeneous Computational Environments. 363-375 - Mennan Selimi, Davide Vega, Felix Freitag

, Luís Veiga
:
Towards Network-Aware Service Placement in Community Network Micro-Clouds. 376-388 - Yanik Ngoko:

Heating as a Cloud-Service, A Position Paper (Industrial Presentation). 389-401
Distributed Systems and Algorithms
- Karthik Murthy, Sri Raj Paul, Kuldeep S. Meel

, Tiago Cogumbreiro
, John M. Mellor-Crummey
:
Design and Verification of Distributed Phasers. 405-418 - Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, Franck Cappello:

Exploring Partial Replication to Improve Lightweight Silent Data Corruption Detection for HPC Applications. 419-430
Parallel and Distributed Programming, Interfaces, Language
- Sascha Hunold, Alexandra Carpen-Amarie, Felix Donatus Lübbe, Jesper Larsson Träff:

Automatic Verification of Self-consistent MPI Performance Guidelines. 433-446 - Guilherme Andrade, Wilson de Carvalho, Renato Utsch, Pedro Caldeira, Alberto Albuquerque, Fabricio Ferracioli, Leonardo Rocha

, Michael Frank, Dorgival O. Guedes
, Renato Ferreira:
ParallelME: A Parallel Mobile Engine to Explore Heterogeneity in Mobile Computing Architectures. 447-459 - Anastasia Braginsky, Nachshon Cohen, Erez Petrank:

CBPQ: High Performance Lock-Free Priority Queue. 460-474
Multicore and Manycore Parallelism
- Ali Charara

, Hatem Ltaief
, David E. Keyes
:
Redesigning Triangular Dense Matrix Computations on GPUs. 477-489 - Eduardo H. M. Cruz, Matthias Diener

, Laércio Lima Pilla
, Philippe O. A. Navaux:
A Sharing-Aware Memory Management Unit for Online Mapping in Multi-core Architectures. 490-501 - Ibrahim Umar

, Otto J. Anshus, Phuong Hoai Ha
:
GreenBST: Energy-Efficient Concurrent Search Tree. 502-517 - Jinsu Park, Woongki Baek:

HAP: A Heterogeneity-Conscious Runtime System for Adaptive Pipeline Parallelism. 518-530 - Philippe Virouleau, François Broquedis

, Thierry Gautier, Fabrice Rastello:
Using Data Dependencies to Improve Task-Based Scheduling Strategies on NUMA Architectures. 531-544 - Martin Groen, Vincent Gramoli:

Multicore vs Manycore: The Energy Cost of Concurrency. 545-557
Theory and Algorithms for Parallel Computation and Networking
- Natcha Simsiri, Kanat Tangwongsan, Srikanta Tirthapura, Kun-Lung Wu:

Work-Efficient Parallel Union-Find with Applications to Incremental Graph Connectivity. 561-573 - Rezaul Chowdhury, Pramod Ganapathi

, Vivek Pradhan, Jesmin Jahan Tithi, Yunpeng Xiao:
An Efficient Cache-oblivious Parallel Viterbi Algorithm. 574-587 - Karine Altisen, Stéphane Devismes

, Anaïs Durand
, Franck Petit:
Gradual Stabilization Under \tau -Dynamics. 588-602
Parallel Numerical Methods and Applications
- Dalal Sukkari

, Hatem Ltaief
, David E. Keyes
:
High Performance Polar Decomposition on Distributed Memory Systems. 605-616 - Weifeng Liu

, Ang Li, Jonathan D. Hogg, Iain S. Duff, Brian Vinter:
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves. 617-630 - José Ignacio Aliaga

, Maria Barreda, Matthias Bollhöfer
, Enrique S. Quintana-Ortí
:
Exploiting Task-Parallelism in Message-Passing Sparse Linear System Solvers Using OmpSs. 631-643 - Pierre-Louis Guhur, Hong Zhang, Tom Peterka, Emil M. Constantinescu

, Franck Cappello:
Lightweight and Accurate Silent Data Corruption Detection in Ordinary Differential Equation Solvers. 644-656
Accelerator Computing
- Ian Masliah, Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov

, Marc Baboulin, Joël Falcou, Jack J. Dongarra:
High-Performance Matrix-Matrix Multiplications of Very Small Matrices. 659-671 - Anshul Gupta, Natalia Gimelshein, Seid Koric, Steven C. Rennich:

Effective Minimally-Invasive GPU Acceleration of Distributed Sparse Matrix Factorization. 672-683 - Pierre Huchant, Marie Christine Counilh, Denis Barthou

:
Automatic OpenCL Task Adaptation for Heterogeneous Architectures. 684-696

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














