


default search action
31. ISC 2016: Frankfurt, Germany
- Julian M. Kunkel, Pavan Balaji, Jack J. Dongarra:

High Performance Computing - 31st International Conference, ISC High Performance 2016, Frankfurt, Germany, June 19-23, 2016, Proceedings. Lecture Notes in Computer Science 9697, Springer 2016, ISBN 978-3-319-41320-4
Autotuning and Thread Mapping
- Rengan Xu, Sunita Chandrasekaran, Xiaonan Tian, Barbara M. Chapman:

An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling. 3-20 - Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov

, Jack J. Dongarra:
Performance, Design, and Autotuning of Batched GEMM for GPUs. 21-38 - Ravi Kumar Pujari, Thomas Wild, Andreas Herkersdorf:

TCU: A Multi-Objective Hardware Thread Mapping Unit for HPC Clusters. 39-58
Data Locality and Decomposition
- James King, Thomas Gilray, Robert M. Kirby

, Matthew Might:
Dynamic Sparse-Matrix Allocation on GPUs. 61-80 - Bruno R. C. Magalhães, Farhan Tauheed, Thomas Heinis, Anastasia Ailamaki, Felix Schürmann:

An Efficient Parallel Load-Balancing Framework for Orthogonal Decomposition of Geometrical Data. 81-97 - Diana Palsetia, William Hendrix, Sunwoo Lee

, Ankit Agrawal
, Wei-keng Liao
, Alok N. Choudhary:
Parallel Community Detection Algorithm Using a Data Partitioning Strategy with Pairwise Subdomain Duplication. 98-115 - Didem Unat

, Tan Nguyen, Weiqun Zhang, Muhammed Nufail Farooqi, Burak Bastem, George Michelogiannakis
, Ann S. Almgren
, John Shalf
:
TiDA: High-Level Programming Abstractions for Data Locality Management. 116-135
Scalable Applications
- Nikhil Jain, Eric J. Bohm, Eric Mikida, Subhasish Mandal, Minjung Kim, Prateek Jindal

, Qi Li, Sohrab Ismail-Beigi, Glenn J. Martyna, Laxmikant V. Kalé:
OpenAtom: Scalable Ab-Initio Molecular Dynamics with Diverse Capabilities. 139-158 - Vasudevan Rengasamy, Kamesh Madduri

:
SPRITE: A Fast Parallel SNP Detection Pipeline. 159-177
Machine Learning
- Andrea Borghesi

, Andrea Bartolini
, Michele Lombardi, Michela Milano, Luca Benini
:
Predictive Modeling for Job Power Consumption in HPC Systems. 181-199 - Tommy Tracy II, Yao Fu, Indranil Roy, Eric Jonas, Paul Glendenning:

Towards Machine Learning on the Automata Processor. 200-218 - Prasanna Balaprakash

, Ananta Tiwari, Stefan M. Wild
, Laura Carrington, Paul D. Hovland
:
AutoMOMML: Automatic Multi-objective Modeling with Machine Learning. 219-239
Datacenters and Cloud
- Tapasya Patki

, Natalie J. Bates, Girish Ghatikar, Anders Clausen
, Sonja Klingert
, Ghaleb Abdulla, Mehdi Sheikhalishahi:
Supercomputing Centers and Electricity Service Providers: A Geographically Distributed Perspective on Demand Management in Europe and the United States. 243-260 - Stephen Herbein, Ayush Dusia

, Aaron Myles Landwehr, Sean McDaniel, José Monsalve Diaz, Yang Yang, Seetharami R. Seelam, Michela Taufer
:
Resource Management for Running HPC Applications in Container Clouds. 261-278
Communication Runtime
- Mario Flajslik, James Dinan, Keith D. Underwood

:
Mitigating MPI Message Matching Misery. 281-299 - Hari Subramoni, Albert Mathews Augustine, Mark Daniel Arnold, Jonathan L. Perkins, Xiaoyi Lu, Khaled Hamidouche, Dhabaleswar K. Panda:

INAM2: InfiniBand Network Analysis and Monitoring with MPI. 300-320 - Rob F. Van der Wijngaart, Abdullah Kayi, Jeff R. Hammond, Gabriele Jost, Tom St. John, Srinivas Sridharan, Timothy G. Mattson, John Abercrombie, Jacob Nelson:

Comparing Runtime Systems with Exascale Ambitions Using the Parallel Research Kernels. 321-339
Intel Xeon Phi
- Alexander Heinecke, Alexander Breuer, Michael Bader, Pradeep Dubey:

High Order Seismic Simulations on the Intel Xeon Phi Processor (Knights Landing). 343-362 - Pramod S. Kumbhar, Michael L. Hines, Aleksandr Ovcharenko, Damián A. Mallón, James Gonzalo King, Florentino Sainz, Felix Schürmann, Fabien Delalondre:

Leveraging a Cluster-Booster Architecture for Brain-Scale Simulations. 363-380
Manycore Architectures
- Karthik Yagna, Onkar Patil, Frank Mueller:

Efficient and Predictable Group Communication for Manycore NoCs. 383-403 - Subramanian Ramachandran, Frank Mueller:

Distributed Job Allocation for Large-Scale Manycores. 404-425
Extreme-Scale Computations
- Tom Deakin

, Simon McIntosh-Smith
, Wayne P. Gaudin:
Many-Core Acceleration of a Discrete Ordinates Transport Mini-App at Extreme Scale. 429-448 - Maxwell Hutchinson, Alexander Heinecke, Hans Pabst, Greg Henry, Matteo Parsani

, David E. Keyes
:
Efficiency of High Order Spectral Element Methods on Petascale Architectures. 449-466
Resilience
- Karla Morris, Francesco Rizzi, Khachik Sargsyan, Kathryn Dahlgren, Paul Mycek

, Cosmin Safta
, Olivier P. Le Maître
, Omar M. Knio, Bert J. Debusschere
:
Scalability of Partial Differential Equations Preconditioner Resilient to Soft and Hard Faults. 469-485 - Nan Dun, Dirk Pleiter, Aiman Fang, Nicolas Vandenbergen, Andrew A. Chien:

Multi-versioning Performance Opportunities in BGAS System for Resilience. 486-504

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














