


default search action
Jack J. Dongarra
Person information
- affiliation: University of Tennessee, Knoxville, TN, USA
- affiliation: Oak Ridge National Laboratory, TN, USA
- affiliation: University of Manchester, Manchester, UK
- award (2021): Turing Award
- award (2020): Computer Pioneer Award
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j345]Heike Jagode, Anthony Danalis, Giuseppe Congiu, Daniel Barry, Anthony Castaldo, Jack J. Dongarra:
Advancements of PAPI for the exascale generation. Int. J. High Perform. Comput. Appl. 39(2): 251-268 (2025) - [j344]Lijing Luo, Klavdiya Bochenina, Tesfamariam M. Abuhay, Nachyn Dorzhu, George Kampis, Sergey V. Kovalchuk, Valeria V. Krzhizhanovskaya, Maciej Paszynski, Clélia de Mulatier, Jack J. Dongarra, Peter M. A. Sloot:
Evolution of the computational science community: The dynamics of topics and collaborations in 24 years of ICCS and JoCS publications. J. Comput. Sci. 89: 102609 (2025) - [e142]Roman Wyrzykowski
, Jack J. Dongarra
, Ewa Deelman
, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 15th International Conference, PPAM 2024, Ostrava, Czech Republic, September 8-11, 2024, Revised Selected Papers, Part I. Lecture Notes in Computer Science 15579, Springer 2025, ISBN 978-3-031-85696-9 [contents] - [e141]Roman Wyrzykowski
, Jack J. Dongarra
, Ewa Deelman
, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 15th International Conference, PPAM 2024, Ostrava, Czech Republic, September 8-11, 2024, Revised Selected Papers, Part II. Lecture Notes in Computer Science 15580, Springer 2025, ISBN 978-3-031-85699-0 [contents] - [e140]Roman Wyrzykowski
, Jack J. Dongarra
, Ewa Deelman
, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 15th International Conference, PPAM 2024, Ostrava, Czech Republic, September 8-11, 2024, Revised Selected Papers, Part III. Lecture Notes in Computer Science 15581, Springer 2025, ISBN 978-3-031-85702-7 [contents] - 2024
- [j343]Torsten Hoefler
, Marcin Copik
, Pete Beckman
, Andrew Jones
, Ian T. Foster
, Manish Parashar
, Daniel A. Reed
, Matthias Troyer
, Thomas C. Schulthess
, Dan Ernst
, Jack J. Dongarra
:
XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing. Comput. Sci. Eng. 26(3): 40-51 (2024) - [j342]Piotr Luszczek
, Anthony Castaldo, Yaohung M. Tsai, Daniel Mishler, Jack J. Dongarra:
Numerical eigen-spectrum slicing, accurate orthogonal eigen-basis, and mixed-precision eigenvalue refinement using OpenMP data-dependent tasks and accelerator offload. Int. J. High Perform. Comput. Appl. 38(6): 671-691 (2024) - [j341]Sergey V. Kovalchuk, Clélia de Mulatier, Valeria V. Krzhizhanovskaya
, Jirí Mikyska, Maciej Paszynski, Jack J. Dongarra, Peter M. A. Sloot:
Computation at the Cutting Edge of Science. J. Comput. Sci. 81: 102379 (2024) - [j340]Neil Lindquist
, Piotr Luszczek
, Jack J. Dongarra
:
Generalizing Random Butterfly Transforms to Arbitrary Matrix Sizes. ACM Trans. Math. Softw. 50(4): 26:1-26:23 (2024) - [c451]Lijing Luo, Sergey V. Kovalchuk, Valeria V. Krzhizhanovskaya
, Maciej Paszynski
, Clélia de Mulatier, Jack J. Dongarra, Peter M. A. Sloot:
Trends in Computational Science: Natural Language Processing and Network Analysis of 23 Years of ICCS Publications. ICCS (2) 2024: 19-33 - [c450]Daniel Barry, Anthony Danalis
, Jack J. Dongarra:
Automated Data Analysis for Defining Performance Metrics from Raw Hardware Events. IPDPS (Workshops) 2024: 716-725 - [e139]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part I. Lecture Notes in Computer Science 14832, Springer 2024, ISBN 978-3-031-63748-3 [contents] - [e138]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part II. Lecture Notes in Computer Science 14833, Springer 2024, ISBN 978-3-031-63753-7 [contents] - [e137]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part III. Lecture Notes in Computer Science 14834, Springer 2024, ISBN 978-3-031-63758-2 [contents] - [e136]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part IV. Lecture Notes in Computer Science 14835, Springer 2024, ISBN 978-3-031-63771-1 [contents] - [e135]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part V. Lecture Notes in Computer Science 14836, Springer 2024, ISBN 978-3-031-63774-2 [contents] - [e134]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part VI. Lecture Notes in Computer Science 14837, Springer 2024, ISBN 978-3-031-63777-3 [contents] - [e133]Leonardo Franco
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2024 - 24th International Conference, Malaga, Spain, July 2-4, 2024, Proceedings, Part VII. Lecture Notes in Computer Science 14838, Springer 2024, ISBN 978-3-031-63785-8 [contents] - [i27]Torsten Hoefler, Marcin Copik
, Pete Beckman, Andrew Jones
, Ian T. Foster, Manish Parashar, Daniel A. Reed, Matthias Troyer, Thomas C. Schulthess, Dan Ernst, Jack J. Dongarra:
XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing. CoRR abs/2401.04552 (2024) - [i26]Jack J. Dongarra, John A. Gunnels, Harun Bayraktar, Azzam Haidar, Dan Ernst:
Hardware Trends Impacting Floating-Point Computations In Scientific Applications. CoRR abs/2411.12090 (2024) - 2023
- [j339]Daniel A. Reed, Dennis Gannon, Jack J. Dongarra:
HPC Forecast: Cloudy and Uncertain. Commun. ACM 66(2): 82-90 (2023) - [j338]Jack J. Dongarra, Bernard Tourancheau:
Guest editors note: Special issue on clusters, clouds, and data for scientific computing. Int. J. High Perform. Comput. Appl. 37(3-4): 211-212 (2023) - [j337]Piotr Luszczek
, Wissam M. Sid-Lakhdar, Jack J. Dongarra:
Combining multitask and transfer learning with deep Gaussian processes for autotuning-based performance engineering. Int. J. High Perform. Comput. Appl. 37(3-4): 229-244 (2023) - [j336]Sergey V. Kovalchuk, Clélia de Mulatier, Derek Groen, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, Peter M. A. Sloot:
The computational planet. J. Comput. Sci. 72: 102102 (2023) - [c449]Neil Lindquist
, Piotr Luszczek
, Jack J. Dongarra
:
Using Additive Modifications in LU Factorization Instead of Pivoting. ICS 2023: 14-24 - [c448]Wissam M. Sid-Lakhdar, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Piotr Luszczek, Mark Gates
, Stanimire Tomov
, Hans Johansen, David B. Williams-Young
, Timothy A. Davis, Jack J. Dongarra, Hartwig Anzt:
PAQR: Pivoting Avoiding QR factorization. IPDPS 2023: 322-332 - [c447]Daniel Barry, Heike Jagode, Anthony Danalis, Jack J. Dongarra:
Memory Traffic and Complete Application Profiling with PAPI Multi-Component Measurements. IPDPS Workshops 2023: 393-402 - [c446]Ahmad Abdelfattah
, Stanimire Tomov
, Piotr Luszczek
, Hartwig Anzt
, Jack J. Dongarra
:
GPU-based LU Factorization and Solve on Batches of Matrices with Band Structure. SC Workshops 2023: 1670-1679 - [c445]Dalal Sukkari
, Mark Gates
, Mohammed A. Al Farhan
, Hartwig Anzt
, Jack J. Dongarra
:
Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators. SC Workshops 2023: 1680-1687 - [e132]Jirí Mikyska
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2023 - 23rd International Conference, Prague, Czech Republic, July 3-5, 2023, Proceedings, Part I. Lecture Notes in Computer Science 14073, Springer 2023, ISBN 978-3-031-35994-1 [contents] - [e131]Jirí Mikyska
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2023 - 23rd International Conference, Prague, Czech Republic, July 3-5, 2023, Proceedings, Part II. Lecture Notes in Computer Science 14074, Springer 2023, ISBN 978-3-031-36020-6 [contents] - [e130]Jirí Mikyska
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2023 - 23rd International Conference, Prague, Czech Republic, July 3-5, 2023, Proceedings, Part III. Lecture Notes in Computer Science 14075, Springer 2023, ISBN 978-3-031-36023-7 [contents] - [e129]Jirí Mikyska
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2023 - 23rd International Conference, Prague, Czech Republic, July 3-5, 2023, Proceedings, Part IV. Lecture Notes in Computer Science 14076, Springer 2023, ISBN 978-3-031-36026-8 [contents] - [e128]Jirí Mikyska
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2023 - 23rd International Conference, Prague, Czech Republic, July 3-5, 2023, Proceedings, Part V. Lecture Notes in Computer Science 14077, Springer 2023, ISBN 978-3-031-36029-9 [contents] - [e127]Roman Wyrzykowski, Jack J. Dongarra, Ewa Deelman, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 14th International Conference, PPAM 2022, Gdansk, Poland, September 11-14, 2022, Revised Selected Papers, Part I. Lecture Notes in Computer Science 13826, Springer 2023, ISBN 978-3-031-30441-5 [contents] - [e126]Roman Wyrzykowski, Jack J. Dongarra, Ewa Deelman, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 14th International Conference, PPAM 2022, Gdansk, Poland, September 11-14, 2022, Revised Selected Papers, Part II. Lecture Notes in Computer Science 13827, Springer 2023, ISBN 978-3-031-30444-6 [contents] - [i25]Riley Murray
, James Demmel, Michael W. Mahoney, N. Benjamin Erichson, Maksim Melnichenko, Osman Asif Malik, Laura Grigori, Piotr Luszczek, Michal Derezinski
, Miles E. Lopes, Tianyu Liang, Hengrui Luo, Jack J. Dongarra:
Randomized Numerical Linear Algebra : A Perspective on the Field With an Eye to Software. CoRR abs/2302.11474 (2023) - [i24]Neil Lindquist, Piotr Luszczek, Jack J. Dongarra:
Generalizing Random Butterfly Transforms to Arbitrary Matrix Sizes. CoRR abs/2312.09376 (2023) - 2022
- [j335]Jack J. Dongarra:
The evolution of mathematical software. Commun. ACM 65(12): 66-72 (2022) - [j334]George Bosilca, Aurélien Bouteiller, Thomas Hérault
, Valentin Le Fèvre, Yves Robert, Jack J. Dongarra:
Comparing Distributed Termination Detection Algorithms for Modern HPC Platforms. Int. J. Netw. Comput. 12(1): 26-46 (2022) - [j333]Sergey V. Kovalchuk, Valeria V. Krzhizhanovskaya
, Maciej Paszynski, Dieter Kranzlmüller, Jack J. Dongarra, Peter M. A. Sloot:
Computational science for a better future. J. Comput. Sci. 62: 101745 (2022) - [j332]Dong Zhong
, Qinglei Cao
, George Bosilca
, Jack J. Dongarra
:
Using long vector extensions for MPI reductions. Parallel Comput. 109: 102871 (2022) - [j331]Sameh Abdulah
, Qinglei Cao
, Yu Pei, George Bosilca
, Jack J. Dongarra
, Marc G. Genton
, David E. Keyes
, Hatem Ltaief
, Ying Sun
:
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC. IEEE Trans. Parallel Distributed Syst. 33(4): 964-976 (2022) - [j330]Neil Lindquist
, Piotr Luszczek
, Jack J. Dongarra
:
Accelerating Restarted GMRES With Mixed Precision Arithmetic. IEEE Trans. Parallel Distributed Syst. 33(4): 1027-1037 (2022) - [j329]Qinglei Cao
, George Bosilca
, Nuria Losada, Wei Wu, Dong Zhong, Jack J. Dongarra
:
Evaluating Data Redistribution in PaRSEC. IEEE Trans. Parallel Distributed Syst. 33(8): 1856-1872 (2022) - [c444]Sébastien Cayrols, Jiali Li, George Bosilca, Stanimire Tomov
, Alan Ayala
, Jack J. Dongarra:
Lossy all-to-all exchange for accelerating parallel 3-D FFTs on hybrid architectures with GPUs. CLUSTER 2022: 152-160 - [c443]James Demmel, Jack J. Dongarra, Mark Gates
, Greg Henry, Julien Langou
, Xiaoye S. Li, Piotr Luszczek, Weslley S. Pereira, E. Jason Riedy, Cindy Rubio-González:
Proposed Consistent Exception Handling for the BLAS and LAPACK. Correctness@SC 2022: 1-9 - [c442]Jack J. Dongarra, Kenli Li, Hai Jin:
Message from the High Performance Computing and Communications 2022 General Chairs. HPCC/DSS/SmartCity/DependSys 2022: liv - [c441]Wissam M. Sid-Lakhdar, Mohsen Aznaveh, Piotr Luszczek, Jack J. Dongarra:
Deep Gaussian process with multitask and transfer learning for performance optimization. HPEC 2022: 1-7 - [c440]Ahmad Abdelfattah, Stan Tomov, Jack J. Dongarra:
Batch QR Factorization on GPUs: Design, Optimization, and Tuning. ICCS (1) 2022: 60-74 - [c439]Alan Ayala
, Stan Tomov, Miroslav Stoyanov
, Azzam Haidar, Jack J. Dongarra:
Performance Analysis of Parallel FFT on Large Multi-GPU Systems. IPDPS Workshops 2022: 372-381 - [c438]Qinglei Cao
, Rabab Alomairy
, Yu Pei, George Bosilca, Hatem Ltaief
, David E. Keyes
, Jack J. Dongarra:
A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization. IPDPS 2022: 414-424 - [c437]Mark Gates
, Asim YarKhan, Dalal Sukkari, Kadir Akbudak, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Mohammed A. Al Farhan, Jack J. Dongarra:
Portable and Efficient Dense Linear Algebra in the Beginning of the Exascale Era. P3HPC@SC 2022: 36-46 - [c436]Ichitaro Yamazaki, Christian Glusa, Jennifer A. Loe, Piotr Luszczek, Sivasankaran Rajamanickam, Jack J. Dongarra:
High-Performance GMRES Multi-Precision Benchmark: Design, Performance, and Challenges. PMBS@SC 2022: 112-122 - [c435]Yu Pei, George Bosilca, Jack J. Dongarra:
Sequential Task Flow Runtime Model Improvements and Limitations. ROSS 2022: 1-8 - [c434]Qinglei Cao
, Sameh Abdulah, Rabab Alomairy
, Yu Pei, Pratik Nag
, George Bosilca, Jack J. Dongarra, Marc G. Genton
, David E. Keyes
, Hatem Ltaief
, Ying Sun
:
Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications. SC 2022: 2:1-2:12 - [c433]Ahmad Abdelfattah, Pieter Ghysels, Wajih Boukaram, Stanimire Tomov, Xiaoye Sherry Li, Jack J. Dongarra:
Addressing Irregular Patterns of Matrix Computations on GPUs and Their Impact on Applications Powered by Sparse Direct Solvers. SC 2022: 26:1-26:14 - [c432]Neil Lindquist
, Mark Gates
, Piotr Luszczek, Jack J. Dongarra:
Threshold Pivoting for Dense LU Factorization. ScalAH@SC 2022: 34-42 - [c431]Yaohung M. Tsai, Piotr Luszczek, Jack J. Dongarra:
Mixed-Precision Algorithm for Finding Selected Eigenvalues and Eigenvectors of Symmetric and Hermitian Matrices1. ScalAH@SC 2022: 43-50 - [e125]Derek Groen
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2022 - 22nd International Conference, London, UK, June 21-23, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13350, Springer 2022, ISBN 978-3-031-08750-9 [contents] - [e124]Derek Groen
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2022 - 22nd International Conference, London, UK, June 21-23, 2022, Proceedings, Part II. Lecture Notes in Computer Science 13351, Springer 2022, ISBN 978-3-031-08753-0 [contents] - [e123]Derek Groen
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2022 - 22nd International Conference, London, UK, June 21-23, 2022, Proceedings, Part III. Lecture Notes in Computer Science 13352, Springer 2022, ISBN 978-3-031-08756-1 [contents] - [e122]Derek Groen
, Clélia de Mulatier
, Maciej Paszynski
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2022 - 22nd International Conference, London, UK, June 21-23, 2022, Proceedings, Part IV. Lecture Notes in Computer Science 13353, Springer 2022, ISBN 978-3-031-08759-2 [contents] - [d5]Mark Gates
, Asim YarKhan
, Dalal Sukkari
, Kadir Akbudak
, Sébastien Cayrols
, Daniel Bielich
, Ahmad Abdelfattah, Mohammed A. Al Farhan, Jack J. Dongarra
:
Reproducability Artifact for Running SLATE's GEMM and POTRF Operations on Summit and Crusher. Version 2. Zenodo, 2022 [all versions] - [d4]Mark Gates
, Asim YarKhan
, Dalal Sukkari
, Kadir Akbudak
, Sébastien Cayrols
, Daniel Bielich
, Mohammed A. Al Farhan, Jack J. Dongarra
:
Reproducability Artifact for Running SLATE's GEMM and POTRF Operations on Summit and Crusher. Version 1. Zenodo, 2022 [all versions] - [d3]Neil Lindquist
, Mark Gates
, Piotr Luszczek
, Jack J. Dongarra
:
Software for "Threshold Pivoting for dense LU Factorization". Version 2. Zenodo, 2022 [all versions] - [d2]Neil Lindquist
, Piotr Luszczek
, Jack J. Dongarra
:
Software for "Threshold Pivoting in LU Factorizations". Version 1. Zenodo, 2022 [all versions] - [i23]Daniel A. Reed, Dennis Gannon, Jack J. Dongarra:
Reinventing High Performance Computing: Challenges and Opportunities. CoRR abs/2203.02544 (2022) - [i22]James Demmel, Jack J. Dongarra, Mark Gates
, Greg Henry, Julien Langou, Xiaoye S. Li, Piotr Luszczek, Weslley da Silva Pereira, E. Jason Riedy, Cindy Rubio-González:
Proposed Consistent Exception Handling for the BLAS and LAPACK. CoRR abs/2207.09281 (2022) - 2021
- [j328]Zafar Iqbal
, Saeid Nooshabadi
, Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra
:
Exploiting Block Structures of KKT Matrices for Efficient Solution of Convex Optimization Problems. IEEE Access 9: 116604-116611 (2021) - [j327]Ahmad Abdelfattah, Hartwig Anzt
, Erik G. Boman, Erin C. Carson, Terry Cojean
, Jack J. Dongarra, Alyson Fox
, Mark Gates
, Nicholas J. Higham, Xiaoye S. Li, Jennifer A. Loe
, Piotr Luszczek, Srikara Pranesh, Siva Rajamanickam, Tobias Ribizel
, Barry F. Smith, Kasia Swirydowicz, Stephen J. Thomas, Stanimire Tomov
, Yaohung M. Tsai, Ulrike Meier Yang
:
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic. Int. J. High Perform. Comput. Appl. 35(4) (2021) - [j326]Tzanio V. Kolev
, Paul F. Fischer, Misun Min
, Jack J. Dongarra, Jed Brown
, Veselin Dobrev
, Tim Warburton
, Stanimire Tomov
, Mark S. Shephard
, Ahmad Abdelfattah, Valeria Barra
, Natalie Beams
, Jean-Sylvain Camier, Noel Chalmers, Yohann Dudouit
, Ali Karakus
, Ian Karlin, Stefan Kerkemeier, Yu-Hsiang Lan, David S. Medina, Elia Merzari
, Aleksandr Obabko, Will Pazner, Thilina Rathnayake
, Cameron W. Smith
, Lukas Spies
, Kasia Swirydowicz, Jeremy L. Thompson
, Ananias Tomboulides
, Vladimir Z. Tomov
:
Efficient exascale discretizations: High-order finite element methods. Int. J. High Perform. Comput. Appl. 35(6): 527-552 (2021) - [j325]Jack J. Dongarra
, Mark Gates
, Piotr Luszczek, Stanimire Tomov
:
Translational process: Mathematical software perspective. J. Comput. Sci. 52: 101216 (2021) - [j324]Sergey V. Kovalchuk, Valeria V. Krzhizhanovskaya
, Maciej Paszynski, Gábor Závodszky
, Michael Lees
, Jack J. Dongarra, Peter M. A. Sloot:
20 years of computational science: Selected papers from 2020 International Conference on Computational Science. J. Comput. Sci. 53: 101395 (2021) - [j323]Ahmad Abdelfattah, Timothy B. Costa, Jack J. Dongarra, Mark Gates
, Azzam Haidar, Sven Hammarling
, Nicholas J. Higham, Jakub Kurzak, Piotr Luszczek
, Stanimire Tomov
, Mawussi Zounon:
A Set of Batched Basic Linear Algebra Subprograms and LAPACK Routines. ACM Trans. Math. Softw. 47(3): 21:1-21:23 (2021) - [c430]Alan Ayala
, Stan Tomov
, Miroslav Stoyanov
, Azzam Haidar, Jack J. Dongarra:
Accelerating Multi - Process Communication for Parallel 3-D FFT. ExaMPI@SC 2021: 46-53 - [c429]Daniel Sharp
, Miroslav Stoyanov
, Stanimire Tomov
, Jack J. Dongarra:
A More Portable HeFFTe: Implementing a Fallback Algorithm for Scalable Fourier Transforms. HPEC 2021: 1-5 - [c428]Qinglei Cao
, Yu Pei, Kadir Akbudak
, George Bosilca, Hatem Ltaief
, David E. Keyes, Jack J. Dongarra:
Leveraging PaRSEC Runtime Support to Tackle Challenging 3D Data-Sparse Matrix Problems. IPDPS 2021: 79-89 - [c427]Thomas Hérault
, Yves Robert, George Bosilca, Robert J. Harrison, Cannada A. Lewis, Edward F. Valeev
, Jack J. Dongarra:
Distributed-memory multi-GPU block-sparse tensor contraction for electronic structure. IPDPS 2021: 537-546 - [c426]George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Valentin Le Fèvre, Yves Robert
, Jack J. Dongarra:
Revisiting Credit Distribution Algorithms for Distributed Termination Detection. IPDPS Workshops 2021: 611-620 - [c425]Alan Ayala
, Stanimire Tomov
, Miroslav Stoyanov, Jack J. Dongarra:
Scalability Issues in FFT Computation. PaCT 2021: 279-287 - [e121]Maciej Paszynski
, Dieter Kranzlmüller
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2021 - 21st International Conference, Krakow, Poland, June 16-18, 2021, Proceedings, Part I. Lecture Notes in Computer Science 12742, Springer 2021, ISBN 978-3-030-77960-3 [contents] - [e120]Maciej Paszynski
, Dieter Kranzlmüller
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2021 - 21st International Conference, Krakow, Poland, June 16-18, 2021, Proceedings, Part II. Lecture Notes in Computer Science 12743, Springer 2021, ISBN 978-3-030-77963-4 [contents] - [e119]Maciej Paszynski
, Dieter Kranzlmüller
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2021 - 21st International Conference, Krakow, Poland, June 16-18, 2021, Proceedings, Part III. Lecture Notes in Computer Science 12744, Springer 2021, ISBN 978-3-030-77966-5 [contents] - [e118]Maciej Paszynski
, Dieter Kranzlmüller
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2021 - 21st International Conference, Krakow, Poland, June 16-18, 2021, Proceedings, Part IV. Lecture Notes in Computer Science 12745, Springer 2021, ISBN 978-3-030-77969-6 [contents] - [e117]Maciej Paszynski
, Dieter Kranzlmüller
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2021 - 21st International Conference, Krakow, Poland, June 16-18, 2021, Proceedings, Part V. Lecture Notes in Computer Science 12746, Springer 2021, ISBN 978-3-030-77976-4 [contents] - [e116]Maciej Paszynski
, Dieter Kranzlmüller
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra
, Peter M. A. Sloot
:
Computational Science - ICCS 2021 - 21st International Conference, Krakow, Poland, June 16-18, 2021, Proceedings, Part VI. Lecture Notes in Computer Science 12747, Springer 2021, ISBN 978-3-030-77979-5 [contents] - [i21]Tzanio V. Kolev, Paul F. Fischer, Misun Min, Jack J. Dongarra, Jed Brown, Veselin Dobrev, Tim Warburton, Stanimire Tomov, Mark S. Shephard, Ahmad Abdelfattah, Valeria Barra, Natalie Beams, Jean-Sylvain Camier, Noel Chalmers, Yohann Dudouit, Ali Karakus, Ian Karlin, Stefan Kerkemeier, Yu-Hsiang Lan, David S. Medina, Elia Merzari, Aleksandr Obabko, Will Pazner, Thilina Rathnayake, Cameron W. Smith, Lukas Spies, Kasia Swirydowicz, Jeremy L. Thompson, Ananias Tomboulides, Vladimir Z. Tomov:
Efficient Exascale Discretizations: High-Order Finite Element Methods. CoRR abs/2109.04996 (2021) - 2020
- [j322]Yuechao Lu
, Ichitaro Yamazaki, Fumihiko Ino, Yasuyuki Matsushita
, Stanimire Tomov
, Jack J. Dongarra:
Reducing the amount of out-of-core data access for GPU-accelerated randomized SVD. Concurr. Comput. Pract. Exp. 32(19) (2020) - [j321]Mohammed A. Al Farhan
, Ahmad Abdelfattah, Stanimire Tomov
, Mark Gates
, Dalal Sukkari
, Azzam Haidar, Robert Rosenberg, Jack J. Dongarra:
MAGMA templates for scalable linear algebra on emerging architectures. Int. J. High Perform. Comput. Appl. 34(6) (2020) - [j320]Pedro J. S. Cardoso
, João M. F. Rodrigues
, Jânio M. Monteiro
, Roberto Lam, Valeria V. Krzhizhanovskaya
, Michael Harold Lees
, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science in the Interconnected World: Selected papers from 2019 International Conference on Computational Science. J. Comput. Sci. 47: 101222 (2020) - [j319]Ahmad Abdelfattah, Stanimire Tomov
, Jack J. Dongarra:
Matrix multiplication on batches of small matrices in half and half-complex precisions. J. Parallel Distributed Comput. 145: 188-201 (2020) - [j318]Hartwig Anzt
, Terry Cojean
, Chen Yen-Chen, Jack J. Dongarra, Goran Flegar, Pratik Nayak
, Stanimire Tomov
, Yuhsiang M. Tsai, Weichung Wang:
Load-balancing Sparse Matrix Vector Product Kernels on GPUs. ACM Trans. Parallel Comput. 7(1): 2:1-2:26 (2020) - [c424]Dong Zhong, Pavel Shamis, Qinglei Cao
, George Bosilca, Shinji Sumimoto, Kenichi Miura, Jack J. Dongarra:
Using Arm Scalable Vector Extension to Optimize OPEN MPI. CCGRID 2020: 222-231 - [c423]Xi Luo, Wei Wu, George Bosilca, Yu Pei, Qinglei Cao
, Thananon Patinyasakdikul, Dong Zhong, Jack J. Dongarra:
HAN: a Hierarchical AutotuNed Collective Communication Framework. CLUSTER 2020: 23-34 - [c422]Qinglei Cao
, George Bosilca, Wei Wu, Dong Zhong, Aurelien Bouteiller
, Jack J. Dongarra:
Flexible Data Redistribution in a Task-Based Runtime System. CLUSTER 2020: 221-225 - [c421]Cade Brown, Ahmad Abdelfattah, Stanimire Tomov
, Jack J. Dongarra:
Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs. HPEC 2020: 1-7 - [c420]Piotr Luszczek, Yaohung M. Tsai, Neil Lindquist
, Hartwig Anzt
, Jack J. Dongarra:
Scalable Data Generation for Evaluating Mixed-Precision Solvers. HPEC 2020: 1-6 - [c419]Ahmad Abdelfattah, Stan Tomov
, Jack J. Dongarra:
Investigating the Benefit of FP16-Enabled Mixed-Precision Solvers for Symmetric Positive Definite Matrices Using GPUs. ICCS (2) 2020: 237-250 - [c418]Alan Ayala
, Stanimire Tomov
, Azzam Haidar, Jack J. Dongarra:
heFFTe: Highly Efficient FFT for Exascale. ICCS (1) 2020: 262-275 - [c417]Yu Pei, Qinglei Cao
, George Bosilca, Piotr Luszczek, Victor Eijkhout, Jack J. Dongarra:
Communication Avoiding 2D Stencil Implementations over PaRSEC Task-Based Runtime. IPDPS Workshops 2020: 721-729 - [c416]Florent Lopez, Edmond Chow, Stanimire Tomov
, Jack J. Dongarra:
Asynchronous SGD for DNN training on Shared-memory Parallel Architectures. IPDPS Workshops 2020: 995-998 - [c415]Qinglei Cao
, Yu Pei, Kadir Akbudak
, Aleksandr Mikhalev
, George Bosilca, Hatem Ltaief
, David E. Keyes
, Jack J. Dongarra:
Extreme-Scale Task-Based Cholesky Factorization Toward Climate and Weather Prediction Applications. PASC 2020: 2:1-2:11 - [c414]Hartwig Anzt
, Yuhsiang M. Tsai, Ahmad Abdelfattah, Terry Cojean
, Jack J. Dongarra:
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations. PMBS@SC 2020: 26-38 - [c413]Dong Zhong, Qinglei Cao
, George Bosilca, Jack J. Dongarra:
Using Advanced Vector Extensions AVX-512 for MPI Reductions. EuroMPI 2020: 1-10 - [c412]Neil Lindquist
, Piotr Luszczek, Jack J. Dongarra:
Replacing Pivoting in Distributed Gaussian Elimination with Randomized Techniques. ScalA@SC 2020: 35-43 - [c411]Natalie Beams
, Ahmad Abdelfattah, Stan Tomov
, Jack J. Dongarra, Tzanio V. Kolev, Yohann Dudouit:
High-Order Finite Element Method using Standard and Device-Level Batch GEMM on GPUs. ScalA@SC 2020: 53-60 - [c410]Rick Archibald
, Edmond Chow, Eduardo F. D'Azevedo, Jack J. Dongarra, Markus Eisenbach, Rocco Febbo, Florent Lopez, Daniel Nichols
, Stanimire Tomov
, Kwai Wong, Junqi Yin:
Integrating Deep Learning in Domain Sciences at Exascale. SMC 2020: 35-50 - [c409]Neil Lindquist
, Piotr Luszczek, Jack J. Dongarra
:
Improving the Performance of the GMRES Method Using Mixed-Precision Techniques. SMC 2020: 51-66 - [e115]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part I. Lecture Notes in Computer Science 12137, Springer 2020, ISBN 978-3-030-50370-3 [contents] - [e114]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part II. Lecture Notes in Computer Science 12138, Springer 2020, ISBN 978-3-030-50416-8 [contents] - [e113]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part III. Lecture Notes in Computer Science 12139, Springer 2020, ISBN 978-3-030-50419-9 [contents] - [e112]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part IV. Lecture Notes in Computer Science 12140, Springer 2020, ISBN 978-3-030-50422-9 [contents] - [e111]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part V. Lecture Notes in Computer Science 12141, Springer 2020, ISBN 978-3-030-50425-0 [contents] - [e110]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part VI. Lecture Notes in Computer Science 12142, Springer 2020, ISBN 978-3-030-50432-8 [contents] - [e109]Valeria V. Krzhizhanovskaya
, Gábor Závodszky
, Michael Harold Lees, Jack J. Dongarra
, Peter M. A. Sloot
, Sérgio Brissos, João Teixeira:
Computational Science - ICCS 2020 - 20th International Conference, Amsterdam, The Netherlands, June 3-5, 2020, Proceedings, Part VII. Lecture Notes in Computer Science 12143, Springer 2020, ISBN 978-3-030-50435-9 [contents] - [e108]Roman Wyrzykowski, Ewa Deelman, Jack J. Dongarra, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 13th International Conference, PPAM 2019, Bialystok, Poland, September 8-11, 2019, Revised Selected Papers, Part I. Lecture Notes in Computer Science 12043, Springer 2020, ISBN 978-3-030-43228-7 [contents] - [e107]Roman Wyrzykowski, Ewa Deelman, Jack J. Dongarra, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 13th International Conference, PPAM 2019, Bialystok, Poland, September 8-11, 2019, Revised Selected Papers, Part II. Lecture Notes in Computer Science 12044, Springer 2020, ISBN 978-3-030-43221-8 [contents] - [d1]Neil Lindquist
, Piotr Luszczek
, Jack J. Dongarra
:
Software for Linear Algebra Targeting Exascale (SLATE) with a Recursive Butterfly Transform based solver. Zenodo, 2020 - [i20]Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin C. Carson, Terry Cojean
, Jack J. Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Xiaoye Sherry Li, Neil Lindquist, Yang Liu, Jennifer A. Loe, Piotr Luszczek, Pratik Nayak, Srikara Pranesh, Sivasankaran Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen J. Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Ulrike Meier Yang:
A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic. CoRR abs/2007.06674 (2020) - [i19]Neil Lindquist, Piotr Luszczek, Jack J. Dongarra:
Improving the Performance of the GMRES Method using Mixed-Precision Techniques. CoRR abs/2011.01850 (2020) - [i18]Rick Archibald, Edmond Chow, Eduardo F. D'Azevedo, Jack J. Dongarra, Markus Eisenbach, Rocco Febbo, Florent Lopez, Daniel Nichols, Stanimire Tomov, Kwai Wong, Junqi Yin:
Integrating Deep Learning in Domain Sciences at Exascale. CoRR abs/2011.11188 (2020)
2010 – 2019
- 2019
- [j317]Hartwig Anzt
, Jack J. Dongarra, Goran Flegar
, Nicholas J. Higham, Enrique S. Quintana-Ortí
:
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers. Concurr. Comput. Pract. Exp. 31(6) (2019) - [j316]Azzam Haidar
, Heike Jagode
, Phil Vaccaro, Asim YarKhan
, Stanimire Tomov
, Jack J. Dongarra:
Investigating power capping toward energy-efficient scientific applications. Concurr. Comput. Pract. Exp. 31(6) (2019) - [j315]Jack J. Dongarra
, Steven Gottlieb, William T. C. Kramer:
Race to Exascale. Comput. Sci. Eng. 21(1): 4-5 (2019) - [j314]Ichitaro Yamazaki
, Akihiro Ida, Rio Yokota
, Jack J. Dongarra:
Distributed-memory lattice H-matrix factorization. Int. J. High Perform. Comput. Appl. 33(5) (2019) - [j313]Jack J. Dongarra, Bernard Tourancheau:
Guest editors' note: Special issue on clusters, clouds, and data for scientific computing. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j312]Heike Jagode
, Anthony Danalis, Hartwig Anzt
, Jack J. Dongarra:
PAPI software-defined events for in-depth performance analysis. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j311]M. Graham Lopez
, Wayne Joubert, Verónica G. Vergara Larrea
, Oscar R. Hernandez, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra:
Evaluation of directive-based performance portable programming models. Int. J. High Perform. Comput. Netw. 14(2): 165-182 (2019) - [j310]Thomas Hérault
, Yves Robert, Aurélien Bouteiller, Dorian C. Arnold, Kurt B. Ferreira, George Bosilca, Jack J. Dongarra:
Checkpointing Strategies for Shared High-Performance Computing Platforms. Int. J. Netw. Comput. 9(1): 28-52 (2019) - [j309]Sergey V. Kovalchuk, Valeria V. Krzhizhanovskaya
, Yong Shi, Haohuan Fu, Michael Harold Lees
, Jack J. Dongarra, Peter M. A. Sloot:
Science at the intersection of data, modelling, and computation. J. Comput. Sci. 34: 117-119 (2019) - [j308]Hartwig Anzt
, Jack J. Dongarra, Enrique S. Quintana-Ortí
:
Fine-grained bit-flip protection for relaxation methods. J. Comput. Sci. 36 (2019) - [j307]Ian Masliah, Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Marc Baboulin, Joël Falcou, Jack J. Dongarra:
Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices. Parallel Comput. 81: 1-21 (2019) - [j306]Hartwig Anzt
, Jack J. Dongarra, Goran Flegar
, Enrique S. Quintana-Ortí
:
Variable-size batched Gauss-Jordan elimination for block-Jacobi preconditioning on graphics processors. Parallel Comput. 81: 131-146 (2019) - [j305]Valentin Le Fèvre, Thomas Hérault
, Yves Robert
, Aurélien Bouteiller
, Atsushi Hori
, George Bosilca
, Jack J. Dongarra:
Comparing the performance of rigid, moldable and grid-shaped applications on failure-prone HPC platforms. Parallel Comput. 85: 1-12 (2019) - [j304]Ichitaro Yamazaki, Edmond Chow, Aurélien Bouteiller
, Jack J. Dongarra:
Performance of asynchronous optimized Schwarz with one-sided communication. Parallel Comput. 86: 66-81 (2019) - [j303]Jack J. Dongarra, Mark Gates
, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Panruo Wu
, Ichitaro Yamazaki, Asim YarKhan
, Maksims Abalenkovs, Negin Bagherpour
, Sven Hammarling, Jakub Sístek
, David Stevens, Mawussi Zounon, Samuel D. Relton
:
PLASMA: Parallel Linear Algebra Software for Multicore Using OpenMP. ACM Trans. Math. Softw. 45(2): 16:1-16:35 (2019) - [j302]Dmitry Zaitsev
, Stanimire Tomov
, Jack J. Dongarra
:
Solving Linear Diophantine Systems on Parallel Architectures. IEEE Trans. Parallel Distributed Syst. 30(5): 1158-1169 (2019) - [c408]Jakub Kurzak
, Mark Gates
, Ali Charara
, Asim YarKhan
, Ichitaro Yamazaki
, Jack J. Dongarra
:
Linear Systems Solvers for Distributed-Memory Machines with GPU Accelerators. Euro-Par 2019: 495-506 - [c407]Ahmad Abdelfattah, Stanimire Tomov
, Jack J. Dongarra:
Progressive Optimization of Batched LU Factorization on GPUs. HPEC 2019: 1-6 - [c406]Piotr Luszczek, Ichitaro Yamazaki, Jack J. Dongarra:
Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators. HPEC 2019: 1-6 - [c405]Jakub Kurzak, Yaohung M. Tsai, Mark Gates
, Ahmad Abdelfattah, Jack J. Dongarra:
Massively Parallel Automated Software Tuning. ICPP 2019: 92:1-92:10 - [c404]Jakub Kurzak, Mark Gates
, Ali Charara, Asim YarKhan
, Jack J. Dongarra:
Least squares solvers for distributed-memory machines with GPU accelerators. ICS 2019: 117-126 - [c403]Ahmad Abdelfattah, Stanimire Tomov
, Jack J. Dongarra:
Fast Batched Matrix Multiplication for Small Sizes Using Half-Precision Arithmetic on GPUs. IPDPS 2019: 111-122 - [c402]Hartwig Anzt
, Tobias Ribizel
, Goran Flegar, Edmond Chow, Jack J. Dongarra:
ParILUT - A Parallel Threshold ILU for GPUs. IPDPS 2019: 231-241 - [c401]Anthony Danalis, Heike Jagode
, Thomas Hérault
, Piotr Luszczek, Jack J. Dongarra:
Software-Defined Events through PAPI. IPDPS Workshops 2019: 363-372 - [c400]Ichitaro Yamazaki, Zhaojun Bai, Ding Lu
, Jack J. Dongarra:
Matrix Powers Kernels for Thick-Restart Lanczos with Explicit External Deflation. IPDPS 2019: 472-481 - [c399]Joshua Hoke Davis
, Tao Gao, Sunita Chandrasekaran, Heike Jagode, Anthony Danalis, Jack J. Dongarra, Pavan Balaji, Michela Taufer
:
Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI. PARCO 2019: 287-298 - [c398]Hartwig Anzt
, Yen-Chen Chen, Terry Cojean
, Jack J. Dongarra, Goran Flegar
, Pratik Nayak
, Enrique S. Quintana-Ortí, Yuhsiang M. Tsai, Weichung Wang:
Towards Continuous Benchmarking: An Automated Performance Evaluation Framework for High Performance Software. PASC 2019: 9:1-9:11 - [c397]Ahmad Abdelfattah, Stanimire Tomov
, Jack J. Dongarra:
Towards Half-Precision Computation for Complex Matrices: A Case Study for Mixed Precision Solvers on GPUs. ScalA@SC 2019: 17-24 - [c396]Qinglei Cao
, Yu Pei, Thomas Hérault
, Kadir Akbudak
, Aleksandr Mikhalev
, George Bosilca, Hatem Ltaief
, David E. Keyes
, Jack J. Dongarra:
Performance Analysis of Tile Low-Rank Cholesky Factorization Using PaRSEC Instrumentation Tools. ProTools@SC 2019: 25-32 - [c395]Yu Pei, George Bosilca, Ichitaro Yamazaki, Akihiro Ida, Jack J. Dongarra:
Evaluation of Programming Models to Address Load Imbalance on Distributed Multi-Core CPUs: A Case Study with Block Low-Rank Factorization. PAW-ATM@SC 2019: 25-36 - [c394]Mark Gates
, Jakub Kurzak, Ali Charara, Asim YarKhan
, Jack J. Dongarra:
SLATE: design of a modern distributed and accelerated linear algebra library. SC 2019: 26:1-26:18 - [c393]Thomas Hérault
, Yves Robert
, George Bosilca, Jack J. Dongarra:
Generic Matrix Multiplication for Multi-GPU Accelerated Distributed-Memory Platforms over PaRSEC. ScalA@SC 2019: 33-41 - [c392]Daniel Nichols
, Nathalie-Sofia Tomov, Frank Betancourt, Stanimire Tomov
, Kwai Wong, Jack J. Dongarra:
MagmaDNN: Towards High-Performance Data Analytics and Machine Learning for Data-Driven Scientific Computing. ISC Workshops 2019: 490-503 - [c391]Kwai Wong, Stanimire Tomov
, Jack J. Dongarra:
Hands-On Research and Training in High Performance Data Sciences, Data Analytics, and Machine Learning for Emerging Environments. ISC Workshops 2019: 643-655 - [e106]João M. F. Rodrigues
, Pedro J. S. Cardoso
, Jânio M. Monteiro, Roberto Lam, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2019 - 19th International Conference, Faro, Portugal, June 12-14, 2019, Proceedings, Part I. Lecture Notes in Computer Science 11536, Springer 2019, ISBN 978-3-030-22733-3 [contents] - [e105]João M. F. Rodrigues, Pedro J. S. Cardoso
, Jânio M. Monteiro, Roberto Lam, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2019 - 19th International Conference, Faro, Portugal, June 12-14, 2019, Proceedings, Part II. Lecture Notes in Computer Science 11537, Springer 2019, ISBN 978-3-030-22740-1 [contents] - [e104]João M. F. Rodrigues
, Pedro J. S. Cardoso
, Jânio M. Monteiro, Roberto Lam, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2019 - 19th International Conference, Faro, Portugal, June 12-14, 2019, Proceedings, Part III. Lecture Notes in Computer Science 11538, Springer 2019, ISBN 978-3-030-22743-2 [contents] - [e103]João M. F. Rodrigues
, Pedro J. S. Cardoso
, Jânio M. Monteiro, Roberto Lam, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2019 - 19th International Conference, Faro, Portugal, June 12-14, 2019, Proceedings, Part IV. Lecture Notes in Computer Science 11539, Springer 2019, ISBN 978-3-030-22746-3 [contents] - [e102]João M. F. Rodrigues
, Pedro J. S. Cardoso
, Jânio M. Monteiro
, Roberto Lam, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2019 - 19th International Conference, Faro, Portugal, June 12-14, 2019, Proceedings, Part V. Lecture Notes in Computer Science 11540, Springer 2019, ISBN 978-3-030-22749-4 [contents] - 2018
- [j301]Jack J. Dongarra, Vladimir Getov
, Kevin Walsh:
The 30th Anniversary of the Supercomputing Conference: Bringing the Future Closer - Supercomputing History and the Immortality of Now. Computer 51(10): 74-85 (2018) - [j300]Heike Jagode
, Anthony Danalis, Reazul Hoque, Mathieu Faverge, Jack J. Dongarra:
Evaluation of dataflow programming models for electronic structure theory. Concurr. Comput. Pract. Exp. 30(17) (2018) - [j299]Joseph Dorris, Asim YarKhan
, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Task based Cholesky decomposition on Xeon Phi architectures using OpenMP. Int. J. Comput. Sci. Eng. 17(3): 310-323 (2018) - [j298]Jack J. Dongarra, Bernard Tourancheau:
Guest editors' note. Int. J. High Perform. Comput. Appl. 32(1): 3 (2018) - [j297]George Bosilca, Aurélien Bouteiller
, Amina Guermouche, Thomas Hérault
, Yves Robert
, Pierre Sens, Jack J. Dongarra:
A failure detector for HPC platforms. Int. J. High Perform. Comput. Appl. 32(1): 139-158 (2018) - [j296]Hartwig Anzt
, Moritz Kreutzer, Eduardo Ponce, Gregory D. Peterson, Gerhard Wellein
, Jack J. Dongarra:
Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs. Int. J. High Perform. Comput. Appl. 32(2): 220-230 (2018) - [j295]Mark Asch
, Terry Moore, Rosa M. Badia
, Micah Beck, Peter H. Beckman, T. Bidot, François Bodin, Franck Cappello, Alok N. Choudhary, Bronis R. de Supinski, Ewa Deelman, Jack J. Dongarra
, Anshu Dubey
, Geoffrey C. Fox, H. Fu, Sergi Girona
, William Gropp
, Michael A. Heroux, Yutaka Ishikawa, Katarzyna Keahey, David E. Keyes, Bill Kramer, J.-F. Lavignon, Y. Lu, Satoshi Matsuoka, Bernd Mohr, Daniel A. Reed, S. Requena, Joel H. Saltz, Thomas C. Schulthess, Rick L. Stevens, D. Martin Swany
, Alexander S. Szalay, William M. Tang, G. Varoquaux, Jean-Pierre Vilotte
, Robert W. Wisniewski, Z. Xu, Igor Zacharov
:
Big data and extreme-scale computing. Int. J. High Perform. Comput. Appl. 32(4): 435-479 (2018) - [j294]Heike Jagode
, Anthony Danalis, Jack J. Dongarra:
Accelerating NWChem Coupled Cluster through dataflow-based execution. Int. J. High Perform. Comput. Appl. 32(4): 540-551 (2018) - [j293]Sergey V. Kovalchuk, Valeria V. Krzhizhanovskaya
, Petros Koumoutsakos
, Eleni N. Chatzi, Michael Harold Lees
, Jack J. Dongarra, Peter M. A. Sloot:
The art of computational science: Bridging gaps - forming alloys. J. Comput. Sci. 26: 190-192 (2018) - [j292]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Batched one-sided factorizations of tiny matrices using GPUs: Challenges and countermeasures. J. Comput. Sci. 26: 226-236 (2018) - [j291]Tingxing Dong, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Accelerating the SVD bi-diagonalization of a batch of small matrices using GPUs. J. Comput. Sci. 26: 237-245 (2018) - [j290]Edmond Chow, Hartwig Anzt
, Jennifer A. Scott, Jack J. Dongarra:
Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning. J. Parallel Distributed Comput. 119: 219-230 (2018) - [j289]Zuoning Chen, Jack J. Dongarra, Zhiwei Xu:
Post-exascale supercomputing: research opportunities abound. Frontiers Inf. Technol. Electron. Eng. 19(10): 1203-1208 (2018) - [j288]Hartwig Anzt
, Thomas K. Huckle, Jürgen Bräckle, Jack J. Dongarra:
Incomplete Sparse Approximate Inverses for Parallel Preconditioning. Parallel Comput. 71: 1-22 (2018) - [j287]Mark Gates
, Stanimire Tomov
, Jack J. Dongarra:
Accelerating the SVD two stage bidiagonal reduction and divide and conquer using GPUs. Parallel Comput. 74: 3-18 (2018) - [j286]Franz Franchetti
, José M. F. Moura, David A. Padua, Jack J. Dongarra:
From High-Level Specification to High-Performance Code. Proc. IEEE 106(11): 1875-1878 (2018) - [j285]Jack J. Dongarra
, Mark Gates
, Jakub Kurzak
, Piotr Luszczek
, Yaohung M. Tsai:
Autotuning Numerical Dense Linear Algebra for Batched Computation With GPU Hardware Accelerators. Proc. IEEE 106(11): 2040-2055 (2018) - [j284]Prasanna Balaprakash
, Jack J. Dongarra
, Todd Gamblin, Mary W. Hall
, Jeffrey K. Hollingsworth, Boyana Norris
, Richard W. Vuduc
:
Autotuning in High-Performance Computing Applications. Proc. IEEE 106(11): 2068-2083 (2018) - [j283]Jack J. Dongarra, Mark Gates
, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Ichitaro Yamazaki:
The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme Scale. SIAM Rev. 60(4): 808-865 (2018) - [j282]Hartwig Anzt
, Edmond Chow
, Jack J. Dongarra:
ParILUT - A New Parallel Threshold ILU Factorization. SIAM J. Sci. Comput. 40(4): C503-C519 (2018) - [j281]Alexander S. Antonov, Jack J. Dongarra, Vladimir V. Voevodin:
AlgoWiki Project as an Extension of the Top500 Methodology. Supercomput. Front. Innov. 5(1): 4-10 (2018) - [j280]Piotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, David J. Keffer
, Vasileios Maroulas
, Jack J. Dongarra:
Autotuning Techniques for Performance-Portable Point Set Registration in 3D. Supercomput. Front. Innov. 5(4): 42-61 (2018) - [j279]Azzam Haidar, Ahmad Abdelfattah
, Mawussi Zounon
, Stanimire Tomov
, Jack J. Dongarra:
A Guide for Achieving High Performance with Very Small Matrices on GPU: A Case Study of Batched LU and Cholesky Factorizations. IEEE Trans. Parallel Distributed Syst. 29(5): 973-984 (2018) - [j278]Ichitaro Yamazaki
, Jakub Kurzak
, Panruo Wu
, Mawussi Zounon
, Jack J. Dongarra
:
Symmetric Indefinite Linear Solver Using OpenMP Task on Multicore Architectures. IEEE Trans. Parallel Distributed Syst. 29(8): 1879-1892 (2018) - [j277]Ahmad Abdelfattah
, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra
:
Analysis and Design Techniques towards High-Performance and Energy-Efficient Dense Linear Solvers on GPUs. IEEE Trans. Parallel Distributed Syst. 29(12): 2700-2712 (2018) - [c390]Valentin Le Fèvre, George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Atsushi Hori, Yves Robert
, Jack J. Dongarra:
Do Moldable Applications Perform Better on Failure-Prone HPC Platforms? Euro-Par Workshops 2018: 787-799 - [c389]Xi Luo, Wei Wu
, George Bosilca, Thananon Patinyasakdikul, Linnan Wang, Jack J. Dongarra:
ADAPT: an event-based adaptive collective communication framework. HPDC 2018: 118-130 - [c388]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky Factorization. HPEC 2018: 1-7 - [c387]Azzam Haidar, Ahmad Abdelfattah, Mawussi Zounon, Panruo Wu
, Srikara Pranesh, Stanimire Tomov
, Jack J. Dongarra:
The Design of Fast and Energy-Efficient Linear Solvers: On the Potential of Half-Precision Arithmetic and Iterative Refinement Techniques. ICCS (1) 2018: 586-600 - [c386]Thomas Hérault
, Yves Robert
, Aurélien Bouteiller
, Dorian C. Arnold, Kurt B. Ferreira, George Bosilca, Jack J. Dongarra:
Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms. IPDPS Workshops 2018: 803-812 - [c385]Ichitaro Yamazaki, Ahmad Abdelfattah, Akihiro Ida, Satoshi Ohshima
, Stanimire Tomov
, Rio Yokota, Jack J. Dongarra:
Performance of Hierarchical-matrix BiCGStab Solver on GPU Clusters. IPDPS 2018: 930-939 - [c384]Hartwig Anzt
, Jack J. Dongarra, Goran Flegar
, Thomas Grützmacher
:
Variable-Size Batched Condition Number Calculation on GPUs. SBAC-PAD 2018: 132-139 - [c383]Hartwig Anzt
, Jack J. Dongarra:
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs. SBAC-PAD 2018: 229-232 - [c382]Azzam Haidar, Stanimire Tomov, Jack J. Dongarra, Nicholas J. Higham:
Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers. SC 2018: 47:1-47:11 - [e101]Yong Shi, Haohuan Fu, Yingjie Tian, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2018 - 18th International Conference, Wuxi, China, June 11-13, 2018, Proceedings, Part I. Lecture Notes in Computer Science 10860, Springer 2018, ISBN 978-3-319-93697-0 [contents] - [e100]Yong Shi, Haohuan Fu, Yingjie Tian, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2018 - 18th International Conference, Wuxi, China, June 11-13, 2018, Proceedings, Part II. Lecture Notes in Computer Science 10861, Springer 2018, ISBN 978-3-319-93700-7 [contents] - [e99]Yong Shi, Haohuan Fu, Yingjie Tian, Valeria V. Krzhizhanovskaya, Michael Harold Lees, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2018 - 18th International Conference, Wuxi, China, June 11-13, 2018 Proceedings, Part III. Lecture Notes in Computer Science 10862, Springer 2018, ISBN 978-3-319-93712-0 [contents] - [e98]Roman Wyrzykowski, Jack J. Dongarra, Ewa Deelman, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 12th International Conference, PPAM 2017, Lublin, Poland, September 10-13, 2017, Revised Selected Papers, Part I. Lecture Notes in Computer Science 10777, Springer 2018, ISBN 978-3-319-78023-8 [contents] - [e97]Roman Wyrzykowski, Jack J. Dongarra, Ewa Deelman, Konrad Karczewski:
Parallel Processing and Applied Mathematics - 12th International Conference, PPAM 2017, Lublin, Poland, September 10-13, 2017, Revised Selected Papers, Part II. Lecture Notes in Computer Science 10778, Springer 2018, ISBN 978-3-319-78053-5 [contents] - [i17]Linnan Wang, Wei Wu, Yiyang Zhao, Junyu Zhang, Hang Liu, George Bosilca, Jack J. Dongarra, Maurice Herlihy, Rodrigo Fonseca:
SuperNeurons: FFT-based Gradient Sparsification in the Distributed Training of Deep Neural Networks. CoRR abs/1811.08596 (2018) - 2017
- [j276]Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Non-GPU-resident symmetric indefinite factorization. Concurr. Comput. Pract. Exp. 29(5) (2017) - [j275]Marc Baboulin, Jack J. Dongarra, Adrien Rémy, Stanimire Tomov
, Ichitaro Yamazaki:
Solving dense symmetric indefinite systems using GPUs. Concurr. Comput. Pract. Exp. 29(9) (2017) - [j274]Jack J. Dongarra, Stanimire Tomov
, Piotr Luszczek, Jakub Kurzak, Mark Gates
, Ichitaro Yamazaki, Hartwig Anzt
, Azzam Haidar, Ahmad Abdelfattah:
With Extreme Computing, the Rules Have Changed. Comput. Sci. Eng. 19(3): 52-62 (2017) - [j273]Ichitaro Yamazaki, Saeid Nooshabadi
, Stanimire Tomov
, Jack J. Dongarra:
Structure-Aware Linear Solver for Realtime Convex Optimization for Embedded Systems. IEEE Embed. Syst. Lett. 9(3): 61-64 (2017) - [j272]Jack J. Dongarra, Bernard Tourancheau:
Guest Editor's Note: Special Issue on Clusters, Clouds and Data for Scientific Computing. Int. J. High Perform. Comput. Appl. 31(1): 3 (2017) - [j271]Hartwig Anzt
, Stanimire Tomov
, Jack J. Dongarra:
On the performance and energy efficiency of sparse linear algebra on GPUs. Int. J. High Perform. Comput. Appl. 31(5): 375-390 (2017) - [j270]Gordon Bell, David H. Bailey, Jack J. Dongarra, Alan H. Karp, Kevin Walsh:
A look back on 30 years of the Gordon Bell Prize. Int. J. High Perform. Comput. Appl. 31(6): 469-484 (2017) - [j269]Asim YarKhan
, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Porting the PLASMA Numerical Library to the OpenMP Standard. Int. J. Parallel Program. 45(3): 612-633 (2017) - [j268]Sergey V. Kovalchuk, Tesfamariam M. Abuhay
, Ilkay Altintas, Michael L. Norman, Michael Harold Lees
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra, Peter M. A. Sloot:
Data through the Computational Lens. J. Comput. Sci. 20: 81-84 (2017) - [j267]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Fast Cholesky factorization on GPUs for batch and native modes in MAGMA. J. Comput. Sci. 20: 85-93 (2017) - [j266]Hartwig Anzt
, Mark Gates
, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein
, Martin Koehler
:
Preconditioned Krylov solvers on GPUs. Parallel Comput. 68: 32-44 (2017) - [j265]Jakub Kurzak, Piotr Luszczek, Ichitaro Yamazaki, Yves Robert
, Jack J. Dongarra:
Design and Implementation of the PULSAR Programming System for Large Scale Computing. Supercomput. Front. Innov. 4(1): 4-26 (2017) - [c381]Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Sampling algorithms to update truncated SVD. IEEE BigData 2017: 817-826 - [c380]Piotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, David J. Keffer
, Jack J. Dongarra:
Scaling point set registration in 3D across thread counts on multicore and hardware accelerator platforms through autotuning for large scale analysis of scientific point clouds. IEEE BigData 2017: 2893-2902 - [c379]Jack J. Dongarra, Sven Hammarling, Nicholas J. Higham
, Samuel D. Relton
, Mawussi Zounon
:
Optimized Batched Linear Algebra for Modern Architectures. Euro-Par 2017: 511-522 - [c378]Azzam Haidar, Heike Jagode
, Asim YarKhan
, Phil Vaccaro, Stanimire Tomov
, Jack J. Dongarra:
Power-aware computing: Measurement, control, and performance analysis for Intel Xeon Phi. HPEC 2017: 1-7 - [c377]Azzam Haidar, Khairul Kabir, Diana Fayad, Stanimire Tomov
, Jack J. Dongarra:
Out of memory SVD solver for big data. HPEC 2017: 1-7 - [c376]Piotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, Jack J. Dongarra:
Towards numerical benchmark for half-precision floating point arithmetic. HPEC 2017: 1-5 - [c375]Petros Koumoutsakos
, Eleni N. Chatzi, Valeria V. Krzhizhanovskaya
, Michael Lees
, Jack J. Dongarra, Peter M. A. Sloot:
The Art of Computational Science, Bridging Gaps - Forming Alloys. Preface for ICCS 2017. ICCS 2017: 1-6 - [c374]Jack J. Dongarra, Sven Hammarling, Nicholas J. Higham
, Samuel D. Relton
, Pedro Valero-Lara
, Mawussi Zounon
:
The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems. ICCS 2017: 495-504 - [c373]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Factorization and Inversion of a Million Matrices using GPUs: Challenges and Countermeasures. ICCS 2017: 606-615 - [c372]Tingxing Dong, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Optimizing the SVD Bidiagonalization Process for a Batch of Small Matrices. ICCS 2017: 1008-1018 - [c371]Hartwig Anzt
, Jack J. Dongarra, Goran Flegar
, Enrique S. Quintana-Ortí
, Andrés E. Tomás:
Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning. ICCS 2017: 1783-1792 - [c370]Hartwig Anzt
, Jack J. Dongarra, Goran Flegar
, Enrique S. Quintana-Ortí
:
Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning. ICPP 2017: 91-100 - [c369]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs. ICS 2017: 5:1-5:10 - [c368]Jack J. Dongarra:
EduPar Keynote. IPDPS Workshops 2017: 314 - [c367]Mathieu Faverge, Julien Langou
, Yves Robert
, Jack J. Dongarra:
Bidiagonalization and R-Bidiagonalization: Parallel Tiled Algorithms, Critical Paths and Distributed-Memory Implementation. IPDPS 2017: 668-677 - [c366]Ichitaro Yamazaki, Mark Hoemmen, Piotr Luszczek, Jack J. Dongarra:
Improving Performance of GMRES by Reducing Communication and Pipelining Global Collectives. IPDPS Workshops 2017: 1118-1127 - [c365]Mark Gates
, Jakub Kurzak, Piotr Luszczek, Yu Pei, Jack J. Dongarra:
Autotuning Batch Cholesky Factorization in CUDA with Interleaved Layout of Matrices. IPDPS Workshops 2017: 1408-1417 - [c364]Hartwig Anzt, Jack J. Dongarra, Goran Flegar
, Enrique S. Quintana-Ortí:
Batched Gauss-Jordan Elimination for Block-Jacobi Preconditioner Generation on GPUs. PMAM@PPoPP 2017: 1-10 - [c363]Azzam Haidar, Ahmad Abdelfattah, Stanimire Tomov
, Jack J. Dongarra:
High-performance Cholesky factorization for GPU-only execution. GPGPU@PPoPP 2017: 42-52 - [c362]Hartwig Anzt
, Gary Collins, Jack J. Dongarra, Goran Flegar
, Enrique S. Quintana-Ortí:
Flexible batched sparse matrix-vector product on GPUs. ScalA@SC 2017: 3:1-3:8 - [c361]Reazul Hoque, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Dynamic task discovery in PaRSEC: a data-flow task-based runtime. ScalA@SC 2017: 6:1-6:8 - [c360]Azzam Haidar, Panruo Wu
, Stanimire Tomov
, Jack J. Dongarra:
Investigating half precision arithmetic to accelerate dense linear system solvers. ScalA@SC 2017: 10:1-10:8 - [c359]Khairul Kabir, Azzam Haidar, Stanimire Tomov
, Aurélien Bouteiller
, Jack J. Dongarra:
A Framework for Out of Memory SVD Algorithms. ISC 2017: 158-178 - [p15]Hartwig Anzt
, Jack J. Dongarra, Mark Gates
, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Ichitaro Yamazaki:
Bringing High Performance Computing to Big Data Algorithms. Handbook of Big Data Technologies 2017: 777-806 - [e96]Petros Koumoutsakos, Michael Lees, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, Peter M. A. Sloot:
International Conference on Computational Science, ICCS 2017, 12-14 June 2017, Zurich, Switzerland. Procedia Computer Science 108, Elsevier 2017 [contents] - [e95]Vassil Alexandrov, Al Geist, Jack J. Dongarra:
Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, ScalA@SC 2017, Denver, CO, USA, November 13, 2017. ACM 2017, ISBN 978-1-4503-5125-6 [contents] - 2016
- [j264]Ahmad Abdelfattah, Hartwig Anzt
, Jack J. Dongarra, Mark Gates
, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Ichitaro Yamazaki, Asim YarKhan
:
Linear algebra software for large-scale accelerated multicore computing. Acta Numer. 25: 1-160 (2016) - [j263]Ahmad Abdelfattah, Hatem Ltaief
, David E. Keyes
, Jack J. Dongarra:
Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs. Concurr. Comput. Pract. Exp. 28(12): 3447-3465 (2016) - [j262]Jack J. Dongarra, Michael A. Heroux, Piotr Luszczek:
High-performance conjugate-gradient benchmark: A new metric for ranking high-performance computing systems. Int. J. High Perform. Comput. Appl. 30(1): 3-10 (2016) - [j261]Hartwig Anzt
, Edmond Chow, Jens Saak
, Jack J. Dongarra:
Updating incomplete factorization preconditioners for model order reduction. Numer. Algorithms 73(3): 611-630 (2016) - [j260]Julien Herrmann
, George Bosilca
, Thomas Hérault
, Loris Marchal
, Yves Robert
, Jack J. Dongarra:
Assessing the cost of redistribution followed by a computational kernel: Complexity and performance results. Parallel Comput. 52: 22-41 (2016) - [j259]Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU. ACM Trans. Math. Softw. 43(2): 10:1-10:18 (2016) - [j258]Jakub Kurzak, Hartwig Anzt
, Mark Gates
, Jack J. Dongarra:
Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs. IEEE Trans. Parallel Distributed Syst. 27(7): 2036-2048 (2016) - [c358]Ian Masliah, Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Marc Baboulin, Joël Falcou, Jack J. Dongarra:
High-Performance Matrix-Matrix Multiplications of Very Small Matrices. Euro-Par 2016: 659-671 - [c357]Jack J. Dongarra:
With Extreme Scale Computing the Rules Have Changed. HPDC 2016: 123 - [c356]Wei Wu
, George Bosilca, Rolf Vandevaart, Sylvain Jeaugey, Jack J. Dongarra:
GPU-Aware Non-contiguous Data Movement In Open MPI. HPDC 2016: 231-242 - [c355]Azzam Haidar, Benjamin Brock, Stanimire Tomov
, Michael Guidry, Jay Jay Billings, Daniel Shyles, Jack J. Dongarra:
Performance analysis and acceleration of explicit integration for large kinetic networks using batched GPU computations. HPEC 2016: 1-7 - [c354]Azzam Haidar, Stanimire Tomov
, Konstantin Arturov, Murat Efe Guney, Shane Story, Jack J. Dongarra:
LU, QR, and Cholesky factorizations: Programming model, performance analysis and optimization techniques for the Intel Knights Landing Xeon Phi. HPEC 2016: 1-7 - [c353]Ilkay Altintas, Michael Normal, Michael Lees
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra, Peter M. A. Sloot:
Data through the Computational Lens, Preface for ICCS 2016. ICCS 2016: 1-7 - [c352]Ahmad Abdelfattah, Marc Baboulin, Veselin Dobrev
, Jack J. Dongarra, Christopher W. Earl, Joel Falcou, Azzam Haidar, Ian Karlin, Tzanio V. Kolev, Ian Masliah, Stanimire Tomov
:
High-Performance Tensor Contractions for GPUs. ICCS 2016: 108-118 - [c351]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Performance Tuning and Optimization Techniques of Fixed and Variable Size Batched Cholesky Factorization on GPUs. ICCS 2016: 119-130 - [c350]Jack J. Dongarra:
With Extreme Scale Computing the Rules Have Changed. ICMS 2016: 3-6 - [c349]Chris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas
, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra, Hartwig Anzt
, Mark Gates
, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta:
Heterogeneous Streaming. IPDPS Workshops 2016: 611-620 - [c348]Yulu Jia, Piotr Luszczek, Jack J. Dongarra:
Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures. IPDPS Workshops 2016: 653-662 - [c347]Hartwig Anzt
, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler
:
Efficiency of General Krylov Methods on GPUs - An Experimental Study. IPDPS Workshops 2016: 683-691 - [c346]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures. IPDPS Workshops 2016: 1249-1258 - [c345]Piotr Luszczek, Mark Gates
, Jakub Kurzak, Anthony Danalis, Jack J. Dongarra:
Search Space Generation and Pruning System for Autotuners. IPDPS Workshops 2016: 1545-1554 - [c344]Yaohung M. Tsai, Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra:
Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks. MLHPC@SC 2016: 9-18 - [c343]M. Graham Lopez
, Verónica G. Vergara Larrea
, Wayne Joubert, Oscar R. Hernandez, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Towards Achieving Performance Portability Using Directives for Accelerators. WACCPD@SC 2016: 13-24 - [c342]Hartwig Anzt
, Edmond Chow, Thomas Huckle, Jack J. Dongarra:
Batched Generation of Incomplete Sparse Approximate Inverses on GPUs. ScalA@SC 2016: 49-56 - [c341]George Bosilca, Aurélien Bouteiller
, Amina Guermouche, Thomas Hérault
, Yves Robert
, Pierre Sens, Jack J. Dongarra:
Failure detection and propagation in HPC systems. SC 2016: 312-322 - [c340]Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Performance, Design, and Autotuning of Batched GEMM for GPUs. ISC 2016: 21-38 - [c339]Joseph Dorris, Jakub Kurzak, Piotr Luszczek, Asim YarKhan
, Jack J. Dongarra:
Task-Based Cholesky Decomposition on Knights Corner Using OpenMP. ISC Workshops 2016: 544-562 - [c338]Hartwig Anzt
, Marc Baboulin, Jack J. Dongarra, Yvan Fournier, Frank Hülsemann, Amal Khabou, Yushan Wang:
Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations. VECPAR 2016: 35-43 - [p14]Hartwig Anzt
, Edmond Chow, Daniel B. Szyld, Jack J. Dongarra:
Domain Overlap for Iterative Sparse Triangular Solves on GPUs. Software for Exascale Computing 2016: 527-545 - [e94]Roman Wyrzykowski
, Ewa Deelman, Jack J. Dongarra, Konrad Karczewski, Jacek Kitowski
, Kazimierz Wiatr:
Parallel Processing and Applied Mathematics - 11th International Conference, PPAM 2015, Krakow, Poland, September 6-9, 2015. Revised Selected Papers, Part I. Lecture Notes in Computer Science 9573, Springer 2016, ISBN 978-3-319-32148-6 [contents] - [e93]Roman Wyrzykowski
, Ewa Deelman, Jack J. Dongarra, Konrad Karczewski, Jacek Kitowski
, Kazimierz Wiatr:
Parallel Processing and Applied Mathematics - 11th International Conference, PPAM 2015, Krakow, Poland, September 6-9, 2015. Revised Selected Papers, Part II. Lecture Notes in Computer Science 9574, Springer 2016, ISBN 978-3-319-32151-6 [contents] - [e92]Jack J. Dongarra, Daniel J. Holmes, Antonia B. K. Collis, Jesper Larsson Träff, Lorna Smith:
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, Edinburgh, United Kingdom, September 25-28, 2016. ACM 2016, ISBN 978-1-4503-4234-6 [contents] - [e91]Julian M. Kunkel, Pavan Balaji, Jack J. Dongarra:
High Performance Computing - 31st International Conference, ISC High Performance 2016, Frankfurt, Germany, June 19-23, 2016, Proceedings. Lecture Notes in Computer Science 9697, Springer 2016, ISBN 978-3-319-41320-4 [contents] - [i16]Mathieu Faverge, Julien Langou, Yves Robert, Jack J. Dongarra:
Bidiagonalization with Parallel Tiled Algorithms. CoRR abs/1611.06892 (2016) - 2015
- [j257]Daniel A. Reed, Jack J. Dongarra:
Exascale computing and big data. Commun. ACM 58(7): 56-68 (2015) - [j256]Erich Strohmaier, Hans Werner Meuer, Jack J. Dongarra, Horst D. Simon
:
The TOP500 List and Progress in High-Performance Computing. Computer 48(11): 42-49 (2015) - [j255]Simplice Donfack, Jack J. Dongarra, Mathieu Faverge, Mark Gates
, Jakub Kurzak, Piotr Luszczek, Ichitaro Yamazaki:
A survey of recent developments in parallel implementations of Gaussian elimination. Concurr. Comput. Pract. Exp. 27(5): 1292-1309 (2015) - [j254]Fengguang Song, Jack J. Dongarra:
A scalable approach to solving dense linear algebra problems on hybrid CPU-GPU systems. Concurr. Comput. Pract. Exp. 27(14): 3702-3723 (2015) - [j253]Hartwig Anzt
, Blake Haugen, Jakub Kurzak, Piotr Luszczek
, Jack J. Dongarra:
Experiences in autotuning matrix multiplication for energy minimization on GPUs. Concurr. Comput. Pract. Exp. 27(17): 5096-5113 (2015) - [j252]Hartwig Anzt
, Stanimire Tomov
, Piotr Luszczek, William B. Sawyer, Jack J. Dongarra:
Acceleration of GPU-based Krylov solvers via data transfer reduction. Int. J. High Perform. Comput. Appl. 29(3): 366-383 (2015) - [j251]George Bosilca, Aurélien Bouteiller, Thomas Hérault
, Yves Robert, Jack J. Dongarra:
Composing resilience techniques: ABFT, periodic and incremental checkpointing. Int. J. Netw. Comput. 5(1): 2-25 (2015) - [j250]Mathieu Faverge, Julien Herrmann
, Julien Langou
, Bradley R. Lowery, Yves Robert
, Jack J. Dongarra:
Mixing LU and QR factorization algorithms to design high-performance dense linear algebra solvers. J. Parallel Distributed Comput. 85: 32-46 (2015) - [j249]Jack J. Dongarra, Bernard Tourancheau:
Guest Editors' Note: Special Issue on Clusters, Clouds and Data for Scientific Computing. Parallel Process. Lett. 25(3): 1502002:1-1502002:2 (2015) - [j248]Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Mixed-Precision Cholesky QR Factorization and Its Case Studies on Multicore CPU with Multiple GPUs. SIAM J. Sci. Comput. 37(3) (2015) - [j247]Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Computing Low-Rank Approximation of a Dense Matrix on Multicore CPUs with a GPU and Its Application to Solving a Hierarchically Semiseparable Linear System of Equations. Sci. Program. 2015: 246019:1-246019:17 (2015) - [j246]Jack J. Dongarra, Mark Gates
, Azzam Haidar, Yulu Jia, Khairul Kabir, Piotr Luszczek, Stanimire Tomov
:
HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi. Sci. Program. 2015: 502593:1-502593:11 (2015) - [j245]Vladimir V. Voevodin
, Alexander S. Antonov
, Jack J. Dongarra:
AlgoWiki: an Open Encyclopedia of Parallel Algorithmic Features. Supercomput. Front. Innov. 2(1): 4-18 (2015) - [j244]Jack J. Dongarra, Maksims Abalenkovs, Ahmad Abdelfattah, Mark Gates
, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Ichitaro Yamazaki, Asim YarKhan
:
Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems. Supercomput. Front. Innov. 2(4): 67-86 (2015) - [j243]Aurélien Bouteiller
, Thomas Hérault
, George Bosilca, Peng Du, Jack J. Dongarra:
Algorithm-Based Fault Tolerance for Dense Matrix Factorizations, Multiple Failures and Accuracy. ACM Trans. Parallel Comput. 1(2): 10:1-10:28 (2015) - [c337]Mark Gates
, Hartwig Anzt
, Jakub Kurzak, Jack J. Dongarra:
Accelerating collaborative filtering using concepts from high performance computing. IEEE BigData 2015: 667-676 - [c336]Anthony Danalis, Heike Jagode
, George Bosilca, Jack J. Dongarra:
PaRSEC in Practice: Optimizing a Legacy Chemistry Application through Distributed Task-Based Execution. CLUSTER 2015: 304-313 - [c335]Hartwig Anzt
, Edmond Chow, Jack J. Dongarra:
Iterative Sparse Triangular Solves for Preconditioning. Euro-Par 2015: 650-661 - [c334]Azzam Haidar, Asim YarKhan
, Chongxiao Cao, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Flexible Linear Algebra Development and Scheduling with Cholesky Factorization. HPCC/CSS/ICESS 2015: 861-864 - [c333]Azzam Haidar, Stanimire Tomov
, Piotr Luszczek, Jack J. Dongarra:
MAGMA embedded: Towards a dense linear algebra library for energy efficient extreme computing. HPEC 2015: 1-6 - [c332]Khairul Kabir, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
Performance Analysis and Optimisation of Two-sided Factorization Algorithms for Heterogeneous Platform. ICCS 2015: 180-190 - [c331]Wei Wu
, Aurélien Bouteiller
, George Bosilca, Mathieu Faverge, Jack J. Dongarra:
Hierarchical DAG Scheduling for Hybrid Distributed Systems. IPDPS 2015: 156-165 - [c330]Chongxiao Cao, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Design for a Soft Error Resilient Dynamic Task-Based Runtime. IPDPS 2015: 765-774 - [c329]Marc Baboulin, Jack J. Dongarra, Adrien Rémy, Stanimire Tomov
, Ichitaro Yamazaki:
Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures. PPAM (1) 2015: 86-95 - [c328]Heike Jagode
, Anthony Danalis, George Bosilca, Jack J. Dongarra:
Accelerating NWChem Coupled Cluster Through Dataflow-Based Execution. PPAM (1) 2015: 366-376 - [c327]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
Energy efficiency and performance frontiers for sparse computations on GPU supercomputers. PMAM@PPoPP 2015: 1-10 - [c326]Azzam Haidar, Tingxing Dong, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Optimization for performance and energy for batched matrix computations on GPUs. GPGPU@PPoPP 2015: 59-69 - [c325]Azzam Haidar, Tingxing Dong, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Towards batched linear solvers on accelerated hardware platforms. PPoPP 2015: 261-262 - [c324]Aurélien Bouteiller
, George Bosilca, Jack J. Dongarra:
Plan B: Interruption of Ongoing MPI Operations to Support Failure Recovery. EuroMPI 2015: 11:1-11:9 - [c323]Hrachya V. Astsatryan, Vladimir Sahakyan, Yuri Shoukourian
, Jack J. Dongarra, Pierre-Henri Cros, Michel J. Daydé, Per Öster
:
Strengthening compute and data intensive capacities of Armenia. RoEduNet 2015: 28-33 - [c322]Hartwig Anzt
, Jack J. Dongarra, Enrique S. Quintana-Ortí
:
Tuning stationary iterative solvers for fault resilience. ScalA@SC 2015: 1:1-1:8 - [c321]Hartwig Anzt
, Jack J. Dongarra, Enrique S. Quintana-Ortí
:
Adaptive precision solvers for sparse linear systems. E2SC@SC 2015: 2:1-2:10 - [c320]Blake Haugen, Stephen Richmond, Jakub Kurzak, Chad A. Steed
, Jack J. Dongarra:
Visualizing execution traces with task dependencies. VPA@SC 2015: 2:1-2:8 - [c319]Ichitaro Yamazaki, Stanimire Tomov
, Jakub Kurzak, Jack J. Dongarra, Jesse L. Barlow:
Mixed-precision block gram Schmidt orthogonalization. ScalA@SC 2015: 2:1-2:8 - [c318]Hartwig Anzt
, Eduardo Ponce, Gregory D. Peterson, Jack J. Dongarra:
GPU-accelerated co-design of induced dimension reduction: algorithmic fusion and kernel overlap. Co-HPC@SC 2015: 5:1-5:8 - [c317]Azzam Haidar, Yulu Jia, Piotr Luszczek, Stanimire Tomov
, Asim YarKhan
, Jack J. Dongarra:
Weighted dynamic scheduling with many parallelism grains for offloading of numerical workloads to multiple varied accelerators. ScalA@SC 2015: 5:1-5:8 - [c316]Raffaele Solcà
, Anton Kozhevnikov
, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra, Thomas C. Schulthess:
Efficient implementation of quantum materials simulations on distributed CPU-GPU systems. SC 2015: 10:1-10:12 - [c315]Thomas Hérault
, Aurélien Bouteiller
, George Bosilca, Marc Gamell, Keita Teranishi, Manish Parashar, Jack J. Dongarra:
Practical scalable consensus for pseudo-synchronous distributed systems. SC 2015: 31:1-31:12 - [c314]Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster. SC 2015: 59:1-59:12 - [c313]Théo Mary
, Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs. SC 2015: 60:1-60:11 - [c312]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product. SpringSim (HPS) 2015: 75-82 - [c311]Khairul Kabir, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra:
Performance analysis and design of a hessenberg reduction using stabilized blocked elementary transformations for new architectures. SpringSim (HPS) 2015: 135-142 - [c310]Edmond Chow, Hartwig Anzt
, Jack J. Dongarra:
Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs. ISC 2015: 1-16 - [c309]Azzam Haidar, Tingxing Tim Dong, Stanimire Tomov
, Piotr Luszczek, Jack J. Dongarra:
A Framework for Batched and GPU-Resident Factorization Algorithms Applied to Block Householder Transformations. ISC 2015: 31-47 - [c308]Khairul Kabir, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
On the Design, Development, and Analysis of Optimized Matrix-Vector Multiplication Routines for Coprocessors. ISC 2015: 58-73 - [e90]Slawomir Koziel, Leifur Þ. Leifsson, Michael Lees, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, Peter M. A. Sloot:
Proceedings of the International Conference on Computational Science, ICCS 2015, Computational Science at the Gates of Nature, Reykjavík, Iceland, 1-3 June, 2015, 2014. Procedia Computer Science 51, Elsevier 2015 [contents] - [e89]Jack J. Dongarra, Alexandre Denis, Brice Goglin, Emmanuel Jeannot, Guillaume Mercier:
Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015, Bordeaux, France, September 21-23, 2015. ACM 2015, ISBN 978-1-4503-3795-3 [contents] - [e88]Vassil Alexandrov, Al Geist, Jack J. Dongarra:
Proceedings of the 6th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, ScalA@SC 2015, Austin, Texas, USA, November 15, 2015. ACM 2015, ISBN 978-1-4503-4011-3 [contents] - 2014
- [j242]Anthony Danalis, Piotr Luszczek, Gabriel Marin, Jeffrey S. Vetter, Jack J. Dongarra:
BlackjackBench: Portable Hardware Characterization with Automated Results' Analysis. Comput. J. 57(7): 1002-1016 (2014) - [j241]Jack J. Dongarra, Mathieu Faverge, Hatem Ltaief
, Piotr Luszczek:
Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting. Concurr. Comput. Pract. Exp. 26(7): 1408-1431 (2014) - [j240]Ichitaro Yamazaki, Tingxing Dong, Raffaele Solcà
, Stanimire Tomov
, Jack J. Dongarra, Thomas C. Schulthess:
Tridiagonalization of a dense symmetric matrix on multiple GPUs and its application to symmetric eigenvalue problems. Concurr. Comput. Pract. Exp. 26(16): 2652-2666 (2014) - [j239]George Bosilca, Aurélien Bouteiller
, Elisabeth Brunet, Franck Cappello, Jack J. Dongarra, Amina Guermouche, Thomas Hérault
, Yves Robert
, Frédéric Vivien
, Dounia Zaidouni:
Unified model for assessing checkpointing protocols at extreme-scale. Concurr. Comput. Pract. Exp. 26(17): 2772-2791 (2014) - [j238]George Bosilca, Hatem Ltaief
, Jack J. Dongarra:
Power profiling of Cholesky and QR factorizations on distributed memory systems. Comput. Sci. Res. Dev. 29(2): 139-147 (2014) - [j237]Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra, Raffaele Solcà
, Thomas C. Schulthess:
A novel hybrid CPU-GPU generalized eigensolver for electronic structure calculations based on fine-grained memory aware tasks. Int. J. High Perform. Comput. Appl. 28(2): 196-209 (2014) - [j236]Jack J. Dongarra, Thomas Hérault
, Yves Robert:
Performance and reliability trade-offs for the double checkpointing algorithm. Int. J. Netw. Comput. 4(1): 23-41 (2014) - [j235]Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra:
Looking back at dense linear algebra software. J. Parallel Distributed Comput. 74(7): 2548-2560 (2014) - [j234]Marc Baboulin, Dulceneia Becker, George Bosilca, Anthony Danalis, Jack J. Dongarra:
An efficient distributed randomized algorithm for solving large dense symmetric indefinite linear systems. Parallel Comput. 40(7): 213-223 (2014) - [j233]Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight Runtime. Parallel Process. Lett. 24(4) (2014) - [j232]Grey Ballard
, Dulceneia Becker, James Demmel, Jack J. Dongarra, Alex Druinsky, Inon Peled, Oded Schwartz, Sivan Toledo, Ichitaro Yamazaki:
Communication-Avoiding Symmetric-Indefinite Factorization. SIAM J. Matrix Anal. Appl. 35(4): 1364-1406 (2014) - [j231]Jack J. Dongarra, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Asim YarKhan
:
Model-Driven One-Sided Factorizations on Multicore Accelerated Systems. Supercomput. Front. Innov. 1(1): 85-115 (2014) - [c307]Ichitaro Yamazaki, Théo Mary
, Jakub Kurzak, Stanimire Tomov
, Jack J. Dongarra:
Access-averse framework for computing low-rank matrix approximations. IEEE BigData 2014: 70-77 - [c306]Heike McCraw, Anthony Danalis, Thomas Hérault
, George Bosilca, Jack J. Dongarra, Karol Kowalski, Theresa L. Windus
:
Utilizing dataflow-based execution for coupled cluster methods. CLUSTER 2014: 296-297 - [c305]Heike McCraw, James Ralph, Anthony Danalis, Jack J. Dongarra:
Power monitoring with PAPI for extreme scale architectures and dataflow-based programming models. CLUSTER 2014: 385-391 - [c304]Tingxing Dong, Azzam Haidar, Piotr Luszczek, James Austin Harris
, Stanimire Tomov
, Jack J. Dongarra:
LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPU. HPCC/CSS/ICESS 2014: 157-160 - [c303]David Abramson
, Michael Lees
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra, Peter M. A. Sloot:
Big Data Meets Computational Science, Preface for ICCS 2014. ICCS 2014: 1-7 - [c302]Blake Haugen, Jakub Kurzak, Asim YarKhan
, Piotr Luszczek, Jack J. Dongarra:
Parallel Simulation of Superscalar Scheduling. ICPP 2014: 121-130 - [c301]Tingxing Dong, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
A Fast Batched Cholesky Factorization on a GPU. ICPP 2014: 432-440 - [c300]Fengguang Song, Jack J. Dongarra:
Scaling up matrix computations on shared-memory manycore systems with 1000 CPU cores. ICS 2014: 333-342 - [c299]Dimitar Lukarski, Hartwig Anzt
, Stanimire Tomov
, Jack J. Dongarra:
Hybrid Multi-elimination ILU Preconditioners on GPUs. IPDPS Workshops 2014: 7-16 - [c298]Ichitaro Yamazaki, Hartwig Anzt
, Stanimire Tomov
, Mark Hoemmen, Jack J. Dongarra:
Improving the Performance of CA-GMRES on Multicores with Multiple GPUs. IPDPS 2014: 382-391 - [c297]Azzam Haidar, Chongxiao Cao, Asim YarKhan
, Piotr Luszczek, Stanimire Tomov
, Khairul Kabir, Jack J. Dongarra:
Unified Development for Mixed Multi-GPU and Multi-coprocessor Environments Using a Lightweight Runtime Environment. IPDPS 2014: 491-500 - [c296]George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Yves Robert
, Jack J. Dongarra:
Assessing the Impact of ABFT and Checkpoint Composite Strategies. IPDPS Workshops 2014: 679-688 - [c295]Hartwig Anzt
, William B. Sawyer, Stanimire Tomov
, Piotr Luszczek, Ichitaro Yamazaki, Jack J. Dongarra:
Optimizing Krylov Subspace Solvers on Graphics Processing Units. IPDPS Workshops 2014: 941-949 - [c294]Simplice Donfack, Stanimire Tomov
, Jack J. Dongarra:
Dynamically Balanced Synchronization-Avoiding LU Factorization with Multicore and GPUs. IPDPS Workshops 2014: 958-965 - [c293]Tingxing Dong, Veselin Dobrev
, Tzanio V. Kolev
, Robert N. Rieben, Stanimire Tomov
, Jack J. Dongarra:
A Step towards Energy Efficient Computing: Redesigning a Hydrodynamic Application on CPU-GPU. IPDPS 2014: 972-981 - [c292]Mathieu Faverge, Julien Herrmann, Julien Langou, Bradley R. Lowery, Yves Robert
, Jack J. Dongarra:
Designing LU-QR Hybrid Solvers for Performance and Stability. IPDPS 2014: 1029-1038 - [c291]Azzam Haidar, Piotr Luszczek, Jack J. Dongarra:
New Algorithm for Computing Eigenvectors of the Symmetric Eigenvalue Problem. IPDPS Workshops 2014: 1150-1159 - [c290]Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight Runtime. IPDPS Workshops 2014: 1495-1504 - [c289]Gabriel Marin, Jack J. Dongarra, Daniel Terpstra:
MIAMI: A framework for application performance diagnosis. ISPASS 2014: 158-168 - [c288]Chongxiao Cao, Jack J. Dongarra, Peng Du, Mark Gates
, Piotr Luszczek, Stanimire Tomov
:
clMAGMA: high performance dense linear algebra with OpenCL. IWOCL 2014: 1:1-1:9 - [c287]Anthony Danalis, George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Jack J. Dongarra:
PTG: an abstraction for unhindered parallelism. WOLFHPC@SC 2014: 21-30 - [c286]Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Deflation strategies to improve the convergence of communication-avoiding GMRES. ScalA@SC 2014: 39-46 - [c285]Chongxiao Cao, Mark Gates
, Azzam Haidar, Piotr Luszczek, Stanimire Tomov
, Ichitaro Yamazaki, Jack J. Dongarra:
Performance and portability with OpenCL for throughput-oriented HPC workloads across accelerators, coprocessors, and multicore processors. ScalA@SC 2014: 61-68 - [c284]Ichitaro Yamazaki, Stanimire Tomov, Tingxing Dong, Jack J. Dongarra:
Mixed-Precision Orthogonalization Scheme and Adaptive Step Size for Improving the Stability and Performance of CA-GMRES on GPUs. VECPAR 2014: 17-30 - [c283]Azzam Haidar, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments. VECPAR 2014: 31-42 - [c282]Hartwig Anzt
, Dimitar Lukarski, Stanimire Tomov
, Jack J. Dongarra:
Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures. VECPAR 2014: 115-123 - [c281]Mark Gates
, Azzam Haidar, Jack J. Dongarra:
Accelerating Computation of Eigenvectors in the Dense Nonsymmetric Eigenvalue Problem. VECPAR 2014: 182-191 - [p13]Jack J. Dongarra, Mark Gates
, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Ichitaro Yamazaki:
Accelerating Numerical Dense Linear Algebra Calculations with GPUs. Numerical Computations with GPUs 2014: 3-28 - [e87]David Abramson, Michael Lees, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, Peter M. A. Sloot:
Proceedings of the International Conference on Computational Science, ICCS 2014, Cairns, Queensland, Australia, 10-12 June, 2014. Procedia Computer Science 29, Elsevier 2014 [contents] - [e86]Roman Wyrzykowski, Jack J. Dongarra, Konrad Karczewski, Jerzy Wasniewski:
Parallel Processing and Applied Mathematics - 10th International Conference, PPAM 2013, Warsaw, Poland, September 8-11, 2013, Revised Selected Papers, Part I. Lecture Notes in Computer Science 8384, Springer 2014, ISBN 978-3-642-55223-6 [contents] - [e85]Roman Wyrzykowski, Jack J. Dongarra, Konrad Karczewski, Jerzy Wasniewski:
Parallel Processing and Applied Mathematics - 10th International Conference, PPAM 2013, Warsaw, Poland, September 8-11, 2013, Revised Selected Papers, Part II. Lecture Notes in Computer Science 8385, Springer 2014, ISBN 978-3-642-55194-9 [contents] - [e84]Jack J. Dongarra, Yutaka Ishikawa, Atsushi Hori:
21st European MPI Users' Group Meeting, EuroMPI/ASIA '14, Kyoto, Japan - September 09 - 12, 2014. ACM 2014, ISBN 978-1-4503-2875-3 [contents] - [e83]Trish Damkroger, Jack J. Dongarra:
International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014, New Orleans, LA, USA, November 16-21, 2014. IEEE Computer Society 2014, ISBN 978-1-4799-5500-8 [contents] - [i15]Mathieu Faverge, Julien Herrmann, Julien Langou, Bradley R. Lowery, Yves Robert, Jack J. Dongarra:
Designing LU-QR hybrid solvers for performance and stability. CoRR abs/1401.5522 (2014) - [i14]Tim Mattson, David A. Bader, Jonathan W. Berry, Aydin Buluç, Jack J. Dongarra, Christos Faloutsos, John Feo, John R. Gilbert, Joseph Gonzalez, Bruce Hendrickson, Jeremy Kepner, Charles E. Leiserson, Andrew Lumsdaine, David A. Padua, Stephen W. Poole, Steven P. Reinhardt, Mike Stonebraker, Steve Wallach, Andrew Yoo:
Standards for Graph Algorithm Primitives. CoRR abs/1408.0393 (2014) - 2013
- [j230]Wesley Bland, Aurélien Bouteiller
, Thomas Hérault
, Joshua Hursey, George Bosilca, Jack J. Dongarra:
An evaluation of User-Level Failure Mitigation support in MPI. Computing 95(12): 1171-1184 (2013) - [j229]Aurélien Bouteiller
, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Correlated set coordination in fault tolerant message logging protocols for many-core clusters. Concurr. Comput. Pract. Exp. 25(4): 572-585 (2013) - [j228]Wesley Bland, Peng Du, Aurélien Bouteiller
, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Extending the scope of the Checkpoint-on-Failure protocol for forward recovery in standard MPI. Concurr. Comput. Pract. Exp. 25(17): 2381-2393 (2013) - [j227]George Bosilca, Aurélien Bouteiller
, Anthony Danalis, Mathieu Faverge, Thomas Hérault
, Jack J. Dongarra:
PaRSEC: Exploiting Heterogeneity to Enhance Scalability. Comput. Sci. Eng. 15(6): 36-45 (2013) - [j226]Jack J. Dongarra, Bernard Tourancheau:
Introduction for August Special Issue CCDSC. Int. J. High Perform. Comput. Appl. 27(3): 231 (2013) - [j225]Wesley Bland, Aurélien Bouteiller
, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Post-failure recovery of MPI communication capability: Design and rationale. Int. J. High Perform. Comput. Appl. 27(3): 244-254 (2013) - [j224]Peng Du, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Soft error resilient QR factorization for hybrid system with GPGPU. J. Comput. Sci. 4(6): 457-464 (2013) - [j223]Teng Ma, George Bosilca, Aurélien Bouteiller
, Jack J. Dongarra:
Kernel-assisted and topology-aware MPI collective communications on multicore/many-core platforms. J. Parallel Distributed Comput. 73(7): 1000-1010 (2013) - [j222]Hartwig Anzt
, Stanimire Tomov
, Jack J. Dongarra, Vincent Heuveline:
A block-asynchronous relaxation method for graphics processing units. J. Parallel Distributed Comput. 73(12): 1613-1626 (2013) - [j221]Jack J. Dongarra, Mathieu Faverge, Thomas Hérault
, Mathias Jacquelin
, Julien Langou
, Yves Robert
:
Hierarchical QR factorization algorithms for multi-core clusters. Parallel Comput. 39(4-5): 212-232 (2013) - [j220]Jack J. Dongarra, Bernard Tourancheau:
Guest Editors' Note: Special Issue on Clusters, Clouds, and Data for Scientific Computing. Parallel Process. Lett. 23(2) (2013) - [j219]Yinan Li, Asim YarKhan
, Jack J. Dongarra, Keith Seymour, Aurélie Hurault:
Enabling workflows in GridSolve: request sequencing and service trading. J. Supercomput. 64(3): 1133-1152 (2013) - [j218]Marc Baboulin, Jack J. Dongarra, Julien Herrmann, Stanimire Tomov
:
Accelerating Linear System Solutions Using Randomization Techniques. ACM Trans. Math. Softw. 39(2): 8:1-8:13 (2013) - [j217]Fred G. Gustavson, Jerzy Wasniewski, Jack J. Dongarra, José R. Herrero
, Julien Langou:
Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms. ACM Trans. Math. Softw. 39(2): 9:1-9:10 (2013) - [j216]Hatem Ltaief
, Piotr Luszczek, Jack J. Dongarra:
High-performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures. ACM Trans. Math. Softw. 39(3): 16:1-16:22 (2013) - [j215]Jakub Kurzak, Piotr Luszczek, Mathieu Faverge, Jack J. Dongarra:
LU Factorization with Partial Pivoting for a Multicore System with Accelerators. IEEE Trans. Parallel Distributed Syst. 24(8): 1613-1621 (2013) - [c280]Aurélien Bouteiller
, Franck Cappello, Jack J. Dongarra, Amina Guermouche, Thomas Hérault
, Yves Robert:
Multi-criteria Checkpointing Strategies: Response-Time versus Resource Utilization. Euro-Par 2013: 420-431 - [c279]Guillaume Aupy, Mathieu Faverge, Yves Robert
, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Implementing a Systolic Algorithm for QR Factorization on Multicore Clusters with PaRSEC. Euro-Par Workshops 2013: 657-667 - [c278]Tim Mattson, David A. Bader, Jonathan W. Berry, Aydin Buluç, Jack J. Dongarra, Christos Faloutsos
, John Feo, John R. Gilbert, Joseph Gonzalez, Bruce Hendrickson, Jeremy Kepner, Charles E. Leiserson, Andrew Lumsdaine, David A. Padua, Stephen Poole, Steven P. Reinhardt, Mike Stonebraker, Steve Wallach, Andrew Yoo:
Standards for graph algorithm primitives. HPEC 2013: 1-2 - [c277]Vassil Alexandrov
, Michael Lees
, Valeria V. Krzhizhanovskaya
, Jack J. Dongarra, Peter M. A. Sloot:
Computation at the Frontiers of Science, preface for ICCS 2013. ICCS 2013: 1-9 - [c276]Yushan Wang, Marc Baboulin, Jack J. Dongarra, Joël Falcou, Yann Fraigneau, Olivier P. Le Maître
:
A Parallel Solver for Incompressible Fluid Flows. ICCS 2013: 439-448 - [c275]Azzam Haidar, Mark Gates
, Stanimire Tomov
, Jack J. Dongarra:
Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication. ICS 2013: 223-232 - [c274]Volodymyr Turchenko
, George Bosilca, Aurélien Bouteiller
, Jack J. Dongarra:
Efficient parallelization of batch pattern training algorithm on many-core and cluster architectures. IDAACS 2013: 692-698 - [c273]Jack J. Dongarra:
HCW 2013 Keynote Talk. IPDPS Workshops 2013: 6 - [c272]Jakub Kurzak, Piotr Luszczek, Mark Gates
, Ichitaro Yamazaki, Jack J. Dongarra:
Virtual Systolic Array for QR Decomposition. IPDPS 2013: 251-260 - [c271]Jack J. Dongarra, Thomas Hérault
, Yves Robert
:
Revisiting the Double Checkpointing Algorithm. IPDPS Workshops 2013: 706-715 - [c270]Grey Ballard
, Dulceneia Becker, James Demmel, Jack J. Dongarra, Alex Druinsky, Inon Peled, Oded Schwartz, Sivan Toledo, Ichitaro Yamazaki:
Implementing a Blocked Aasen's Algorithm with a Dynamic Scheduler on Multicore Architectures. IPDPS 2013: 895-907 - [c269]Ichitaro Yamazaki, Tingxing Dong, Stanimire Tomov
, Jack J. Dongarra:
Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster. IPDPS Workshops 2013: 1070-1079 - [c268]Jack J. Dongarra, Mark Gates
, Azzam Haidar, Yulu Jia, Khairul Kabir, Piotr Luszczek, Stanimire Tomov
:
Portable HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi. PPAM (1) 2013: 571-581 - [c267]Yulu Jia, Piotr Luszczek, George Bosilca, Jack J. Dongarra:
CPU-GPU hybrid bidiagonal reduction with soft error resilience. ScalA@SC 2013: 2:1-2:5 - [c266]Yulu Jia, George Bosilca, Piotr Luszczek, Jack J. Dongarra:
Parallel reduction to hessenberg form with algorithm-based fault tolerance. SC 2013: 88:1-88:11 - [c265]Guillaume Aupy, Anne Benoit
, Thomas Hérault
, Yves Robert
, Jack J. Dongarra:
Optimal Checkpointing Period: Time vs. Energy. PMBS@SC 2013: 203-214 - [c264]Azzam Haidar, Raffaele Solcà
, Mark Gates
, Stanimire Tomov
, Thomas C. Schulthess, Jack J. Dongarra:
Leading Edge Hybrid Multi-GPU Algorithms for Generalized Eigenproblems in Electronic Structure Calculations. ISC 2013: 67-80 - [c263]Heike McCraw, Daniel Terpstra, Jack J. Dongarra, Kris Davis, Roy G. Musselman:
Beyond the CPU: Hardware Performance Counter Monitoring on Blue Gene/Q. ISC 2013: 213-225 - [e82]Erik H. D'Hollander, Jack J. Dongarra, Ian T. Foster, Lucio Grandinetti, Gerhard R. Joubert:
Transition of HPC Towards Exascale Computing - Selected Papers from the High Performance Computing Workshop, Cetraro, Italy, June 25-29, 2012. Advances in Parallel Computing 24, IOS Press 2013, ISBN 978-1-61499-323-0 [contents] - [e81]Vassil Alexandrov, Michael Lees, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, Peter M. A. Sloot:
Proceedings of the International Conference on Computational Science, ICCS 2013, Barcelona, Spain, 5-7 June, 2013. Procedia Computer Science 18, Elsevier 2013 [contents] - [e80]Jack J. Dongarra, Javier García Blas, Jesús Carretero:
20th European MPI Users's Group Meeting, EuroMPI '13, Madrid, Spain - September 15 - 18, 2013. ACM 2013, ISBN 978-1-4503-1903-4 [contents] - [i13]Guillaume Aupy, Anne Benoit, Thomas Hérault, Yves Robert, Jack J. Dongarra:
Optimal Checkpointing Period: Time vs. Energy. CoRR abs/1310.8456 (2013) - 2012
- [j214]Jack J. Dongarra, Aad J. van der Steen:
High-performance computing systems: Status and outlook. Acta Numer. 21: 379-474 (2012) - [j213]Hatem Ltaief
, Piotr Luszczek, Jack J. Dongarra:
Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency. Comput. Sci. Res. Dev. 27(4): 277-287 (2012) - [j212]Horst D. Simon
, Jack J. Dongarra, Hemant Shukla:
Introduction to the Special Issue. Int. J. High Perform. Comput. Appl. 26(4): 335-336 (2012) - [j211]George Bosilca, Aurélien Bouteiller
, Anthony Danalis, Thomas Hérault
, Pierre Lemarinier
, Jack J. Dongarra:
DAGuE: A generic distributed DAG engine for High Performance Computing. Parallel Comput. 38(1-2): 37-51 (2012) - [j210]Peng Du, Rick Weber, Piotr Luszczek, Stanimire Tomov
, Gregory D. Peterson, Jack J. Dongarra:
From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming. Parallel Comput. 38(8): 391-407 (2012) - [j209]Christof Vömel, Stanimire Tomov
, Jack J. Dongarra:
Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems. SIAM J. Sci. Comput. 34(2) (2012) - [j208]Azzam Haidar, Hatem Ltaief
, Jack J. Dongarra:
Toward a High Performance Tile Divide and Conquer Algorithm for the Dense Symmetric Eigenvalue Problem. SIAM J. Sci. Comput. 34(6) (2012) - [j207]Anthony Danalis, Piotr Luszczek, Gabriel Marin, Jeffrey S. Vetter, Jack J. Dongarra:
BlackjackBench: portable hardware characterization. SIGMETRICS Perform. Evaluation Rev. 40(2): 74-79 (2012) - [j206]Jakub Kurzak, Stanimire Tomov
, Jack J. Dongarra:
Autotuning GEMM Kernels for the Fermi GPU. IEEE Trans. Parallel Distributed Syst. 23(11): 2045-2057 (2012) - [c262]Jack J. Dongarra, Hatem Ltaief
, Piotr Luszczek, Vincent M. Weaver:
Energy Footprint of Advanced Dense Numerical Linear Algebra Using Tile Algorithms on Multicore Architectures. CGC 2012: 274-281 - [c261]Hartwig Anzt
, Stanimire Tomov
, Jack J. Dongarra, Vincent Heuveline
:
Weighted Block-Asynchronous Iteration on GPU-Accelerated Systems. Euro-Par Workshops 2012: 145-154 - [c260]George Bosilca, Aurélien Bouteiller
, Anthony Danalis, Thomas Hérault
, Jack J. Dongarra:
From Serial Loops to Parallel Execution on Distributed Systems. Euro-Par 2012: 246-257 - [c259]Wesley Bland, Peng Du, Aurélien Bouteiller
, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI. Euro-Par 2012: 477-488 - [c258]Hartwig Anzt
, Piotr Luszczek, Jack J. Dongarra, Vincent Heuveline
:
GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement. Euro-Par 2012: 908-919 - [c257]George Bosilca, Aurélien Bouteiller
, Anthony Danalis, Thomas Hérault
, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Scalable Dense Linear Algebra on Heterogeneous Hardware. High Performance Computing Workshop (2) 2012: 65-103 - [c256]Jack J. Dongarra, Piotr Luszczek:
Anatomy of a globally recursive embedded LINPACK benchmark. HPEC 2012: 1-6 - [c255]Fengguang Song, Stanimire Tomov
, Jack J. Dongarra:
Enabling and scaling matrix computations on heterogeneous multi-core and multi-GPU systems. ICS 2012: 365-376 - [c254]Marc Baboulin, Dulceneia Becker, Jack J. Dongarra:
A Parallel Tiled Solver for Dense Symmetric Indefinite Systems on Multicore Architectures. IPDPS 2012: 14-24 - [c253]Azzam Haidar, Hatem Ltaief
, Piotr Luszczek, Jack J. Dongarra:
A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction. IPDPS 2012: 25-35 - [c252]Hartwig Anzt
, Stanimire Tomov
, Jack J. Dongarra, Vincent Heuveline
:
A Block-Asynchronous Relaxation Method for Graphics Processing Units. IPDPS Workshops 2012: 113-124 - [c251]Jack J. Dongarra, Mathieu Faverge, Thomas Hérault
, Julien Langou, Yves Robert:
Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems. IPDPS 2012: 607-618 - [c250]Teng Ma, George Bosilca, Aurélien Bouteiller
, Jack J. Dongarra:
HierKNEM: An Adaptive Framework for Kernel-Assisted and Topology-Aware Collective Communications on Many-core Clusters. IPDPS 2012: 970-982 - [c249]Peng Du, Aurélien Bouteiller
, George Bosilca, Thomas Hérault
, Jack J. Dongarra:
Algorithm-based fault tolerance for dense matrix factorizations. PPoPP 2012: 225-234 - [c248]Wesley Bland, Aurélien Bouteiller
, Thomas Hérault
, Joshua Hursey, George Bosilca, Jack J. Dongarra:
An Evaluation of User-Level Failure Mitigation Support in MPI. EuroMPI 2012: 193-203 - [c247]Emmanuel Agullo, George Bosilca, Bérenger Bramas, Cedric Castagnede, Olivier Coulaud, Eric Darve, Jack J. Dongarra, Mathieu Faverge, Nathalie Furmento
, Luc Giraud, Xavier Lacoste, Julien Langou, Hatem Ltaief
, Matthias Messner, Raymond Namyst, Pierre Ramet
, Toru Takahashi
, Samuel Thibault, Stanimire Tomov
, Ichitaro Yamazaki:
Abstract: Matrices Over Runtime Systems at Exascale. SC Companion 2012: 1330-1331 - [c246]Emmanuel Agullo, George Bosilca, Bérenger Bramas, Cedric Castagnede, Olivier Coulaud, Eric Darve, Jack J. Dongarra, Mathieu Faverge, Nathalie Furmento, Luc Giraud, Xavier Lacoste, Julien Langou, Hatem Ltaief, Matthias Messner, Raymond Namyst, Pierre Ramet, Toru Takahashi, Samuel Thibault, Stanimire Tomov, Ichitaro Yamazaki:
Poster: Matrices over Runtime Systems at Exascale. SC Companion 2012: 1332 - [c245]Raffaele Solcà, Azzam Haidar, Stanimire Tomov
, Thomas C. Schulthess, Jack J. Dongarra:
Abstract: A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks. SC Companion 2012: 1338-1339 - [c244]Raffaele Solcà, Azzam Haidar, Stanimire Tomov, Thomas C. Schulthess, Jack J. Dongarra:
Poster: A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks. SC Companion 2012: 1340 - [c243]Fengguang Song, Jack J. Dongarra:
A scalable framework for heterogeneous GPU-based clusters. SPAA 2012: 91-100 - [c242]Jakub Kurzak, Piotr Luszczek, Mathieu Faverge, Jack J. Dongarra:
Programming the LU Factorization for a Multicore System with Accelerators. VECPAR 2012: 28-35 - [c241]Ahmad Abdelfattah, Jack J. Dongarra, David E. Keyes
, Hatem Ltaief
:
Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators. VECPAR 2012: 72-79 - [c240]Hesham H. Ali, Yong Shi, Deepak Khazanchi
, Michael Lees
, G. Dick van Albada, Jack J. Dongarra, Peter M. A. Sloot:
Empowering Science through Computing, Preface for ICCS 2012. ICCS 2012: 1-6 - [c239]Hartwig Anzt
, Stanimire Tomov
, Mark Gates
, Jack J. Dongarra, Vincent Heuveline
:
Block-asynchronous Multigrid Smoothers for GPU-accelerated Systems. ICCS 2012: 7-16 - [c238]Marc Baboulin, Simplice Donfack, Jack J. Dongarra, Laura Grigori, Adrien Rémy, Stanimire Tomov
:
A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines. ICCS 2012: 17-26 - [c237]Yulu Jia, Piotr Luszczek, Jack J. Dongarra:
Multi-GPU Implementation of LU Factorization. ICCS 2012: 106-115 - [c236]Peng Du, Piotr Luszczek, Jack J. Dongarra:
High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors. ICCS 2012: 216-225 - [p12]Jack J. Dongarra, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
:
Dense Linear Algebra on Accelerated Multicore Hardware. High-Performance Scientific Computing 2012: 123-146 - [e79]Hesham H. Ali, Yong Shi, Deepak Khazanchi, Michael Lees, G. Dick van Albada, Jack J. Dongarra, Peter M. A. Sloot:
Proceedings of the International Conference on Computational Science, ICCS 2012, Omaha, Nebraska, USA, 4-6 June, 2012. Procedia Computer Science 9, Elsevier 2012 [contents] - [e78]Roman Wyrzykowski, Jack J. Dongarra, Konrad Karczewski, Jerzy Wasniewski:
Parallel Processing and Applied Mathematics - 9th International Conference, PPAM 2011, Torun, Poland, September 11-14, 2011. Revised Selected Papers, Part I. Lecture Notes in Computer Science 7203, Springer 2012, ISBN 978-3-642-31463-6 [contents] - [e77]Roman Wyrzykowski, Jack J. Dongarra, Konrad Karczewski, Jerzy Wasniewski:
Parallel Processing and Applied Mathematics - 9th International Conference, PPAM 2011, Torun, Poland, September 11-14, 2011. Revised Selected Papers, Part II. Lecture Notes in Computer Science 7204, Springer 2012, ISBN 978-3-642-31499-5 [contents] - [e76]Jesper Larsson Träff, Siegfried Benkner
, Jack J. Dongarra:
Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria, September 23-26, 2012. Proceedings. Lecture Notes in Computer Science 7490, Springer 2012, ISBN 978-3-642-33517-4 [contents] - [i12]Raffaele Solcà, Thomas C. Schulthess, Azzam Haidar, Stanimire Tomov, Ichitaro Yamazaki, Jack J. Dongarra:
A hybrid Hermitian general eigenvalue solver. CoRR abs/1207.1773 (2012) - 2011
- [j205]Azzam Haidar, Hatem Ltaief
, Asim YarKhan
, Jack J. Dongarra:
Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures. Concurr. Comput. Pract. Exp. 24(3): 305-321 (2011) - [j204]Jeffrey S. Vetter, Richard Glassbrook, Jack J. Dongarra, Karsten Schwan, Bruce Loftis, Stephen Taylor McNally, Jeremy S. Meredith, James H. Rogers, Philip C. Roth, Kyle Spafford, Sudhakar Yalamanchili:
Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community. Comput. Sci. Eng. 13(5): 90-95 (2011) - [j203]Emmanuel Agullo, Camille Coti
, Thomas Hérault
, Julien Langou, Sylvain Peyronnet, Ala Rezmerita, Franck Cappello, Jack J. Dongarra:
QCG-OMPI: MPI applications on grids. Future Gener. Comput. Syst. 27(4): 357-369 (2011) - [j202]Jack J. Dongarra, Peter H. Beckman, Terry Moore, Patrick Aerts, Giovanni Aloisio
, Jean-Claude Andre, David Barkai, Jean-Yves Berthou, Taisuke Boku, Bertrand Braunschweig, Franck Cappello, Barbara M. Chapman, Xuebin Chi, Alok N. Choudhary, Sudip S. Dosanjh, Thom H. Dunning, Sandro Fiore
, Al Geist, Bill Gropp
, Robert J. Harrison
, Mark Hereld, Michael A. Heroux, Adolfy Hoisie, Koh Hotta, Zhong Jin, Yutaka Ishikawa, Fred Johnson, Sanjay Kale, Richard Kenway, David E. Keyes, Bill Kramer, Jesús Labarta
, Alain Lichnewsky, Thomas Lippert, Bob Lucas, Barney Maccabe
, Satoshi Matsuoka, Paul Messina, Peter Michielse, Bernd Mohr
, Matthias S. Müller
, Wolfgang E. Nagel, Hiroshi Nakashima, Michael E. Papka
, Daniel A. Reed, Mitsuhisa Sato, Edward Seidel, John Shalf
, David Skinner, Marc Snir, Thomas L. Sterling, Rick Stevens, Frederick H. Streitz
, Bob Sugar, Shinji Sumimoto, William M. Tang
, John A. Taylor, Rajeev Thakur
, Anne E. Trefethen, Mateo Valero
, Aad J. van der Steen, Jeffrey S. Vetter, Peg Williams, Robert W. Wisniewski, Katherine A. Yelick
:
The International Exascale Software Project roadmap. Int. J. High Perform. Comput. Appl. 25(1): 3-60 (2011) - [j201]Jack J. Dongarra, Bernard Tourancheau:
Selected papers of the Workshop on Clusters, Clouds and Grids for Scientific Computing (CCGSC). Int. J. High Perform. Comput. Appl. 25(3): 259-260 (2011) - [j200]Heike Jagode
, Andreas Knüpfer
, Jack J. Dongarra, Matthias Jurenz, Matthias S. Müller
, Wolfgang E. Nagel:
Trace-based performance analysis for the petascale simulation code FLASH. Int. J. High Perform. Comput. Appl. 25(4): 428-439 (2011) - [j199]James Buford White III
, Jack J. Dongarra:
High-performance high-resolution semi-Lagrangian tracer transport on a sphere. J. Comput. Phys. 230(17): 6778-6799 (2011) - [j198]Jack J. Dongarra, Bernard Tourancheau:
Guest Editors Note. Parallel Process. Lett. 21(2): 109 (2011) - [j197]Piotr Luszczek, Jack J. Dongarra:
Linear algebra - software issues. Scholarpedia 6(4): 9699 (2011) - [c235]Emmanuel Agullo, Cédric Augonnet
, Jack J. Dongarra, Mathieu Faverge, Julien Langou, Hatem Ltaief
, Stanimire Tomov
:
LU factorization for accelerator-based systems. AICCSA 2011: 217-224 - [c234]François Trahay, François Rué, Mathieu Faverge, Yutaka Ishikawa, Raymond Namyst, Jack J. Dongarra:
EZTrace: A Generic Framework for Performance Analysis. CCGRID 2011: 618-619 - [c233]George Bosilca, Thomas Hérault
, Ala Rezmerita, Jack J. Dongarra:
On Scalability for MPI Runtime Systems. CLUSTER 2011: 187-195 - [c232]Teng Ma, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Process Distance-Aware Adaptive MPI Collective Communications. CLUSTER 2011: 196-204 - [c231]Peng Du, Piotr Luszczek, Jack J. Dongarra:
High Performance Dense Linear System Solver with Soft Error Resilience. CLUSTER 2011: 272-280 - [c230]George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Pierre Lemarinier
, Narapat Ohm Saengpatsa, Stanimire Tomov
, Jack J. Dongarra:
Performance Portability of a GPU Enabled Factorization with the DAGuE Framework. CLUSTER 2011: 395-402 - [c229]Aurélien Bouteiller
, Thomas Hérault
, George Bosilca, Jack J. Dongarra:
Correlated Set Coordination in Fault Tolerant Message Logging Protocols. Euro-Par (2) 2011: 51-64 - [c228]Emmanuel Agullo, Jack J. Dongarra, Rajib Nath, Stanimire Tomov
:
A Fully Empirical Autotuned Dense QR Factorization for Multicore Architectures. Euro-Par (2) 2011: 194-205 - [c227]Piotr Luszczek, Eric Meek, Shirley Moore, Daniel Terpstra, Vincent M. Weaver, Jack J. Dongarra:
Evaluation of the HPC Challenge Benchmarks in Virtualized Environments. Euro-Par Workshops (2) 2011: 436-445 - [c226]Teng Ma, George Bosilca, Aurélien Bouteiller
, Brice Goglin
, Jeffrey M. Squyres, Jack J. Dongarra:
Kernel Assisted Collective Intra-node MPI Communication among Multi-Core and Many-Core CPUs. ICPP 2011: 532-541 - [c225]James Buford White III
, Jack J. Dongarra:
Overlapping Computation and Communication for Advection on Hybrid Parallel Computers. IPDPS 2011: 59-67 - [c224]Yves Robert, William J. Dally, Jack J. Dongarra, Satoshi Matsuoka, Robert Schreiber, Horst D. Simon, Uzi Vishkin:
Panel Statement. IPDPS 2011: 505 - [c223]Jack J. Dongarra:
Architecture-aware Algorithms and Software for Peta and Exascale Computing. IPDPS 2011: 507 - [c222]Emmanuel Agullo, Cédric Augonnet
, Jack J. Dongarra, Mathieu Faverge, Hatem Ltaief
, Samuel Thibault, Stanimire Tomov
:
QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators. IPDPS 2011: 932-943 - [c221]Piotr Luszczek, Hatem Ltaief
, Jack J. Dongarra:
Two-Stage Tridiagonal Reduction for Dense Symmetric Matrices Using Tile Algorithms on Multicore Architectures. IPDPS 2011: 944-955 - [c220]George Bosilca, Aurélien Bouteiller
, Anthony Danalis, Thomas Hérault
, Pierre Lemarinier
, Jack J. Dongarra:
DAGuE: A Generic Distributed DAG Engine for High Performance Computing. IPDPS Workshops 2011: 1151-1158 - [c219]George Bosilca, Aurélien Bouteiller
, Anthony Danalis, Mathieu Faverge, Azzam Haidar, Thomas Hérault
, Jakub Kurzak, Julien Langou, Pierre Lemarinier
, Hatem Ltaief
, Piotr Luszczek, Asim YarKhan
, Jack J. Dongarra:
Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA. IPDPS Workshops 2011: 1432-1441 - [c218]Hatem Ltaief
, Piotr Luszczek, Azzam Haidar, Jack J. Dongarra:
Solving the Generalized Symmetric Eigenvalue Problem using Tile Algorithms on Multicore Architectures. PARCO 2011: 397-404 - [c217]Jack J. Dongarra, Mathieu Faverge, Hatem Ltaief
, Piotr Luszczek:
Exploiting Fine-Grain Parallelism in Recursive LU Factorization. PARCO 2011: 429-436 - [c216]Dulceneia Becker, Marc Baboulin, Jack J. Dongarra:
Reducing the Amount of Pivoting in Symmetric Indefinite Systems. PPAM (1) 2011: 133-142 - [c215]Hatem Ltaief
, Piotr Luszczek, Jack J. Dongarra:
Enhancing Parallelism of Tile Bidiagonal Transformation on Multicore Architectures Using Tree Reduction. PPAM (1) 2011: 661-670 - [c214]Piotr Luszczek, Jack J. Dongarra:
Reducing the Time to Tune Parallel Dense Linear Algebra Routines with Partial Execution and Performance Modeling. PPAM (1) 2011: 730-739 - [c213]Mohamad Chaarawi, Edgar Gabriel, Rainer Keller, Richard L. Graham, George Bosilca, Jack J. Dongarra:
OMPIO: A Modular Software Architecture for MPI I/O. EuroMPI 2011: 81-89 - [c212]Teng Ma, Aurélien Bouteiller
, George Bosilca, Jack J. Dongarra:
Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW. EuroMPI 2011: 247-254 - [c211]George Bosilca, Thomas Hérault
, Pierre Lemarinier
, Ala Rezmerita, Jack J. Dongarra:
Scalable Runtime for MPI: Efficiently Building the Communication Infrastructure. EuroMPI 2011: 342-344 - [c210]Shirley Moore, Daniel Terpstra, Vincent M. Weaver, Heike Jagode
, James Ralph, Jack J. Dongarra:
Poster: new features of the PAPI hardware counter library. SC Companion 2011: 3-4 - [c209]Ioan Raicu, Dan Reed, Jack J. Dongarra, Daniel S. Katz
, David Abramson
:
Panel: many-task computing meets exascales. MTAGS@SC 2011: 3-4 - [c208]Rajib Nath, Stanimire Tomov
, Tingxing Dong, Jack J. Dongarra:
Optimizing symmetric dense matrix-vector multiplication on GPUs. SC 2011: 6:1-6:10 - [c207]Azzam Haidar, Hatem Ltaief
, Jack J. Dongarra:
Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels. SC 2011: 8:1-8:11 - [c206]Peng Du, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Soft error resilient QR factorization for hybrid system with GPGPU. ScalA@SC 2011: 11-14 - [c205]Jack J. Dongarra, Mathieu Faverge, Hatem Ltaief
, Piotr Luszczek:
High performance matrix inversion based on LU factorization for multicore architectures. MTAGS@SC 2011: 33-42 - [c204]Mitsuhisa Sato, Satoshi Matsuoka, Peter M. A. Sloot, G. Dick van Albada, Jack J. Dongarra:
Preface. ICCS 2011: 1-6 - [e75]Mitsuhisa Sato, Satoshi Matsuoka, Peter M. A. Sloot, G. Dick van Albada, Jack J. Dongarra:
Proceedings of the International Conference on Computational Science, ICCS 2011, Nanyang Technological University, Singapore, 1-3 June, 2011. Procedia Computer Science 4, Elsevier 2011 [contents] - [e74]Yiannis Cotronis, Anthony Danalis, Dimitrios S. Nikolopoulos, Jack J. Dongarra:
Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Santorini, Greece, September 18-21, 2011. Proceedings. Lecture Notes in Computer Science 6960, Springer 2011, ISBN 978-3-642-24448-3 [contents] - [e73]Vassil Alexandrov, Al Geist, Jack J. Dongarra:
Proceedings of the second workshop on Scalable algorithms for large-scale systems, ScalA@SC 2011, Seattle, WA, USA, November 14, 2011. ACM 2011, ISBN 978-1-4503-1180-9 [contents] - [r10]Jack J. Dongarra, Piotr Luszczek:
Benchmarks. Encyclopedia of Parallel Computing 2011: 127-129 - [r9]Jack J. Dongarra, Piotr Luszczek:
HPC Challenge Benchmark. Encyclopedia of Parallel Computing 2011: 844-850 - [r8]Jack J. Dongarra, Piotr Luszczek:
LAPACK. Encyclopedia of Parallel Computing 2011: 1005-1006 - [r7]Jack J. Dongarra, Piotr Luszczek:
Linear Algebra Software. Encyclopedia of Parallel Computing 2011: 1021-1026 - [r6]Jack J. Dongarra, Piotr Luszczek:
LINPACK Benchmark. Encyclopedia of Parallel Computing 2011: 1033-1036 - [r5]Jack J. Dongarra, Piotr Luszczek:
Livermore Loops. Encyclopedia of Parallel Computing 2011: 1041-1043 - [r4]Jack J. Dongarra, Piotr Luszczek:
PLASMA. Encyclopedia of Parallel Computing 2011: 1568-1570 - [r3]Jack J. Dongarra, Piotr Luszczek:
ScaLAPACK. Encyclopedia of Parallel Computing 2011: 1773-1775 - [r2]Jack J. Dongarra, Piotr Luszczek:
TOP500. Encyclopedia of Parallel Computing 2011: 2055-2057 - [i11]Emmanuel Agullo, Jack J. Dongarra, Rajib Nath, Stanimire Tomov:
Fully Empirical Autotuned QR Factorization For Multicore Architectures. CoRR abs/1102.5328 (2011) - [i10]Jack J. Dongarra, Mathieu Faverge, Thomas Hérault, Julien Langou, Yves Robert:
Hierarchical QR factorization algorithms for multi-core cluster systems. CoRR abs/1110.1553 (2011) - 2010
- [j196]Jakub Kurzak, Hatem Ltaief
, Jack J. Dongarra, Rosa M. Badia
:
Scheduling dense linear algebra operations on multicore processors. Concurr. Comput. Pract. Exp. 22(1): 15-44 (2010) - [j195]Aurélien Bouteiller, George Bosilca, Jack J. Dongarra:
Redesigning the message logging model for high performance. Concurr. Comput. Pract. Exp. 22(16): 2196-2211 (2010) - [j194]Thomas Brady, Jack J. Dongarra, Michele Guidolin
, Alexey L. Lastovetsky
, Keith Seymour:
SmartGridRPC: The new RPC model for high performance Grid computing. Concurr. Comput. Pract. Exp. 22(18): 2467-2487 (2010) - [j193]Thara Angskun, Graham E. Fagg, George Bosilca, Jelena Pjesivac-Grbovic, Jack J. Dongarra:
Self-healing network for scalable fault-tolerant runtime environments. Future Gener. Comput. Syst. 26(3): 479-485 (2010) - [j192]Rajib Nath, Stanimire Tomov
, Jack J. Dongarra:
An Improved Magma Gemm For Fermi Graphics Processing Units. Int. J. High Perform. Comput. Appl. 24(4): 511-515 (2010) - [j191]Peter M. A. Sloot, Peter V. Coveney, Jack J. Dongarra:
Preface. J. Comput. Sci. 1(1): 3-4 (2010) - [j190]Stanimire Tomov
, Jack J. Dongarra, Marc Baboulin:
Towards dense linear algebra for hybrid GPU accelerated manycore systems. Parallel Comput. 36(5-6): 232-240 (2010) - [j189]Stanimire Tomov
, Rajib Nath, Jack J. Dongarra:
Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing. Parallel Comput. 36(12): 645-654 (2010) - [j188]Hatem Ltaief
, Jakub Kurzak, Jack J. Dongarra, Rosa M. Badia
:
Scheduling two-sided transformations using tile algorithms on multicore architectures. Sci. Program. 18(1): 35-50 (2010) - [j187]Fred G. Gustavson, Jerzy Wasniewski, Jack J. Dongarra, Julien Langou:
Rectangular full packed format for cholesky's algorithm: factorization, solution, and inversion. ACM Trans. Math. Softw. 37(2): 18:1-18:21 (2010) - [j186]Hatem Ltaief
, Jakub Kurzak, Jack J. Dongarra:
Parallel Two-Sided Matrix Reduction to Band Bidiagonal Form on Multicore Architectures. IEEE Trans. Parallel Distributed Syst. 21(4): 417-423 (2010) - [c203]Peng Du, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Mixed-Tool Performance Analysis on Hybrid Multicore Architectures. ICPP Workshops 2010: 236-244 - [c202]Emmanuel Agullo, Camille Coti, Jack J. Dongarra, Thomas Hérault
, Julien Langou:
QR factorization of tall and skinny matrices in a grid computing environment. IPDPS 2010: 1-11 - [c201]Bilel Hadri, Hatem Ltaief
, Emmanuel Agullo, Jack J. Dongarra:
Tile QR factorization with parallel panel processing for multicore architectures. IPDPS 2010: 1-10 - [c200]Stanimire Tomov
, Rajib Nath, Hatem Ltaief
, Jack J. Dongarra:
Dense linear algebra solvers for multicore with GPU accelerators. IPDPS Workshops 2010: 1-8 - [c199]Jakub Kurzak, Rajib Nath, Peng Du, Jack J. Dongarra:
An Implementation of the Tile QR Factorization for a GPU and Multiple CPUs. PARA (2) 2010: 248-257 - [c198]George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Pierre Lemarinier
, Jack J. Dongarra:
Dodging the Cost of Unavoidable Memory Copies in Message Logging Protocols. EuroMPI 2010: 189-197 - [c197]Teng Ma, George Bosilca, Aurélien Bouteiller
, Jack J. Dongarra:
Locality and Topology Aware Intra-node Communication among Multicore CPUs. EuroMPI 2010: 265-274 - [c196]Fengguang Song, Hatem Ltaief
, Bilel Hadri, Jack J. Dongarra:
Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems. SC 2010: 1-11 - [c195]Rajib Nath, Stanimire Tomov
, Jack J. Dongarra:
Accelerating GPU Kernels for Dense Linear Algebra. VECPAR 2010: 83-92 - [c194]Hatem Ltaief
, Stanimire Tomov
, Rajib Nath, Peng Du, Jack J. Dongarra:
A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators. VECPAR 2010: 93-101 - [c193]Emmanuel Agullo, Henricus Bouwmeester, Jack J. Dongarra, Jakub Kurzak, Julien Langou, Lee Rosenberg:
Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures. VECPAR 2010: 129-138 - [c192]Volodymyr Turchenko
, Lucio Grandinetti, George Bosilca, Jack J. Dongarra:
Improvement of parallelization efficiency of batch pattern BP training algorithm using Open MPI. ICCS 2010: 525-533 - [p11]Wesley Alvaro, Jakub Kurzak, Jack J. Dongarra:
Implementing Matrix Multiplication on the Cell B. E. Scientific Computing with Multicore and Accelerators 2010: 3-20 - [p10]Jakub Kurzak, Jack J. Dongarra:
Implementing Matrix Factorizations on the Cell B. E. Scientific Computing with Multicore and Accelerators 2010: 21-35 - [p9]Stanimire Tomov, Jack J. Dongarra:
Dense Linear Algebra for Hybrid GPU-Based Systems. Scientific Computing with Multicore and Accelerators 2010: 37-55 - [p8]Rajib Nath, Stanimire Tomov, Jack J. Dongarra:
BLAS for GPUs. Scientific Computing with Multicore and Accelerators 2010: 57-80 - [e72]Jakub Kurzak, David A. Bader
, Jack J. Dongarra:
Scientific Computing with Multicore and Accelerators. Chapman and Hall / CRC computational science series, CRC Press / Taylor & Francis 2010, ISBN 978-1-4398-2536-5 [contents] - [e71]Peter M. A. Sloot, G. Dick van Albada, Jack J. Dongarra:
Proceedings of the International Conference on Computational Science, ICCS 2010, University of Amsterdam, The Netherlands, May 31 - June 2, 2010. Procedia Computer Science 1(1), Elsevier 2010 [contents] - [e70]Roman Wyrzykowski, Jack J. Dongarra, Konrad Karczewski, Jerzy Wasniewski:
Parallel Processing and Applied Mathematics, 8th International Conference, PPAM 2009, Wroclaw, Poland, September 13-16, 2009. Revised Selected Papers, Part I. Lecture Notes in Computer Science 6067, Springer 2010, ISBN 978-3-642-14389-2 [contents] - [e69]Roman Wyrzykowski, Jack J. Dongarra, Konrad Karczewski, Jerzy Wasniewski:
Parallel Processing and Applied Mathematics, 8th International Conference, PPAM 2009, Wroclaw, Poland, September 13-16, 2009, Revised Selected Papers, Part II. Lecture Notes in Computer Science 6068, Springer 2010, ISBN 978-3-642-14402-8 [contents] - [e68]Rainer Keller, Edgar Gabriel, Michael M. Resch, Jack J. Dongarra:
Recent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Stuttgart, Germany, September 12-15, 2010. Proceedings. Lecture Notes in Computer Science 6305, Springer 2010, ISBN 978-3-642-15645-8 [contents] - [i9]Emmanuel Agullo, Henricus Bouwmeester, Jack J. Dongarra, Jakub Kurzak, Julien Langou, Lee Rosenberg:
Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures. CoRR abs/1002.4057 (2010)
2000 – 2009
- 2009
- [b8]Alexey L. Lastovetsky, Jack J. Dongarra:
High Performance Heterogeneous Computing. Wiley series on parallel and distributed computing, Wiley 2009, ISBN 978-0-470-04039-3, pp. I-XI, 1-267 - [j185]Lamia Youseff, Keith Seymour, Haihang You, Dmitrii Zagorodnov, Jack J. Dongarra, Richard Wolski:
Paravirtualization effect on single- and multi-threaded memory-intensive linear algebra software. Clust. Comput. 12(2): 101-122 (2009) - [j184]Marc Baboulin, Alfredo Buttari
, Jack J. Dongarra, Jakub Kurzak, Julie Langou, Julien Langou, Piotr Luszczek, Stanimire Tomov
:
Accelerating scientific computations with mixed precision algorithms. Comput. Phys. Commun. 180(12): 2526-2533 (2009) - [j183]Jack J. Dongarra, Julien Langou:
The Problem With the Linpack Benchmark 1.0 Matrix Generator. Int. J. High Perform. Comput. Appl. 23(1): 5-13 (2009) - [j182]Jack J. Dongarra, Bernard Tourancheau:
Editorial. Int. J. High Perform. Comput. Appl. 23(3): 195 (2009) - [j181]Jack J. Dongarra, Peter H. Beckman, Patrick Aerts, Franck Cappello, Thomas Lippert, Satoshi Matsuoka, Paul Messina, Terry Moore, Rick Stevens, Anne E. Trefethen, Mateo Valero
:
The International Exascale Software Project: a Call To Cooperative Action By the Global High-Performance Community. Int. J. High Perform. Comput. Appl. 23(4): 309-322 (2009) - [j180]George Bosilca, Remi Delmas, Jack J. Dongarra, Julien Langou
:
Algorithm-based fault tolerance applied to high performance computing. J. Parallel Distributed Comput. 69(4): 410-416 (2009) - [j179]Marc Baboulin, Jack J. Dongarra, Serge Gratton, Julien Langou:
Computing the conditioning of the components of a linear least-squares solution. Numer. Linear Algebra Appl. 16(7): 517-533 (2009) - [j178]Alfredo Buttari
, Julien Langou
, Jakub Kurzak, Jack J. Dongarra:
A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Comput. 35(1): 38-53 (2009) - [j177]Jakub Kurzak, Wesley Alvaro, Jack J. Dongarra:
Optimizing matrix multiplication for a short-vector SIMD architecture - CELL processor. Parallel Comput. 35(3): 138-150 (2009) - [j176]Franck Cappello, Thomas Hérault
, Jack J. Dongarra:
Foreword. Parallel Comput. 35(12): 571 (2009) - [j175]Jakub Kurzak, Jack J. Dongarra:
QR factorization for the Cell Broadband Engine. Sci. Program. 17(1-2): 31-42 (2009) - [j174]Zizhong Chen
, Jack J. Dongarra:
Highly Scalable Self-Healing Algorithms for High Performance Scientific Computing. IEEE Trans. Computers 58(11): 1512-1524 (2009) - [c191]Aurélien Bouteiller
, Thomas Ropars, George Bosilca, Christine Morin, Jack J. Dongarra:
Reasons for a pessimistic or optimistic message logging protocol in MPI uncoordinated failure, recovery. CLUSTER 2009: 1-9 - [c190]Fengguang Song, Shirley Moore, Jack J. Dongarra:
Analytical modeling and optimization for affinity based thread scheduling on multicore systems. CLUSTER 2009: 1-10 - [c189]Fengguang Song, Jack J. Dongarra, Shirley Moore:
A Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling. ICCS (1) 2009: 195-204 - [c188]Heike Jagode
, Jack J. Dongarra, Sadaf R. Alam, Jeffrey S. Vetter, Wyatt Spear, Allen D. Malony:
A Holistic Approach for Performance Measurement and Analysis for Petascale Applications. ICCS (2) 2009: 686-695 - [c187]Yinan Li, Jack J. Dongarra, Stanimire Tomov
:
A Note on Auto-tuning GEMM for GPUs. ICCS (1) 2009: 884-892 - [c186]Rinku Gupta, Peter H. Beckman, Byung-Hoon Park, Ewing L. Lusk, Paul Hargrove
, Al Geist, Dhabaleswar K. Panda, Andrew Lumsdaine
, Jack J. Dongarra:
CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems. ICPP 2009: 237-245 - [c185]George Bosilca, Camille Coti, Thomas Hérault
, Pierre Lemarinier
, Jack J. Dongarra:
Constructing Resiliant Communication Infrastructure for Runtime Environments. PARCO 2009: 441-451 - [c184]Daniel Terpstra, Heike Jagode
, Haihang You, Jack J. Dongarra:
Collecting Performance Data with PAPI-C. Parallel Tools Workshop 2009: 157-173 - [c183]Torsten Hoefler, Andrew Lumsdaine, Jack J. Dongarra:
Towards Efficient MapReduce Using MPI. PVM/MPI 2009: 240-249 - [c182]Emmanuel Agullo, Bilel Hadri, Hatem Ltaief
, Jack J. Dongarra:
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware. SC 2009 - [c181]Fengguang Song, Asim YarKhan
, Jack J. Dongarra:
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems. SC 2009 - [p7]Jack J. Dongarra, Hans Werner Meuer, Horst D. Simon
, Erich Strohmaier:
Recent trends in high performance computing. The Birth of Numerical Analysis 2009: 93-107 - [e67]Gabrielle Allen, Jaroslaw Nabrzyski, Edward Seidel, G. Dick van Albada, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2009, 9th International Conference, Baton Rouge, LA, USA, May 25-27, 2009, Proceedings, Part I. Lecture Notes in Computer Science 5544, Springer 2009, ISBN 978-3-642-01969-2 [contents] - [e66]Gabrielle Allen, Jaroslaw Nabrzyski, Edward Seidel, G. Dick van Albada, Jack J. Dongarra, Peter M. A. Sloot:
Computational Science - ICCS 2009, 9th International Conference, Baton Rouge, LA, USA, May 25-27, 2009, Proceedings, Part II. Lecture Notes in Computer Science 5545, Springer 2009, ISBN 978-3-642-01972-2 [contents] - [e65]Matti Ropo
, Jan Westerholm, Jack J. Dongarra:
Recent Advances in Parallel Virtual Machine and Message Passing Interface, 16th European PVM/MPI Users' Group Meeting, Espoo, Finland, September 7-10, 2009. Proceedings. Lecture Notes in Computer Science 5759, Springer 2009, ISBN 978-3-642-03769-6 [contents] - [i8]