default search action
Enrique S. Quintana-Ortí
Person information
- affiliation: Jaume I University, Castellón de la Plana, Spain
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j183]Adrián Castelló, Héctor Martínez, Sandra Catalán, Francisco D. Igual, Enrique S. Quintana-Ortí:
Experience-guided, mixed-precision matrix multiplication with apache TVM for ARM processors. J. Supercomput. 81(1): 214 (2025) - [j182]Roberto Díaz-Cano Lozano, Francesc Folch, Enrique S. Quintana-Ortí, Pedro Alonso-Jordá:
Acceleration of the MVS workflow using graphics processors. J. Supercomput. 81(2): 364 (2025) - 2024
- [j181]Rafael Rodríguez-Sánchez, Adrián Castelló, Sandra Catalán, Francisco D. Igual, Enrique S. Quintana-Ortí:
Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors. Int. J. High Perform. Comput. Appl. 38(2): 55-68 (2024) - [j180]Cristián Ramírez, Adrián Castelló, Héctor Martínez, Enrique S. Quintana-Ortí:
Communication-Avoiding Fusion of GEMM-Based Convolutions for Deep Learning in the RISC-V GAP8 MCU. IEEE Internet Things J. 11(21): 35640-35653 (2024) - [j179]Héctor Martínez, Sandra Catalán, Adrián Castelló, Enrique S. Quintana-Ortí:
Parallel GEMM-based convolutions for deep learning on multicore ARM and RISC-V architectures. J. Syst. Archit. 153: 103186 (2024) - [j178]María Engracia Gómez, Julio Sahuquillo, Andrea Biagioni, Nikos Chrysos, Damien Berton, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Elena Pastorelli, Francesco Simula, Matteo Turisini, Piero Vicini, Roberto Ammendola, Carlotta Chiarini, Chiara De Luca, Fabrizio Capuani, Adrián Castelló, Jose Duro, Eugenio Stabile, Enrique S. Quintana-Ortí, Pascale Bernier-Bruna, Claire Chen, Pierre-Axel Lagadec, Gregoire Pichon, Etienne Walter, Manolis Katevenis, Sokratis Bartzis, Orestis Mousouros, Pantelis Xirouchakis, Vangelis Mageiropoulos, Michalis Gianioudis, Harisis Loukas, Aggelos Ioannou, Nikos Kallimanis, Miguel Sánchez de la Rosa, Gabriel Gomez-Lopez, Francisco Alfaro-Cortés, Jesús Escudero-Sahuquillo, Pedro Javier García, Francisco J. Quiles, José L. Sánchez, Gaetan De Gassowski, Matthieu Hautreaux, Stephane Mathieu, Gilles Moreau, Marc Pérache, Hugo Taboada, Torsten Hoefler, Timo Schneider, Matteo Barnaba, Giuseppe Piero Brandino, Francesco De Giorgi, Matteo Poggi, Iakovos Mavroidis, Yannis Papaefstathiou, Nikolaos Tampouratzis, Benjamin Kalisch, Ulrich Krackhardt, Mondrian Nuessle, Wolfgang Frings, Dominik Gottwald, Felime Guimaraes, Max Holicki, Volker Marx, Yannik Müller, Carsten Clauss, Hugo Falter, Xu Huang, Jennifer Lopez Barillao, Thomas Moschny, Simon Pickartz:
RED-SEA Project: Towards a new-generation European interconnect. Microprocess. Microsystems 110: 105102 (2024) - [j177]Antoine Grenier, Jie Lei, Hans Jakob Damsgaard, Enrique S. Quintana-Ortí, Aleksandr Ometov, Elena Simona Lohan, Jari Nurmi:
Hard SyDR: A Benchmarking Environment for Global Navigation Satellite System Algorithms. Sensors 24(2): 409 (2024) - [j176]Cristián Ramírez, Adrián Castelló, Héctor Martínez, Enrique S. Quintana-Ortí:
Parallel GEMM-based convolution for deep learning on multicore RISC-V processors. J. Supercomput. 80(9): 12623-12643 (2024) - [j175]Guillermo Alaejos, Héctor Martínez, Adrián Castelló, Manuel F. Dolz, Francisco D. Igual, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
Automatic generation of ARM NEON micro-kernels for matrix multiplication. J. Supercomput. 80(10): 13873-13899 (2024) - [j174]Guillermo Alaejos, Adrián Castelló, Pedro Alonso-Jordá, Francisco D. Igual, Héctor Martínez, Enrique S. Quintana-Ortí:
Algorithm 1039: Automatic Generators for a Family of Matrix Multiplication Routines with Apache TVM. ACM Trans. Math. Softw. 50(1): 6:1-6:34 (2024) - [c195]Piotr Kluska, Adrián Castelló, Florian Scheidegger, A. Cristiano I. Malossi, Enrique S. Quintana-Ortí:
QAttn: Efficient GPU Kernels for mixed-precision Vision Transformers. CVPR Workshops 2024: 3648-3657 - [c194]Héctor Martínez, Francisco D. Igual, Rafael Rodríguez-Sánchez, Sandra Catalán, Adrián Castelló, Enrique S. Quintana-Ortí:
Inference with Transformer Encoders on ARM and RISC-V Multicore Processors. Euro-Par (2) 2024: 377-392 - [c193]Roberto Díaz-Cano Lozano, Francesc Folch, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
Acceleration of the Pre-processing Stage of the MVS Workflow using Graphics Processors. PMAM@PPoPP 2024: 1-10 - [c192]Héctor Martínez, Sandra Catalán, Carlos García, Francisco D. Igual, Rafael Rodríguez-Sánchez, Adrián Castelló, Enrique S. Quintana-Ortí:
Performance Analysis of BERT on RISC-V Processors with SIMD Units. ISC Workshops 2024: 325-338 - [i29]Andrés E. Tomás, Enrique S. Quintana-Ortí, Hartwig Anzt:
Fast Truncated SVD of Sparse and Dense Matrices on Graphics Processors. CoRR abs/2403.06218 (2024) - [i28]Cristián Ramírez, Adrián Castelló, Héctor Martínez, Enrique S. Quintana-Ortí:
Performance Analysis of Matrix Multiplication for Deep Learning on the Edge. CoRR abs/2403.07731 (2024) - [i27]Jie Lei, Enrique S. Quintana-Ortí:
Mapping Parallel Matrix Multiplication in GotoBLAS2 to the AMD Versal ACAP for Deep Learning. CoRR abs/2404.15043 (2024) - [i26]S. Ares de Parga, Jose Raul Bravo, N. Sibuet, J. A. Hernandez, Riccardo Rossi, Stefan Boschert, Enrique S. Quintana-Ortí, Andrés E. Tomás, Cristian Catalin Tatu, Fernando Vázquez-Novoa, Jorge Ejarque, Rosa M. Badia:
Parallel Reduced Order Modeling for Digital Twins using High-Performance Computing Workflows. CoRR abs/2409.09080 (2024) - 2023
- [j173]Adrián Castelló, Mar Catalán, Manuel F. Dolz, Enrique S. Quintana-Ortí, José Duato:
Analyzing the impact of the MPI allreduce in distributed training of convolutional neural networks. Computing 105(5): 1101-1119 (2023) - [j172]Sandra Catalán, José R. Herrero, Francisco D. Igual, Enrique S. Quintana-Ortí, Rafael Rodríguez-Sánchez:
Fine-grain task-parallel algorithms for matrix factorizations and inversion on many-threaded CPUs. Concurr. Comput. Pract. Exp. 35(27) (2023) - [j171]José Ignacio Aliaga, Hartwig Anzt, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors. Concurr. Comput. Pract. Exp. 35(28) (2023) - [j170]José Ignacio Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Compressed basis GMRES on high-performance graphics processing units. Int. J. High Perform. Comput. Appl. 37(2): 82-100 (2023) - [j169]Andrés E. Tomás, Enrique S. Quintana-Ortí, Hartwig Anzt:
Fast truncated SVD of sparse and dense matrices on graphics processors. Int. J. High Perform. Comput. Appl. 37(3-4): 380-393 (2023) - [j168]Sandra Catalán, Francisco D. Igual, José R. Herrero, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí:
Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures. J. Parallel Distributed Comput. 175: 51-65 (2023) - [j167]Sergio Barrachina, Adrián Castelló, Manuel F. Dolz, Tze Meng Low, Héctor Martínez, Enrique S. Quintana-Ortí, Upasana Sridhar, Andrés E. Tomás:
Reformulating the direct convolution for high-performance deep learning inference on ARM processors. J. Syst. Archit. 135: 102806 (2023) - [j166]Thomas Grützmacher, Hartwig Anzt, Enrique S. Quintana-Ortí:
Using Ginkgo's memory accessor for improving the accuracy of memory-bound low precision BLAS. Softw. Pract. Exp. 53(1): 81-98 (2023) - [j165]Guillermo Alaejos, Adrián Castelló, Héctor Martínez, Pedro Alonso-Jordá, Francisco D. Igual, Enrique S. Quintana-Ortí:
Micro-kernels for portable and efficient matrix multiplication in deep learning. J. Supercomput. 79(7): 8124-8147 (2023) - [j164]Manuel F. Dolz, Héctor Martínez, Adrián Castelló, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
Efficient and portable Winograd convolutions for multi-core processors. J. Supercomput. 79(10): 10589-10610 (2023) - [c191]Andrés E. Tomás, Enrique S. Quintana-Ortí:
Tall-and-Skinny QR Factorization for Clusters of GPUs Using High-Performance Building Blocks. Euro-Par Workshops (1) 2023: 306-317 - [c190]Antoine Grenier, Hans Jakob Damsgaard, Jie Lei, Enrique S. Quintana-Ortí, Aleksandr Ometov, Elena Simona Lohan, Jari Nurmi:
Towards Benchmarking GNSS Algorithms on FPGA using SyDR. ICL-GNSS 2023: 1-7 - [c189]Jie Lei, José Flich, Enrique S. Quintana-Ortí:
Toward Matrix Multiplication for Deep Learning Inference on the Xilinx Versal. PDP 2023: 227-234 - [c188]Francisco D. Igual, Luis Piñuel, Sandra Catalán, Héctor Martínez, Adrián Castelló, Enrique S. Quintana-Ortí:
Automatic Generation of Micro-kernels for Performance Portability of Matrix Multiplication on RISC-V Vector Processors. SC Workshops 2023: 1521-1532 - [c187]Jie Lei, Héctor Martínez, José Flich, Enrique S. Quintana-Ortí:
GEMM-Like Convolution for Deep Learning Inference on the Xilinx Versal. ISC Workshops 2023: 593-604 - [i25]Jie Lei, José Flich, Enrique S. Quintana-Ortí:
Toward matrix multiplication for deep learning inference on the Xilinx Versal. CoRR abs/2302.07594 (2023) - [i24]Héctor Martínez, Sandra Catalán, Francisco D. Igual, José R. Herrero, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí:
Co-Design of the Dense Linear AlgebravSoftware Stack for Multicore Processors. CoRR abs/2304.14480 (2023) - [i23]Guillermo Alaejos, Adrián Castelló, Pedro Alonso-Jordá, Francisco D. Igual, Héctor Martínez, Enrique S. Quintana-Ortí:
Automatic Generators for a Family of Matrix Multiplication Routines with Apache TVM. CoRR abs/2310.20347 (2023) - [i22]José Duato, José I. Mestre, Manuel F. Dolz, Enrique S. Quintana-Ortí:
GreenLightningAI: An Efficient AI System with Decoupled Structural and Quantitative Knowledge. CoRR abs/2312.09971 (2023) - 2022
- [j163]José Ignacio Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units. Concurr. Comput. Pract. Exp. 34(14) (2022) - [j162]Jorge Ejarque, Rosa M. Badia, Loïc Albertin, Giovanni Aloisio, Enrico Baglione, Yolanda Becerra, Stefan Boschert, Julian R. Berlin, Alessandro D'Anca, Donatello Elia, François Exertier, Sandro Fiore, José Flich, Arnau Folch, Steven J. Gibbons, Nikolay Koldunov, Francesc Lordan, Stefano Lorito, Finn Løvholt, Jorge Macías Sánchez, Fabrizio Marozzo, Alberto Michelini, Marisol Monterrubio Velasco, Marta Pienkowska, Josep de la Puente, Anna Queralt, Enrique S. Quintana-Ortí, Juan Esteban Rodriguez, Fabrizio Romano, Riccardo Rossi, Jedrzej Rybicki, Miroslaw Kupczyk, Jacopo Selva, Domenico Talia, Roberto Tonini, Paolo Trunfio, Manuela Volpe:
Enabling dynamic and intelligent workflows for HPC, data analytics, and AI convergence. Future Gener. Comput. Syst. 134: 414-429 (2022) - [j161]Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Göddeke, Marco Heisig, Fabienne Jézéquel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Ortí, Francesco Rizzi, Ulrich Rüde, Martin Schulz, Fred Fung, Robert Speck, Linda Stals, Keita Teranishi, Samuel Thibault, Dominik Thönnes, Andreas Wagner, Barbara I. Wohlmuth:
Resiliency in numerical algorithm design for extreme scale simulations. Int. J. High Perform. Comput. Appl. 36(2): 251-285 (2022) - [j160]Sergio Iserte, Aina Macías, Raúl Martínez-Cuenca, Sergio Chiva, Roberto Paredes, Enrique S. Quintana-Ortí:
Accelerating urban scale simulations leveraging local spatial 3D structure. J. Comput. Sci. 62: 101741 (2022) - [j159]Sergio Barrachina, Manuel F. Dolz, Pablo San Juan, Enrique S. Quintana-Ortí:
Efficient and portable GEMM-based convolution operators for deep neural network training on multicore processors. J. Parallel Distributed Comput. 167: 240-254 (2022) - [j158]Adrián Castelló, Sergio Barrachina, Manuel F. Dolz, Enrique S. Quintana-Ortí, Pau San Juan, Andrés E. Tomás:
High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS. J. Syst. Archit. 125: 102459 (2022) - [j157]Cristián Ramírez, Adrián Castelló, Enrique S. Quintana-Ortí:
A BLIS-like matrix multiplication for machine learning in the RISC-V ISA-based GAP8 processor. J. Supercomput. 78(16): 18051-18060 (2022) - [j156]Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang Mike Tsai, Enrique S. Quintana-Ortí:
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing. ACM Trans. Math. Softw. 48(1): 2:1-2:33 (2022) - [c186]Andrea Biagioni, Paolo Cretaro, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Elena Pastorelli, Francesco Simula, Matteo Turisini, Piero Vicini, Roberto Ammendola, Pascale Bernier-Bruna, Claire Chen, Said Derradji, Stéphane Guez, Pierre-Axel Lagadec, Gregoire Pichon, Etienne Walter, Gaetan De Gassowski, Matthieu Hautreaux, Stephane Mathieu, Gilles Moreau, Marc Pérache, Hugo Taboada, Torsten Hoefler, Timo Schneider, Matteo Barnaba, Giuseppe Piero Brandino, Francesco De Giorgi, Matteo Poggi, Iakovos Mavroidis, Yannis Papaefstathiou, Nikolaos Tampouratzis, Benjamin Kalisch, Ulrich Krackhardt, Mondrian Nuessle, Pantelis Xirouchakis, Vangelis Mageiropoulos, Michalis Gianioudis, Harisis Loukas, Aggelos Ioannou, Nikos Kallimanis, Nikos Chrysos, Manolis Katevenis, Wolfgang Frings, Dominik Gottwald, Felime Guimaraes, Max Holicki, Volker Marx, Yannik Müller, Carsten Clauss, Hugo Falter, Xu Huang, Jennifer Lopez Barillao, Thomas Moschny, Simon Pickartz, Francisco J. Alfaro, Jesús Escudero-Sahuquillo, Pedro Javier García, Francisco J. Quiles, José L. Sánchez, Adrián Castelló, Jose Duro, María Engracia Gómez, Enrique S. Quintana-Ortí, Julio Sahuquillo, Eugenio Stabile:
RED-SEA: Network Solution for Exascale Architectures. DSD 2022: 712-719 - [c185]Manuel F. Dolz, Adrián Castelló, Enrique S. Quintana-Ortí:
Towards Portable Realizations of Winograd-based Convolution with Vector Intrinsics and OpenMP. PDP 2022: 39-46 - [c184]Adrián Castelló, Enrique S. Quintana-Ortí, Francisco D. Igual:
Anatomy of the BLIS Family of Algorithms for Matrix Multiplication. PDP 2022: 92-99 - [c183]Pedro Alonso-Jordá, Héctor Martínez, Enrique S. Quintana-Ortí, Cristián Ramírez:
Performance Analysis of Convolution Algorithms for Deep Learning on Edge Processors. PPAM (2) 2022: 236-247 - [c182]Sandra Catalán, Francisco D. Igual, Rafael Rodríguez-Sánchez, José R. Herrero, Enrique S. Quintana-Ortí:
NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors. SBAC-PAD 2022: 91-99 - [c181]Manuel F. Dolz, Héctor Martínez, Pedro Alonso, Enrique S. Quintana-Ortí:
Convolution Operators for Deep Learning Inference on the Fujitsu A64FX Processor. SBAC-PAD 2022: 160-169 - [c180]Cristián Ramírez, Adrián Castelló, Héctor Martínez, Enrique S. Quintana-Ortí:
Performance Analysis of Matrix Multiplication for Deep Learning on the Edge. ISC Workshops 2022: 65-76 - [c179]Adrián Castelló, Sandra Catalán, Francisco D. Igual, Enrique S. Quintana-Ortí, Rafael Rodríguez-Sánchez:
QR Factorization Using Malleable BLAS on Multicore Processors. ISC Workshops 2022: 176-189 - [i21]Jorge Ejarque, Rosa M. Badia, Loïc Albertin, Giovanni Aloisio, Enrico Baglione, Yolanda Becerra, Stefan Boschert, Julian R. Berlin, Alessandro D'Anca, Donatello Elia, François Exertier, Sandro Fiore, José Flich, Arnau Folch, Steven J. Gibbons, Nikolay Koldunov, Francesc Lordan, Stefano Lorito, Finn Løvholt, Jorge Macías Sánchez, Fabrizio Marozzo, Alberto Michelini, Marisol Monterrubio Velasco, Marta Pienkowska, Josep de la Puente, Anna Queralt, Enrique S. Quintana-Ortí, Juan Esteban Rodriguez, Fabrizio Romano, Riccardo Rossi, Jedrzej Rybicki, Miroslaw Kupczyk, Jacopo Selva, Domenico Talia, Roberto Tonini, Paolo Trunfio, Manuela Volpe:
Enabling Dynamic and Intelligent Workflows for HPC, Data Analytics, and AI Convergence. CoRR abs/2204.09287 (2022) - 2021
- [j155]Adrián Castelló, Enrique S. Quintana-Ortí, José Duato:
Accelerating distributed deep neural network training with pipelined MPI allreduce. Clust. Comput. 24(4): 3797-3813 (2021) - [j154]Pedro Alonso-Jordá, Davor Davidovic, Marin Sapunar, José R. Herrero, Enrique S. Quintana-Ortí:
Efficient update of determinants for many-electron wave function overlaps. Comput. Phys. Commun. 258: 107521 (2021) - [j153]Peter Benner, Enrique S. Quintana-Ortí, Jens Saak:
Introduction to the Special Issue related to the Power-Aware Computing Workshop 2019 - PACO 2019. Int. J. High Perform. Comput. Appl. 35(3) (2021) - [j152]Ernesto Dufrechou, Pablo Ezzatti, Enrique S. Quintana-Ortí:
Selecting optimal SpMV realizations for GPUs via machine learning. Int. J. High Perform. Comput. Appl. 35(3) (2021) - [j151]Ernesto Dufrechou, Pablo Ezzatti, Manuel Freire, Enrique S. Quintana-Ortí:
Machine learning for optimal selection of sparse triangular system solvers on GPUs. J. Parallel Distributed Comput. 158: 47-55 (2021) - [j150]Sergio Iserte, Rafael Mayo, Enrique S. Quintana-Ortí, Antonio J. Peña:
DMRlib: Easy-Coding and Efficient Resource Management for Job Malleability. IEEE Trans. Computers 70(9): 1443-1457 (2021) - [j149]Jose A. Belloch, José M. Badía, Diego Francisco Larios Marín, Enrique Personal, Miguel Ferrer, Laura Fuster, Mihaita Lupoiu, Alberto González, Carlos León, Antonio M. Vidal, Enrique S. Quintana-Ortí:
On the performance of a GPU-based SoC in a distributed spatial audio system. J. Supercomput. 77(7): 6920-6935 (2021) - [j148]Peter Benner, Ernesto Dufrechou, Pablo Ezzatti, Rodrigo Gallardo, Enrique S. Quintana-Ortí:
Factorized solution of generalized stable Sylvester equations using many-core GPU accelerators. J. Supercomput. 77(9): 10152-10164 (2021) - [j147]Pablo San Juan, Rafael Rodríguez-Sánchez, Francisco D. Igual, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
Low precision matrix multiplication for efficient deep learning in NVIDIA Carmel processors. J. Supercomput. 77(10): 11257-11269 (2021) - [j146]Goran Flegar, Hartwig Anzt, Terry Cojean, Enrique S. Quintana-Ortí:
Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software. ACM Trans. Math. Softw. 47(2): 14:1-14:28 (2021) - [c178]Sandra Catalán, Francisco D. Igual, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí:
Scalable Hybrid Loop- and Task-Parallel Matrix Inversion for Multicore Processors. IPDPS Workshops 2021: 679-687 - [c177]Adrián Castelló, Mar Catalán, Manuel F. Dolz, José I. Mestre, Enrique S. Quintana-Ortí, José Duato:
Performance Modeling for Distributed Training of Convolutional Neural Networks. PDP 2021: 99-108 - [c176]Adrián Castelló, Mar Catalán, Manuel F. Dolz, José I. Mestre, Enrique S. Quintana-Ortí, José Duato:
Evaluation of MPI Allreduce for Distributed Training of Convolutional Neural Networks. PDP 2021: 109-116 - [c175]Pau San Juan, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
High Performance and Energy Efficient Integer Matrix Multiplication for Deep Learning. PDP 2021: 122-125 - [c174]Sandra Catalán, Francisco D. Igual, Rafael Rodríguez-Sánchez, José R. Herrero, Enrique S. Quintana-Ortí:
A New Generation of Task-Parallel Algorithms for Matrix Inversion in Many-Threaded CPUs. PMAM@PPoPP 2021: 1-10 - [i20]Adrián Castelló, Sergio Barrachina, Manuel F. Dolz, Enrique S. Quintana-Ortí, Pau San Juan:
High performance and energy efficient inference for deep learning on ARM processors. CoRR abs/2105.09187 (2021) - 2020
- [j145]Sandra Catalán, Adrián Castelló, Francisco D. Igual, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí:
Programming parallel dense matrix factorizations with look-ahead and OpenMP. Clust. Comput. 23(1): 359-375 (2020) - [j144]Roman Iakymchuk, María Barreda Vayá, Stef Graillat, José Ignacio Aliaga, Enrique S. Quintana-Ortí:
Reproducibility of parallel preconditioned conjugate gradient in hybrid programming environments. Int. J. High Perform. Comput. Appl. 34(5) (2020) - [j143]Roman Iakymchuk, Maria Barreda, Matthias Wiesenberger, José Ignacio Aliaga, Enrique S. Quintana-Ortí:
Reproducibility strategies for parallel Preconditioned Conjugate Gradient. J. Comput. Appl. Math. 371: 112697 (2020) - [j142]Adrián Castelló, Rafael Mayo Gual, Sangmin Seo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña:
Analysis of Threading Libraries for High Performance Computing. IEEE Trans. Computers 69(9): 1279-1292 (2020) - [j141]Rafael Rodríguez-Sánchez, Francisco D. Igual, Enrique S. Quintana-Ortí:
Integration and exploitation of intra-routine malleability in BLIS. J. Supercomput. 76(4): 2860-2875 (2020) - [j140]Andrés E. Tomás, Enrique S. Quintana-Ortí:
Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors. J. Supercomput. 76(11): 8771-8786 (2020) - [j139]Maria Barreda, Manuel F. Dolz, M. Asunción Castaño, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
Performance modeling of the sparse matrix-vector product via convolutional neural networks. J. Supercomput. 76(11): 8883-8900 (2020) - [j138]Thomas Grützmacher, Terry Cojean, Goran Flegar, Hartwig Anzt, Enrique S. Quintana-Ortí:
Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation. ACM Trans. Parallel Comput. 7(1): 4:1-4:19 (2020) - [c173]José Ignacio Aliaga, Hartwig Anzt, Enrique S. Quintana-Ortí, Andrés E. Tomás, Yuhsiang M. Tsai:
Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs. Euro-Par Workshops 2020: 83-95 - [c172]Fritz Göbel, Hartwig Anzt, Terry Cojean, Goran Flegar, Enrique S. Quintana-Ortí:
Multiprecision Block-Jacobi for Iterative Triangular Solves. Euro-Par 2020: 546-560 - [c171]Rocío Carratalá-Sáez, Mathieu Faverge, Grégoire Pichon, Guillaume Sylvand, Enrique S. Quintana-Ortí:
Tiled Algorithms for Efficient Task-Parallel ℌ-Matrix Solvers. IPDPS Workshops 2020: 757-766 - [c170]Pablo San Juan, Adrián Castelló, Manuel F. Dolz, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
High Performance and Portable Convolution Operators for Multicore Processors. SBAC-PAD 2020: 91-98 - [i19]Sergio Iserte, Rafael Mayo, Enrique S. Quintana-Ortí, Vicenç Beltran, Antonio J. Peña:
DMR API: Improving cluster productivity by turning applications into malleable. CoRR abs/2005.05910 (2020) - [i18]Pablo San Juan, Adrián Castelló, Manuel F. Dolz, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí:
High Performance and Portable Convolution Operators for ARM-based Multicore Processors. CoRR abs/2005.06410 (2020) - [i17]Roman Iakymchuk, Maria Barreda, Stef Graillat, José Ignacio Aliaga, Enrique S. Quintana-Ortí:
Reproducibility of Parallel Preconditioned Conjugate Gradient in Hybrid Programming Environments. CoRR abs/2005.07282 (2020) - [i16]Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yu-Hsiang Tsai, Enrique S. Quintana-Ortí:
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing. CoRR abs/2006.16852 (2020) - [i15]José Ignacio Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Compressed Basis GMRES on High Performance GPUs. CoRR abs/2009.12101 (2020) - [i14]Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Göddeke, Marco Heisig, Fabienne Jézéquel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Ortí, Francesco Rizzi, Ulrich Rüde, Martin Schulz, Fred Fung, Robert Speck, Linda Stals, Keita Teranishi, Samuel Thibault, Dominik Thönnes, Andreas Wagner, Barbara I. Wohlmuth:
Resiliency in Numerical Algorithm Design for Extreme Scale Simulations. CoRR abs/2010.13342 (2020)
2010 – 2019
- 2019
- [j137]Sandra Catalán, José R. Herrero, Enrique S. Quintana-Ortí, Rafael Rodríguez-Sánchez, Robert A. van de Geijn:
A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting. IEEE Access 7: 17617-17633 (2019) - [j136]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Nicholas J. Higham, Enrique S. Quintana-Ortí:
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers. Concurr. Comput. Pract. Exp. 31(6) (2019) - [j135]Pablo Ezzatti, Enrique S. Quintana-Ortí, Alfredo Remón, Jens Saak:
Power-aware computing. Concurr. Comput. Pract. Exp. 31(6) (2019) - [j134]Roman Iakymchuk, Stef Graillat, David Defour, Enrique S. Quintana-Ortí:
Hierarchical approach for deriving a reproducible unblocked LU factorization. Int. J. High Perform. Comput. Appl. 33(5) (2019) - [j133]Hartwig Anzt, Goran Flegar, Thomas Grützmacher, Enrique S. Quintana-Ortí:
Toward a modular precision ecosystem for high-performance computing. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j132]Rocío Carratalá-Sáez, Sven Christophersen, José Ignacio Aliaga, Vicenç Beltran, Steffen Börm, Enrique S. Quintana-Ortí:
Exploiting nested task-parallelism in the H-LU factorization. J. Comput. Sci. 33: 20-33 (2019) - [j131]Rocío Carratalá-Sáez, Sven Christophersen, José Ignacio Aliaga, Vicenç Beltran, Steffen Börm, Enrique S. Quintana-Ortí:
Erratum to "Exploiting nested task-parallelism in theH-LU factorization" [J. Comput. Sci. 33 (2019) 20-33]. J. Comput. Sci. 35: 110 (2019) - [j130]Hartwig Anzt, Jack J. Dongarra, Enrique S. Quintana-Ortí:
Fine-grained bit-flip protection for relaxation methods. J. Comput. Sci. 36 (2019) - [j129]Rafael Rodríguez-Sánchez, Sandra Catalán, José R. Herrero, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD. Numer. Algorithms 80(2): 635-660 (2019) - [j128]Andrés E. Tomás, Rafael Rodríguez-Sánchez, Sandra Catalán, Rocío Carratalá-Sáez, Enrique S. Quintana-Ortí:
Dynamic look-ahead in the reduction to band form for the singular value decomposition. Parallel Comput. 81: 22-31 (2019) - [j127]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Variable-size batched Gauss-Jordan elimination for block-Jacobi preconditioning on graphics processors. Parallel Comput. 81: 131-146 (2019) - [j126]José Ignacio Aliaga, Ernesto Dufrechou, Pablo Ezzatti, Enrique S. Quintana-Ortí:
Accelerating the task/data-parallel version of ILUPACK's BiCG in multi-CPU/GPU configurations. Parallel Comput. 85: 79-87 (2019) - [j125]