


default search action
ACM Transactions on Parallel Computing, Volume 11
Volume 11, Number 1, March 2024
- Anne Benoit

, Lucas Perotin
, Yves Robert
, Frédéric Vivien
:
Checkpointing Strategies to Tolerate Non-Memoryless Failures on HPC Platforms. 1:1-1:26 - Lucas Perotin

, Hongyang Sun
:
Improved Online Scheduling of Moldable Task Graphs under Common Speedup Models. 2:1-2:31 - Shengle Lin

, Wangdong Yang
, Yikun Hu
, Qinyun Cai
, Minlu Dai
, Haotian Wang
, Kenli Li
:
HPS Cholesky: Hierarchical Parallelized Supernodal Cholesky with Adaptive Parameters. 3:1-3:22 - Romolo Marotta

, Mauro Ianni
, Alessandro Pellegrini
, Francesco Quaglia
:
A Conflict-Resilient Lock-Free Linearizable Calendar Queue. 4:1-4:32 - Stefan K. Muller

, Jan Hoffmann
:
Modeling and Analyzing Evaluation Cost of CUDA Kernels. 5:1-5:53 - Qinyun Cai

, Guoqing Xiao
, Shengle Lin
, Wangdong Yang
, Keqin Li
, Kenli Li
:
ABSS: An Adaptive Batch-Stream Scheduling Module for Dynamic Task Parallelism on Chiplet-based Multi-Chip Systems. 6:1-6:24
Volume 11, Number 2, June 2024
- Qiang Fu, Yuede Ji, Thomas B. Rolinger, H. Howie Huang

:
TLPGNN: A Lightweight Two-level Parallelism Paradigm for Graph Neural Network Computation on Single and Multiple GPUs. 7 - Zixuan Li

, Yunchuan Qin
, Qi Xiao
, Wangdong Yang
, Kenli Li
:
cuFasterTucker: A Stochastic Optimization Strategy for Parallel Sparse FastTucker Decomposition on GPU Platform. 8 - Sébastien Darche

, Michel R. Dagenais
:
Low-Overhead Trace Collection and Profiling on GPU Compute Kernels. 9 - Ziyang Li

, Dongsheng Li
, Yingwen Chen
, Kai Chen
, Yiming Zhang
:
Decentralized Scheduling for Data-Parallel Tasks in the Cloud. 10 - Guoqing Xiao

, Tao Zhou
, Yuedan Chen
, Yikun Hu
, Kenli Li
:
Machine Learning-Based Kernel Selector for SpMV Optimization in Graph Analysis. 11 - Zixuan Li

, Yikun Hu
, Mengquan Li
, Wangdong Yang
, Kenli Li
:
cuFastTucker: A Novel Sparse FastTucker Decomposition For HHLST on Multi-GPUs. 12
Volume 11, Number 3, September 2024
- Yiqian Liu

, Noushin Azami
, Avery Vanausdal
, Martin Burtscher
:
Indigo3: A Parallel Graph Analytics Benchmark Suite for Exploring Implementation Styles and Common Bugs. 13:1-13:29 - Johan Bontes

, James Gain
:
Redzone stream compaction: removing k items from a list in parallel O(k) time. 14:1-14:16
Volume 11, Number 4, December 2024
- Cu Cui

:
Acceleration of Tensor-Product Operations with Tensor Cores. 15:1-15:24 - Wim H. Hesselink

, Peter A. Buhr
, Colby A. Parsons
:
First-Come-First-Served as a Separate Principle. 16:1-16:20 - Johannes Pahlke

, Ivo F. Sbalzarini
:
Proven Distributed Memory Parallelization of Particle Methods. 17:1-17:45 - Hermann Bogning Tepiele

, Vianney Kengne Tchendji
, Mathias Akong Onabid
, Jean Frédéric Myoupo
, Armel Nkonjoh Ngomade
:
Dominant Point-Based Sequential and Parallel Algorithms for the Multiple Sequential Substring Constrained-LCS Problem. 18:1-18:31

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














