


default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 36
Volume 36, Number 1, January 2025
- Junqiang Jiang
, Zhifang Sun
, Ruiqi Lu
, Li Pan
, Zebo Peng
:
Real Relative Encoding Genetic Algorithm for Workflow Scheduling in Heterogeneous Distributed Computing Systems. 1-14 - Sanaz Rabinia
, Niloofar Didar
, Marco Brocanelli
, Daniel Grosu
:
Algorithms for Data Sharing-Aware Task Allocation in Edge Computing Systems. 15-28 - Qiang He
, Guobiao Zhang, Jiawei Wang, Ruikun Luo
, Xiaohai Dai
, Yuchong Hu
, Feifei Chen
, Hai Jin
, Yun Yang
:
EdgeHydra: Fault-Tolerant Edge Data Distribution Based on Erasure Coding. 29-42 - Jonatha Anselmi
, Josu Doncel
:
Balanced Splitting: A Framework for Achieving Zero-Wait in the Multiserver-Job Model. 43-54 - Ruikun Luo
, Qiang He
, Feifei Chen
, Song Wu
, Hai Jin
, Yun Yang
:
Ripple: Enabling Decentralized Data Deduplication at the Edge. 55-66 - Haoyu Liao
, Tong-Yu Liu
, Jianmei Guo
, Bo Huang
, Dingyu Yang
, Jonathan Ding:
Retrospecting Available CPU Resources: SMT-Aware Scheduling to Prevent SLA Violations in Data Centers. 67-83 - Ruikun Luo
, Qiang He
, Mengxi Xu, Feifei Chen
, Song Wu
, Jing Yang
, Yuan Gao
, Hai Jin
:
Edge Data Deduplication Under Uncertainties: A Robust Optimization Approach. 84-95 - Guillaume Raffin
, Denis Trystram
:
Dissecting the Software-Based Measurement of CPU Energy Consumption: A Comparative Analysis. 96-107
Volume 36, Number 2, February 2025
- Hariharan Devarajan
, Gerd Heber
, Kathryn M. Mohror
:
H5Intent: Autotuning HDF5 With User Intent. 108-119 - Diletta Olliaro
, Adityo Anggraito
, Marco Ajmone Marsan
, Simonetta Balsamo
, Andrea Marin
:
The Impact of Service Demand Variability on Data Center Performance. 120-132 - Shuai Lin, Rui Wang
, Yongkun Li
, Yinlong Xu
, John C. S. Lui
:
Two-Dimensional Balanced Partitioning and Efficient Caching for Distributed Graph Analysis. 133-149 - Zhi Ling
, Xiaofeng Jiang
, Xiaobin Tan
, Huasen He
, Shiyin Zhu, Jian Yang
:
Joint Dynamic Data and Model Parallelism for Distributed Training of DNNs Over Heterogeneous Infrastructure. 150-167 - Diandian Gu
, Yihao Zhao
, Peng Sun, Xin Jin
, Xuanzhe Liu
:
GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads. 168-184 - Pengwei Wang
, Junye Qiao, Yuying Zhao, Zhijun Ding
:
Cost-Effective and Low-Latency Data Placement in Edge Environment Based on PageRank-Inspired Regional Value. 185-196 - Xiaodong Dong
, Lihai Nie
, Zheli Liu
, Yang Xiang
:
Slark: A Performance Robust Decentralized Inter-Datacenter Deadline-Aware Coflows Scheduling Framework With Local Information. 197-211 - Jialiang Han
, Yudong Han
, Xiang Jing, Gang Huang
, Yun Ma
:
DegaFL: Decentralized Gradient Aggregation for Cross-Silo Federated Learning. 212-225 - Zhongyi Lin
, Ning Sun, Pallab Bhattacharya
, Xizhou Feng, Louis Feng, John D. Owens
:
Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms. 226-238 - Zhangrong Qin
, Xusheng Lu, Long Lv
, Zhongxiang Tang
, Binghai Wen
:
An Efficient GPU Algorithm for Lattice Boltzmann Method on Sparse Complex Geometries. 239-252 - J. Gregory Pauloski
, Valérie Hayot-Sasson
, Logan T. Ward
, Alexander Brace, André Bauer, Kyle Chard
, Ian T. Foster
:
Object Proxy Patterns for Accelerating Distributed Applications. 253-265 - Changyao Lin
, Zhenming Chen, Ziyang Zhang
, Jie Liu
:
TOP: Task-Based Operator Parallelism for Asynchronous Deep Learning Inference on GPU. 266-281 - Jing Hou
, Guang Chen
, Ruiqi Zhang
, Zhijun Li
, Shangding Gu, Changjun Jiang
:
Spreeze: High-Throughput Parallel Reinforcement Learning Framework. 282-292 - Guangyao Zhou
, Wenhong Tian
, Rajkumar Buyya
, Kui Wu
:
UMPIPE: Unequal Microbatches-Based Pipeline Parallelism for Deep Neural Network Training. 293-307 - Yuyang Jin
, Haojie Wang
, Xiongchao Tang, Zhenhua Guo, Yaqian Zhao, Torsten Hoefler
, Tao Liu
, Xu Liu, Jidong Zhai
:
Leveraging Graph Analysis to Pinpoint Root Causes of Scalability Issues for Parallel Applications. 308-325 - Giacomo Valente
, Gianluca Brilli
, Tania Di Mascio
, Alessandro Capotondi
, Paolo Burgio
, Paolo Valente
, Andrea Marongiu
:
Fine-Grained QoS Control via Tightly-Coupled Bandwidth Monitoring and Regulation for FPGA-Based Heterogeneous SoCs. 326-340
Volume 36, Number 3, March 2025
- Shuaibing Lu
, Ran Yan, Jie Wu
, Jackson Yang, Xinyu Deng, Shen Wu
, Zhi Cai
, Juan Fang
:
Online Elastic Resource Provisioning With QoS Guarantee in Container-Based Cloud Computing. 361-376 - Junyuan Liang
, Peiyuan Yao
, Wuhui Chen
, Zicong Hong
, Jianting Zhang
, Ting Cai
, Min Sun, Zibin Zheng
:
Sparrow: Expediting Smart Contract Execution for Blockchain Sharding via Inter-Shard Caching. 377-390 - Jialun Li
, Danyang Xiao
, Diying Yang
, Xuan Mo
, Weigang Wu
:
Integrated and Fungible Scheduling of Deep Learning Workloads Using Multi-Agent Reinforcement Learning. 391-406 - Saiman Dahal
, Pratyush Dhingra
, Krishu K. Thapa, Partha Pratim Pande
, Ananth Kalyanaraman
:
HpT: Hybrid Acceleration of Spatio-Temporal Attention Model Training on Heterogeneous Manycore Architectures. 407-421 - Yuhan Leng, Gaoyuan Zou, Hansheng Wang
, Panruo Wu
, Shaoshuai Zhang
:
High Performance Householder QR Factorization on Emerging GPU Architectures Using Tensor Cores. 422-436 - Lizhen Zhou
, Zichuan Xu
, Qiufen Xia
, Zhou Xu
, Wenhao Ren
, Wenbo Qi, Jinjing Ma, Song Yan, Yuan Yang:
Chasing Common Knowledge: Joint Large Model Selection and Pulling in MEC With Parameter Sharing. 437-454 - Binqi Sun
, Tomasz Kloda
, Jiyang Chen, Cen Lu, Marco Caccamo
:
Response Time Analysis and Optimal Priority Assignment for Global Non-Preemptive Fixed-Priority Rigid Gang Scheduling. 455-470 - Ziqu Yu
, Jinyu Gu
, Zijian Wu, Nian Liu, Jian Guo:
HTLL: Latency-Aware Scalable Blocking Mutex. 471-486 - Haining Yang
, Dengguo Feng, Jing Qin
:
Towards Efficient Verifiable Cloud Storage and Distribution for Large-Scale Data Streaming. 487-501 - Hongkuan Zhou
, Bingyi Zhang
, Rajgopal Kannan, Carl E. Busart
, Viktor K. Prasanna
:
ViTeGNN: Towards Versatile Inference of Temporal Graph Neural Networks on FPGA. 502-519 - Wenhan Xu
, Hui Ma
, Rui Zhang
, Jianhao Li
:
$ \mathsf{GPABE} $GPABE: GPU-Based Parallelization Framework for Attribute-Based Encryption Schemes. 520-536
Volume 36, Number 4, April 2025
- Junhee Ryu
, Dongeun Lee
, Kang G. Shin
, Kyungtae Kang
:
Paralfetch: Fast Application Launch on Personal Computing/Communication Devices. 616-632 - Yi Chen
, Qiang-Sheng Hua
, Zixiao Hong
, Lin Zhu, Hai Jin
:
FHE4DMM: A Low-Latency Distributed Matrix Multiplication With Fully Homomorphic Encryption. 645-658 - Zhengjun Cao
:
A Note on "AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems". 675-676 - Yan Zeng
, Chengchuang Huang
, Yipeng Mei
, Lifu Zhang
, Teng Su
, Wei Ye
, Wenqi Shi, Shengnan Wang
:
EfficientMoE: Optimizing Mixture-of-Experts Model Training With Adaptive Load Balance. 677-688 - Luiz Gustavo Coutinho Xavier
, Cristina Meinhardt
, Odorico Machado Mendizabal
:
Beelog: Online Log Compaction for Dependable Systems. 689-700
Volume 36, Number 5, May 2025
- Omer F. Rana
, Josef Spillner
, Stephen Leak, Gerald F. Lofstead II, Rafael Tolosana-Calasanz
:
Guest Editorial:Special Section on SC22 Student Cluster Competition. 803 - Alexandros Nikolaos Ziogas
, Timo Schneider, Tal Ben-Nun, Alexandru Calotoiu, Tiziano De Matteis, Johannes de Fine Licht, Luca Lavarini, Torsten Hoefler:
Productivity, Portability, Performance, and Reproducibility: Data-Centric Python. 804-820 - Fu-Chiang Chang
, En-Ming Huang
, Pin-Yi Kuo, Chan-Yu Mou
, Hsu-Tzu Ting, Pang-Ning Wu, Jerry Chou
:
Reproducing Performance of Data-Centric Python by SCC Team From National Tsing Hua University. 821-825 - Zihan Yang
, Yi Chen
, Kaiqi Chen
, Xingjian Qian
, Shaojun Xu
, Yun Pan
, Chong Zeng
, Jianhai Chen
, Yin Zhang
, Zeke Wang
:
Critique of "Productivity, Portability, Performance: Data-Centric Python" by SCC Team From Zhejiang University. 826-829 - Han Huang
, Tengyang Zheng
, Tianxing Yang
, Yang Ye
, Siran Liu
, Zhe Tang
, Shengyou Lu
, Guangnan Feng
, Zhiguang Chen
, Dan Huang
:
Critique of "Productivity, Portability, Performance Data-Centric Python" by SCC Team From Sun Yat-sen University. 830-834 - Christopher Lompa
, Piotr Luczynski
:
Analysis and Reproducibility of "Productivity, Portability, Performance: Data-Centric Python". 835-840 - Anish Govind
, Yuchen Jing
, Stefanie Dao
, Michael Granado
, Rachel Handran
, Davit Margarian
, Matthew Mikhailov
, Danny Vo
, Matei-Alexandru Gardus
, Khai Vu
, Derek Bouius, Bryan Chin
, Mahidhar Tatineni
, Mary P. Thomas
:
Reproducibility of the DaCe Framework on NPBench Benchmarks. 841-846 - Yuan Gao
, Liquan Chen
, Jianchang Lai
, Tianyi Wang
, Xiaoming Wu
, Shui Yu
:
IoT-Dedup: Device Relationship-Based IoT Data Deduplication Scheme. 847-860 - Conor John Williams
, James Elliott:
Libfork: Portable Continuation-Stealing With Stackless Coroutines. 877-888 - Keyun Cheng
, Huancheng Puyang, Xiaolu Li
, Patrick P. C. Lee
, Yuchong Hu
, Jie Li
, Ting-Yi Wu:
Toward Load-Balanced Redundancy Transitioning for Erasure-Coded Storage. 889-902 - Junhan Liu
, Zinuo Cai
, Yumou Liu, Hao Li
, Zongpu Zhang
, Ruhui Ma
, Rajkumar Buyya
:
SMore: Enhancing GPU Utilization in Deep Learning Clusters by Serverless-Based Co-Location Scheduling. 903-917 - Hyeonjin Kim
, Taesoo Lim
, William J. Song
:
Graphite: Hardware-Aware GNN Reshaping for Acceleration With GPU Tensor Cores. 918-931 - S. M. Shovan
, Arindam Khanda
, Sajal K. Das
:
Parallel Multi Objective Shortest Path Update Algorithm in Large Dynamic Networks. 932-944 - Xiangyu Zou
, Wen Xia
, Philip Shilane
, Haijun Zhang
, Xuan Wang
:
The Design of a High-Performance Fine-Grained Deduplication Framework for Backup Storage. 945-960 - Qiange Wang
, Xin Ai, Yongze Yan
, Shufeng Gong
, Yanfeng Zhang
, Jing Chen
, Ge Yu
:
Towards Communication-Efficient Out-of-Core Graph Processing on the GPU. 961-976 - Huijing Yang
, Juan Fang
, Yumin Hou
, Xing Su
, Neal N. Xiong
:
Reinforcement Learning-Driven Adaptive Prefetch Aggressiveness Control for Enhanced Performance in Parallel System Architectures. 977-993 - Zerui Shao
, Beibei Li
, Peiran Wang
, Yi Zhang
, Kim-Kwang Raymond Choo
:
FedLoRE: Communication-Efficient and Personalized Edge Intelligence Framework via Federated Low-Rank Estimation. 994-1010 - Jingweijia Tan
, Xurui Li, An Zhong, Kaige Yan
, Xiaohui Wei
, Guanpeng Li
:
GEREM: Fast and Precise Error Resilience Assessment for GPU Microarchitectures. 1011-1024 - Jan Laukemann
, Ahmed E. Helal
, S. Isaac Geronimo Anderson, Fabio Checconi, Yongseok Soh, Jesmin Jahan Tithi
, Teresa M. Ranadive
, Brian J. Gravelle, Fabrizio Petrini, Jee W. Choi:
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation. 1025-1041 - Weihan Kong
, Shengan Zheng
, Yifan Hua
, Ruoyan Ma, Yuheng Wen
, Guifeng Wang
, Cong Zhou
, Linpeng Huang
:
PimBeam: Efficient Regular Path Queries Over Graph Database Using Processing-in-Memory. 1042-1057 - Zhaochen Zhang
, Xu Zhang
, Zhaoxiang Bao
, Liang Wei, Chaohong Tan, Wanchun Dou
, Guihai Chen
, Chen Tian
:
Courier: A Unified Communication Agent to Support Concurrent Flow Scheduling in Cluster Computing. 861-876
Volume 36, Number 6, June 2025
- Yuan Yao
, Yujiao Hu
, Yi Dang, Wei Tao, Kai Hu, Qiming Huang, Zhe Peng
, Gang Yang
, Xingshe Zhou:
Workload-Aware Performance Model Based Soft Preemptive Real-Time Scheduling for Neural Processing Units. 1058-1070 - Wei Gao
, Zhuoyuan Ouyang
, Peng Sun
, Tianwei Zhang
, Yonggang Wen
:
IceFrog: A Layer-Elastic Scheduling System for Deep Learning Training in GPU Clusters. 1071-1086 - Wenting Wei
, Huaxi Gu
, Zhe Xiao, Yi Chen:
Energy Efficient and Multi-Resource Optimization for Virtual Machine Placement by Improving MOEA/D. 1087-1099 - Wenming Li
, Zhihua Fan
, Tianyu Liu
, Zhen Wang
, Haibin Wu
, Meng Wu
, Kunming Zhang, Yanhuan Liu
, Ninghui Sun
, Xiaochun Ye
, Dongrui Fan
:
DFU-E: A Dataflow Architecture for Edge DSP and AI Applications. 1100-1114 - Yifeng Tang
, Huaman Zhou
, Zhuoran Ji
, Cho-Li Wang:
Cube-fx: Mapping Taylor Expansion Onto Matrix Multiplier-Accumulators of Huawei Ascend AI Processors. 1115-1129 - William Andrew Simon
, Irem Boybat
, Riselda Kodra
, Elena Ferro
, Gagandeep Singh
, Mohammed Alser
, Shubham Jain
, Hsinyu Tsai
, Geoffrey W. Burr
, Onur Mutlu
, Abu Sebastian
:
CiMBA: Accelerating Genome Sequencing Through On-Device Basecalling via Compute-in-Memory. 1130-1145 - Wei Zhang
, Yunlong Yu
, Xiao Jiang, Nan Guan
, Naijun Zhan
, Lei Ju
:
WCET Estimation for CNN Inference on FPGA SoC With Multi-DPU Engines. 1146-1160 - Haobin Tan
, Yao Xiao
, Amelie Chi Zhou
, Kezhong Lu
, Xuan Yang
:
Distributed and Adaptive Partitioning for Large Graphs in Geo-Distributed Data Centers. 1161-1174 - Kumseok Jung
, Julien Gascon-Samson
, Sathish Gopalakrishnan
, Karthik Pattabiraman
:
OneOS: Distributed Operating System for the Edge-to-Cloud Continuum. 1175-1192 - Luca Colagrande
, Luca Benini
:
Taming Offload Overheads in a Massively Parallel Open-Source RISC-V MPSoC: Analysis and Optimization. 1193-1205

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.