


default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 35
Volume 35, Number 1, January 2024
- Qiufen Xia

, Zhiwei Jiao
, Zichuan Xu
:
Online Learning Algorithms for Context-Aware Video Caching in D2D Edge Networks. 1-19 - Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian:

Adaptive Auto-Tuning Framework for Global Exploration of Stencil Optimization on GPUs. 20-33 - Qixiang Chen

, Zhijun Chen
, Kai Zhang
, X. Sean Wang
:
CLIC: An Extensible and Efficient Cross-Platform Data Analytics System. 34-45 - Yuan Li

, Ahmed Louri
, Avinash Karanth
:
A High-Performance and Energy-Efficient Photonic Architecture for Multi-DNN Acceleration. 46-58 - Fangming Liu

, Yipei Niu
:
Demystifying the Cost of Serverless Computing: Towards a Win-Win Deal. 59-72 - Isam Mashhour Al Jawarneh

, Paolo Bellavista
, Antonio Corradi
, Luca Foschini
, Rebecca Montanari
:
SpatialSSJP: QoS-Aware Adaptive Approximate Stream-Static Spatial Join Processor. 73-88 - Zhe Jiang

, Kecheng Yang
, Nathan Fisher
, Nan Guan
, Neil C. Audsley
, Zheng Dong
:
Hopscotch: A Hardware-Software Co-Design for Efficient Cache Resizing on Multi-Core SoCs. 89-104 - Zichuan Xu

, Guangyuan Xu
, Hao Wang
, Weifa Liang
, Qiufen Xia
, Shangguang Wang
:
Enabling Streaming Analytics in Satellite Edge Computing via Timely Evaluation of Big Data Queries. 105-122 - Yunqi Gao

, Bing Hu
, Mahdi Boloursaz Mashhadi
, A-Long Jin
, Pei Xiao
, Chunming Wu
:
US-Byte: An Efficient Communication Framework for Scheduling Unequal-Sized Tensor Blocks in Distributed Deep Learning. 123-139 - Changlong Li

, Yu Liang
, Liang Shi, Chao Wang
, Chun Jason Xue
, Xuehai Zhou
:
Flexible and Efficient Memory Swapping Across Mobile Devices With LegoSwap. 140-153 - Qiliang Li

, Liangliang Xu
, Yongkun Li
, Min Lyu
, Wei Wang
, Pengfei Zuo
, Yinlong Xu
:
Enabling Efficient Erasure Coding in Disaggregated Memory Systems. 154-168 - Tiangang Li

, Shi Ying
, Yishi Zhao
, Jianga Shang
:
Batch Jobs Load Balancing Scheduling in Cloud Computing Using Distributional Reinforcement Learning. 169-185 - Yaozheng Fang

, Zhiyuan Zhou
, Surong Dai
, Jinni Yang
, Hui Zhang
, Ye Lu
:
PaVM: A Parallel Virtual Machine for Smart Contract Execution and Validation. 186-202
Volume 35, Number 2, February 2024
- Ajay Singh

, Trevor Alexander Brown
, Ali José Mashtizadeh
:
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization. 203-220 - Zhiyuan Wang

, Hongli Xu
, Yang Xu
, Zhida Jiang
, Jianchun Liu
, Suo Chen
:
FAST: Enhancing Federated Learning Through Adaptive Data Sampling and Local Training. 221-236 - Gang Zeng

, Jianfeng Zhu
, Yichi Zhang
, Ganhui Chen, Zhenhai Yuan, Shaojun Wei
, Leibo Liu
:
A High-Performance Genomic Accelerator for Accurate Sequence-to-Graph Alignment Using Dynamic Programming Algorithm. 237-249 - Junyan Qian

, Kunzhu Qiu
, Hao Ding
, Huimin Zhang
, Zhongyi Zhai
:
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays. 250-263 - Yong Dong

, Yiqin Dai
, Min Xie, Kai Lu
, Ruibo Wang
, Juan Chen, Mingtian Shao, Zheng Wang
:
Faster and Scalable MPI Applications Launching. 264-279 - Jing Wu

, Lin Wang
, Qirui Jin
, Fangming Liu
:
Graft: Efficient Inference Serving for Hybrid Deep Learning With SLO Guarantees via DNN Re-Alignment. 280-296 - Ning Li

, Jianmei Guo
, Bo Huang
, Yuyang Li
, Yilei Zhang
, Chengdong Li, Wenxin Huang:
TCSA: Efficient Localization of Busy-Wait Synchronization Bugs for Latency-Critical Applications. 297-309 - Hao-Rui Chen

, Lei Yang
, Xinglin Zhang
, Jiaxing Shen
, Jiannong Cao
:
Distributed Semi-Supervised Learning With Consensus Consistency on Edge Devices. 310-323 - Zhao Liu

, Xuesen Chu
, Xiaojing Lv
, Hongsong Meng
, Hanyue Liu
, Guanghui Zhu
, Haohuan Fu
, Guangwen Yang
:
SunwayLB: Enabling Extreme-Scale Lattice Boltzmann Method Based Computing Fluid Dynamics Simulations on Advanced Heterogeneous Supercomputers. 324-337 - Jiaqi Yang

, Hao Zheng
, Ahmed Louri
:
Versa-DNN: A Versatile Architecture Enabling High-Performance and Energy-Efficient Multi-DNN Acceleration. 349-361 - Dongyu Zheng

, Lei Liu
, Guoming Tang
, Yi Wang
, Weichao Li
:
Power Demand Reshaping Using Energy Storage for Distributed Edge Clouds. 362-376
Volume 35, Number 3, March 2024
- Di Wu

, Rehmat Ullah
, Philip Rodgers
, Peter Kilpatrick
, Ivor T. A. Spence, Blesson Varghese
:
EcoFed: Efficient Communication for DNN Partitioning-Based Federated Learning. 377-390 - Xing Chen

, Shengxi Hu
, Chujia Yu, Zheyi Chen
, Geyong Min
:
Real-Time Offloading for Dependent and Parallel Tasks in Cloud-Edge Environments Using Deep Reinforcement Learning. 391-404 - Linpeng Jia

, Yanxiu Liu
, Keyuan Wang
, Yi Sun
:
Estuary: A Low Cross-Shard Blockchain Sharding Protocol Based on State Splitting. 405-420 - Daoce Wang

, Jesus Pulido
, Pascal Grosset
, Sian Jin
, Jiannan Tian
, Kai Zhao
, James P. Ahrens
, Dingwen Tao
:
TAC+: Optimizing Error-Bounded Lossy Compression for 3D AMR Simulations. 421-438 - Weiling Yang

, Jianbin Fang
, Dezun Dong
, Xing Su
, Zheng Wang
:
Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Multi-Core CPUs. 439-454 - Guangjing Huang

, Qiong Wu
, Peng Sun
, Qian Ma
, Xu Chen
:
Collaboration in Federated Learning With Differential Privacy: A Stackelberg Game Analysis. 455-469 - Fatemeh Elahi, Mahmood Fazlali

, Hadi Tabatabaee Malazi
, Mehdi Elahi:
Parallel Fractional Stochastic Gradient Descent With Adaptive Learning for Recommender Systems. 470-483 - Vinicius S. da Silva, Everton Camargo de Lima, Janaina Schwarzrock

, Fábio D. Rossi
, Marcelo Caggiani Luizelli
, Antonio Carlos Schneider Beck
, Arthur Francisco Lorenzon
:
Synergistically Rebalancing the EDP of Container-Based Parallel Applications. 484-498 - Jialun Li

, Jieqian Yao
, Danyang Xiao
, Diying Yang
, Weigang Wu
:
EvoGWP: Predicting Long-Term Changes in Cloud Workloads Using Deep Graph-Evolution Learning. 499-516
Volume 35, Number 4, April 2024
- Jian Yang

, Jiantong Jiang
, Zeyi Wen
, Ajmal Mian
:
Parallel and Distributed Bayesian Network Structure Learning. 517-530 - Jie Song

, Peimeng Zhu
, Yanfeng Zhang
, Ge Yu
:
CloudSimPer: Simulating Geo-Distributed Datacenters Powered by Renewable Energy Mix. 531-547 - Jie Xu

, Yulong Ming
, Zihan Wu
, Cong Wang
, Xiaohua Jia
:
X-Shard: Optimistic Cross-Shard Transaction Processing for Sharding-Based Blockchains. 548-559 - Zhaojie Wen

, Qiong Chen
, Yipei Niu
, Zhen Song
, Quanfeng Deng
, Fangming Liu
:
Joint Optimization of Parallelism and Resource Configuration for Serverless Function Steps. 560-576 - Dongsheng Li

, Shengwei Li
, Zhiquan Lai
, Yongquan Fu
, Xiangyu Ye
, Lei Cai
, Linbo Qiao
:
A Memory-Efficient Hybrid Parallel Framework for Deep Neural Network Training. 577-591 - Meilin Yang

, Jian Xu
, Wenbo Ding
, Yang Liu
:
FedHAP: Federated Hashing With Global Prototypes for Cross-Silo Retrieval. 592-603 - Amanda Jayanetti

, Saman K. Halgamuge
, Rajkumar Buyya
:
Multi-Agent Deep Reinforcement Learning Framework for Renewable Energy-Aware Workflow Scheduling on Distributed Cloud Data Centers. 604-615 - Jianyuan Lu

, Tian Pan
, Shan He
, Mao Miao
, Guangzhe Zhou
, Yining Qi
, Shize Zhang
, Enge Song
, Xiaoqing Sun
, Huaiyi Zhao
, Biao Lyu
, Shunmin Zhu
:
CloudSentry: Two-Stage Heavy Hitter Detection for Cloud-Scale Gateway Overload Protection. 616-633 - Subhadeep Karan, Zainul Abideen Sayed

, Jaroslaw Zola
:
End-to-End Bayesian Networks Exact Learning in Shared Memory. 634-645 - Ke Cheng

, Sheng Zhang
, Meizhao Liu
, Yingcheng Gu
, Liu Wei
, Huanyu Cheng
, Kai Liu
, Yu Song
, Xiaohang Shi
, Andong Zhu
, Lei Tang
:
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds. 646-662 - Zhe Wang

, Jia Hu
, Geyong Min
, Zhiwei Zhao
, Zi Wang
:
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning. 663-674 - Wai-Kong Lee

, Raymond K. Zhao
, Ron Steinfeld
, Amin Sakzad
, Seong Oun Hwang
:
High Throughput Lattice-Based Signatures on GPUs: Comparing Falcon and Mitaka. 675-692 - Burak Aksar

, Efe Sencan
, Benjamin Schwaller
, Omar Aaziz
, Vitus J. Leung
, Jim M. Brandt
, Brian Kulis
, Manuel Egele
, Ayse K. Coskun
:
Runtime Performance Anomaly Diagnosis in Production HPC Systems Using Active Learning. 693-706
Volume 35, Number 5, May 2024
- Yi-Chien Lin

, Bingyi Zhang
, Viktor K. Prasanna
:
HitGNN: High-Throughput GNN Training Framework on CPU+Multi-FPGA Heterogeneous Platform. 707-719 - Tianyu Zeng

, Xiaoxi Zhang
, Jingpu Duan
, Chao Yu
, Chuan Wu
, Xu Chen
:
An Offline-Transfer-Online Framework for Cloud-Edge Collaborative Distributed Reinforcement Learning. 720-731 - Yuhang Liu

, Xin Deng
, Jiapeng Zhou
, Mingyu Chen
, Yungang Bao
:
Suppressing the Interference Within a Datacenter: Theorems, Metric and Strategy. 732-750 - Enge Song

, Tian Pan
, Haoyu Song
, Qiang Fu
, Yingjiang Liu
, Chenhao Jia
, Chuanying Yuan
, Minglan Gao
, Jiao Zhang
, Tao Huang
, Yunjie Liu
:
INT-Label: Lightweight In-Band Network-Wide Telemetry via Distributed Labeling. 751-767 - Fan Yuan

, Xiaojian Yang
, Shengguo Li
, Dezun Dong
, Chun Huang
, Zheng Wang
:
Optimizing Multi-Grid Preconditioned Conjugate Gradient Method on Multi-Cores. 768-779 - Yuanhong Zhang

, Weizhan Zhang
, Haipeng Du
, Caixia Yan
, Li Liu
, Qinghua Zheng
:
FHVAC: Feature-Level Hybrid Video Adaptive Configuration for Machine-Centric Live Streaming. 780-795 - Bowen Zhang

, Shengan Zheng
, Liangxu Nie
, Zhenlin Qi
, Hongyi Chen
, Linpeng Huang
, Hong Mei
:
Revisiting PM-Based B+-Tree With Persistent CPU Cache. 796-813 - Anshuman Misra

, Ajay D. Kshemkalyani
:
Byzantine-Tolerant Causal Ordering for Unicasts, Multicasts, and Broadcasts. 814-828 - Dingding Li

, Weijie Zhang
, Mianxiong Dong
, Kaoru Ota
:
DMA-Assisted I/O for Persistent Memory. 829-843 - Runzhou Han

, Mai Zheng
, Suren Byna
, Houjun Tang
, Bin Dong
, Dong Dai
, Yong Chen
, Dongkyun Kim, Joseph Hassoun, David Thorsley
:
PROV-IO$^+$+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems. 844-861
Volume 35, Number 6, June 2024
- Jiamin Fan

, Kui Wu
, Guoming Tang
, Yang Zhou
, Shengqiang Huang
:
Taking Advantage of the Mistakes: Rethinking Clustered Federated Learning for IoT Anomaly Detection. 707-721 - Xinyi Ji

, Jiankuo Dong
, Tonggui Deng
, Pinchang Zhang
, Jiafeng Hua
, Fu Xiao
:
HI-Kyber: A Novel High-Performance Implementation Scheme of Kyber Based on GPU. 722-736 - Chiranjeb Mondal

, Sanjay V. Rajopadhye
:
Taking RNA-RNA Interaction to Machine Peak. 737-749 - Siqi Wang

, Tianyu Feng
, Hailong Yang
, Xin You
, Bangduo Chen
, Tongxuan Liu
, Zhongzhi Luan
, Depei Qian
:
AtRec: Accelerating Recommendation Model Training on CPUs. 750-763 - Wei-Mei Chen

, Hsin-Hung Tsai
, Joon Fong Ling:
Parallel Computation of Dominance Scores for Multidimensional Datasets on GPUs. 764-776 - Zirui Liu

, Yikai Zhao
, Zhuochen Fan
, Tong Yang
, Xiaodong Li
, Ruwen Zhang
, Kaicheng Yang
, Zihan Jiang
, Zheng Zhong, Yi Huang
, Cong Liu, Jing Hu, Gaogang Xie
, Bin Cui
:
BurstBalancer: Do Less, Better Balance for Large-Scale Data Center Traffic. 777-794 - Jiesong Liu

, Feng Zhang
, Lv Lu
, Chang Qi
, Xiaoguang Guo
, Dong Deng
, Guoliang Li
, Huanchen Zhang
, Jidong Zhai
, Hechen Zhang
, Yuxing Chen
, Anqun Pan
, Xiaoyong Du
:
G-Learned Index: Enabling Efficient Learned Index on GPU. 795-812 - Amirhossein Taherpour

, Xiaodong Wang
:
HybridChain: Fast, Accurate, and Secure Transaction Processing With Distributed Learning. 813-827 - Pourya Soltani

, Farid Ashtiani
:
Analytical Modeling and Throughput Computation of Blockchain Sharding. 828-842 - Zheng Zhang

, Yaqi Xia
, Hulin Wang
, Donglin Yang
, Chuang Hu
, Xiaobo Zhou
, Dazhao Cheng
:
MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline Parallelism. 843-856 - Cheng Wang

, Kun Xie
, Jiazheng Tian
, Jigang Wen
, Xiaocan Li
, Gaogang Xie
, Kenli Li
:
HPETC: History Priority Enhanced Tensor Completion for Network Distance Measurement. 857-873 - Kaiyang Liu

, Jingrong Wang
, Zhiming Huang
, Jianping Pan
:
Sampling-Based Multi-Job Placement for Heterogeneous Deep Learning Clusters. 874-888 - Guoqing Xiao

, Chuanghui Yin
, Yuedan Chen
, Mingxing Duan
, Kenli Li
:
Efficient Utilization of Multi-Threading Parallelism on Heterogeneous Systems for Sparse Tensor Contraction. 889-900 - Xin Du

, Minglong Wang
, Zhihui Lu
, Qiang Duan
, Yuhao Liu
, Jianfeng Feng
, Huarui Wang
:
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations. 901-918 - Dazhao Cheng

, Kai Yan
, Xinquan Cai
, Yili Gong
, Chuang Hu
:
SLO-Aware Function Placement for Serverless Workflows With Layer-Wise Memory Sharing. 919-936 - Chen Wang

, Kathryn M. Mohror
, Marc Snir
:
Formal Definitions and Performance Comparison of Consistency Models for Parallel File Systems. 937-951 - Zhiyuan Wu

, Sheng Sun
, Yuwei Wang
, Min Liu
, Quyang Pan, Xuefeng Jiang
, Bo Gao
:
FedICT: Federated Multi-Task Distillation for Multi-Access Edge Computing. 952-966
Volume 35, Number 7, July 2024
- Runzhen Xue

, Dengke Han
, Mingyu Yan
, Mo Zou
, Xiaocheng Yang
, Duo Wang
, Wenming Li
, Zhimin Tang
, John Kim
, Xiaochun Ye
, Dongrui Fan
:
HiHGNN: Accelerating HGNNs Through Parallelism and Data Reusability Exploitation. 1122-1138 - Bowen Zhang

, Huaxi Gu
, Grace Li Zhang
, Yintang Yang
, Ziteng Ma, Ulf Schlichtmann
:
A 3D Hybrid Optical-Electrical NoC Using Novel Mapping Strategy Based DCNN Dataflow Acceleration. 1139-1154 - Chen Chen

, Hong Xu
, Wei Wang
, Baochun Li
, Bo Li
, Li Chen
, Gong Zhang
:
Synchronize Only the Immature Parameters: Communication-Efficient Federated Learning By Freezing Parameters Adaptively. 1155-1173 - Xiaqing Li

, Qi Guo
, Guangyan Zhang
, Siwei Ye
, Guanhua He
, Yiheng Yao
, Rui Zhang
, Yifan Hao
, Zidong Du
, Weimin Zheng
:
FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning With Partitioning and Parallelism of Search Space. 1174-1188 - Linsi Lan

, Junbo Wang
, Zhi Li
, Krishna Kant
, Wanquan Liu
:
FedREM: Guided Federated Learning in the Presence of Dynamic Device Unpredictability. 1189-1206 - Rahul Mishra

, Hari Prabhat Gupta
, Garvit Banga
, Sajal K. Das
:
Fed-RAC: Resource-Aware Clustering for Tackling Heterogeneity of Participants in Federated Learning. 1207-1220 - Yuyang Jin

, Haojie Wang
, Runxin Zhong
, Chen Zhang
, Xia Liao
, Feng Zhang
, Jidong Zhai
:
Graph-Centric Performance Analysis for Large-Scale Parallel Applications. 1221-1238 - Yuzhen Zhao

, Xiyu Liu
:
Spiking Neural P Systems With Microglia. 1239-1250 - Liang Zhang

, Wenli Zheng
, Kuangyu Zheng
, Hongzi Zhu
, Chao Li
, Minyi Guo
:
Bayesian-Driven Automated Scaling in Stream Computing With Multiple QoS Targets. 1251-1267 - Lu Zhao

, Fu Xiao
, Bo Li
, Jian Zhou
, Xiaolong Xu
, Yun Yang
:
Availability-Aware Revenue-Effective Application Deployment in Multi-Access Edge Computing. 1268-1280 - Kai Zhang

, Jiahui Hong
, Zhengying He
, Yinan Jing
, X. Sean Wang
:
AdaptChain: Adaptive Data Sharing and Synchronization for NFV Systems on Heterogeneous Architectures. 1281-1292 - Chilankamol Sunny

, Satyajit Das
, Kevin J. M. Martin
, Philippe Coussy
:
CREPE: Concurrent Reverse-Modulo-Scheduling and Placement for CGRAs. 1293-1306 - Daniela Loreti

, Marcello Artioli
, Anna Ciampolini
:
Rollback-Free Recovery for a High Performance Dense Linear Solver With Reduced Memory Footprint. 1307-1319 - Sai Zhang

, Li Tang
, Yan-Jun Liu
:
Adaptive Neural Control for a Network of Parabolic PDEs With Event-Triggered Mechanism. 1320-1330
Volume 35, Number 8, August 2024
- Jinfan Chen

, Shigang Li
, Ran Guo
, Jinhui Yuan
, Torsten Hoefler
:
AutoDDL: Automatic Distributed Deep Learning With Near-Optimal Bandwidth Cost. 1331-1344 - Isra Mohamed Ali

, Mohamed M. Abdallah
:
On Off-Chaining Smart Contract Runtime Protection: A Queuing Model Approach. 1345-1359 - Yanxi Zhang, Muyu Mei, Dongqi Yan, Xu Zhang, Qinghai Yang, Mingwu Yao:

Age-of-Event Aware: Sampling Period Optimization in a Three-Stage Wireless Cyber-Physical System With Diverse Parallelisms. 1360-1372 - Yang Zhou

, Fang Wang
, Zhan Shi
, Dan Feng
:
The Static Allocation is Not a Static: Optimizing SSD Address Allocation Through Boosting Static Policy. 1373-1386 - Changmao Wu

, Zhengwei Xu
, Xiaoming He
, Qi Lou
, Yuanyuan Xia
, Shuman Huang
:
Proactive Caching With Distributed Deep Reinforcement Learning in 6G Cloud-Edge Collaboration Computing. 1387-1399 - Xiaofeng Hou

, Xuehan Tang
, Jiacheng Liu
, Chao Li
, Luhong Liang
, Kwang-Ting Cheng
:
WASP: Efficient Power Management Enabling Workload-Aware, Self-Powered AIoT Devices. 1400-1414 - Shengwei Li

, Kai Lu
, Zhiquan Lai
, Weijie Liu
, Keshi Ge
, Dong Sheng Li
:
A Multidimensional Communication Scheduling Method for Hybrid Parallel DNN Training. 1415-1428 - Jingwen Zhou

, Feifei Chen
, Guangming Cui
, Yong Xiang
, Qiang He
:
FEUAGame: Fairness-Aware Edge User Allocation for App Vendors. 1429-1443 - Jiantong Jiang

, Zeyi Wen
, Atif Bin Mansoor
, Ajmal Mian
:
Faster-BNI: Fast Parallel Exact Inference on Bayesian Networks. 1444-1455 - Xinliang Wei

, Kejiang Ye
, Xinghua Shi, Cheng-Zhong Xu
, Yu Wang
:
Joint Participant and Learning Topology Selection for Federated Learning in Edge Clouds. 1456-1468 - Chengying Huan

, Yongchao Liu
, Heng Zhang
, Hang Liu
, Shiyang Chen
, Shuaiwen Leon Song
, Yanjun Wu
:
TeGraph+: Scalable Temporal Graph Processing Enabling Flexible Edge Modifications. 1469-1487 - Liang Geng

, Hao Wang
, Jingsong Meng
, Dayi Fan
, Sami Ben-Romdhane
, Hari Kadayam Pichumani
, Vinay Phegade
, Xiaodong Zhang
:
RR-Compound: RDMA-Fused gRPC for Low Latency, High Throughput, and Easy Interface. 1488-1505 - Junxue Zhang

, Xiaodian Cheng
, Liu Yang
, Jinbin Hu
, Han Tian
, Kai Chen
:
High-Performance Hardware Acceleration Architecture for Cross-Silo Federated Learning. 1506-1523
Volume 35, Number 9, September 2024
- Yi-Wei Ci

, Michael R. Lyu
, Zhan Zhang
, De-Cheng Zuo
, Xiao-Zong Yang:
KLNK: Expanding Page Boundaries in a Distributed Shared Memory System. 1524-1535 - Sheng Qi

, Chao Jin
, Mosharaf Chowdhury
, Zhenming Liu, Xuanzhe Liu
, Xin Jin
:
Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters. 1536-1550 - Ahmad Tarraf

, Martin Schreiber
, Alberto Cascajo
, Jean-Baptiste Besnard
, Marc-André Vef
, Dominik Huber
, Sonja Happ
, André Brinkmann
, David E. Singh
, Hans-Christian Hoppe
, Alberto Miranda
, Antonio J. Peña
, Rui Machado
, Marta Garcia-Gasulla
, Martin Schulz
, Paul M. Carpenter
, Simon Pickartz
, Tiberiu Rotaru
, Sergio Iserte
, Víctor López
, Jorge Ejarque
, Heena Sirwani
, Jesús Carretero
, Felix Wolf
:
Malleability in Modern HPC Systems: Current Experiences, Challenges, and Future Opportunities. 1551-1564 - Jiuchen Shi

, Kaihua Fu
, Jiawen Wang
, Quan Chen
, Deze Zeng
, Minyi Guo
:
Adaptive QoS-Aware Microservice Deployment With Excessive Loads via Intra- and Inter-Datacenter Scheduling. 1565-1582 - Dhruv Gajaria

, Kevin Antony Gomez
, Tosiron Adegbija
:
STT-RAM-Based Hierarchical in-Memory Computing. 1615-1629 - Rong Cong

, Zhiwei Zhao
, Linyuanqi Zhang
, Geyong Min
:
Cost-Effective Server Deployment for Multi-Access Edge Networks: A Cooperative Scheme. 1583-1597 - Yifan Hua

, Shengan Zheng
, Weihan Kong
, Cong Zhou, Kaixin Huang
, Ruoyan Ma, Linpeng Huang
:
RADAR: A Skew-Resistant and Hotness-Aware Ordered Index Design for Processing-in-Memory Systems. 1598-1614 - Ran Wang

, Cheng Xu
, Xiaotong Zhang
:
Toward Materials Genome Big-Data: A Blockchain-Based Secure Storage and Efficient Retrieval Method. 1630-1643 - Yuchen Zhong

, Guangming Sheng
, Juncheng Liu, Jinhui Yuan, Chuan Wu
:
Swift: Expedited Failure Recovery for Large-Scale DNN Training. 1644-1656 - Gabriele Mencagli

, Patrizio Dazzi
, Massimo Coppola
:
Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams. 1657-1671 - Cunyang Wei

, Haipeng Jia
, Yunquan Zhang
, Jianyu Yao, Chendi Li
, Wenxuan Cao:
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs. 1672-1689
Volume 35, Number 10, October 2024
- Jiaxing Qi

, Wencong Xiao, Mingzhen Li
, Chaojie Yang, Yong Li, Wei Lin, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG. 1708-1720 - Rong Chen

, Xingda Wei
, Xiating Xie, Haibo Chen
:
Locality-Preserving Graph Traversal With Split Live Migration. 1810-1825 - Mi Zhang

, Qihan Kang, Patrick P. C. Lee
:
FlexRaft: Exploiting Flexible Erasure Coding for Minimum-Cost Consensus and Fast Recovery. 1826-1840 - Peixuan Li

, Ping Xie
, Qiang Cao
:
SSRAID: A Stripe-Queued and Stripe-Threaded Merging I/O Strategy to Improve Write Performance of Serial Interface SSD RAID. 1841-1853 - Fatemeh Keshavarz-Kohjerdi

:
Paired Many-to-Many 2-Disjoint Path Covers in Meshes. 1854-1866 - Jiangfei Duan

, Xiuhong Li
, Ping Xu, Xingcheng Zhang
, Shengen Yan, Yun Liang
, Dahua Lin
:
Proteus: Simulating the Performance of Distributed DNN Training. 1867-1878
Volume 35, Number 11, November 2024
- Yichen Li

, Wenchao Xu
, Yining Qi
, Haozhao Wang
, Ruixuan Li
, Song Guo
:
SR-FDIL: Synergistic Replay for Federated Domain-Incremental Learning. 1879-1890 - Yuezhi Che

, Dazhao Cheng
, Xiao Wang
, Rujia Wang
:
Opca: Enabling Optimistic Concurrent Access for Multiple Users in Oblivious Data Storage. 1891-1903 - Yaqi Xia

, Zheng Zhang
, Donglin Yang
, Chuang Hu
, Xiaobo Zhou
, Hongyang Chen
, Qianlong Sang
, Dazhao Cheng
:
Redundancy-Free and Load-Balanced TGNN Training With Hierarchical Pipeline Parallelism. 1904-1919 - Haoran Zhou

, Wei Rang
, Hongyang Chen
, Xiaobo Zhou
, Dazhao Cheng
:
DeepTM: Efficient Tensor Management in Heterogeneous Memory for DNN Training. 1920-1935 - Sanjay Lall

, Calin Cascaval
, Martin Izzard, Tammo Spalink:
Logical Synchrony and the Bittide Mechanism. 1936-1948 - Peng Wang

, Hong Jiang
, Yu Liu
, Zhelong Zhao
, Ke Zhou
, Zhihai Huang:
Beyond Belady to Attain a Seemingly Unattainable Byte Miss Ratio for Content Delivery Networks. 1949-1963 - Shiyu Shen

, Hao Yang
, Wangchen Dai
, Hong Zhang, Zhe Liu
, Yunlei Zhao
:
High-Throughput GPU Implementation of Dilithium Post-Quantum Digital Signature. 1964-1976 - Hua Huang

, Edmond Chow
:
Exploring the Design Space of Distributed Parallel Sparse Matrix-Multiple Vector Multiplication. 1977-1988 - Zhaojie Wen

, Qiong Chen
, Quanfeng Deng
, Yipei Niu
, Zhen Song
, Fangming Liu
:
ComboFunc: Joint Resource Combination and Container Placement for Serverless Function Scaling With Heterogeneous Container. 1989-2005 - Huali Lu

, Feng Lyu
, Ju Ren
, Huaqing Wu
, Conghao Zhou
, Zhongyuan Liu, Yaoxue Zhang
, Xuemin Shen
:
CODE$^{+}$+: Fast and Accurate Inference for Compact Distributed IoT Data Collection. 2006-2022 - Di Mou

, Bo Wang, Dajiang Liu
:
SC-CGRA: An Energy-Efficient CGRA Using Stochastic Computing. 2023-2038 - Yin Xu

, Mingjun Xiao
, Jie Wu
, He Sun
:
Privacy Preserving Task Push in Spatial Crowdsourcing With Unknown Popularity. 2039-2053 - Lan Zhang

, Anran Li
, Hongyi Peng
, Feng Han, Fan Huang, Xiang-Yang Li
:
Privacy-Preserving Data Selection for Horizontal and Vertical Federated Learning. 2054-2068 - Kai Chen

, Qingjun Qu, Feng Zhu
, Zhengming Yi, Wenjie Tang
:
CPLNS: Cooperative Parallel Large Neighborhood Search for Large-Scale Multi-Agent Path Finding. 2069-2086 - Qiqi Duan

, Chang Shao
, Guochen Zhou
, Minghan Zhang, Qi Zhao
, Yuhui Shi
:
Distributed Evolution Strategies With Multi-Level Learning for Large-Scale Black-Box Optimization. 2087-2101 - Ping Luo

, Jieren Cheng
, Neal Xiong
, Zhenhao Liu, Jie Wu
:
FedVeca: Federated Vectorized Averaging on Non-IID Data With Adaptive Bi-Directional Global Objective. 2102-2113 - Hui Dou

, Yilun Wang
, Yiwen Zhang
, Pengfei Chen
, Zibin Zheng
:
DeepCAT+: A Low-Cost and Transferrable Online Configuration Auto-Tuning Approach for Big Data Frameworks. 2114-2131 - Biao Hou

, Song Yang
, Fan Li
, Liehuang Zhu
, Lei Jiao
, Xu Chen
, Xiaoming Fu
:
Gamora: Learning-Based Buffer-Aware Preloading for Adaptive Short Video Streaming. 2132-2146 - Feng Yao

, Qian Tao
, Shengyuan Lin
, Yanfeng Zhang
, Wenyuan Yu
, Shufeng Gong
, Qiange Wang
, Ge Yu
, Jingren Zhou
:
Towards Efficient Graph Processing in Geo-Distributed Data Centers. 2147-2160 - Darong Huang

, Luis Costero
, David Atienza
:
An Evaluation Framework for Dynamic Thermal Management Strategies in 3D MultiProcessor System-on-Chip Co-Design. 2161-2176 - Rui Tian

, Jiazhi Jiang
, Jiangsu Du
, Dan Huang
, Yutong Lu:
Sophisticated Orchestrating Concurrent DLRM Training on CPU/GPU Platform. 2177-2192 - Donglei Wu

, Weihao Yang, Xiangyu Zou
, Hao Feng
, Dingwen Tao
, Shiyi Li
, Wen Xia
, Binxing Fang:
BIRD+: Design of a Lightweight Communication Compressor for Resource-Constrained Distribution Learning Platforms. 2193-2207 - Yuyang Jin

, Runxin Zhong
, Saiqin Long
, Jidong Zhai
:
Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment. 2208-2223 - Shouxi Luo

, Renyi Wang, Ke Li
, Huanlai Xing
:
Efficient Cross-Cloud Partial Reduce With CREW. 2224-2238 - Renyou Xie

, Chaojie Li
, Xiaojun Zhou
, Zhaoyang Dong
:
Accelerating Communication-Efficient Federated Multi-Task Learning With Personalization and Fairness. 2239-2253 - Hanfei Yu

, Hao Wang
, Jian Li
, Xu Yuan
, Seung-Jong Park:
Freyr $^+$+: Harvesting Idle Resources in Serverless Computing via Deep Reinforcement Learning. 2254-2269 - Jiandong Liu

, Lan Zhang
, Fengxiang He
, Chi Zhang
, Shanyang Jiang
, Xiang-Yang Li
:
Communication-Efficient Regret-Optimal Distributed Online Convex Optimization. 2270-2283 - Renwen Ma, Kai Hwang

, Mo Li
, Yiming Miao
:
Trusted Model Aggregation With Zero-Knowledge Proofs in Federated Learning. 2284-2296
Volume 35, Number 12, December 2024
- Quan Deng

, Qiang Liu
, Ming Yuan, Xiaohui Duan
, Lin Gan
, Jinzhe Yang, Wenlai Zhao
, Zhenxiang Zhang
, Guiming Wu
, Wayne Luk
, Haohuan Fu
, Guangwen Yang
:
Acceleration of Multi-Body Molecular Dynamics With Customized Parallel Dataflow. 2297-2314 - Liang Wang

, Jinzhe Yang, Jidong Zhai
, Guangwen Yang
:
Optimizing I/O Performance Through Effective vCPU Scheduling Interference Management. 2315-2330 - Shuangwu Chen

, Jiangming Li
, Qifeng Yuan
, Huasen He
, Sen Li, Jian Yang
:
Two-Timescale Joint Optimization of Task Scheduling and Resource Scaling in Multi-Data Center System Based on Multi-Agent Deep Reinforcement Learning. 2331-2346 - Francesco De Pellegrini

, Vaibhav Kumar Gupta
, Rachid El Azouzi
, Serigne Gueye, Cédric Richier
, Jeremie Leguay
:
Fair Coflow Scheduling via Controlled Slowdown. 2347-2360 - Devki Nandan Jha

, Yinhao Li, Zhenyu Wen
, Graham Morgan
, Prem Prakash Jayaraman
, Maciej Koutny, Omer F. Rana
, Rajiv Ranjan
:
GeoDeploy: Geo-Distributed Application Deployment Using Benchmarking. 2361-2374 - Zhiqi Lin

, Youshan Miao
, Guanbin Xu, Cheng Li
, Olli Saarikivi, Saeed Maleki, Fan Yang:
Efficient Schedule Construction for Distributed Execution of Large DNN Models. 2375-2391 - Qiushi Zheng

, Jiong Jin
, Zhishu Shen
, Libing Wu
, Iftekhar Ahmad
, Yong Xiang
:
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach. 2392-2404 - Bingyi Zhang

, Rajgopal Kannan, Carl E. Busart, Viktor K. Prasanna
:
VisionAGILE: A Versatile Domain-Specific Accelerator for Computer Vision Tasks. 2405-2422 - Jinyu Hu

, Huizhang Luo
, Hong Jiang
, Guoqing Xiao
, Kenli Li
:
FastLoad: Speeding Up Data Loading of Both Sparse Matrix and Vector for SpMV on GPUs. 2423-2434 - Rong Hu

, Haotian Wang
, Wangdong Yang
, Renqiu Ouyang
, Keqin Li
, Kenli Li
:
BCB-SpTC: An Efficient Sparse High-Dimensional Tensor Contraction Employing Tensor Core Acceleration. 2435-2448 - Binghan Wu

, Wei Bao
, Bing Bing Zhou
:
Competitive Analysis of Online Elastic Caching of Transient Data in Multi-Tiered Content Delivery Network. 2449-2462 - Zhenhua Guo, Yinan Tang

, Jidong Zhai
, Tongtong Yuan
, Jian Jin
, Li Wang, Yaqian Zhao, Rengang Li
:
A Survey on Performance Modeling and Prediction for Distributed DNN Training. 2463-2478 - Hui Sun

, Deyan Kong, Song Jiang, Yinliang Yue
, Xiao Qin
:
TrieKV: A High-Performance Key-Value Store Design With Memory as Its First-Class Citizen. 2479-2496 - Keyuan Wang

, Linpeng Jia
, Zhaoxiong Song
, Yi Sun
:
Mitosis: A Scalable Sharding System Featuring Multiple Dynamic Relay Chains. 2497-2512 - Chunlin Tian

, Li Li
, Kahou Tam
, Yebo Wu
, Cheng-Zhong Xu
:
Breaking the Memory Wall for Heterogeneous Federated Learning via Model Splitting. 2513-2526 - Ruchi Bhoot

, Suved Sanjay Ghanmode
, Yogesh Simmhan
:
TARIS: Scalable Incremental Processing of Time-Respecting Algorithms on Streaming Graphs. 2527-2544

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














