default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 33
Volume 33, Number 1, January 2022
- J. Rubén Titos Gil, Ricardo Fernández Pascual, Alberto Ros, Manuel E. Acacio:
DeTraS: Delaying Stores for Friendly-Fire Mitigation in Hardware Transactional Memory. 1-13 - Haozhao Wang, Song Guo, Zhihao Qu, Ruixuan Li, Ziming Liu:
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment. 14-25 - Georgios Andreadis, Fabian Mastenbroek, Vincent van Beek, Alexandru Iosup:
Capelin: Data-Driven Compute Capacity Procurement for Cloud Datacenters Using Portfolios of Scenarios. 26-39 - Yibo Jin, Zhuzhong Qian, Song Guo, Sheng Zhang, Lei Jiao, Sanglu Lu:
$run$ runData: Re-Distributing Data via Piggybacking for Geo-Distributed Data Analytics Over Edges. 40-55 - Si Wu, Zhirong Shen, Patrick P. C. Lee, Yinlong Xu:
Optimal Repair-Scaling Trade-off in Locally Repairable Codes: Analysis and Evaluation. 56-69 - Gangzhao Lu, Weizhe Zhang, Zheng Wang:
Optimizing Depthwise Separable Convolution Operations on GPUs. 70-87 - Gingfung Yeung, Damian Borowiec, Renyu Yang, Adrian Friday, Richard Harper, Peter Garraghan:
Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems. 88-100 - Shreshth Tuli, Shivananda R. Poojara, Satish Narayana Srirama, Giuliano Casale, Nicholas R. Jennings:
COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments. 101-116 - Dipika Deb, Rohith M. K., John Jose:
FlitZip: Effective Packet Compression for NoC in MultiProcessor System-on-Chip. 117-128 - Tongfeng Weng, Xu Zhou, Kenli Li, Peng Peng, Keqin Li:
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs. 129-143 - Yidi Wu, Kaihao Ma, Xiao Yan, Zhi Liu, Zhenkun Cai, Yuzhen Huang, James Cheng, Han Yuan, Fan Yu:
Elastic Deep Learning in Multi-Tenant GPU Clusters. 144-158 - Zhen Xie, Guangming Tan, Weifeng Liu, Ninghui Sun:
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures. 159-175 - Federico Magnanini, Luca Ferretti, Michele Colajanni:
Scalable, Confidential and Survivable Software Updates. 176-191 - Yuhao Zhou, Qing Ye, Jiancheng Lv:
Communication-Efficient Federated Learning With Compensated Overlap-FedAvg. 192-205 - Saad Zia Sheikh, Muhammad Adeel Pasha:
Energy-Efficient Cache-Aware Scheduling on Heterogeneous Multicore Systems. 206-217 - Huan Wang, Guoming Tang, Kui Wu, Jianping Wang:
PLVER: Joint Stable Allocation and Content Replication for Edge-Assisted Live Video Delivery. 218-230 - Amelie Chi Zhou, Weilin Xue, Yao Xiao, Bingsheng He, Shadi Ibrahim, Reynold Cheng:
Taming System Dynamics on Resource Optimization for Data Processing Workflows: A Probabilistic Approach. 231-248
Volume 33, Number 2, February 2022
- Shutong Chen, Lei Jiao, Fangming Liu, Lin Wang:
EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds. 343-358 - Ping Gao, Xiaohui Duan, Bertil Schmidt, Wusheng Zhang, Lin Gan, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang:
Optimization of Reactive Force Field Simulation: Refactor, Parallelization, and Vectorization for Interactions. 359-373 - Yipei Niu, Panpan Jin, Jian Guo, Yikai Xiao, Rong Shi, Fangming Liu, Chen Qian, Yang Wang:
PostMan: Rapidly Mitigating Bursty Traffic via On-Demand Offloading of Packet Processing. 374-387 - Konstantinos Iliakis, Sotirios Xydis, Dimitrios Soudris:
Repurposing GPU Microarchitectures with Light-Weight Out-Of-Order Execution. 388-402 - Li Chen, Shuhao Liu, Baochun Li:
Optimizing Network Transfers for Data Analytic Jobs Across Geo-Distributed Datacenters. 403-414 - Limei Lin, Yanze Huang, Li Xu, Sun-Yuan Hsieh:
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity. 415-428 - Nhut-Minh Ho, Weng-Fai Wong:
Tensorox: Accelerating GPU Applications via Neural Approximation on Unused Tensor Cores. 429-443 - Abdurrahman Yasar, Sivasankaran Rajamanickam, Jonathan W. Berry, Ümit V. Çatalyürek:
A Block-Based Triangle Counting Algorithm on Heterogeneous Environments. 444-458 - Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression. 459-475 - Jie Cui, Bei Li, Hong Zhong, Geyong Min, Yan Xu, Lu Liu:
A Practical and Efficient Bidirectional Access Control Scheme for Cloud-Edge Data Sharing. 476-488 - Scott Pakin, Christof Teuscher, Catherine D. Schuman:
Guest Editorial: Special Section on Parallel and Distributed Computing Techniques for Non-Von Neumann Technologies. 249-250 - Chang Hyun Kim, Won Jun Lee, Yoonah Paik, Kiyong Kwon, Seok Young Kim, Il Park, Seon Wook Kim:
Silent-PIM: Realizing the Processing-in-Memory Computing With Standard Memory Requests. 251-262 - Purab Ranjan Sutradhar, Sathwika Bavikadi, Mark Connolly, Savankumar Prajapati, Mark A. Indovina, Sai Manoj Pudukotai Dinakarrao, Amlan Ganguly:
Look-up-Table Based Processing-in-Memory Architecture With Programmable Precision-Scaling for Deep Learning Applications. 263-275 - Leonid Yavits, Roman Kaplan, Ran Ginosar:
GIRAF: General Purpose In-Storage Resistive Associative Framework. 276-287 - Twisha Titirsha, Shihao Song, Anup Das, Jeffrey L. Krichmar, Nikil D. Dutt, Nagarajan Kandasamy, Francky Catthoor:
Endurance-Aware Mapping of Spiking Neural Networks to Neuromorphic Hardware. 288-301 - Kyle Henke, Garrett T. Kenyon, Ben Migliori:
Fast Post-Hoc Normalization for Brain Inspired Sparse Coding on a Neuromorphic Device. 302-309 - Elijah Pelofske, Georg Hahn, Hristo N. Djidjev:
Inferring the Dynamics of the State Evolution During Quantum Annealing. 310-321 - Karolos-Alexandros Tsakalos, Georgios Ch. Sirakoulis, Andrew Adamatzky, Jim Smith:
Protein Structured Reservoir Computing for Spike-Based Pattern Recognition. 322-331 - Bosheng Song, Kenli Li, Xiangxiang Zeng:
Monodirectional Evolutional Symport Tissue P Systems With Promoters and Cell Division. 332-342
Volume 33, Number 3, March 2022
- Shixiong Zhao, Fanxin Li, Xusheng Chen, Xiuxian Guan, Jianyu Jiang, Dong Huang, Yuhao Qing, Sen Wang, Peng Wang, Gong Zhang, Cheng Li, Ping Luo, Heming Cui:
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training. 489-506 - Yishu Du, Loris Marchal, Guillaume Pallez, Yves Robert:
Optimal Checkpointing Strategies for Iterative Applications. 507-522 - Marcin Copik, Tobias Grosser, Torsten Hoefler, Paolo Bientinesi, Benjamin Berkels:
Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration. 523-535 - Wei Yang Bryan Lim, Jer Shyuan Ng, Zehui Xiong, Jiangming Jin, Yang Zhang, Dusit Niyato, Cyril Leung, Chunyan Miao:
Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning. 536-550 - Yiwen Gao, Jia Xu, Hongbing Wang:
cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope. 551-568 - Hui Cai, Fan Ye, Yuanyuan Yang, Yanmin Zhu, Jie Li, Fu Xiao:
Online Pricing and Trading of Private Data in Correlated Queries. 569-585 - Yuan Wang, Hideaki Ishii, François Bonnet, Xavier Défago:
Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs. 586-603 - Oliver Giersch, Jörg Nolte:
Fast and Portable Concurrent FIFO Queues With Deterministic Memory Reclamation. 604-616 - Chavit Denninnart, Mohsen Amini Salehi:
Harnessing the Potential of Function-Reuse in Multimedia Cloud Systems. 617-629 - Jed Mills, Jia Hu, Geyong Min:
Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing. 630-641 - John Gounley, Madhurima Vardhan, Erik W. Draeger, Pedro Valero-Lara, Shirley V. Moore, Amanda Randles:
Propagation Pattern for Moment Representation of the Lattice Boltzmann Method. 642-653 - Quan Zheng, Tao Yang, Yuanzhi Kan, Xiaobin Tan, Jian Yang, Xiaofeng Jiang:
On the Analysis of Cache Invalidation With LRU Replacement. 654-666 - Tayebeh Bahreini, Hossein Badri, Daniel Grosu:
Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems. 667-682 - Xing Chen, Jianshan Zhang, Bing Lin, Zheyi Chen, Katinka Wolter, Geyong Min:
Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments. 683-697 - Anandarup Mukherjee, Pallav Kumar Deb, Sudip Misra:
Timed Loops for Distributed Storage in Wireless Networks. 698-709 - Umar Ibrahim Minhas, Roger F. Woods, Dimitrios S. Nikolopoulos, Georgios Karakonstantis:
Efficient, Dynamic Multi-Task Execution on FPGA-Based Computing Systems. 710-722 - Linsong Cheng, Jiliang Wang, Yinghui Li:
ViTrack: Efficient Tracking on the Edge for Commodity Video Surveillance Systems. 723-735
Volume 33, Number 4, April 2022
- Sadaf R. Alam, Lois Curfman McInnes, Kengo Nakajima:
IEEE Special Issue on Innovative R&D Toward the Exascale Era. 736-738 - Andrea Borghesi, Martin Molan, Michela Milano, Andrea Bartolini:
Anomaly Detection and Anticipation in High Performance Computing Systems. 739-750 - Yiming Wang, Weizhe Zhang, Meng Hao, Zheng Wang:
Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach. 751-764 - Chao Chen, Greg Eisenhauer, Santosh Pande:
Near-Zero Downtime Recovery From Transient-Error-Induced Crashes. 765-778 - Juan M. Cebrian, Thibaud Balem, Adrián Barredo, Marc Casas, Miquel Moretó, Alberto Ros, Alexandra Jimborean:
Compiler-Assisted Compaction/Restoration of SIMD Instructions. 779-791 - Lazaros Papadopoulos, Dimitrios Soudris, Christoph W. Kessler, August Ernstsson, Johan Ahlqvist, Nikos Vasilas, Athanasios I. Papadopoulos, Panos Seferlis, Charles Prouveur, Matthieu Haefele, Samuel Thibault, Athanasios Salamanis, Theodoros Ioakimidis, Dionysios D. Kehagias:
EXA2PRO: A Framework for High Development Productivity on Heterogeneous Computing Systems. 792-804 - Christian R. Trott, Damien Lebrun-Grandié, Daniel Arndt, Jan Ciesko, Vinh Q. Dang, Nathan D. Ellingwood, Rahulkumar Gayatri, Evan Harvey, Daisy S. Hollman, Dan Ibanez, Nevin Liber, Jonathan R. Madsen, Jeff Miles, David Poliakoff, Amy Powell, Sivasankaran Rajamanickam, Mikael Simberg, Dan Sunderland, Bruno Turcksin, Jeremiah J. Wilke:
Kokkos 3: Programming Model Extensions for the Exascale Era. 805-817 - André Merzky, Matteo Turilli, Mikhail Titov, Aymen Al-Saadi, Shantenu Jha:
Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms. 818-829 - Jonas H. Müller Korndörfer, Ahmed Eleliemy, Ali Mohammed, Florina M. Ciorba:
LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications. 830-841 - Junchao Zhang, Jed Brown, Satish Balay, Jacob Faibussowitsch, Matthew G. Knepley, Oana Marin, Richard Tran Mills, Todd S. Munson, Barry F. Smith, Stefano Zampini:
The PetscSF Scalable Communication Layer. 842-853 - Keren Zhou, Xiaozhu Meng, Ryuichi Sai, Dejan Grubisic, John M. Mellor-Crummey:
An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications. 854-865 - Ivy Bo Peng, Maya B. Gokhale, Karim Youssef, Keita Iwabuchi, Roger Pearce:
Enabling Scalable and Extensible Memory-Mapped Datastores in Userspace. 866-877 - Lipeng Wan, Axel Huebl, Junmin Gu, Franz Poeschel, Ana Gainaru, Ruonan Wang, Jieyang Chen, Xin Liang, Dmitry Ganyushin, Todd S. Munson, Ian T. Foster, Jean-Luc Vay, Norbert Podhorszki, Kesheng Wu, Scott Klasky:
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization. 878-890 - Houjun Tang, Quincey Koziol, John Ravi, Suren Byna:
Transparent Asynchronous Parallel I/O Using Background Threads. 891-902 - Jérome Soumagne, Jordan Henderson, Mohamad Chaarawi, Neil Fortner, M. Scot Breitenfeld, Songyu Lu, Dana Robinson, Elena Pourmal, Johann Lombardi:
Accelerating HDF5 I/O for Exascale Using DAOS. 903-914 - Sayan Ghosh, Nathan R. Tallent, Mahantesh Halappanavar:
Characterizing Performance of Graph Neighborhood Communication Patterns. 915-928 - Arindam Khanda, Sriram Srinivasan, Sanjukta Bhowmick, Boyana Norris, Sajal K. Das:
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in Large-Scale Dynamic Networks. 929-940 - Xinbiao Gan, Yiming Zhang, Ruibo Wang, Tiejun Li, Tiaojie Xiao, Ruigeng Zeng, Jie Liu, Kai Lu:
TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer. 941-951 - Robert F. Bird, Nigel Tan, Scott V. Luedtke, Stephen Lien Harrell, Michela Taufer, Brian J. Albright:
VPIC 2.0: Next Generation Particle-in-Cell Simulations. 952-963 - Sameh Abdulah, Qinglei Cao, Yu Pei, George Bosilca, Jack J. Dongarra, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun:
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC. 964-976 - Stephen Hudson, Jeffrey Larson, John-Luke Navarro, Stefan M. Wild:
libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations. 977-988 - Ariful Azad, Oguz Selvitopi, Md Taufique Hussain, John R. Gilbert, Aydin Buluç:
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems. 989-1001 - Gordon Euhyun Moon, Hyoukjun Kwon, Geonhwa Jeong, Prasanth Chatarasi, Sivasankaran Rajamanickam, Tushar Krishna:
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication. 1002-1014 - Anil Gaihre, Xiaoye Sherry Li, Hang Liu:
gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs. 1015-1026 - Neil Lindquist, Piotr Luszczek, Jack J. Dongarra:
Accelerating Restarted GMRES With Mixed Precision Arithmetic. 1027-1037
Volume 33, Number 5, May 2022
- Fabio Montagna, Stefan Mach, Simone Benatti, Angelo Garofalo, Gianmarco Ottavi, Luca Benini, Davide Rossi, Giuseppe Tagliavini:
A Low-Power Transprecision Floating-Point Cluster for Efficient Near-Sensor Data Analytics. 1038-1053 - Fei Lei, Dezun Dong, Xiangke Liao:
Exploring the Galaxyfly Family to Build Flexible-Scale Interconnection Networks. 1054-1068 - Zongyi Zhao, Xingang Shi, Zhiliang Wang, Qing Li, Han Zhang, Xia Yin:
Efficient and Accurate Flow Record Collection With HashFlow. 1069-1083 - Jiantong Jiang, Zeyi Wen, Ze-ke Wang, Bingsheng He, Jian Chen:
Parallel and Distributed Structured SVM Training. 1084-1096 - Jian Liu, Peilun Li, Raymond Cheng, N. Asokan, Dawn Song:
Parallel and Asynchronous Smart Contract Execution. 1097-1108 - Jiaqi Liu, Shiyue Huang, Deng Li, Sheng Wen, Hui Liu:
Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics. 1109-1127 - Shaoqi Wang, Aidi Pi, Xiaobo Zhou:
Elastic Parameter Server: Accelerating ML Training With Scalable Resource Scheduling. 1128-1143 - Xiaoyu Xia, Feifei Chen, Qiang He, Guangming Cui, John C. Grundy, Mohamed Almorsy Abdelrazek, Xiaolong Xu, Hai Jin:
Data, User and Power Allocations for Caching in Multi-Access Edge Computing. 1144-1155 - Bruno Donassolo, Arnaud Legrand, Panayotis Mertikopoulos, Ilhem Fajjari:
Online Reconfiguration of IoT Applications in the Fog: The Information-Coordination Trade-Off. 1156-1172 - Lipeng Wang, Qiong Luo, Shengen Yan:
DIESEL+: Accelerating Distributed Deep Learning Tasks on Image Datasets. 1173-1184 - Zhi Ma, Sheng Zhang, Zhiqi Chen, Tao Han, Zhuzhong Qian, Mingjun Xiao, Ning Chen, Jie Wu, Sanglu Lu:
Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing. 1185-1198 - Jing Li, Weifa Liang, Wenzheng Xu, Zichuan Xu, Xiaohua Jia, Wanlei Zhou, Jin Zhao:
Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing. 1199-1212 - Takuya Kojima, Ayaka Ohwada, Hideharu Amano:
Mapping-Aware Kernel Partitioning Method for CGRAs Assisted by Deep Learning. 1213-1230 - YuAng Chen, Yeh-Ching Chung:
Workload Balancing via Graph Reordering on Multicore Systems. 1231-1245 - Junsong Fu, Na Wang, Baojiang Cui, Bharat K. Bhargava:
A Practical Framework for Secure Document Retrieval in Encrypted Cloud File Systems. 1246-1261 - Zhu Jin, Wen-Kang Jia:
DH-SVRF: A Reconfigurable Unicast/Multicast Forwarding for High-Performance Packet Forwarding Engines. 1262-1275
Volume 33, Number 6, June 2022
- Kiril Dichev, Daniele De Sensi, Dimitrios S. Nikolopoulos, Kirk W. Cameron, Ivor T. A. Spence:
Power Log'n'Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols. 1276-1288 - Feng Zhang, Erkang Xue, Ruixin Guo, Guangzhi Qu, Gansen Zhao, Albert Y. Zomaya:
DS-ADMM++: A Novel Distributed Quantized ADMM to Speed up Differentially Private Matrix Factorization. 1289-1302 - Tsung-Wei Huang, Dian-Lun Lin, Chun-Xun Lin, Yibo Lin:
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System. 1303-1320 - John Augustine, Keerti Choudhary, Avi Cohen, David Peleg, Sumathi Sivasubramaniam, Suman Sourav:
Distributed Graph Realizations. 1321-1337 - Maciej Kokocinski, Tadeusz Kobus, Pawel T. Wojciechowski:
On Mixing Eventual and Strong Consistency: Acute Cloud Types. 1338-1356 - Yuxuan Li, Lin Gan, Mingcheng Chen, Yaojian Chen, Haitian Lu, Chao-Yang Lu, Jian-Wei Pan, Haohuan Fu, Guangwen Yang:
Benchmarking 50-Photon Gaussian Boson Sampling on the Sunway TaihuLight. 1357-1372 - Peijin Cong, Zhixing Zhang, Junlong Zhou, Xin Liu, Yao Liu, Tongquan Wei:
Customer Adaptive Resource Provisioning for Long-Term Cloud Profit Maximization under Constrained Budget. 1373-1392 - Haotian Wu, Zhe Peng, Songtao Guo, Yuanyuan Yang, Bin Xiao:
VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems. 1393-1406 - Kwangsung Oh, Minmin Zhang, Abhishek Chandra, Jon B. Weissman:
Network Cost-Aware Geo-Distributed Data Analytics System. 1407-1420 - Ziliang Wang, Xiaohong Zhang, Meng Yan, Ling Xu, Dan Yang:
HSA-Net: Hidden-State-Aware Networks for High-Precision QoS Prediction. 1421-1435 - Brian R. Tauro, Conghao Liu, Kyle C. Hale:
Modeling Speedup in Multi-OS Environments. 1436-1450 - Chen Zhao, Wu Gao, Feiping Nie, Huiyang Zhou:
A Survey of GPU Multitasking Methods Supported by Hardware Architecture. 1451-1463 - Ronan-Alexandre Cherrueau, Marie Delavergne, Alexandre van Kempen, Adrien Lebre, Dimitri Pertin, Javier Rojas Balderrama, Anthony Simonet, Matthieu Simonin:
EnosLib: A Library for Experiment-Driven Research in Distributed Computing. 1464-1477 - Abhishek Kumar Jain, Douglas L. Maskell, Suhaib A. Fahmy:
Coarse Grained FPGA Overlay for Rapid Just-In-Time Accelerator Compilation. 1478-1490