


default search action
ICPP 2020: Edmonton, AB, Canada
- José Nelson Amaral, Lizy Kurian John, Xipeng Shen:

ICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020. ACM 2020, ISBN 978-1-4503-8816-0
Best-Paper Candidates
- Naoya Yamamoto, Koji Nakano

, Yasuaki Ito, Daisuke Takafuji, Akihiko Kasagi, Tsuguchika Tabaru:
Huffman Coding with Gap Arrays for GPU Acceleration. 1:1-1:11 - Jiya Su, Feng Zhang, Weifeng Liu

, Bingsheng He
, Ruofan Wu
, Xiaoyong Du, Rujia Wang:
CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs. 2:1-2:11 - Jianting Zhang

, Zicong Hong, Xiaoyu Qiu, Yufeng Zhan, Song Guo, Wuhui Chen:
SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System. 3:1-3:11 - Taha Atahan Akyildiz, Amro Alabsi Aljundi, Kamer Kaya:

GOSH: Embedding Big Graphs on Small Hardware. 4:1-4:11
Distributed Systems
- Shangming Cai, Dongsheng Wang, Zhanye Wang, Haixia Wang:

CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server Cluster. 5:1-5:11 - Chris Kjellqvist, Mohammad Hedayati, Michael L. Scott

:
Safe, Fast Sharing of memcached as a Protected Library. 6:1-6:8 - Ziyi Zhao

, Zhang Jiang
, Ximing Liu, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew
:
DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms. 7:1-7:11
Edge Learning and Inference
- Jae-Won Chung

, Jae-Yun Kim, Soo-Mook Moon:
ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference. 8:1-8:11 - Yeting Guo

, Fang Liu, Zhiping Cai, Li Chen, Nong Xiao:
FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare. 9:1-9:11 - Sai Qian Zhang, Jieyu Lin, Qi Zhang:

Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN. 10:1-10:11
Memory Systems
- Jianming Huang, Yu Hua, Pengfei Zuo

, Wen Zhou, Fangting Huang:
An Efficient Wear-level Architecture using Self-adaptive Wear Leveling. 11:1-11:11 - Xueliang Wei, Dan Feng, Wei Tong

, Jingning Liu, Chengning Wang, Liuqing Ye:
CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates. 12:1-12:11 - Shanjiang Tang, Qifei Chai, Ce Yu, Yusen Li, Chao Sun:

Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System. 13:1-13:11
Fault-Tolerance
- Carlos Pachajoa, Christina Pacher, Markus Levonyak

, Wilfried N. Gansterer
:
Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient Method. 14:1-14:11 - Yishu Du, Loris Marchal, Guillaume Pallez Aupy, Yves Robert

:
Robustness of the Young/Daly formula for stochastic iterative applications. 15:1-15:11 - Li Han, Yiqin Gao

, Jing Liu, Yves Robert, Frédéric Vivien:
Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms. 16:1-16:11
Scheduling and Placement in Networks
- Chi Lin, Ziwei Yang, Yu Sun, Jing Deng, Lei Wang, Guowei Wu:

Cooperative Game for Multiple Chargers with Dynamic Network Topology. 17:1-17:10 - Yang Chen, Jie Wu, Bo Ji

:
Optimizing Flow Bandwidth Consumption with Traffic-diminishing Middlebox Placement. 18:1-18:10 - Yang Shi, Mei Wen, Chunyuan Zhang:

Towards High-Efficiency Data Centers via Job-Aware Network Scheduling. 19:1-19:10
Systems for Machine Learning
- Lipeng Wang, Songgao Ye, Baichen Yang

, Youyou Lu, Hequan Zhang, Shengen Yan, Qiong Luo
:
DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training. 20:1-20:11 - Abeda Sultana, Li Chen, Fei Xu, Xu Yuan

:
E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning Cluster. 21:1-21:11 - Zheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang

, Xiaoyong Du:
ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs. 22:1-22:11
Graph Processing and Concurrent Data Structures
- Somesh Singh

, Rupesh Nasre:
Graffix: Efficient Graph Processing with a Tinge of GPU-Specific Approximations. 23:1-23:11 - Matthew Rodriguez, Michael F. Spear

:
Optimizing Linearizable Bulk Operations on Data Structures. 24:1-24:10 - Feng Sheng, Qiang Cao, Hong Jiang, Jie Yao:

GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs. 25:1-25:11
Large-Scale Applications on Supercomputers
- Xinyuan Li, Huang Ye, Jian Zhang:

Large-scale Simulations of Peridynamics on Sunway Taihulight Supercomputer. 26:1-26:11 - Sudip K. Seal, Seung-Hwan Lim, Dali Wang, Jacob D. Hinkle, Dalton D. Lunga, Aristeidis Tsaris:

Toward Large-Scale Image Segmentation on Summit. 27:1-27:11 - Kai Xu, Xiaohui Duan, Xiangxu Meng, Xin Li, Bertil Schmidt

, Weiguo Liu:
SWMapper: Scalable Read Mapper on SunWay TaihuLight. 28:1-28:10
Machine Learning for Computing
- Xueying Zhang, Ruiting Zhou, Zhi Zhou, John C. S. Lui, Zongpeng Li:

An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks. 29:1-29:11 - Haoyu Wang, Haiying Shen, Qi Liu, Kevin Zheng, Jie Xu:

A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost. 30:1-30:10 - Zixia Liu, Liqiang Wang, Gang Quan

:
Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical Computing. 31:1-31:11
Performance Tools and Methodology
- Girish Mururu, Kaushik Ravichandran, Ada Gavrilovska, Santosh Pande

:
Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinism. 32:1-32:12 - Wei Liu, Yifan Gong, Hao Wu, Jidong Zhai, Jiangming Jin:

Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications. 33:1-33:11 - Christian Helm, Kenjiro Taura

:
Automatic Identification and Precise Attribution of DRAM Bandwidth Contention. 34:1-34:11
Storage Reliability & Memory Security
- Zizhong Wang, Haixia Wang, Airan Shao, Dongsheng Wang:

An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm. 35:1-35:11 - Kartik Ramkrishnan, Stephen McCamant, Pen-Chung Yew

, Antonia Zhai:
First Time Miss : Low Overhead Mitigation for Shared Memory Cache Side Channels. 36:1-36:11 - Tong Liu

, Shakeel Alibhai, Xubin He
:
A Rack-Aware Pipeline Repair Scheme for Erasure-Coded Distributed Storage Systems. 37:1-37:11
Supporting Efficient Machine Learning
- Qingchang Han, Yongmin Hu, Fengwei Yu, Hailong Yang, Bing Liu, Peng Hu, Ruihao Gong

, Yanfei Wang, Rui Wang, Zhongzhi Luan, Depei Qian:
Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures. 38:1-38:12 - Jan Hückelheim, Michel Schanen

, Sri Hari Krishna Narayanan, Paul D. Hovland:
Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures. 39:1-39:11 - Zhenbo Hu, Xiangyu Zou, Wen Xia, Sian Jin, Dingwen Tao

, Yang Liu
, Weizhe Zhang, Zheng Zhang
:
Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity. 40:1-40:12
Data Center Networking
- Jinbin Hu, Jiawei Huang, Zhaoyi Li

, Jianxin Wang, Tian He:
AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center. 41:1-41:10 - Wanchun Jiang, Kaiqin Liao, Yulong Yan, Jianxin Wang:

PS: Periodic Strategy for the 40-100Gbps Energy Efficient Ethernet. 42:1-42:10 - Chang Ruan, Jianxin Wang, Wanchun Jiang, Tao Zhang:

Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric. 43:1-43:10
Parallel Algorithms I
- Jesmin Jahan Tithi, Andrzej Stasiak, Sriram Aananthakrishnan, Fabrizio Petrini:

Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning. 44:1-44:11 - Ashirbad Mishra

, Shad Kirmani, Kamesh Madduri:
Fast Spectral Graph Layout on Multicore Platforms. 45:1-45:11 - Tarequl Islam Sifat, Nirmal Prajapati

, Sanjay V. Rajopadhye:
Revisiting Sparse Dynamic Programming for the 0/1 Knapsack Problem. 46:1-46:10
Parallel and Distributed Machine Learning
- Junyu Li, Ligang He, Shenyuan Ren, Rui Mao:

Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural Networks. 47:1-47:10 - Canh T. Dinh, Nguyen Hoang Tran, Tuan Dung Nguyen, Wei Bao, Albert Y. Zomaya, Bing Bing Zhou:

Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms. 48:1-48:11 - Zijie Yan, Danyang Xiao, Mengqiang Chen, Jieying Zhou, Weigang Wu:

Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning. 49:1-49:10
Heterogeneous Systems
- Matthew Agostini, Francis O'Brien, Tarek S. Abdelrahman:

Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems. 50:1-50:12 - Juan Carlos Saez

, Fernando Castro
, Manuel Prieto-Matías
:
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors. 51:1-51:11 - Pengfei Zou, Ang Li, Kevin J. Barker

, Rong Ge:
Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines. 52:1-52:11
Performance Evaluation and Characterization
- Adrian Munera, Sara Royuela, Germán Llort

, Estanislao Mercadal
, Franck Wartel, Eduardo Quiñones:
Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver. 53:1-53:11 - Pablo Prieto

, Pablo Abad Fidalgo, Jose Angel Herrero, José-Ángel Gregorio
, Valentin Puente:
SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads. 54:1-54:11 - Davood Ghatreh Samani, Chavit Denninnart, Josef Bacik, Mohsen Amini Salehi:

The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms. 55:1-55:11
Routing and Mapping in Networks
- Hongyun Gao, Laiping Zhao, Huanbin Wang, Zhao Tian, Lihai Nie, Keqiu Li:

XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN. 56:1-56:11 - Felix Zahn, Holger Fröning:

On Network Locality in MPI-Based HPC Applications. 57:1-57:10 - Bo He, Jingyu Wang

, Qi Qi, Haifeng Sun, Zirui Zhuang
, Cong Liu, Jianxin Liao:
DeepHop on Edge: Hop-by-hop Routing byDistributed Learning with Semantic Attention. 58:1-58:11
Microarchitecture and Power Management
- Alexandra Angerd, Erik Sintorn, Per Stenström:

A GPU Register File using Static Data Compression. 59:1-59:10 - Kramer Straube, Jason Lowe-Power

, Christopher Nitta
, Matthew K. Farrens, Venkatesh Akella:
HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems. 60:1-60:11 - Jiaxin Peng, Yousra Al-Kabani, Shuai Sun, Volker J. Sorger, Tarek A. El-Ghazawi:

DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics. 61:1-61:11
Parallel Algorithms II
- Ryota Yasudo

, Koji Nakano
, Yasuaki Ito, Masaru Tatekawa, Ryota Katsuki, Takashi Yazane, Yoko Inaba:
Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs. 62:1-62:11 - Zhengyang Lu, Yuyao Niu, Weifeng Liu

:
Efficient Block Algorithms for Parallel Sparse Triangular Solve. 63:1-63:11 - Shouxi Luo

, Pingzhi Fan, Huanlai Xing
, Hongfang Yu:
Selective Coflow Completion for Time-sensitive Distributed Applications with Poco. 64:1-64:10
Resource Management on the Cloud
- Kaiyue Duan

, Yusen Li, Trent G. Marbach
, Gang Wang, Xiaoguang Liu:
Improving Load Balance via Resource Exchange in Large-Scale Search Engines. 65:1-65:11 - Iryanto Jaya, Wentong Cai

, Yusen Li:
Rendering Server Allocation for MMORPG Players in Cloud Gaming. 66:1-66:11 - Zhuozhao Li

, Tanmoy Sen
, Haiying Shen, Mooi Choo Chuah:
Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection Schemes. 67:1-67:11
GPU-Accelerated Applications
- David B. Williams-Young

, Chao Yang:
Parallel Shift-Invert Spectrum Slicing on Distributed Architectures with GPU Accelerators. 68:1-68:11 - Martin Krulis, Miroslav Kratochvíl

:
Detailed Analysis and Optimization of CUDA K-means Algorithm. 69:1-69:11 - Ichitaro Yamazaki, Sivasankaran Rajamanickam, Nathan D. Ellingwood:

Performance Portable Supernode-based Sparse Triangular Solver for Manycore Architectures. 70:1-70:11
Data Centers and the Edge
- Xiaoqing Cai, Jiuchen Shi, Rui Yuan, Chang Liu, Wenli Zheng

, Quan Chen, Chao Li, Jingwen Leng, Minyi Guo:
OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment. 71:1-71:11 - Ahmed Mohamed Abdelmoniem

, Hengky Susanto, Brahim Bensaou:
Reducing Latency in Multi-Tenant Data Centers via Cautious Congestion Watch. 72:1-72:11 - Wei Zhang, Ningxin Zheng, Quan Chen, Yong Yang, Zhuo Song, Tao Ma, Jingwen Leng, Minyi Guo:

URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds. 73:1-73:11 - Weifa Liang

, Yu Ma, Wenzheng Xu, Xiaohua Jia, Sid Chi-Kin Chau
:
Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks. 74:1-74:11
Storage and I/O Optimization
- Yuchen Cheng

, Chunghsuan Wu, Yanqiang Liu, Rui Ren, Hong Xu
, Bin Yang, Zhengwei Qi:
OPS: Optimized Shuffle Management System for Apache Spark. 75:1-75:11 - Fan Deng, Qiang Cao, Shucheng Wang

, Shuyang Liu, Jie Yao, Yuanyuan Dong, Puyuan Yang:
SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds. 76:1-76:11 - Vinay Devadas, Matthew Curtis-Maury:

Scalable Coordination of Hierarchical Parallelism. 77:1-77:11 - Yu Chen, Wei Tong

, Dan Feng, Zike Wang:
Mass: Workload-Aware Storage Policy for OpenStack Swift. 78:1-78:11

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














