


default search action
34th IPDPS 2020: New Orleans, LA, USA
- 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, LA, USA, May 18-22, 2020. IEEE 2020, ISBN 978-1-7281-6876-0

- Mark Clark, Yingping Chen, Avinash Karanth

, Dongsheng Brian Ma, Ahmed Louri:
DozzNoC: Reducing Static and Dynamic Energy in NoCs with Low-latency Voltage Regulators using Machine Learning. 1-11 - Vidushi Goyal, Xiaowei Wang, Valeria Bertacco, Reetuparna Das

:
Neksus: An Interconnect for Heterogeneous System-In-Package Architectures. 12-21 - Yunfan Li, Lizhong Chen:

Accelerated Reply Injection for Removing NoC Bottleneck in GPGPUs. 22-31 - Jahanzeb Maqbool Hashmi, Shulei Xu, Bharath Ramesh, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. D. K. Panda:

Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures. 32-41 - Zhirong Shen, Jiwu Shu, Zhijie Huang

, Yingxun Fu:
ClusterSR: Cluster-Aware Scattered Repair in Erasure-Coded Storage. 42-51 - Jay F. Lofstead

, John Mitchell
, Enze Chen
:
Stitch It Up: Using Progressive Data Storage to Scale Science. 52-61 - Hariharan Devarajan, Anthony Kougkas, Xian-He Sun:

HFetch: Hierarchical Data Prefetching for Scientific Workflows in Multi-Tiered Storage Environments. 62-72 - Michael R. Wyatt II, Stephen Herbein, Kathleen Shoga, Todd Gamblin, Michela Taufer

:
CanarIO: Sounding the Alarm on IO-Related Performance Degradation. 73-83 - Vishwesh Jatala, Roshan Dathathri, Gurbinder Gill, Loc Hoang, V. Krishna Nandivada, Keshav Pingali:

A Study of Graph Analytics for Massive Datasets on Distributed Multi-GPUs. 84-94 - Hang Cao, Liang Yuan, He Zhang, Baodong Wu, Shigang Li, Pengqi Lu, Yunquan Zhang, Yongjun Xu, Minghua Zhang:

A Highly Efficient Dynamical Core of Atmospheric General Circulation Model based on Leap-Format. 95-104 - Sian Jin

, Pascal Grosset
, Christopher M. Biwer, Jesus Pulido, Jiannan Tian
, Dingwen Tao
, James P. Ahrens
:
Understanding GPU-Based Lossy Compression for Extreme-Scale Cosmological Simulations. 105-115 - Oguz Selvitopi, Md Taufique Hussain, Ariful Azad, Aydin Buluç

:
Optimizing High Performance Markov Clustering for Pre-Exascale Architectures. 116-126 - Yukun Cheng, Xiaotie Deng, Yuhao Li

:
Tightening Up the Incentive Ratio for Resource Sharing Over the Rings. 127-136 - Timo Bingmann, Peter Sanders, Matthias Schimek

:
Communication-Efficient String Sorting. 137-147 - Tianchen Ding, Shiyou Qian, Jian Cao, Guangtao Xue, Minglu Li:

SCSL: Optimizing Matching Algorithms to Improve Real-time for Content-based Pub/Sub Systems. 148-157 - John Augustine, Keerti Choudhary

, Avi Cohen, David Peleg, Sumathi Sivasubramaniam, Suman Sourav
:
Distributed Graph Realizations †. 158-167 - Sang Wook Stephen Do, Michel Dubois:

Transaction-Based Core Reliability. 168-179 - Seung-Hwan Lim, Ross G. Miller, Sudharshan S. Vazhkudai:

Understanding the Interplay between Hardware Errors and User Job Characteristics on the Titan Supercomputer. 180-190 - Han Qiu, Chentao Wu, Jie Li

, Minyi Guo, Tong Liu
, Xubin He
, Yuanyuan Dong, Yafei Zhao:
EC-Fusion: An Efficient Hybrid Erasure Coding Framework to Improve Both Application and Recovery Performance in Cloud Storage Systems. 191-201 - Tang Liu, Baijun Wu, Wenzheng Xu, Xianbo Cao, Jian Peng, Hongyi Wu:

Learning an Effective Charging Scheme for Mobile Devices. 202-211 - Cong Wang, Xin Wei, Pengzhan Zhou:

Optimize Scheduling of Federated Learning on Battery-powered Mobile Devices. 212-221 - Evangelos Georganas, Kunal Banerjee, Dhiraj D. Kalamkar, Sasikanth Avancha, Anand Venkat, Michael J. Anderson, Greg Henry, Hans Pabst, Alexander Heinecke:

Harnessing Deep Learning via a Single Building Block. 222-233 - Yufeng Zhan, Peng Li, Song Guo:

Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning. 234-243 - Jiepeng Zhang, Jingwei Sun, Wenju Zhou, Guangzhong Sun:

An Active Learning Method for Empirical Modeling in Performance Tuning. 244-253 - Bin Dong, Verónica Rodríguez Tribaldos

, Xin Xing, Suren Byna
, Jonathan Ajo-Franklin
, Kesheng Wu
:
DASSA: Parallel DAS Data Storage and Analysis for Subsurface Event Detection. 254-263 - Mahesh Balasubramanian, Trevor D. Ruiz

, Brandon Cook, Prabhat, Sharmodeep Bhattacharyya, Aviral Shrivastava
, Kristofer E. Bouchard:
Scaling of Union of Intersections for Inference of Granger Causal Networks from Observational Data. 264-273 - Xiaodong Yu, Fengguo Wei, Xinming Ou, Michela Becchi, Tekin Bicer, Danfeng Daphne Yao

:
GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting. 274-284 - Dongyu Lu, Yuben Qu, Fan Wu, Haipeng Dai

, Chao Dong, Guihai Chen
:
Robust Server Placement for Edge Computing. 285-294 - Yoonsung Nam, Yongjun Choi, Byeonghun Yoo, Hyeonsang Eom, Yongseok Son:

EdgeIso: Effective Performance Isolation for Edge Devices. 295-305 - Runtian Ren, Xueyan Tang:

Busy-Time Scheduling on Heterogeneous Machines. 306-315 - Evripidis Bampis

, Konstantinos Dogeas, Alexander V. Kononov, Giorgio Lucarelli
, Fanny Pascual:
Scheduling Malleable Jobs Under Topological Constraints. 316-325 - Cheng Li, Abdul Dakkak, Jinjun Xiong

, Wei Wei, Lingjie Xu, Wen-Mei Hwu:
XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs. 326-327 - Ricardo Nobre, Aleksandar Ilic

, Sergio Santander-Jiménez
, Leonel Sousa
:
Exploring the Binary Precision Capabilities of Tensor Cores for Epistasis Detection. 338-347 - Pantea Zardoshti, Michael F. Spear

, Aida Vosoughi, Garret Swart:
Understanding and Improving Persistent Transactions on Optane™ DC Memory. 348-357 - Mengqian Zhang, Jichen Li, Zhaohua Chen, Hongyin Chen, Xiaotie Deng:

CycLedger: A Scalable and Secure Parallel Protocol for Distributed Ledger via Sharding. 358-367 - Jianshu Liu, Shungeng Zhang

, Qingyang Wang, Jinpeng Wei:
Mitigating Large Response Time Fluctuations through Fast Concurrency Adapting in Clouds. 368-377 - Yinggen Xu, Liu Liu, Zhijun Ding:

DAG-Aware Joint Task Scheduling and Cache Management in Spark Clusters. 378-387 - Tim Shaffer

, Nicholas L. Hazekamp, Jakob Blomer, Douglas Thain:
Solving the Container Explosion Problem for Distributed High Throughput Computing. 388-398 - Zijun Li

, Quan Chen, Shuai Xue, Tao Ma, Yong Yang, Zhuo Song, Minyi Guo:
Amoeba: QoS-Awareness and Reduced Resource Usage of Microservices with Serverless Computing. 399-408 - Zhao Zhang, Lei Huang, J. Gregory Pauloski, Ian T. Foster:

Efficient I/O for Neural Network Training with Compressed Data. 409-418 - Jun Yi, Chengliang Zhang, Wei Wang, Cheng Li, Feng Yan:

Not All Explorations Are Equal: Harnessing Heterogeneous Profiling Cost for Efficient MLaaS Training. 419-428 - Saeed Soori, Bugra Can, Mert Gürbüzbalaban, Maryam Mehri Dehnavi:

ASYNC: A Cloud Engine with Asynchrony and History for Distributed Machine Learning. 429-439 - Cheng Li, Abdul Dakkak, Jinjun Xiong

, Wen-Mei Hwu:
Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs. 440-450 - Debashis Ganguly, Ziyu Zhang, Jun Yang, Rami G. Melhem:

Adaptive Page Migration for Irregular Data-intensive Applications under GPU Memory Oversubscription. 451-461 - Alberto Zeni

, Giulia Guidi, Marquita Ellis, Nan Ding, Marco D. Santambrogio, Steven A. Hofmeyr, Aydin Buluç
, Leonid Oliker, Katherine A. Yelick
:
LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment. 462-471 - Qi Yu, Bruce R. Childers, Libo Huang, Cheng Qian, Hui Guo, Zhiying Wang:

Coordinated Page Prefetch and Eviction for Memory Oversubscription Management in GPUs. 472-482 - Lingqi Zhang

, Mohamed Wahib, Haoyu Zhang, Satoshi Matsuoka:
A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs. 483-493 - Lili Gao, Fangyu Zheng, Niall Emmart, Jiankuo Dong, Jingqiang Lin, Charles C. Weems:

DPF-ECC: Accelerating Elliptic Curve Cryptography with Floating-Point Computing Power of GPUs. 494-504 - François-Henry Rouet, Cleve Ashcraft, Jef Dawson, Roger Grimes, Erman Guleryuz, Seid Koric, Robert F. Lucas, James S. Ong, Todd A. Simons, Ting-Ting Zhu:

Scalability Challenges of an Industrial Implicit Finite Element Code. 505-514 - Gregory D. Abram, Vignesh Adhinarayanan, Wu-chun Feng, David H. Rogers, James P. Ahrens

:
ETH: An Architecture for Exploring the Design Space of In-situ Scientific Visualization. 515-526 - Alexander van der Grinten, Henning Meyerhenke:

Scaling Betweenness Approximation to Billions of Edges by MPI-based Adaptive Sampling. 527-535 - Haoyu Wang, Haiying Shen, Charles Reiss, Arnim Jain, Yunqiao Zhang:

Improved Intermediate Data Management for MapReduce Frameworks. 536-545 - David Gureya, João Neto, Reza Karimi, João Barreto

, Pramod Bhatotia, Vivien Quéma, Rodrigo Rodrigues, Paolo Romano
, Vladimir Vlassov:
Bandwidth-Aware Page Placement in NUMA. 546-556 - Hariharan Devarajan, Anthony Kougkas, Luke Logan, Xian-He Sun:

HCompress: Hierarchical Data Compression for Multi-Tiered Storage Environments. 557-566 - Robert Underwood

, Sheng Di, Jon C. Calhoun, Franck Cappello:
FRaZ: A Generic High-Fidelity Fixed-Ratio Lossy Compression Framework for Scientific Floating-point Data. 567-577 - Nadja Holtryd, Madhavan Manivannan, Per Stenström, Miquel Pericàs:

DELTA: Distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors. 578-589 - Mehrzad Nejat, Madhavan Manivannan, Miquel Pericàs, Per Stenström:

Coordinated Management of Processor Configuration and Cache Partitioning to Optimize Energy under QoS Constraints. 590-601 - Wenjie Liu, Ping Huang, Xubin He

:
StragglerHelper: Alleviating Straggling in Computing Clusters via Sharing Memory Access Patterns. 602-611 - Nicholas Buoncristiani, Sanjana Shah, David Donofrio, John Shalf

:
Evaluating the Numerical Stability of Posit Arithmetic. 612-621 - Ignacio Laguna:

Varity: Quantifying Floating-Point Variations in HPC Systems Through Randomized Testing. 622-633 - Da Yan, Wei Wang, Xiaowen Chu

:
Demystifying Tensor Cores to Optimize Half-Precision Matrix Multiply. 634-643 - Yuchen Li, Weifa Liang

, Wenzheng Xu, Xiaohua Jia:
Data Collection of IoT Devices Using an Energy-Constrained UAV. 644-653 - Qian Zhou

, Omkant Pandey, Fan Ye:
Argus: Multi-Level Service Visibility Scoping for Internet-of-Things in Enterprise Environments. 654-663 - Laphou Lao, Xiaohai Dai, Bin Xiao

, Songtao Guo:
G-PBFT: A Location-based and Scalable Consensus Protocol for IoT-Blockchain Applications. 664-673 - Giuseppe Antonio Di Luna, Emmanuelle Anceaume, Leonardo Querzoni

:
Byzantine Generalized Lattice Agreement. 674-683 - Yu Huang, Long Zheng, Pengcheng Yao

, Jieshan Zhao, Xiaofei Liao, Hai Jin, Jingling Xue
:
A Heterogeneous PIM Hardware-Software Co-Design for Energy-Efficient Graph Processing. 684-695 - Long Zheng, Jieshan Zhao, Yu Huang, Qinggang Wang, Zhen Zeng, Jingling Xue

, Xiaofei Liao, Hai Jin:
Spara: An Energy-Efficient ReRAM-Based Accelerator for Sparse Graph Analytics Applications. 696-707 - Zhijie Huang

, Hong Jiang, Zhirong Shen, Hao Che, Nong Xiao, Ning Li:
Optimal Encoding and Decoding Algorithms for the RAID-6 Liberation Codes. 708-717 - Pu Pang, Quan Chen, Deze Zeng, Chao Li, Jingwen Leng, Wenli Zheng, Minyi Guo:

Sturgeon: Preference-aware Co-location for Improving Utilization of Power Constrained Computers. 718-727 - Yu-Hang Tang

, Oguz Selvitopi, Doru-Thom Popovici, Aydin Buluç
:
A High-Throughput Solver for Marginalized Graph Kernels on GPU. 728-738 - Muhammad A. Awad

, Saman Ashkiani, Serban D. Porumbescu, John D. Owens:
Dynamic Graphs on the GPU. 739-748 - Lucas Erlandson, Difeng Cai

, Yuanzhe Xi, Edmond Chow:
Accelerating Parallel Hierarchical Matrix-Vector Products via Data-Driven Sampling. 749-758 - Changyong Hu, Vijay K. Garg:

NC Algorithms for Popular Matchings in One-Sided Preference Systems and Related Problems. 759-768 - Jiechao Gao

, Haoyu Wang, Haiying Shen:
Smartly Handling Renewable Energy Instability in Supporting A Cloud Datacenter. 769-778 - Vinodh Kumaran Jayakumar, Jaewoo Lee, In Kee Kim, Wei Wang

:
A Self-Optimized Generic Workload Prediction Framework for Cloud Computing. 779-788 - Ivana Marincic, Venkatram Vishwanath, Henry Hoffmann:

SeeSAw: Optimizing Performance of In-Situ Analytics Applications under Power Constraints. 789-798 - Tirthak Patel

, Adam Wagenhäuser, Christopher Eibel, Timo Hönig, Thomas Zeiser, Devesh Tiwari:
What does Power Consumption Behavior of HPC Jobs Reveal? : Demystifying, Quantifying, and Predicting Power Consumption Characteristics. 799-809 - Jie Yang

, Satish Puri
:
Efficient Parallel and Adaptive Partitioning for Load-balancing in Spatial Join. 810-820 - Xin Wang, Misbah Mubarak, Yao Kang, Robert B. Ross, Zhiling Lan:

Union: An Automatic Workload Manager for Accelerating Network Simulation. 821-830 - Harshitha Menon, Abhinav Bhatele, Todd Gamblin:

Auto-tuning Parameter Choices in HPC Applications using Bayesian Optimization. 831-840 - Zhihui Du, Xinning Hui, Yurui Wang, Jun Jiang, Jason Liu

, Baokun Lu, Chongyu Wang:
Inter-Job Scheduling of High-Throughput Material Screening Applications. 841-852 - Ana Gainaru, Brice Goglin, Valentin Honoré, Guillaume Pallez Aupy, Padma Raghavan, Yves Robert

, Hongyang Sun:
Reservation and Checkpointing Strategies for Stochastic Jobs. 853-863 - Shikha Singh, Sergey Madaminov, Michael A. Bender, Michael Ferdman, Ryan Johnson, Benjamin Moseley, Hung Q. Ngo, Dung Nguyen, Soeren Olesen, Kurt Stirewalt, Geoffrey Washburn:

A Scheduling Approach to Incremental Maintenance of Datalog Programs. 864-873 - Costas Busch, Maurice Herlihy, Miroslav Popovic

, Gokarna Sharma:
Dynamic Scheduling in Distributed Transactional Memory. 874-883 - Marcus Ritter

, Alexandru Calotoiu, Sebastian Rinke, Thorsten Reimann
, Torsten Hoefler, Felix Wolf:
Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling. 884-895 - Abhinav Bhatele, Jayaraman J. Thiagarajan, Taylor L. Groves, Rushil Anirudh, Staci A. Smith, Brandon Cook, David K. Lowenthal:

The Case of Performance Variability on Dragonfly-based Systems. 896-905 - Donghe Kang, Oliver Rübel, Suren Byna

, Spyros Blanas:
Predicting and Comparing the Performance of Array Management Libraries. 906-915 - Ivy Bo Peng

, Kai Wu, Jie Ren
, Dong Li, Maya B. Gokhale:
Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems. 916-925 - Rui Xia, Haipeng Dai

, Jiaqi Zheng, Hong Xu
, Meng Li, Guihai Chen
:
Packet-in Request Redirection for Minimizing Control Plane Response Time. 926-935 - Chao Tian, Lingxiao Ma, Zhi Yang, Yafei Dai:

PCGCN: Partition-Centric Processing for Accelerating Graph Convolutional Network. 936-945 - Guiyan Liu, Songtao Guo, Pan Li, Liang Liu

:
ConMidbox: Consolidated Middleboxes Selection and Routing in SDN/NFV-Enabled Networks. 946-955 - Gustavo Chávez, Yang Liu, Pieter Ghysels, Xiaoye Sherry Li, Elizaveta Rebrova

:
Scalable and Memory-Efficient Kernel Ridge Regression. 956-965 - Renping Liu, Xianzhang Chen

, Yujuan Tan, Runyu Zhang, Liang Liang, Duo Liu:
SSDKeeper: Self-Adapting Channel Allocation to Improve the Performance of SSD Devices. 966-975 - Madhurima Ray

, Krishna Kant, Peng Li, Sanjeev Trika:
FlashKey: A High-Performance Flash Friendly Key-Value Store. 976-985 - Yubo Liu, Yutong Lu, Zhiguang Chen, Ming Zhao:

Pacon: Improving Scalability and Efficiency of Metadata Service through Partial Consistency. 986-996 - Peter Pirkelbauer, Pei-Hung Lin

, Tristan Vanderbruggen, Chunhua Liao
:
XPlacer: Automatic Analysis of Data Access Patterns on Heterogeneous CPU/GPU Systems. 997-1007 - João P. L. de Carvalho

, Bruno C. Honorio, Alexandro Baldassin
, Guido Araujo:
Improving Transactional Code Generation via Variable Annotation and Barrier Elision. 1008-1017 - Hancheng Wu, Michela Becchi:

Evaluating Thread Coarsening and Low-cost Synchronization on Intel Xeon Phi. 1018-1029 - André Müller

, Bertil Schmidt
, Andreas Hildebrandt, Richard Membarth, Roland Leißa
, Matthis Kruse
, Sebastian Hack:
AnySeq: A High Performance Sequence Alignment Library based on Partial Evaluation. 1030-1040 - Lionel Eyraud-Dubois, Suraj Kumar:

Analysis of a List Scheduling Algorithm for Task Graphs on Two Types of Resources. 1041-1050 - Rory Hector, Ramachandran Vaidyanathan, Gokarna Sharma, Jerry L. Trahan

:
Optimal Convex Hull Formation on a Grid by Asynchronous Robots with Lights. 1051-1060 - Alberto Marchetti-Spaccamela

, Nicole Megow, Jens Schlöter, Martin Skutella, Leen Stougie:
On the Complexity of Conditional DAG Scheduling in Multiprocessor Systems. 1061-1070 - Xin Sunny Huang, Yiting Xia, T. S. Eugene Ng:

Weaver: Efficient Coflow Scheduling in Heterogeneous Parallel Networks. 1071-1081 - Diyu Zhou, Yuval Tamir:

Fault-Tolerant Containers Using NiLiCon. 1082-1091 - Anwesha Das, Frank Mueller, Barry Rountree:

Aarohi: Making Real-Time Node Failure Prediction Feasible. 1092-1101 - Pinchao Liu, Hailu Xu

, Dilma Da Silva, Qingyang Wang, Sarker Tanzir Ahmed, Liting Hu:
FP4S: Fragment-based Parallel State Recovery for Stateful Stream Applications. 1102-1111 - Maxime France-Pillois, Jérôme Martin, Frédéric Rousseau:

Implementation and Evaluation of a Hardware Decentralized Synchronization Lock for MPSoCs. 1112-1121 - Maciej Besta, Raghavendra Kanakagiri

, Harun Mustafa
, Mikhail Karasikov, Gunnar Rätsch, Torsten Hoefler, Edgar Solomonik:
Communication-Efficient Jaccard similarity for High-Performance Distributed Genome Comparisons. 1122-1132 - Kyle Berney

, Nodari Sitchinava:
Engineering Worst-Case Inputs for Pairwise Merge Sort on GPUs. 1133-1142 - Karolos Antoniadis, Diego Didona, Rachid Guerraoui

, Willy Zwaenepoel:
The Impossibility of Fast Transactions. 1143-1154

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














