


default search action
45th ICPP 2016: Philadelphia, PA, USA
- 45th International Conference on Parallel Processing, ICPP 2016, Philadelphia, PA, USA, August 16-19, 2016. IEEE Computer Society 2016, ISBN 978-1-5090-2823-8

Session 1A: Data Center and Cloud 1
- Jun Duan, Yuanyuan Yang

:
Efficient Virtual Network Embedding for Variable Size Virtual Machines in Fat-Tree Data Centers. 1-10 - Tingwei Zhu, Dan Feng, Yu Hua, Fang Wang, Qingyu Shi

, Jiahao Liu:
MIC: An Efficient Anonymous Communication System in Data Center Networks. 11-20 - Dian Shen, Junzhou Luo, Fang Dong, Junxue Zhang

:
AppBag: Application-Aware Bandwidth Allocation for Virtual Machines in Cloud Environment. 21-30 - Leonardo Piga, Indrani Paul

, Wei Huang:
Performance Boosting Opportunities under Communication Imbalance in Power-Constrained HPC Clusters. 31-40 - Zhenhua Li, Yuanyuan Yang

:
RRect: A Novel Server-centric Data Center Network with High Availability. 41-46
Session 1B: Architecture 1
- Yi Lin, Po-Chun Huang, Duo Liu, Xiao Zhu, Liang Liang:

Making In-Memory Frequent Pattern Mining Durable and Energy Efficient. 47-56 - Qingda Hu, Jiwu Shu, Jie Fan, Youyou Lu:

Run-Time Performance Estimation and Fairness-Oriented Scheduling Policy for Concurrent GPGPU Applications. 57-66 - Xiaqing Li, Guangyan Zhang, H. Howie Huang

, Zhufan Wang, Weimin Zheng:
Performance Analysis of GPU-Based Convolutional Neural Networks. 67-76 - Shuang Song, Meng Li, Xinnian Zheng, Michael LeBeane, Jee Ho Ryoo, Reena Panda, Andreas Gerstlauer, Lizy K. John:

Proxy-Guided Load Balancing of Graph Processing Workloads on Heterogeneous Clusters. 77-86 - Lei Cui, Zhiyu Hao, Chonghua Wang, Haiqiang Fei, Zhenquan Ding:

Piccolo: A Fast and Efficient Rollback System for Virtual Machine Clusters. 87-92
Session 2A: Parallel Algorithms
- Patrick Mackey, Robert R. Lewis:

Parallel k-Means++ for Multiple Shared-Memory Architectures. 93-102 - Oguz Kaya, Bora Uçar

:
High Performance Parallel Algorithms for the Tucker Decomposition of Sparse Tensors. 103-112 - Moohyeon Nam, Jinwoong Kim, Beomseok Nam:

Parallel Tree Traversal for Nearest Neighbor Query on the GPU. 113-122 - Anne Benoit

, Loic Pottier
, Yves Robert
:
Resilient Application Co-scheduling with Processor Redistribution. 123-132 - Jessica McClintock, Anthony Wirth:

Efficient Parallel Algorithms for k-Center Clustering. 133-138
Session 2B: Architecture 2
- Xin Wang, Xiaofeng Ji, Yunping Lu, Yi Li, Weijia Zhou, Weihua Zhang, Wenyun Zhao:

Understanding the Architectural Characteristics of EDA Algorithms. 139-148 - Jing Wang, Yanjun Liu, Weigong Zhang, Kezhong Lu, Keni Qiu, Xin Fu, Tao Li:

Exploring Variation-Aware Fault-Tolerant Cache under Near-Threshold Computing. 149-158 - Zheng Li, Fang Wang, Dan Feng, Yu Hua, Wei Tong

, Jingning Liu, Xiang Liu:
Tetris Write: Exploring More Write Parallelism Considering PCM Asymmetries. 159-168 - Ping Huang, Wenjie Liu, Kun Tang, Xubin He, Ke Zhou:

ROP: Alleviating Refresh Overheads via Reviving the Memory System in Frozen Cycles. 169-178 - Zhibin Yu, Lieven Eeckhout, Cheng-Zhong Xu

:
Thread Similarity Matrix: Visualizing Branch Divergence in GPGPU Programs. 179-184
Session 3A: Programming Techniques 1
- Sayan Ghosh, Jeff R. Hammond, Antonio J. Peña

, Pavan Balaji, Assefaw Hadish Gebremedhin, Barbara M. Chapman:
One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental. 185-194 - Jintao Meng, Sangmin Seo, Pavan Balaji, Yanjie Wei, Bingqiang Wang, Shengzhong Feng:

SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale. 195-204 - Indranil Roy, Ankit Srivastava, Srinivas Aluru:

Programming Techniques for the Automata Processor. 205-210 - Jinsu Park, Woongki Baek:

RCHC: A Holistic Runtime System for Concurrent Heterogeneous Computing. 211-216
Session 3B: Parallel Algorithms 2
- Matthew Graichen, Joseph Izraelevitz, Michael L. Scott

:
An Unbounded Nonblocking Double-Ended Queue. 217-226 - Jian-Jun Han, Xin Tao, Dakai Zhu, Hakan Aydin:

Criticality-Aware Partitioning for Multicore Mixed-Criticality Systems. 227-235 - Dominique LaSalle, George Karypis

:
A Parallel Hill-Climbing Refinement Algorithm for Graph Partitioning. 236-241 - Evangelia A. Sitaridi, René Müller, Tim Kaldewey, Guy M. Lohman, Kenneth A. Ross:

Massively-Parallel Lossless Data Decompression. 242-247
Session 4A: Data Cloud and Cloud 2
- Prasanna Balaprakash

, Vitali A. Morozov, Rajkumar Kettimuthu, Kalyan Kumaran, Ian T. Foster:
Improving Data Transfer Throughput with Direct Search Optimization. 248-257 - Alexandre Denis, François Trahay

:
MPI Overlap: Benchmark and Analysis. 258-267 - Jie Zhang, Xiaoyi Lu, Dhabaleswar K. Panda:

High Performance MPI Library for Container-Based HPC Cloud on InfiniBand Clusters. 268-277 - Rui Han, Siguang Huang, Fei Tang, Fu-Gui Chang, Jianfeng Zhan:

AccuracyTrader: Accuracy-Aware Approximate Processing for Low Tail Latency and High Result Accuracy in Cloud Online Services. 278-287 - Pradeep Subedi, Ping Huang, Tong Liu

, Joseph Moore, Stan Skelton, Xubin He:
CoARC: Co-operative, Aggressive Recovery and Caching for Failures in Erasure Coded Hadoop. 288-293
Session 4B: Cyberphysical Systems 1
- Guoju Gao, Mingjun Xiao, Zhenhua Zhao:

Optimal Multi-taxi Dispatch for Mobile Taxi-Hailing Systems. 294-303 - Jia Liu, Bin Xiao

, Xuan Liu, Lijun Chen:
Fast RFID Polling Protocols. 304-313 - Zongjian He, Daqiang Zhang, Jiannong Cao

, Xuefeng Liu, Xiaopeng Fan, Cheng-Zhong Xu
:
Exploiting Real-Time Traffic Light Scheduling with Taxi Traces. 314-323 - Ankur Sarker, Chenxi Qiu, Haiying Shen, Andrea Gil, Joachim Taiber, Mashrur Chowdhury, Jim Martin, Mac Devine, Andrew J. Rindos:

An Efficient Wireless Power Transfer System to Balance the State of Charge of Electric Vehicles. 324-333 - Huijie Chen

, Fan Li, Yu Wang:
EchoLoc: Accurate Device-Free Hand Localization Using COTS Devices. 334-339
Session 5A: Parallel Algorithms 3
- Koji Nakano

, Daisuke Takafuji, Satoshi Fujita, Hiroki Matsutani, Ikki Fujiwara
, Michihiro Koibuchi:
Randomly Optimized Grid Graph for Low-Latency Interconnection Networks. 340-349 - Davide Frey

, Hicham Lakhlef, Michel Raynal:
Optimal Collision/Conflict-Free Distance-2 Coloring in Wireless Synchronous Broadcast/Receive Tree Networks. 350-359 - Bapi Chatterjee, Ivan Walulya, Philippas Tsigas

:
Help-Optimal and Language-Portable Lock-Free Concurrent Data Structures. 360-369 - Zhengyuan Xue, Ruixuan Li, Heng Zhang, Xiwu Gu, Zhiyong Xu:

DC-Top-k: A Novel Top-k Selecting Algorithm and Its Parallelization. 370-379 - Napath Pitaksirianan

, Zhila Nouri, Yi-Cheng Tu:
Efficient 2-Body Statistics Computation on GPUs: Parallelization & Beyond. 380-385
Session 5B: Storage Systems
- Peter R. Denz, Matthew Curtis-Maury, Vinay Devadas:

Think Global, Act Local: A Buffer Cache Design for Global Ordering and Parallel Processing in the WAFL File System. 386-395 - Chu Li, Dan Feng, Yu Hua, Fang Wang:

Improving RAID Performance Using an Endurable SSD Cache. 396-405 - Houjun Tang

, Suren Byna
, Steve Harenberg, Wenzhao Zhang, Xiaocheng Zou, Daniel F. Martin, Bin Dong, Dharshi Devendran, Kesheng Wu
, David Trebotich, Scott Klasky, Nagiza F. Samatova:
In Situ Storage Layout Optimization for AMR Spatio-temporal Read Accesses. 406-415 - Sagar Thapaliya, Purushotham V. Bangalore, Jay F. Lofstead

, Kathryn M. Mohror
, Adam Moody:
Managing I/O Interference in a Shared Burst Buffer System. 416-425 - Hao Wen, David Hung-Chang Du, Milan Shetti, Doug Voigt, Shanshan Li:

Guaranteed Bang for the Buck: Modeling VDI Applications with Guaranteed Quality of Service. 426-431
Session 6A: Programming Techniques 2
- Benoît Pradelle, Benoît Meister, Muthu Manikandan Baskaran, Athanasios Konstantinidis, Thomas Henretty, Richard Lethin:

Scalable Hierarchical Polyhedral Compilation. 432-441 - Jingna Zeng, João Pedro Barreto

, Seif Haridi, Luís E. T. Rodrigues, Paolo Romano
:
The Future(s) of Transactional Memory. 442-451 - Sanjay Chatterjee, Nick Vrvilo, Zoran Budimlic, Kathleen Knobe, Vivek Sarkar:

Declarative Tuning for Locality in Parallel Programs. 452-457 - Vivekanandan Balasubramanian, Antons Treikalis, Ole Weidner, Shantenu Jha

:
Ensemble Toolkit: Scalable and Flexible Execution of Ensembles of Tasks. 458-463
Session 6B: Cyberphysical Systems 2
- Ziqi Zhao, Fan Wu, Shaolei Ren

, Xiaofeng Gao, Guihai Chen
, Yong Cui:
TECH: A Thermal-Aware and Cost Efficient Mechanism for Colocation Demand Response. 464-473 - Houssem Chihoub, Christine Collet:

A Scalability Comparison Study of Data Management Approaches for Smart Metering Systems. 474-483 - John W. Romein:

A Comparison of Accelerator Architectures for Radio-Astronomical Signal-Processing Algorithms. 484-489 - Kang Chen, Haiying Shen:

MobiSensing: Exploiting Human Mobility for Multi-application Mobile Data Sensing with Low User Intervention. 490-495
Session 7A: Performance Modeling
- Akrem Benatia, Weixing Ji, Yizhuo Wang, Feng Shi:

Sparse Matrix Format Selection with Multiclass SVM for SpMV on GPU. 496-505 - Jeffrey Daily, Ananth Kalyanaraman, Sriram Krishnamoorthy

, Bin Ren:
On the Impact of Widening Vector Registers on Sequence Alignment. 506-515 - Rong Ge, Xizhou Feng, Yangyang He, Pengfei Zou:

The Case for Cross-Component Power Coordination on Power Bounded Systems. 516-525 - Shi Sha

, Wujie Wen
, Ming Fan, Shaolei Ren
, Gang Quan
:
Performance Maximization via Frequency Oscillation on Temperature Constrained Multi-core Processors. 526-535 - Panfeng Zhang, Ping Huang, Xubin He, Hua Wang, Lingyu Yan, Ke Zhou:

RMD: A Resemblance and Mergence Based Approach for High Performance Deduplication. 536-541
Session 7B: GPU Applications
- Cen Chen, Kenli Li, Aijia Ouyang, Zhuo Tang, Keqin Li:

GFlink: An In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data. 542-551 - Ming-Hsiang Huang, Wuu Yang:

Partial Flattening: A Compilation Technique for Irregular Nested Parallelism on GPGPUs. 552-561 - Feng Zhang, Peng Di

, Hao Zhou
, Xiangke Liao, Jingling Xue
:
RegTT: Accelerating Tree Traversals on GPUs by Exploiting Regularities. 562-571 - Xiaonan Tian, Dounia Khaldi, Deepak Eachempati, Rengan Xu, Barbara M. Chapman:

Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations. 572-581 - Yi Yang, Min Feng, Srimat T. Chakradhar:

HppCnn: A High-Performance, Portable Deep-Learning Library for GPGPUs. 582-587
Session 8A: Applications
- Guillaume Aupy, JeongHyung Park, Padma Raghavan:

Locality-Aware Laplacian Mesh Smoothing. 588-597 - Sameh Shohdy, Abhinav Vishnu, Gagan Agrawal:

Fault Tolerant Support Vector Machines. 598-607 - Juliette Pardue, Andrey N. Chernikov:

Parallel Two-Dimensional Unstructured Anisotropic Delaunay Mesh Generation of Complex Domains for Aerospace Applications. 608-617
Session 8B: Scalable Software
- Sudip K. Seal, Steven P. Hirshman, Andreas Wingen

, Robert S. Wilcox
, Mark R. Cianciosa
, Ezekial A. Unterberg
:
PARVMEC: An Efficient, Scalable Implementation of the Variational Moments Equilibrium Code. 618-627 - Antons Treikalis, André Merzky, Haoyuan Chen, Tai-Sung Lee

, Darrin M. York
, Shantenu Jha
:
RepEx: A Flexible Framework for Scalable Replica Exchange Molecular Dynamics Simulations. 628-637 - Huan Feng, David M. Eyers

, Steven Mills
, Yongwei Wu, Zhiyi Huang:
PCAF: Scalable, High Precision k-NN Search Using Principal Component Analysis Based Filtering. 638-647

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














