


default search action
46th ICPP 2017: Bristol, UK
- 46th International Conference on Parallel Processing, ICPP 2017, Bristol, United Kingdom, August 14-17, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-1042-8

Highlighted Papers (S1-T1)
- Ivy Bo Peng

, Roberto Gioiosa, Gokcen Kestor
, Erwin Laure
, Stefano Markidis:
Preparing HPC Applications for the Exascale Era: A Decoupling Strategy. 1-10 - Guojing Cong, Onkar Bhardwaj

, Minwei Feng:
An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications. 11-20 - Yingrui Wang, Leisheng Li, Rong Tian:

Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster. 21-30
Graph Analytics and ML (S2-T1)
- Erik Vermij, Leandro Fiorin, Christoph Hagleitner, Koen Bertels:

Boosting the Efficiency of HPCG and Graph500 with Near-Data Processing. 31-40 - Han Dong, Tao Li, Jiabing Leng, Lingyan Kong, Gang Bai:

GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification. 41-49 - Mallipeddi Hardhik, Dip Sankar Banerjee

, Kiran Raj Ramamoorthy, Kishore Kothapalli, Kannan Srinathan:
Nearly Balanced Work Partitioning for Heterogeneous Algorithms. 50-59
Enhancing Programming Runtime Systems (S2-T2)
- Adrián Castelló

, Sangmin Seo, Rafael Mayo
, Pavan Balaji, Enrique S. Quintana-Ortí
, Antonio J. Peña
:
GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations. 60-69 - Jordyn Maglalang, Sriram Krishnamoorthy

, Kunal Agrawal:
Locality-Aware Dynamic Task Graph Scheduling. 70-80 - Tingzhe Zhou, Pantea Zardoshti, Michael F. Spear

:
Practical Experience with Transactional Lock Elision. 81-90
Linear Algebra Algorithms (S2-T3)
- Hartwig Anzt

, Jack J. Dongarra, Goran Flegar
, Enrique S. Quintana-Ortí
:
Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning. 91-100 - Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka:

High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU. 101-110 - Shaden Smith, Alec Beri, George Karypis

:
Constrained Tensor Factorization with Accelerated AO-ADMM. 111-120
Data and Networks (S3-T1)
- Victor Garcia-Flores, Eduard Ayguadé, Antonio J. Peña

:
Efficient Data Sharing on Heterogeneous Systems. 121-130 - Vikram K. Narayana, Shuai Sun, Armin Mehrabian

, Volker J. Sorger, Tarek A. El-Ghazawi:
HyPPI NoC: Bringing Hybrid Plasmonics to an Opto-Electronic Network-on-Chip. 131-140 - Xiaokang Hu

, Wang Zhang, Jian Li, Ruhui Ma, Feng Wu, Haibing Guan:
ES2: Aiming at an Optimal Virtual I/O Event Path. 141-150
GPU & Runtime Systems (S3-T2)
- Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rossetti, Ching-Hsiang Chu

, Dhabaleswar K. Panda:
MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling. 151-160 - Ching-Hsiang Chu

, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Jahanzeb Maqbool Hashmi, Bracy Elton, Dhabaleswar K. Panda:
Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning. 161-170 - Burak Bastem, Didem Unat

, Weiqun Zhang, Ann S. Almgren
, John Shalf
:
Overlapping Data Transfers with Computation on GPU with Tiles. 171-180
Graphs and Networks (S3-T3)
- Jiawen Sun, Hans Vandierendonck, Dimitrios S. Nikolopoulos

:
Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning. 181-190 - Hari Sundar, Parmeshwar Khurd:

Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs. 191-200 - Minho Bae, Junho Eum, Donghoon Kim, Sangyoon Oh:

High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution. 201-210
Storage (S4-T1)
- Xiaoyang Qu, Jiguang Wan, Fengguang Song, Xiaozhao Zhuang, Fei Wu, Changsheng Xie:

OptiMatch: Enabling an Optimal Match between Green Power and Various Workloads for Renewable-Energy Powered Storage Systems. 211-220 - Luyu Li, Houxiang Ji

, Chentao Wu, Jie Li
, Minyi Guo:
Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays. 221-230 - Yanwen Xie, Dan Feng, Fang Wang:

Non-Sequential Striping for Distributed Storage Systems with Different Redundancy Schemes. 231-240
IO & Cloud (S4-T2)
- Yi Su, Dan Feng, Yu Hua, Zhan Shi:

Predicting Response Latency Percentiles for Cloud Object Storage Systems. 241-250 - Mehmet Fatih Aktas, Javier Diaz Montes, Ivan Rodero

, Manish Parashar:
WA-Dataspaces: Exploring the Data Staging Abstractions for Wide-Area Distributed Scientific Workflows. 251-260 - Matthew Curtis-Maury, Ram Kesavan, Mrinal K. Bhattacharjee:

Scalable Write Allocation in the WAFL File System. 261-270
Numerical Applications (S4-T3)
- Minyoung Jung, Jinwoo Park, Johann Blieberger

, Bernd Burgstaller:
Parallel Construction of Simultaneous Deterministic Finite Automata on Shared-Memory Multicores. 271-281 - Sudip K. Seal, Mark R. Cianciosa

, Steven P. Hirshman, Andreas Wingen
, Robert S. Wilcox
, Ezekial A. Unterberg
:
Parallel Reconstruction of Three Dimensional Magnetohydrodynamic Equilibria in Plasma Confinement Devices. 282-291 - Athena Elafrou, Georgios I. Goumas, Nectarios Koziris:

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors. 292-301
Networks (S5-T1)
- Lei Yang, Jiannong Cao, Zhenyu Wang, Weigang Wu:

Network Aware Multi-User Computation Partitioning in Mobile Edge Clouds. 302-311 - Chenxi Qiu, Haiying Shen:

Fading-Resistant Link Scheduling in Wireless Networks. 312-321 - Ryota Yasudo

, Michihiro Koibuchi, Koji Nakano
, Hiroki Matsutani, Hideharu Amano:
Order/Radix Problem: Towards Low End-to-End Latency Interconnection Networks. 322-331
Cloud Scheduling (S5-T2)
- MohammadReza HoseinyFarahabady

, Javid Taheri, Zahir Tari
, Albert Y. Zomaya
:
A Dynamic Resource Controller for a Lambda Architecture. 332-341 - Sunimal Rathnayake

, Dumitrel Loghin, Yong Meng Teo
:
CELIA: Cost-Time Performance of Elastic Applications on Cloud. 342-351 - Hervé Yviquel

, Guido Araujo:
The Cloud as an OpenMP Offloading Device. 352-361
GPU Applications (S5-T3)
- Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano

, Yasuaki Ito:
Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations. 362-371 - Kubilay Atasu

, Thomas P. Parnell, Celestine Dünner, Michail Vlachos
, Haralampos Pozidis:
High-Performance Recommender System Training Using Co-Clustering on CPU/GPU Clusters. 372-381 - Govert G. Brinkmann

, Kristian F. D. Rietveld
, Frank W. Takes
:
Exploiting GPUs for Fast Force-Directed Visualization of Large-Scale Networks. 382-391
Data and IO (S6-T1)
- Long Cheng

, Ying Wang
, Yulong Pei
, Dick H. J. Epema:
A Coflow-Based Co-Optimization Framework for High-Performance Data Analytics. 392-401 - Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian, Youhui Bai:

PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout. 402-411 - Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu

, Joseph C. Culberson, Joseph Horton:
Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line. 412-421
Computation Optimization (S6-T2)
- Lijuan Jiang, Chao Yang, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang:

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor. 422-431 - James Lin, Zhigeng Xu, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka:

Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor. 432-441 - Shixiong Xu, David Gregg:

Bitslice Vectors: A Software Approach to Customizable Data Precision on Processors with SIMD Extensions. 442-451
Data Analytics (S6-T3)
- Yang You, James Demmel:

Runtime Data Layout Scheduling for Machine Learning Dataset. 452-461 - Kamesh Arumugam

, Desh Ranjan, Mohammad Zubair, Balsa Terzic
, Alexander N. Godunov, Tunazzina Islam:
A Machine Learning Approach for Efficient Parallel Simulation of Beam Dynamics on GPUs. 462-471 - Charalampos Stylianopoulos, Magnus Almgren

, Olaf Landsiedel, Marina Papatriantafilou
:
Multiple Pattern Matching for Network Security Applications: Acceleration through Vectorization. 472-482
Graph Algorithms (S7-T1)
- Erik Saule

, Dinesh Panchananam, Alexander Hohl
, Wenwu Tang
, Eric Delmelle:
Parallel Space-Time Kernel Density Estimation. 483-492 - Peng Ni, Masatoshi Hanai, Wen Jun Tan

, Chen Wang, Wentong Cai
:
Parallel Algorithm for Single-Source Earliest-Arrival Problem in Temporal Graphs. 493-502 - Mustafa Kemal Tas, Kamer Kaya, Erik Saule

:
Greed Is Good: Parallel Algorithms for Bipartite-Graph Partial Coloring on Multicore Architectures. 503-512
Performance & Power Tuning for Heterogeneous Platforms (S7-T2)
- Isuru Dilanka Fernando, Sanath Jayasena

, Milinda Fernando, Hari Sundar:
A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters. 513-522 - Robert V. Lim, Boyana Norris

, Allen D. Malony:
Autotuning GPU Kernels via Static and Predictive Analysis. 523-532 - Aniket Chakrabarti, Srinivasan Parthasarathy

, Christopher Stewart
:
A Pareto Framework for Data Analytics on Heterogeneous Systems: Implications for Green Energy Usage and Performance. 533-542
Various Parallel Algorithms (S8-T1)
- Ayham Kassab, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo:

Scheduling Independent Tasks in Parallel under Power Constraints. 543-552 - Eduardo Moscoso Rubino, Alberto Jose Alvares, Raúl Marín Prades

, Pedro Sanz Valero:
A Novel Minimum Time Parallel 2-D Discrete Wavelet Transform Algorithm for General Purpose Processors. 553-562 - Harshvardhan Das, Subodh Kumar:

A Parallel TSP-Based Algorithm for Balanced Graph Partitioning. 563-570
Resilience & Power Aware Scheduling (S8-T2)
- Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin I. P. Rubinstein

, Rajkumar Buyya:
E-Storm: Replication-Based State Management in Distributed Stream Processing Systems. 571-580 - Aiman Fang, Aurélien Cavelan, Yves Robert

, Andrew A. Chien:
Resilience for Stencil Computations with Latent Errors. 581-590 - Rong Ge, Pengfei Zou, Xizhou Feng:

Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems. 591-600

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














