


default search action
IPDPS 2015: Hyderabad, India
- 2015 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015, Hyderabad, India, May 25-29, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-8649-1

Keynote 1
- Phillip B. Gibbons:

Big data: Scale down, scale up, scale out. 3
Session 1: Graph and Social Analytics
- Hao Lu

, Mahantesh Halappanavar, Daniel G. Chavarría-Miranda, Assefaw Hadish Gebremedhin, Ananth Kalyanaraman:
Balanced Coloring for Parallel Computing Applications. 7-16 - George M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri

:
High-Performance Graph Analytics on Manycore Processors. 17-27 - Xinyu Que, Fabio Checconi, Fabrizio Petrini, John A. Gunnels:

Scalable Community Detection with the Louvain Algorithm. 28-37 - Jonathan W. Berry, Michael J. Collins, Aaron Kearns, Cynthia A. Phillips, Jared Saia, Randy Smith:

Cooperative Computing for Autonomous Data Centers. 38-47
Session 2: Numerical Linear Algebra
- Gregoire Pichon

, Azzam Haidar, Mathieu Faverge, Jakub Kurzak:
Divide and Conquer Symmetric Tridiagonal Eigensolver for Multicore Architectures. 51-60 - Shaden Smith, Niranjay Ravindran, Nicholas D. Sidiropoulos

, George Karypis
:
SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication. 61-70 - Piyush Sao, Xing Liu, Richard W. Vuduc

, Xiaoye S. Li:
A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated Systems. 71-81 - Tobias Maier, Peter Sanders, Jochen Speck:

Locality Aware DAG-Scheduling for LU-Decomposition. 82-92
Session 3: High Performance Networks and Congestion Management
- Jiwei Liu, Jun Yang, Rami G. Melhem:

GASOLIN: Global Arbitration for Streams of Data in Optical Links. 93-102 - Pablo Fuentes

, Enrique Vallejo
, Marina García, Ramón Beivide, Germán Rodríguez, Cyriel Minkenberg, Mateo Valero
:
Contention-Based Nonminimal Adaptive Routing in High-Radix Networks. 103-112 - Abhinav Bhatele, Andrew R. Titus, Jayaraman J. Thiagarajan, Nikhil Jain, Todd Gamblin, Peer-Timo Bremer, Martin Schulz

, Laxmikant V. Kalé:
Identifying the Culprits Behind Network Congestion. 113-122 - Jun Duan, Zhiyang Guo, Yuanyuan Yang

:
Embedding Nonblocking Multicast Virtual Networks in Fat-Tree Data Centers. 123-132
Session 4: Software for Heterogeneous Many-Core Systems
- Pieter Hijma, Ceriel J. H. Jacobs

, Rob van Nieuwpoort
, Henri E. Bal:
Cashmere: Heterogeneous Many-Core Computing. 135-145 - Tarun Beri, Sorav Bansal, Subodh Kumar:

A Scheduling and Runtime Framework for a Cluster of Heterogeneous Machines with Multiple Accelerators. 146-155 - Wei Wu

, Aurélien Bouteiller
, George Bosilca, Mathieu Faverge, Jack J. Dongarra:
Hierarchical DAG Scheduling for Hybrid Distributed Systems. 156-165 - Niall Emmart, Charles C. Weems:

Pushing the Performance Envelope of Modular Exponentiation Across Multiple Generations of GPUs. 166-176
Session 5: Scheduling Algorithms
- Sanjoy K. Baruah:

Federated Scheduling of Sporadic DAG Task Systems. 179-186 - Josué Feliu

, Julio Sahuquillo, Salvador Petit
, José Duato
:
Addressing Fairness in SMT Multicores with a Progress-Aware Scheduler. 187-196 - Mehmet Deveci, Kamer Kaya, Bora Uçar

, Ümit V. Çatalyürek:
Fast and High Quality Topology-Aware Task Mapping. 197-206 - Hao Lin, Xin Qi, Shuo Yang, Samuel P. Midkiff

:
Workload-Driven VM Consolidation in Cloud Data Centers. 207-216
Session 6: Concurrency in Memory Systems
- Matthieu Perrin, Achour Mostéfaoui, Claude Jard:

Update Consistency for Wait-Free Concurrent Objects. 219-228 - Aras Atalar, Anders Gidenstam, Paul Renaud-Goud, Philippas Tsigas

:
Modeling Energy Consumption of Lock-Free Queue Implementations. 229-238 - Yiannis Nikolakopoulos, Anders Gidenstam, Marina Papatriantafilou

, Philippas Tsigas
:
A Consistency Framework for Iteration Operations in Concurrent Data Structures. 239-248 - Aditya Dhoke, Roberto Palmieri

, Binoy Ravindran
:
An Automated Framework for Decomposing Memory Transactions to Exploit Partial Rollback. 249-258
Session 7: MapReduce Advances
- Yandong Wang, Huansong Fu, Weikuan Yu

:
Cracking Down MapReduce Failure Amplification through Analytics Logging and Migration. 261-270 - Xiao Yu, Bo Hong:

Grouping Blocks for MapReduce Co-Locality. 271-280 - Feng Liang

, Francis C. M. Lau:
SMapReduce: Optimising Resource Allocation by Managing Working Slots at Runtime. 281-290 - Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Sharmin Islam, Raghunath Rajachandrasekar, Dhabaleswar K. Panda:

High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA. 291-300
Session 8: Performance and Energy Optimizations
- Jesmin Jahan Tithi, Pramod Ganapathi

, Aakrati Talati, Sonal Aggarwal, Rezaul Alam Chowdhury:
High-Performance Energy-Efficient Recursive Dynamic Programming with Matrix-Multiplication-Like Flexible Kernels. 303-312 - Protonu Basu, Mary W. Hall

, Samuel Williams
, Brian van Straalen, Leonid Oliker, Phillip Colella:
Compiler-Directed Transformation for Higher-Order Stencils. 313-323 - Hung-Ching Chang, Bo Li, Godmar Back, Ali Raza Butt

, Kirk W. Cameron
:
LUC: Limiting the Unintended Consequences of Power Scaling on Parallel Transaction-Oriented Workloads. 324-333 - Kuangyu Zheng, Xiaodong Wang, Xiaorui Wang:

PowerFCT: Power Optimization of Data Center Network with Flow Completion Time Constraints. 334-343
Session 9: Dynamic Networks
- John Augustine, Tejas Kulkarni, Sumathi Sivasubramaniam:

Leader Election in Sparse Dynamic Networks with Churn. 347-356 - Alexander Mäcker, Manuel Malatyali, Friedhelm Meyer auf der Heide:

Online Top-k-Position Monitoring of Distributed Data Streams. 357-364 - Ashutosh Bhatia, R. C. Hansdah:

DSLR: A Distributed Schedule Length Reduction Algorithm for WSNs. 365-374 - Ramachandran Vaidyanathan, Costas Busch, Jerry L. Trahan

, Gokarna Sharma, Suresh Rai:
Logarithmic-Time Complete Visibility for Robots with Lights. 375-384
Session 10: Applications on GPUs
- Michael G. Gowanlock, Henri Casanova:

Indexing of Spatiotemporal Trajectories for Efficient Distance Threshold Similarity Searches on the GPU. 387-396 - Xiaoxin Tang, Zhiyi Huang, David M. Eyers

, Steven Mills
, Minyi Guo:
Efficient Selection Algorithm for Fast k-NN Search on GPUs. 397-406 - Steven Dalton, Sean Baxter, Duane Merrill, Luke N. Olson, Michael Garland:

Optimizing Sparse Matrix Operations on GPUs Using Merge Path. 407-416 - Moritz Kreutzer, Andreas Pieper, Georg Hager

, Gerhard Wellein, Andreas Alvermann, Holger Fehske:
Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems. 417-426
Session 11: Scheduling on Clusters
- Suraj Prabhakaran

, Marcel Neumann, Sebastian Rinke, Felix Wolf, Abhishek Gupta
, Laxmikant V. Kalé:
A Batch System with Efficient Adaptive Scheduling for Malleable and Evolving Applications. 429-438 - Zhou Zhou, Xu Yang, Zhiling Lan, Paul Rich, Wei Tang, Vitali A. Morozov, Narayan Desai:

Improving Batch Scheduling on Blue Gene/Q by Relaxing 5D Torus Network Allocation Constraints. 439-448 - Ana Jokanovic

, José Carlos Sancho, Germán Rodríguez, Alejandro Lucero, Cyriel Minkenberg, Jesús Labarta
:
Quiet Neighborhoods: Key to Protect Job Performance Predictability. 449-459 - Jeeva Paudel, Levi H. S. Lelis, José Nelson Amaral:

Stratified Sampling for Even Workload Partitioning Applied to IDA* and Delaunay Algorithms. 460-469
Session 12: Debugging and Verification
- Nicklas Bo Jensen, Niklas Quarfot Nielsen, Gregory L. Lee, Sven Karlsson

, Matthew P. LeGendre, Martin Schulz
, Dong H. Ahn:
A Scalable Prescriptive Parallel Debugging Model. 473-483 - Zhen Li, Ali Jannesari

, Felix Wolf:
An Efficient Data-Dependence Profiler for Sequential and Parallel Programs. 484-493 - Menna Mostafa, Borzoo Bonakdarpour:

Decentralized Runtime Verification of LTL Specifications in Distributed Systems. 494-503 - Jingyu Zhou, Jiannong Cao

, Bin Yao, Minyi Guo:
Fast Proof Generation for Verifying Cloud Search. 504-513
Keynote 2
- Alan Edelman:

Julia: A fresh approach to parallel programming. 517
Session 13: Randomized Algorithms
- Robert Elsässer, Dominik Kaaser:

On the Influence of Graph Density on Randomized Gossiping. 521-531 - Srikanta Tirthapura:

Distinct Random Sampling from a Distributed Stream. 532-541 - Petra Berenbrink, André Brinkmann

, Robert Elsässer, Tom Friedetzky
, Lars Nagel
:
Randomized Renaming in Shared Memory Systems. 542-549 - Petra Berenbrink, Tom Friedetzky

, Frederik Mallmann-Trenn, Sepehr Meshkinfamfard
, Chris Wastell:
Threshold Load Balancing with Weighted Tasks. 550-558
Session 14: Scientific Applications I
- Evangelos Georganas, Aydin Buluç

, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar
, Katherine A. Yelick
:
merAligner: A Fully Parallel Sequence Aligner. 561-570 - William B. March, Bo Xiao, Chenhan D. Yu, George Biros:

An Algebraic Parallel Treecode in Arbitrary Dimensions. 571-580 - Salli Moustafa, Mathieu Faverge, Laurent Plagne, Pierre Ramet

:
3D Cartesian Transport Sweep for Massively Parallel Architectures with PaRSEC. 581-590 - Linchuan Chen, Xin Huo, Gagan Agrawal:

A Pattern Specification and Optimizations Framework for Accelerating Scientific Computations on Heterogeneous Clusters. 591-600
Session 15: Storage Systems Architecture
- Yingxun Fu, Jiwu Shu:

D-Code: An Efficient RAID-6 Code to Optimize I/O Loads and Read Performance. 603-612 - Shuibing He, Xian-He Sun, Adnan Haider:

HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid Servers. 613-622 - Jiangling Yin, Jun Wang

, Jian Zhou
, Tyler Lukasiewicz, Dan Huang, Junyao Zhang:
Opass: Analysis and Optimization of Parallel Data Access on Distributed File Systems. 623-632 - Bo Mao, Suzhen Wu, Hong Jiang:

Improving Storage Availability in Cloud-of-Clouds with Hybrid Redundant Data Distribution. 633-642
Session 16: MPI and Charm++ Advances
- Thomas Ropars, Arnaud Lefray, Dohyun Kim, André Schiper:

Efficient Process Replication for MPI Applications: Sharing Work between Replicas. 645-654 - Nikhil Jain, Abhinav Bhatele, Jae-Seung Yeom

, Mark F. Adams, Francesco Miniati, Chao Mei, Laxmikant V. Kalé:
Charm++ and MPI: Combining the Best of Both Worlds. 655-664 - Min Si, Antonio J. Peña

, Jeff R. Hammond
, Pavan Balaji, Masamichi Takagi, Yutaka Ishikawa:
Casper: An Asynchronous Progress Model for MPI RMA on Many-Core Architectures. 665-676 - Xiang Ni, Laxmikant V. Kalé, Rasmus Tamstorf

:
Scalable Asynchronous Contact Mechanics Using Charm++. 677-686
Session 17: Combinatorial Algorithms and Optimization
- Ke Wang, Yanjun Qi, Jeffrey J. Fox

, Mircea R. Stan
, Kevin Skadron
:
Association Rule Mining with the Micron Automata Processor. 689-699 - Rong Gu, Shanyong Wang, Fangfang Wang, Chunfeng Yuan, Yihua Huang:

Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark. 700-709 - Guojing Cong, Carol Meyers, Deepak Rajan, Tiziano Parriani:

Parallel Strategies for Solving Large Unit Commitment Problems in the California ISO Planning Model. 710-719
Session 18: Scientific Applications II
- Dheevatsa Mudigere, Srinivas Sridharan, Anand M. Deshpande, Jongsoo Park, Alexander Heinecke, Mikhail Smelyanskiy, Bharat Kaul, Pradeep Dubey, Dinesh K. Kaushik, David E. Keyes

:
Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems. 723-732 - David Ozog, Allen D. Malony, Andrew R. Siegel:

A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor Cores. 733-742 - Doru-Thom Popovici, Francis P. Russell

, Karl A. Wilkinson, Chris-Kriton Skylaris, Paul H. J. Kelly, Franz Franchetti:
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL. 743-752 - Scott French, Yili Zheng, Barbara Romanowicz

, Katherine A. Yelick
:
Parallel Hessian Assembly for Seismic Waveform Inversion Using Global Updates. 753-762
Session 19: Resilience
- Chongxiao Cao, Thomas Hérault

, George Bosilca, Jack J. Dongarra:
Design for a Soft Error Resilient Dynamic Task-Based Runtime. 765-774 - Jeremy P. Erickson, Namhoon Kim, James H. Anderson:

Recovering from Overload in Multicore Mixed-Criticality Systems. 775-785 - Li Tan, Shuaiwen Leon Song, Panruo Wu

, Zizhong Chen
, Rong Ge, Darren J. Kerbyson:
Investigating the Interplay between Energy Efficiency and Resilience in High Performance Computing. 786-796
Session 20: Graph Analytics
- Harshvardhan, Brandon West, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger:

A Hybrid Approach to Processing Big Data Graphs on Memory-Restricted Systems. 799-808 - Yogesh L. Simmhan

, Neel Choudhury, Charith Wickramaarachchi, Alok Gautam Kumbhare, Marc Frîncu
, Cauligi S. Raghavendra, Viktor K. Prasanna:
Distributed Programming over Time-Series Graphs. 809-818 - Linchuan Chen, Xin Huo, Bin Ren, Surabhi Jain, Gagan Agrawal:

Efficient and Simplified Parallel Graph Processing over CPU and MIC. 819-828
Keynote 3
- Madhav V. Marathe:

Assisting H1N1 and Ebola Outbreak Response through High Performance Networked Epidemiology. 831
Best Papers Session
- Michael A. Bender, Jonathan W. Berry, Simon D. Hammond, K. Scott Hemmert, Samuel McCauley, Branden Moore, Benjamin Moseley, Cynthia A. Phillips, David S. Resnick, Arun Rodrigues:

Two-Level Main Memory Co-Design: Multi-threaded Algorithmic Primitives, Analysis, and Simulation. 835-846 - Yang You, James Demmel, Kenneth Czechowski, Le Song, Richard W. Vuduc

:
CA-SVM: Communication-Avoiding Support Vector Machines on Distributed Systems. 847-859 - J. P. Grossman, Brian Towles, Brian Greskamp, David E. Shaw:

Filtering, Reductions and Synchronization in the Anton 2 Network. 860-870 - Roberto Belli, Torsten Hoefler:

Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer Synchronization. 871-881
Session 21: Algorithms for Fault Tolerance
- Alejandro Z. Tomsic, Pierre Sens, João Garcia, Luciana Arantes

, Julien Sopena:
2W-FD: A Failure Detector Algorithm with QoS. 885-893 - Silvia Bonomi

, Maria Potop-Butucaru, Sébastien Tixeuil:
Stabilizing Byzantine-Fault Tolerant Storage. 894-903 - Jean Paul Bahsoun, Rachid Guerraoui

, Ali Shoker
:
Making BFT Protocols Really Adaptive. 904-913 - Naoto Sasaki, Kento Sato, Toshio Endo, Satoshi Matsuoka:

Exploration of Lossy Compression for Application-Level Checkpoint/Restart. 914-922
Session 22: Scheduling and Load Balancing
- Max Rietmann, Daniel Peter

, Olaf Schenk
, Bora Uçar
, Marcus J. Grote
:
Load-Balanced Local Time Stepping for Large-Scale Wave Propagation. 925-935 - Yinglong Xia, Lifeng Nai, Jui-Hsin Lai:

Towards Balance-Affinity Tradeoff in Concurrent Subgraph Traversals. 936-945 - Jingjing Wang, Nael B. Abu-Ghazaleh

, Dmitry V. Ponomarev:
Controlled Contention: Balancing Contention and Reservation in Multicore Application Scheduling. 946-955 - Dazhao Cheng, Jia Rao, Changjun Jiang, Xiaobo Zhou:

Resource and Deadline-Aware Job Scheduling in Dynamic Hadoop Clusters. 956-965
Session 23: Heterogeneous Systems
- Jingweijia Tan, Xin Fu:

Mitigating the Susceptibility of GPGPUs Register File to Process Variations. 969-978 - Jayvant Anantpur, R. Govindarajan:

PRO: Progress Aware GPU Warp Scheduling Algorithm. 979-988 - Tobias Fjalling, Per Stenström:

Performance Impact of Batching Web-Application Requests Using Hot-Spot Processing on GPUs. 989-999 - Lavanya Ramapantulu

, Dumitrel Loghin, Yong Meng Teo
:
An Approach for Energy Efficient Execution of Hybrid Parallel Programs. 1000-1009
Session 24: I/O Optimizations
- Ana Gainaru, Guillaume Aupy, Anne Benoit

, Franck Cappello, Yves Robert
, Marc Snir:
Scheduling the I/O of HPC Applications Under Congestion. 1013-1022 - Bogdan Nicolae

:
Leveraging Naturally Distributed Data Redundancy to Reduce Collective I/O Replication Overhead. 1023-1032 - Tong Jin, Fan Zhang, Qian Sun, Hoang Bui, Melissa Romanus

, Norbert Podhorszki, Scott Klasky, Hemanth Kolla, Jacqueline Chen, Robert Hager
, Choong-Seock Chang
, Manish Parashar:
Exploring Data Staging Across Deep Memory Hierarchies for Coupled Data Intensive Simulation Workflows. 1033-1042 - Pham Nguyen Quang Anh, Rui Fan, Yonggang Wen:

Reducing Vector I/O for Faster GPU Sparse Matrix-Vector Multiplication. 1043-1052
Session 25: Graph Algorithms
- Henning Meyerhenke

, Peter Sanders, Christian Schulz
:
Parallel Graph Partitioning for Complex Networks. 1055-1064 - Lélia Blin, Fadwa Boubekeur, Swan Dubois

:
A Self-Stabilizing Memory Efficient Algorithm for the Minimum Diameter Spanning Tree under an Omnipotent Daemon. 1065-1074 - Ariful Azad, Aydin Buluç

, Alex Pothen
:
A Parallel Tree Grafting Algorithm for Maximum Cardinality Matching in Bipartite Graphs. 1075-1084
Session 26: Resource Management
- Koyel Mukherjee, Partha Dutta, Gurulingesh Raravi, Thangaraj Rajasubramaniam, Koustuv Dasgupta, Atul Singh:

Fair Resource Allocation for Heterogeneous Tasks. 1087-1096 - Tan Li, Yufei Ren, Dantong Yu, Shudong Jin:

Resources-Conscious Asynchronous High-Speed Data Transfer in Multicore Systems: Design, Optimizations, and Evaluation. 1097-1106 - Tridib Mukherjee, Partha Dutta, Vinay Gangadhar Hegde, Sujit Gujar

:
RISC: Robust Infrastructure over Shared Computing Resources through Dynamic Pricing and Incentivization. 1107-1116
Session 27: Architectural Support for Runtime and Thermal Management
- Alberto Ros

, Alexandra Jimborean
:
A Dual-Consistency Cache Coherence Protocol. 1119-1128 - Tamer Dallou, Nina Engelhardt, Ahmed Elhossini, Ben H. H. Juurlink:

Nexus#: A Distributed Hardware Task Manager for Task-Based Programming Models. 1129-1138 - Kaicheng Zhang, Seda Ogrenci Memik

, Gokhan Memik, Kazutomo Yoshii, Rajesh Sankaran, Peter H. Beckman:
Minimizing Thermal Variation Across System Components. 1139-1148
Session 28: Performance Monitoring and Prediction
- Mihail Popov, Chadi Akel, Florent Conti, William Jalby, Pablo de Oliveira Castro

:
PCERE: Fine-Grained Parallel Benchmark Decomposition for Scalability Prediction. 1151-1160 - Anirudh Jayakumar, Prakash Murali, Sathish Vadhiyar:

Matching Application Signatures for Performance Predictions Using a Single Execution. 1161-1170 - Hammad Khan, Julien Gascon-Samson, Jörg Kienzle, Bettina Kemme:

Monitoring Large-Scale Location-Based Information Systems. 1171-1181

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














