


default search action
IPDPS 2013: Cambridge, MA, USA - Workshops
- 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, Cambridge, MA, USA, May 20-24, 2013. IEEE 2013

Heterogeneity in Computing Workshop - HCW
- Alexey L. Lastovetsky

, Uwe Schwiegelshohn:
HCW Introduction. 1-2
Table of Contents
- Behrooz A. Shirazi:

Message from the HCW Steering Committee Chair. 3 - Alexey L. Lastovetsky

:
Message from the HCW General Chair. 4 - Uwe Schwiegelshohn:

Message from the HCW Program Chair. 5 - Jack J. Dongarra:

HCW 2013 Keynote Talk. 6
HCW Session 1: Scheduling and Resource Allocation
- Piotr Skowron

, Krzysztof Rzadca:
Network Delay-Aware Load Balancing in Selfish and Cooperative Distributed Systems. 7-18 - Ryan D. Friese, Bhavesh Khemka, Anthony A. Maciejewski

, Howard Jay Siegel, Gregory A. Koenig, Sarah Powers, Marcia Hilton, Jendra Rambharos, Gene Okonski, Stephen W. Poole:
An Analysis Framework for Investigating the Trade-Offs between System Performance and Energy Consumption in a Heterogeneous Computing Environment. 19-30 - Henri Casanova, Fanny Dufossé, Yves Robert, Frédéric Vivien

:
Scheduling Tightly-Coupled Applications on Heterogeneous Desktop Grids. 31-42 - Ehsan Ullah Munir, Sajjad Mohsin, Altaf Hussain, Muhammad Wasif Nisar, Shoukat Ali:

SDBATS: A Novel Algorithm for Task Scheduling in Heterogeneous Computing Systems. 43-53
HCW Session 2: Heterogeneous Processors
- Arash Sadrieh, Stefano Charissis, Adam P. Hill:

An On-chip Heterogeneous Implementation of a General Sparse Linear Solver. 54-63 - Tim Süß, Andrew Schoenrock, Sebastian Meisner, Christian Plessl

:
Parallel Macro Pipelining on the Intel SCC Many-Core Computer. 64-73 - Vishal Gupta, Karsten Schwan:

Brawny vs. Wimpy: Evaluation and Analysis of Modern Workloads on Heterogeneous Processors. 74-83
HCW Session 3: Communication in Heterogeneous Systems
- Adam Hackett, Deepak Ajwani

, Shoukat Ali, Steve Kirkland, John P. Morrison:
Seeds for a Heterogeneous Interconnect. 84-92 - Jaroslaw Slawinski, Umberto Villa

, Tiziano Passerini, Alessandro Veneziani, Vaidy S. Sunderam:
Issues in Communication Heterogeneity for Message-Passing Concurrent Computing. 93-102
Reconfigurable Architectures Workshop - RAW
- Jürgen Becker, Ramachandran Vaidyanathan, Peter Athanas, Marco D. Santambrogio

, René Cumplido, Oliver Sander:
RAW Introduction. 103-105
RAW Session 1: Systems and Applications
- Lingkan Gong, Oliver Diessel

, Johny Paul, Walter Stechele:
RTL Simulation of High Performance Dynamic Reconfiguration: A Video Processing Case Study. 106-113 - Yun Qu, Viktor K. Prasanna:

High-Performance Pipelined Architecture for Tree-Based IP Lookup Engine on FPGA. 114-123 - Andrea Sanny, Thilan Ganegedara, Viktor K. Prasanna:

A Comparison of Ruleset Feature Independent Packet Classification Engines on FPGA. 124-133
RAW Session 2: Architectures and Algorithms I
- Björn Liebig, Jens Huthmann, Andreas Koch:

Architecture Exploration of High-Performance Floating-Point Fused Multiply-Add Units and their Automatic Use in High-Level Synthesis. 134-143 - Jingfei Jiang, Rongdong Hu, Mikel Luján:

A Flexible Memory Controller Supporting Deep Belief Networks with Fixed-Point Arithmetic. 144-152 - Jan Heisswolf, Andreas Weichslgartner

, Aurang Zaib
, Ralf König, Thomas Wild, Andreas Herkersdorf, Jürgen Teich, Jürgen Becker
:
Hardware Supported Adaptive Data Collection for Networks on Chip. 153-162
RAW Session 3: Software and Tools
- Wenwei Zha, Peter Athanas:

An FPGA Router for Alternative Reconfiguration Flows. 163-171 - Kizheppatt Vipin

, Suhaib A. Fahmy
:
Automated Partitioning for Partial Reconfiguration Design of Adaptive Systems. 172-181 - Angel Gallego, Javier Mora

, Andrés Otero
, Rubén Salvador
, Eduardo de la Torre, Teresa Riesgo:
A Novel FPGA-based Evolvable Hardware System Based on Multiple Processing Arrays. 182-191
RAW Session 4: Architectures and Algorithms II
- Gianluca Durelli, Alessandro Antonio Nacci, Riccardo Cattaneo

, Christian Pilato
, Donatella Sciuto
, Marco D. Santambrogio
:
A Flexible Interconnection Structure for Reconfigurable FPGA Dataflow Applications. 192-201 - Mansureh Shahraki Moghaddam, Kolin Paul, M. Balakrishnan:

Design and Implementation of High Performance Architectures with Partially Reconfigurable CGRAs. 202-211 - Jochem H. Rutgers, Marco Jan Gerrit Bekooij, Gerard J. M. Smit:

Portable Memory Consistency for Software Managed Distributed Memory in Many-Core SoC. 212-221
RAW Session 5: Software and Tools II
- Anil Kumar Sistla, Natalie Parde

, Krunalkumar Patel, Gayatri Mehta:
Cross-Architectural Study of Custom Reconfigurable Devices Using Crowdsourcing. 222-230 - Davide B. Bartolini, Matteo Carminati, Fabio Cancare, Marco D. Santambrogio

, Donatella Sciuto
:
HERA Project's Holistic Evolutionary Framework. 231-238 - Yoon Kah Leow, Ali Akoglu

:
A Hybrid FPGA Model to Estimate On-Chip Crossbar Logic Utilization in SoC Platforms. 239-246
RAW Poster Session 1
- Indranil Hatai

, Indrajit Chakrabarti, Swapna Banerjee:
Reconfigurable Architecture of a RRC Fir Interpolator for Multi-standard Digital Up Converter. 247-251 - Pierre Bomel, Kevin J. M. Martin, Jean-Philippe Diguet:

Virtual UARTs for Reconfigurable Multi-processor Architectures. 252-259 - George Afonso, Zeineb Baklouti, David Duvivier

, Rabie Ben Atitallah, Eli Billauer, Stephan Stilkerich:
Heterogeneous CPU/FPGA Reconfigurable Computing System for Avionic Test Application. 260-267 - Michael Mefenza, Christophe Bobda:

FPGA Implementation of Subcarrier Index Modulation OFDM Transceiver. 268-272 - Roland Christian Gamom Ngounou Ewo, Emmanuel Kiegaing, Martin Mbouenda, Hilaire Bertrand Fotsin, Bertrand Granado:

Hardware MPI-2 Functions for Multi-Processing Reconfigurable System on Chip. 273-280
RAW Poster Session 2
- Alexander Wold, Dirk Koch, Jim Tørresen:

Thermal Aware Module Placement for Heterogeneous 3D-IC Based FPGAs. 281-286 - Peng Li

, Angshuman Parashar, Michael Pellauer, Tao Wang, Joel S. Emer:
A Hierarchical Architectural Framework for Reconfigurable Logic Computing. 287-292 - Pranav Tendulkar, Sander Stuijk

:
A Case Study into Predictable and Composable MPSoC Reconfiguration. 293-300 - Miho Ueno, Masanori Hashimoto

, Takao Onoye:
Real-Time Supply Voltage Sensor for Detecting/Debugging Electrical Timing Failures. 301-305 - Sérgio Cruz, Daniel M. Muñoz

, Milton E. Conde, Carlos H. Llanos
, Geovany A. Borges
:
A Hardware Approach for Solving the Robot Localization Problem Using a Sequential EKF. 306-313
Workshop on High-Level Parallel Programming Models and Supportive
- Kathryn M. Mohror, Stephen L. Olivier:

HIPS Introduction. 314
HIPS Session 1: Support for Accelerator and Manycore Architectures
- Eric Holk, Milinda Pathirage, Arun Chauhan, Andrew Lumsdaine

, Nicholas D. Matsakis:
GPU Programming in Rust: Implementing High-Level Abstractions in a Systems-Level Language. 315-324 - Bryant C. Lam, Alan D. George

, Herman Lam:
TSHMEM: Shared-Memory Parallel Computing on Tilera Many-Core Processors. 325-334 - Safae Dahmani, Loïc Cudennec

, Guy Gogniat
:
Introducing a Data Sliding Mechanism for Cooperative Caching in Manycore Architectures. 335-344
HIPS Session 2: Scalable Tools for Correctness and Performance Analysis
- Kevin Pouget, Patricia López Cueva, Miguel Santana, Jean-François Méhaut:

Interactive Debugging of Dynamic Dataflow Embedded Applications. 345-354 - Andrea Martínez, Anna Sikora

, Eduardo César
, Joan Sorribes
:
How to Scale Dynamic Tuning to Large Parallel Applications. 355-364 - Tatsuya Abe

, Toshiyuki Maeda, Mitsuhisa Sato:
Model Checking Stencil Computations Written in a Partitioned Global Address Space Language. 365-374
HIPS Session 3: Programming Models and Abstractions
- Christopher D. Krieger, Michelle Mills Strout, Catherine Olschanowsky, Andrew Stone, Stephen M. Guzik, Xinfeng Gao, Carlo Bertolli, Paul H. J. Kelly, Gihan R. Mudalige

, Brian van Straalen, Samuel Williams
:
Loop Chaining: A Programming Abstraction for Balancing Locality and Parallelism. 375-384 - Tiffany M. Mintz, Oscar R. Hernandez, Christos Kartsaklis, David E. Bernholdt

, Markus Eisenbach
, Swaroop Pophale:
Toward Abstracting the Communication Intent in Applications to Improve Portability and Productivity. 385-393 - Min Feng, Rajiv Gupta

, Iulian Neamtiu:
Programming Support for Speculative Execution with Software Transactional Memory. 394-403
Workshop on Nature Inspired Distributed Computing - NIDISC
- Pascal Bouvry

, Franciszek Seredynski
, El-Ghazali Talbi:
NIDISC Introduction. 404
NIDISC Session 1: Multi-objective Optimization
- Bahri Oumayma, Nahla Ben Amor

, El-Ghazali Talbi:
A Possibilistic Framework for Solving Multi-objective Problems under Uncertainty: Definition of New Pareto Optimality. 405-414 - Santiago Iturriaga

, Patricia Ruiz
, Sergio Nesmachnow
, Bernabé Dorronsoro Díaz
, Pascal Bouvry
:
A Parallel Multi-objective Local Search for AEDB Protocol Tuning. 415-424 - David Hadka, Kamesh Madduri

, Patrick M. Reed:
Scalability Analysis of the Asynchronous, Master-Slave Borg Multiobjective Evolutionary Algorithm. 425-434 - Benoît Bertholon, Sébastien Varrette, Sebastien Martinez:

ShadObf: A C-Source Obfuscator Based on Multi-objective Optimisation Algorithms. 435-444
NIDISC Session 2: Application of Bio-Inspired Algorithms
- Antonina Tretyakova, Franciszek Seredynski

:
Application of Evolutionary Algorithms to Maximum Lifetime Coverage Problem in Wireless Sensor Networks. 445-453 - Paul D. Yoo

, Albert Y. Zomaya
, Khalfan Alromaithi, Sara Alshamsi:
Tree-Based Consensus Model for Proline Cis-Trans Isomerization Prediction. 454-458 - Marcin Seredynski, Wojciech Mazurczyk

, Djamel Khadraoui:
Multi-segment Green Light Optimal Speed Advisory. 459-465 - Boussaad Ismail, Amir Nakib

, Frederic Heliodore, Serge Poullain, Patrick Siarry:
Novel Levy Based Particle Swarm Optimization Algorithm for Electrical Power Grid. 466-473
NIDISC Session 3: Scheduling and Resource Allocation
- Thibaud Rohmer, Amir Nakib

, Abdelhamid Nafaa:
Optimal Peer Selection Strategy in P2P-VoD Systems Using Dynamic Evolution Strategy. 474-481 - Michail-Antisthenis I. Tsompanas

, Christoforos Kachris
, Georgios Ch. Sirakoulis
:
Optimization of Shared-Memory Multicore Systems Using Game Theory and Genetic Algorithms on Cellular Automata Lattices. 482-490 - Francois Legillon, Nouredine Melab, Didier Renard, El-Ghazali Talbi:

Cost Minimization of Service Deployment in a Public Cloud Environment. 491-498
Workshop on High Performance Computational Biology - HiCOMB
- Jaroslaw Zola, David A. Bader

, Srinivas Aluru:
HiCOMB Introduction. 499-500
HiCOMB Session I: Sequence and Structure
- Fernando Machado Mendonca, Alba Cristina Magalhaes Alves de Melo

:
Biological Sequence Comparison on Hybrid Platforms with Dynamic Workload Adjustment. 501-509 - Anuj Sharma, Antonis Papanikolaou, Elias S. Manolakos

:
Accelerating All-to-All Protein Structures Comparison with TMalign Using a NoC Many-Cores Processor Architecture. 510-519 - Daniel T. Yehdego, Boyu Zhang, Vikram Kumar Reddy Kodimala, Kyle L. Johnson, Michela Taufer

, Ming-Ying Leung:
Secondary Structure Predictions for Long RNA Sequences Based on Inversion Excursions and MapReduce. 520-529
HiCOMB Session II: Phylogenetics and Metagenomics
- Fernando Izquierdo-Carrasco, Nikolaos Alachiotis, Simon A. Berger, Tomás Flouri

, Solon P. Pissis
, Alexandros Stamatakis:
A Generic Vectorization Scheme and a GPU Kernel for the Phylogenetic Likelihood Library. 530-538 - Diego Darriba

, Andre J. Aberer, Tomás Flouri
, Tracy A. Heath
, Fernando Izquierdo-Carrasco, Alexandros Stamatakis:
Boosting the Performance of Bayesian Divergence Time Estimation with the Phylogenetic Likelihood Library. 539-548 - Zeehasham Rasheed, Huzefa Rangwala:

A Map-Reduce Framework for Clustering Metagenomes. 549-558
HiCOMB Session III: Molecular and Systems Biology
- Changjun Wu, Ananth Kalyanaraman:

GPU-Accelerated Protein Family Identification for Metagenomics. 559-568 - Jesmin Jahan Tithi, Rezaul Alam Chowdhury:

Polarization Energy on a Cluster of Multicores. 569-578 - Marco Maggioni, Tanya Y. Berger-Wolf

, Jie Liang:
GPU-Based Steady-State Solution of the Chemical Master Equation. 579-588
Advances in Parallel and Distributed Computing Models - APDCM
- Oscar H. Ibarra:

APDCM Introduction. 589-590
APDCM Session 1: Parallel Computing 1
- Koji Nakano

:
The Hierarchical Memory Machine Model for GPUs. 591-600 - Alejandro López-Ortiz, Alejandro Salinger, Robert Suderman:

Toward a Generic Hybrid CPU-GPU Parallelization of Divide-and-Conquer Algorithms. 601-610 - Patrick Poon, Quentin F. Stout:

Time-Power Tradeoffs for Sorting on a Mesh-Connected Computer with Optical Connections. 611-619
APDCM Session 2: Parallel Computing 2
- Kohei Tanaka, Akihiro Fujiwara:

Asynchronous P Systems for the Maximum Independent Set and Related Graph Problems. 620-629 - Jun Du, Ce Yu, Jizhou Sun, Chao Sun, Shanjiang Tang, Yanlong Yin:

EasyHPS: A Multilevel Hybrid Parallel System for Dynamic Programming. 630-639 - Martti Forsell, Erik Hansson, Christoph W. Kessler, Jari-Matti Mäkelä

, Ville Leppänen
:
Hardware and Software Support for NUMA Computing on Configurable Emulated Shared Memory Architectures. 640-648 - Mengjia Yan, Weiwei Fu, Chao Wang, Tianzhou Chen, Li Liu:

Agent-Based Traffic Merging in Network-on-Chip. 649-658
APDCM Session 3: Distributed Computing 1
- Yihua Ding, James Zijun Wang, Pradip K. Srimani:

Self-Stabilizing Master-Slave Token Circulation Algorithm in an Undirected Ring of Arbitrary Size and Its Orientation. 659-666 - Gianlorenzo D'Angelo

, Gabriele Di Stefano, Alfredo Navarra
, Nicolas Nisse, Karol Suchan
:
A Unified Approach for Different Tasks on Rings in Robot-Based Computing Systems. 667-676 - Keqin Li:

Parallel File Download in Peer-to-Peer Networks with Random Service Capacities. 677-686 - Jing Zhou, Gregor von Bochmann, Zhongzhi Shi:

Distributed Query Processing in an Ad-hoc Semantic Web Data Sharing System. 687-695
APDCM Session 4: Distributed Computing 2
- Jie Cai, Paola Flocchini, Nicola Santoro:

Network Decontamination from a Black Virus. 696-705 - Jack J. Dongarra, Thomas Hérault

, Yves Robert
:
Revisiting the Double Checkpointing Algorithm. 706-715
APDCM Session 5: Short Presentations
- Ajoy Kumar Datta, Lawrence L. Larmore, Stéphane Devismes

, Sébastien Tixeuil:
Fast Leader (Full) Recovery Despite Dynamic Faults. 716-725 - Marcin Kardas, Marek Klonowski, Dominik Pajak

, Kamil Wolny:
K-Selection Protocols from Energetic Complexity Perspective. 726-733 - Wei Chen, Liang Hong:

Cooperative MIMO Paradigms for Cognitive Radio Networks. 734-740 - Xia Zhang, Jinyu Zhan, Wei Jiang, Yue Ma, Ke Jiang:

Design Optimization of Energy- and Security-Critical Distributed Real-Time Embedded Systems. 741-750 - Amlan Chatterjee, Sridhar Radhakrishnan, John K. Antonio:

On Analyzing Large Graphs Using GPUs. 751-760 - Shinichi Yamagiwa, Ryoyu Watanabe, Koichi Wada:

Operation Synchronization Technique on Pipeline-Based Hardware Synthesis Applying Stream-Based Computing Framework. 761-770 - Xin Zhou, Norihiro Tomagou, Yasuaki Ito, Koji Nakano

:
Efficient Hough Transform on the FPGA using DSP Slices and Block RAMs. 771-778
Communication Architecture for Scalable Systems - CASS
- José Flich

, Scott Pakin, Craig B. Stunkel:
CASS Introduction. 779-780 - Turbo Majumder, Partha Pratim Pande, Ananth Kalyanaraman:

Network-on-Chip with Long-Range Wireless Links for High-Throughput Scientific Computation. 781-790 - Arne Garbade, Sebastian Weis, Sebastian Schlingmann, Bernhard Fechner

, Theo Ungerer:
Fault Localization in NoCs Exploiting Periodic Heartbeat Messages in a Many-Core Environment. 791-795 - José V. Escamilla, José Flich

, Pedro Javier García
:
Head-of-Line Blocking Avoidance in Networks-on-Chip. 796-805 - Roberto Ammendola

, Massimo Bernaschi, Andrea Biagioni
, Mauro Bisson, Massimiliano Fatica, Ottorino Frezza
, Francesca Lo Cicero
, Alessandro Lonardo, Enrico Mastrostefano, Pier Stanislao Paolucci
, Davide Rossetti, Francesco Simula
, Laura Tosoratto, Piero Vicini
:
GPU Peer-to-Peer Techniques Applied to a Cluster Interconnect. 806-815 - Min Si, Yutaka Ishikawa, Masamichi Takagi:

Direct MPI Library for Intel Xeon Phi Co-Processors. 816-824 - Abhinav Vishnu, Darren J. Kerbyson, Kevin J. Barker

, Hubertus Van Dam
:
Building Scalable PGAS Communication Subsystem on Blue Gene/Q. 825-833 - Xin Yuan, Santosh Mahapatra, Michael Lang

, Scott Pakin
:
RRR: A Load Balanced Routing Scheme for Slimmed Fat-Trees. 834-841 - Yuan He

, Hiroshi Sasaki, Shinobu Miwa, Hiroshi Nakamura
:
Predict-More Router: A Low Latency NoC Router with More Route Predictions. 842-850
High-Performance, Power-Aware Computing - HPPAC
- Bronis R. de Supinski, Dong Li:

HPPAC Introduction. 851-852
HPPAC Session 1: Power Efficient Hardware
- Sean Wallace, Venkatram Vishwanath, Susan Coghlan, Zhiling Lan, Michael E. Papka

:
Measuring Power Consumption on IBM Blue Gene/Q. 853-859 - Vishal Gupta, Karsten Schwan:

PowerTune: Differentiated Power Allocation in Over-Provisioned Multicore Systems. 860-867 - Gilbert Hendry:

Decreasing Network Power with on-off Links Informed by Scientific Applications. 868-875
HPPAC Session 2 : Energy/Power Measurement and Profiling
- Dominic DiTomaso, Randy Morris, Evan Jolley, Ashwini Sarathy, Ahmed Louri, Avinash Karanth Kodi:

Design of a Concentrated Torus Topology with Channel Buffers and Efficient Crossbars in NoCs. 876-883 - Allan Porterfield, Stephen L. Olivier

, Sridutt Bhalachandra, Jan F. Prins:
Power Measurement and Concurrency Throttling for Energy Reduction in OpenMP Programs. 884-891 - Dale Sartor, Rod Mahdavi, Ben D. Radhakrishnan, Natalie J. Bates, Anna Maria Bailey, Ralph Wescott:

General Recommendations for High Performance Computing Data Center Energy Management Dashboard Display. 892-898
HPPAC Session 3: Large Scale Power Management
- Taghrid Samak, Christine Morin, David H. Bailey:

Energy Consumption Models and Predictions for Large-Scale Systems. 899-906 - Carlo Mastroianni

, Michela Meo
, Giuseppe Papuzzo
:
Analysis of a Self-Organizing Algorithm for Energy Saving in Data Centers. 907-914 - Ehsan Totoni, Nikhil Jain, Laxmikant V. Kalé:

Toward Runtime Power Management of Exascale Networks by on/off Control of Links. 915-922
HPPAC Session 4: Compiler and Runtime Techniques
- Sanket Tavarageri, P. Sadayappan:

A Compiler Analysis to Determine Useful Cache Size for Energy Efficiency. 923-930 - Jan Christian Meyer, Juan M. Cebrian

, Lasse Natvig, Vasileios Karakasis, Dimitris Siakavaras, Konstantinos Nikas:
Energy-Efficient Sparse Matrix Autotuning with CSX - A Trade-off Study. 931-937 - Akshay Venkatesh, Krishna Chaitanya Kandalla, Dhabaleswar K. Panda:

Evaluation of Energy Characteristics of MPI Communication Primitives with RAPL. 938-945
High-Performance Grid and Cloud Computing Workshop - HPGC
- Eric E. Aubanel, Michael A. Frumkin:

HPGC Introduction. 946
HPGC Session 1: High Performance Cloud Computing
- Khaled Diab, M. Mustafa Rafique, Mohamed Hefeeda

:
Dynamic Sharing of GPUs in Cloud Systems. 947-954 - Yiming Han, Anthony T. Chronopoulos

:
Distributed Loop Scheduling Schemes for Cloud Systems. 955-962 - Petra Berenbrink, André Brinkmann

, Tom Friedetzky
, Dirk Meister, Lars Nagel
:
Distributing Storage in Cloud Environments. 963-973 - Christophe Cérin, Alain Takoudjou:

BOINC as a Service for the SlapOS Cloud: Tools and Methods. 974-983
HPGC Session 2: Virtual Machines and Data Clouds
- Liuyi Eric Zhang, Gaurav Dhiman, Tajana Simunic Rosing:

vGreenNet: Managing Server and Networking Resources of Co-Located Heterogeneous VMs. 984-991 - Ryousei Takano, Hidemoto Nakada

, Takahiro Hirofuchi, Yoshio Tanaka, Tomohiro Kudoh:
Ninja Migration: An Interconnect-Transparent Migration for Heterogeneous Data Centers. 992-1000 - Seung-Hwan Lim, James L. Horey, Yanjun Yao, Edmon Begoli

, Qing Cao:
Performance Implications from Sizing a VM on Multi-core Systems: A Data Analytic Application's View. 1001-1008 - Satish Puri

, Dinesh Agarwal, Xi He, Sushil K. Prasad
:
MapReduce Algorithms for GIS Polygonal Overlay Processing. 1009-1016
Accelerators and Hybrid Exascale Systems - ASHES
- Jiayuan Meng:

ASHES Introduction. 1017-1019
ASHES Session 1: Programing Model and Performance Optimizations
- Ashwin M. Aji, Pavan Balaji, James Dinan, Wu-chun Feng, Rajeev Thakur

:
Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming. 1020-1029 - Toshihiro Hanawa

, Yuetsu Kodama, Taisuke Boku, Mitsuhisa Sato:
Tightly Coupled Accelerators Architecture for Minimizing Communication Latency among Accelerators. 1030-1039 - Yash Ukidave, David R. Kaeli:

Analyzing Optimization Techniques for Power Efficiency on Heterogeneous Platforms. 1040-1049 - Andra-Ecaterina Hugo, Abdou Guermouche, Pierre-André Wacrenier, Raymond Namyst:

Composing Multiple StarPU Applications over Heterogeneous Machines: A Supervised Approach. 1050-1059
ASHES Session 2: Accelerated Applications
- Dip Sankar Banerjee

, Parikshit Sakurikar, Kishore Kothapalli:
Fast, Scalable Parallel Comparison Sort on Hybrid Multicore Architectures. 1060-1069 - Ichitaro Yamazaki, Tingxing Dong, Stanimire Tomov

, Jack J. Dongarra:
Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster. 1070-1079 - Guanghao Jin, Toshio Endo, Satoshi Matsuoka:

A Multi-Level Optimization Method for Stencil Computation on the Domain that is Bigger than Memory Capacity of GPU. 1080-1087 - Yang You, Haohuan Fu, Xiaomeng Huang, Guojie Song, Lin Gan, Wenjian Yu, Guangwen Yang:

Accelerating the 3D Elastic Wave Forward Modeling on GPU and MIC. 1088-1096
ASHES Session 3: Emerging Hybrid Systems
- Michael Boyer, Jiayuan Meng, Kalyan Kumaran:

Improving GPU Performance Prediction with Data Transfer Modeling. 1097-1106 - Gaurav Mitra, Beau Johnston

, Alistair P. Rendell, Eric McCreath, Jun Zhou
:
Use of SIMD Vector Operations to Accelerate Application Code Performance on Low-Powered ARM and Intel Platforms. 1107-1116 - Tao Gao, Yutong Lu, Guang Suo:

Using MIC to Accelerate a Typical Data-Intensive Application: The Breadth-first Search. 1117-1125 - Robert E. Overman, Jan F. Prins, Laura A. Miller, Michael L. Minion

:
Dynamic Load Balancing of the Adaptive Fast Multipole Method in Heterogeneous Systems. 1126-1135
Multicore and GPU Programming Models, Languages, and Compilers Workshop-PLC
- Barbara M. Chapman, Weimin Zheng:

PLC Introduction. 1136-1137
PLC Session 1: Languages Extensions and Compilers for Multicore-SIMD Processors and GPUs
- Hiroyuki Takizawa

, Makoto Sugawara, Shoichi Hirasawa, Isaac Gelado, Hiroaki Kobayashi, Wen-mei W. Hwu:
clMPI: An OpenCL Extension for Interoperation with the Message Passing Interface. 1138-1148 - Xinmin Tian, Hideki Saito, Serguei Preis, Eric N. Garcia, Sergey Kozhukhov, Matt Masten, Aleksei G. Cherkasov, Nikolay Panchenko:

Practical SIMD Vectorization Techniques for Intel® Xeon Phi Coprocessors. 1149-1158 - Jin Lin, Xinmin Tian, John Ng:

Mis-speculation-Driven Compiler Framework for Aggressive Loop Automatic Parallelization. 1159-1168
PLC Session 2: Programming and Applications for MultiCore Processors and GPUs
- Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman:

Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model. 1169-1176 - Joo Hwan Lee, Kaushik Patel, Nimit Nigania, Hyojong Kim, Hyesoon Kim:

OpenCL Performance Evaluation on Modern Multi Core CPUs. 1177-1185 - Bharath Ramesh, Calvin J. Ribbens, Srinidhi Varadarajan:

Towards Virtual Shared Memory for Non-cache-coherent Multicore Systems. 1186-1193 - Rodrigo Dominguez, David R. Kaeli:

Unstructured Control Flow in GPGPU. 1194-1202
PLC Session 3: Runtime Implementation and Optimizations
- Jean-Marc Gratien:

An Abstract Object Oriented Runtime System for Heterogeneous Parallel Architecture. 1203-1212 - Chris J. Newburn, Serguei Dmitriev, Ravi Narayanaswamy, John Wiegert, Ravi Murty, Francisco Chinchilla, Rajiv Deodhar, Russell McGuire:

Offload Compiler Runtime for the Intel® Xeon Phi Coprocessor. 1213-1225 - Feng Jiang, Alan Sussman

:
Reducing the Cost of Measuring Memory Hierarchy Communication Parameters. 1226-1233
NSF/TCPP Workshop on Parallel and Distributed Computing Education-EduPar
- Sushil K. Prasad

:
EduPar Introduction. 1234-1236 - Andrew Danner, Tia Newhall:

Integrating Parallel and Distributed Computing Topics into an Undergraduate CS Curriculum. 1237-1243 - Joel C. Adams

, Richard A. Brown, Elizabeth Shoop:
Patterns and Exemplars: Compelling Strategies for Teaching Parallel and Distributed Computing to CS Undergraduates. 1244-1251
EduPar Session: Introductory Courses (CS0/CS1/CS2)
- Sean Massung, Cinda Heeren:

Visualizing Parallelism in CS 2. 1252-1256 - Travis J. Desell:

Using Actors and the SALSA Programming Language to Introduce Concurrency in Computer Science II. 1257-1262 - Steven Bogaerts

:
Hands-On Exploration of Parallelism for Absolute Beginners with Scratch. 1263-1268
EduPar Session: Systems/Architecture Courses
- Cordelia M. Brown, Yung-Hsiang Lu, Samuel P. Midkiff

:
Introducing Parallel Programming in Undergraduate Curriculum. 1269-1274 - David P. Bunde, Karen L. Karavanic, Jens Mache

, Christopher T. Mitchell:
Adding GPU Computing to Computer Organization Courses. 1275-1282 - Crispín Gómez Requena, María Engracia Gómez

, Julio Sahuquillo
:
A New Methodology for Studying Realistic Processors in Computer Science Degrees. 1283-1290
EduPar Session: Multiple Course Adoptions and Techniques
- Clayton Ferner, Barry Wilkinson, Barbara Heath:

Toward Using Higher-Level Abstractions to Teach Parallel Computing. 1291-1296 - Marcelo Arroyo:

Teaching Parallel and Distributed Computing to Undergraduate Computer Science Students. 1297-1303 - Zhen Li, Eileen T. Kraemer:

Programming with Concurrency: Threads, Actors, and Coroutines. 1304-1311 - Hong Lin:

Teaching Parallel and Distributed Computing Using a Cluster Computing Portal. 1312-1317 - Adriano Branco

, Ana Lúcia de Moura, Noemi de La Rocque Rodriguez, Silvana Rossetto:
Teaching Concurrent and Distributed Computing - Initiatives in Rio de Janeiro. 1318-1323 - Peter Strazdins, Neal Naixue Xiong, Thomas Rauber, Yinglong Xia, Laurence T. Yang, Gudula Rünger:

PDSEC Introduction. 1324-1325 - Manaschai Kunaseth

, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta, David F. Richards, James N. Glosli:
Performance Characteristics of Hardware Transactional Memory for Molecular Dynamics Application on BlueGene/Q: Toward Efficient Multithreading Strategies for Large-Scale Scientific Applications. 1326-1335 - Phillip M. Dickens, Timothy Morey:

Increasing the Scalability of PISM for High Resolution Ice Sheet Models. 1336-1344 - Scott Pakin

, Michael Lang
:
Understanding the Performance of Two Production Supercomputers. 1345-1354
PDSEC Session 2: Applications via Heterogeneous Computing
- Matthew Wezowicz, Trilce Estrada

, Sandeep Patel, Michela Taufer
:
Performance Dissection of a Molecular Dynamics Code across CUDA and GPU Generations. 1355-1364 - Dossay Oryspayev, Hugh D. Potter, Pieter Maris, Masha Sosonkina, James P. Vary, Sven Binder, Angelo Calci, Joachim Langhammer, Robert Roth:

Leveraging GPUs in Ab Initio Nuclear Physics Calculations. 1365-1372 - Jan Verschelde, Genady Yoffe:

Orthogonalization on a General Purpose Graphics Processing Unit with Double Double and Quad Double Arithmetic. 1373-1380
PDSEC Session 3: Linear Algebra and Infrastructure
- Corentin Rossignon, Pascal Hénon, Olivier Aumage, Samuel Thibault:

A NUMA-Aware Fine Grain Parallelization Framework for Multi-core Architecture. 1381-1390 - Ichitaro Yamazaki, Xiaoye S. Li, François-Henry Rouet, Bora Uçar

:
On Partitioning and Reordering Problems in a Hierarchically Parallel Hybrid Linear Solver. 1391-1400 - Kyungjoo Kim, Victor Eijkhout:

Scheduling a Parallel Sparse Direct Solver to Multiple GPUs. 1401-1408 - Chao Yang

, Xiao-Chuan Cai:
A Scalable Implicit Solver for Phase Field Crystal Simulations. 1409-1416
PDSEC Session 4: Cloud, Grid, and Distributed Systems
- Hongju Cheng, Naixue Xiong, Xingbo Huang, Laurence T. Yang:

An Efficient Scheduling Model for Broadcasting in Wireless Sensor Networks. 1417-1428 - Nitin Sukhija, Ioana Banicescu, Srishti Srivastava, Florina M. Ciorba

:
Evaluating the Flexibility of Dynamic Loop Scheduling on Heterogeneous Systems in the Presence of Fluctuating Load Using SimGrid. 1429-1438 - Patrick Cichowski, Jörg Keller:

Efficient and Fault-Tolerant Static Scheduling for Grids. 1439-1448 - Zhaomeng Zhu, Gongxuan Zhang, Yongping Zhang, Jian Guo, Naixue Xiong:

Briareus: Accelerating Python Applications with Cloud. 1449-1456
Dependable Parallel, Distributed, and Network-Centric Systems - DPDNS
- Dimiter Avresky, Erik Maehle, Salvatore Distefano

:
DPDNS Introduction. 1457-1459
DPDNS Session: Network Algorithms and Techniques
- Min Shen, Ajay D. Kshemkalyani

:
A Fault-Tolerant Strong Conjunctive Predicate Detection Algorithm for Large-Scale Networks. 1460-1469 - Frank Engelhardt

, Timo Lindhorst, Edgar Nett:
Tolerating Packet Losses in Wireless Mesh Networks. 1470-1479 - Tae Seung Kang, Maurício O. Tsugawa, José A. B. Fortes, Takahiro Hirofuchi:

Reducing the Migration Times of Multiple VMs on WANs Using a Feedback Controller. 1480-1489
DPDNS Session: Cloud Computing
- Antonio Celesti

, Nicola Peditto, Fabio Verboso, Massimo Villari
, Antonio Puliafito:
DRACO PaaS: A Distributed Resilient Adaptable Cloud Oriented Platform. 1490-1497 - Arash Rezaei, Frank Mueller:

Sustained Resilience via Live Process Cloning. 1498-1507 - Andreas Dittrich, Igor Kaitovic, Cristina Murillo, Rafael Rezende:

A Model for Evaluation of User-Perceived Service Properties. 1508-1517
DPDNS Session: High Performance/Availability Computing
- Mourad Elhadef

:
Symmetric Comparison-Based Fault Diagnosis of Multiprocessor and Distributed Systems Using Nonlinear Support Vector Machines. 1518-1527 - Pradeep Subedi, Xubin He:

A Comprehensive Analysis of XOR-Based Erasure Codes Tolerating 3 or More Concurrent Failures. 1528-1537 - Andreas C. Döring, Tibor Kiss:

Monitoring and Controlling System for Microservers. 1538-1541
DPDNS Session: Fault Tolerance
- Salvatore Distefano

:
Investigating Standby from a System Reliability Perspective. 1542-1549 - Ammar Amory, Benjamin Meyer, Christoph Osterloh, Thomas Tosik, Erik Maehle:

Towards Fault-Tolerant and Energy-Efficient Swarms of Underwater Robots. 1550-1553
Workshop on Multi-Threaded Architectures and Applications - MTAAP
- Luiz DeRose:

MTAAP Introduction. 1554
MTAAP Session: Compiler and Tools
- Yilin Zhang, Wei-Ming Lin:

Capping Speculative Traces to Improve Performance in Simultaneous Multi-threading CPUs. 1555-1564 - Bevin Brett, Pranith Kumar, Minjang Kim, Hyesoon Kim:

CHiP: A Profiler to Measure the Effect of Cache Contention on Scalability. 1565-1574 - Rakesh Krishnaiyer, Emre Kultursay, Pankaj Chawla, Serguei Preis, Anatoly Zvezdin, Hideki Saito:

Compiler-Based Data Prefetching and Streaming Non-temporal Store Generation for the Intel(R) Xeon Phi(TM) Coprocessor. 1575-1586
MTAAP Session: Scheduling and Runtime
- Martin Wimmer, Manuel Pöter, Jesper Larsson Träff:

The Pheet Task-Scheduling Framework on the Intel® Xeon Phi Coprocessor and other Multicore Architectures. 1587-1596 - John D. Leidel, Joe Bolding, Geoffrey Rogers:

Toward a Scalable Heterogeneous Runtime System for the Convey MX Architecture. 1597-1606 - Chen Chen, Yao Wu, Stéphane Zuckerman, Guang R. Gao:

Towards Memory-Load Balanced Fast Fourier Transformations in Fine-Grain Execution Models. 1607-1617
MTAAP Session: Graph Algorithms
- Scott Beamer, Aydin Buluç

, Krste Asanovic, David A. Patterson:
Distributed Memory Breadth-First Search Revisited: Enabling Bottom-Up Search. 1618-1627 - Jesmin Jahan Tithi, Dhruv Mátáni

, Gaurav Menghani, Rezaul Alam Chowdhury:
Avoiding Locks and Atomic Instructions in Shared-Memory Parallel BFS Using Optimistic Parallelization. 1628-1637 - David Ediger, David A. Bader

:
Investigating Graph Algorithms in the BSP Model on the Cray XMT. 1638-1645
MTAAP Session: Algorithms
- E. Jason Riedy

, David A. Bader
:
Multithreaded Community Monitoring for Massive Streaming Graph Data. 1646-1655 - David J. Haglin, Robert Adolf, Greg E. Mackey:

Scalable, Multithreaded, Partially-in-Place Sorting. 1656-1664
Workshop on Large-Scale Parallel Processing - LSPP
- Darren J. Kerbyson, Ram Rajamony, Charles C. Weems:

LSPP Introduction. 1665-1666
LSPP Session 1: Performance Analysis and Optimization
- Laura Carrington, Michael Laurenzano, Ananta Tiwari:

Inferring Large-Scale Computation Behavior via Trace Extrapolation. 1667-1674 - Hormozd Gahvari, William Gropp

, Kirk E. Jordan, Martin Schulz
, Ulrike Meier Yang
:
Systematic Reduction of Data Movement in Algebraic Multigrid Solvers. 1675-1682
LSPP Session 2: Modeling Performance for Scaling
- Fan Zhang, Majd F. Sakr:

Dataset Scaling and MapReduce Performance. 1683-1690 - Matthew Anderson, Maciej Brodowicz, Thomas L. Sterling, Hartmut Kaiser

, Bryce Adelstein-Lelbach:
Tabulated Equations of State with a Many-tasking Execution Model. 1691-1699 - Niall Emmart, Charles C. Weems:

Toward Automatic Optimized Code Generation for Multiprecision Modular Exponentiation on a GPU. 1700-1707
LSPP Session 3: Large-Scale Systems
- Faisal Shahzad, Markus Wittmann, Thomas Zeiser, Georg Hager

, Gerhard Wellein:
An Evaluation of Different I/O Techniques for Checkpoint/Restart. 1708-1716 - Richard F. Barrett, Courtenay T. Vaughan, Simon D. Hammond, Duncan Roweth:

Application Explorations for Future Interconnects. 1717-1724
LSPP Session 4: Scheduling
- Ankur Narang, Abhinav Srivastava, R. K. Shyamasundar:

High Performance Adaptive Distributed Scheduling Algorithm. 1725-1734 - Yiming Han, Anthony T. Chronopoulos

:
Scalable Loop Self-Scheduling Schemes Implemented on Large-Scale Clusters. 1735-1742
Parallel Computing and Optimization - PCO
- Didier El Baz

:
PCO Introduction. 1743
PCO Session 1: Algorithms
- Anne Benoit

, Johannes Langguth, Bora Uçar
:
Semi-Matching Algorithms for Scheduling Parallel Tasks under Resource Constraints. 1744-1753 - Yves Caniou, Philippe Codognet:

Sequential and Parallel Restart Policies for Constraint-Based Local Search. 1754-1763 - Juan F. R. Herrera

, Leocadio G. Casado
, Remigijus Paulavicius
, Julius Zilinskas, Eligius M. T. Hendrix
:
On a Hybrid MPI-Pthread Approach for Simplicial Branch-and-Bound. 1764-1770 - Tarek Menouer

, Bertrand Le Cun:
Anticipated Dynamic Load Balancing Strategy to Parallelize Constraint Programming Search. 1771-1777
PCO Session 2: GPU Computing and Optimization
- Vincent Boyer

, Didier El Baz
:
Recent Advances on GPU Computing in Operations Research. 1778-1787 - Kamil Rocki, Reiji Suda:

High Performance GPU Accelerated Local Optimization in TSP. 1788-1796 - Karl-Eduard Berger, François Galea:

An Efficient Parallelization Strategy for Dynamic Programming on GPU. 1797-1806 - Nhat-Phuong Tran, Myungho Lee, Sugwon Hong, Jaeyoung Choi:

High Throughput Parallel Implementation of Aho-Corasick Algorithm on a GPU. 1807-1816
PCO Session 3
- Keliang Zhang, Baifeng Wu:

Task Scheduling Greedy Heuristics for GPU Heterogeneous Cluster Involving the Weights of the Processor. 1817-1827 - Amgad Naiem, Mohammed El-Beltagy:

On the Optimality and Speed of the Deep Greedy Switching Algorithm for Linear Assignment Problems. 1828-1837 - Blair D. Sullivan

, Dinesh Weerapurage, Chris Groër:
Parallel Algorithms for Graph Optimization Using Tree Decompositions. 1838-1847 - Yingsheng Ji

, Guangwen Yang, Li Liu, Shu Wang:
Subdomain Mapping Approach to Enhance the Coupling in Earth System Modeling. 1848-1855
Parallel and Distributed Computing for Machine Learning and Inference Problems-ParLearning
- Sutanay Choudhury, George Chin Jr., Yinglong Xia:

ParLearning Introduction. 1856-1858 - Alfredo Goldman

, Joachim Lepping, Yanik Ngoko, Denis Trystram:
Combining Parallel Algorithms Solving the Same Application: What is the Best Approach? 1859-1868 - Che-Rung Lee, Ya-Fang Chang:

Enhancing Accuracy and Performance of Collaborative Filtering Algorithm by Stochastic SVD and Its MapReduce Implementation. 1869-1878 - Lifeng Nai, Hsien-Hsin S. Lee:

Reducing False Transactional Conflicts with Speculative Sub-Blocking State - An Empirical Study for ASF Transactional Memory System. 1879-1888 - Christian Trefftz, Jerry Scripps:

Revisiting a Pattern for Processing Combinatorial Objects in Parallel. 1889-1893 - Ryan Hafen, Terence Critchlow:

EDA and ML - A Perfect Pair for Large-Scale Data Analysis. 1894-1898 - David J. Haglin, Lawrence B. Holder:

Combining Structure and Property Values is Essential for Graph-Based Learning. 1899-1904
High Performance Data Intensive Computing - HPDIC
- Christophe Cérin, Congfeng Jiang, Yuqing Gao, Jilin Zhang:

HPDIC Introduction. 1905-1907
HPDIC Session 1: MapReduce, Hadoop, and New Architecture Support for Data Intensive Computing
- Md. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Jithin Jose, Hari Subramoni, Hao Wang, Dhabaleswar K. Panda:

High-Performance RDMA-based Design of Hadoop MapReduce over InfiniBand. 1908-1917 - Max Grossman, Maurício Breternitz Jr.

, Vivek Sarkar:
HadoopCL: MapReduce on Distributed Heterogeneous Platforms through Seamless Integration of Hadoop and OpenCL. 1918-1927 - Prateek Tandon, Michael J. Cafarella, Thomas F. Wenisch:

Minimizing Remote Accesses in MapReduce Clusters. 1928-1936 - Sébastien Gambs, Marc-Olivier Killijian, Izabela Moise, Miguel Núñez del Prado Cortez

:
MapReducing GEPETO or Towards Conducting a Privacy Analysis on Millions of Mobility Traces. 1937-1946 - Tuan-Anh Nguyen, Greg Eisenhauer, Karsten Schwan, Matthew Wolf, Hasan Abbasi, Scott Klasky, Norbert Podhorszki:

Resource Provisioning for Staging Components. 1947-1953 - Shuibing He, Xian-He Sun, Yanlong Yin:

BPS: A Performance Metric of I/O System. 1954-1962
HPDIC Session 2: File System and Storage for Data Intensive Computing
- Paul Z. Kolano:

Transparent Optimization of Parallel File System I/O via Standard System Tool Enhancement. 1963-1970 - Ramya Prabhakar, Mahmut T. Kandemir, Padma Raghavan, Myoungsoo Jung:

Interference Resolver in Shared Storage Systems to Provide Fairness to I/O Intensive Applications. 1971-1980 - Yong-Jian Ren, YouQing Lin, Jilin Zhang, Jian Wan, Congfeng Jiang:

InfoStor: Highly Available Distributed Block Store. 1981-1988 - Bo Wang, Jinlei Jiang, Guangwen Yang:

tpNFS: Efficient Support of Small Files Processing over pNFS. 1989-1996 - Dongfang Zhao, Ioan Raicu:

HyCache: A User-Level Caching Middleware for Distributed File Systems. 1997-2006 - Rengan Xu, Mauricio Araya-Polo, Barbara M. Chapman:

Filesystem Aware Scalable I/O Framework for Data-Intensive Parallel Applications. 2007-2014
HPDIC Session 3: Data Analytics: Solutions and Case Studies
- Jai Dayal, Jianting Cao, Greg Eisenhauer, Karsten Schwan, Matthew Wolf, Fang Zheng, Hasan Abbasi, Scott Klasky, Norbert Podhorszki, Jay F. Lofstead

:
I/O Containers: Managing the Data Analytics and Visualization Pipelines of High End Codes. 2015-2024 - Mark Hereld, Tanu Malik, Venkatram Vishwanath:

Proactive Support for Large-Scale Data Exploration. 2025-2034 - Jed Kao-Tung Chang, Chen Liu

, Jean-Luc Gaudiot:
Enhancement for Potential Target in Cryptography Algorithms by Applying Processor-in-Memory Architecture. 2035-2044 - Jie Liang, Kenli Li, Lin Shi, Yingqiang Liao:

Accelerating Dynamics Simulation of Solidification Processes of Liquid Metals Using GPU with CUDA. 2045-2053 - Xin Cheng, Chungjin Hu, Yang Li, Wei Lin, Haolei Zuo:

Data Evolution Analysis of Virtual DataSpace for Managing the Big Data Lifecycle. 2054-2063 - Jian Wan, Shuting Han, Jilin Zhang, Baojin Zhu, Li Zhou:

An Image Management System Implemented on Open-Source Cloud Platform. 2064-2070 - Chen Meng, Long Wang, Zongyan Cao, Xianfeng Ye, Long-Long Feng:

Acceleration of a High Order Finite-Difference WENO Scheme for Large-Scale Cosmological Simulations on GPU. 2071-2078 - Lee Carraher, Philip A. Wilsey, Fred S. Annexstein:

A GPGPU Algorithm for c-Approximate r-Nearest Neighbor Search in High Dimensions. 2079-2088
Workflow Models, Systems, Services, and Applications in the Cloud - CloudFlow
- Yong Zhao, Cui Lin, Shiyong Lu:

CloudFlow Introduction. 2089-2090
CloudFlow Session: Modeling
- Philip Yeo, Syed Sibte Raza Abidi

:
Dataflow Oriented Similarity Matching for Scientific Workflows. 2091-2100 - Marc Frîncu, Stéphane Genaud

, Julien Gossa:
Comparing Provisioning and Scheduling Strategies for Workflows on Clouds. 2101-2110 - Ke Wang, Zhangjie Ma, Ioan Raicu:

Modeling Many-Task Computing Workloads on a Petaflop IBM Blue Gene/P Supercomputer. 2111-2120 - Xiaoliang Fan, Ruisheng Zhang

, Patrick Brézillon:
Investigating the Feasibility of Making Contexts Explicit in Designing Cloud Workflow. 2121-2128
CloudFlow Session: Application
- Siddharth Ramakrishnan, Robert Reutiman, Abhishek Chandra, Jon B. Weissman:

Accelerating Distributed Workflows with Edge Resources. 2129-2138 - Dinesh Agarwal, Sushil K. Prasad

:
AzureBOT: A Framework for Bag-of-Tasks Applications on the Azure Cloud Platform. 2139-2146 - Yi Wei, M. Brian Blake

, Iman Saleh:
Adaptive Resource Management for Service Workflows in Cloud Environments. 2147-2156 - David Chapman, Tyler A. Simon, Phuong Nguyen, Milton Halem:

A Data Intensive Statistical Aggregation Engine: A Case Study for Gridded Climate Records. 2157-2164
Workshop on Job Scheduling Strategies for Parallel Processing - JSSPP
- Walfredo Cirne, Narayan Desai:

JSSPP Introduction. 2165
Virtual Prototyping of Parallel and Embedded Systems - ViPES
- Michael Hübner:

ViPES Introduction. 2166-2167
ViPES Session 1: Novel Designs and Features for Full System Simulators
- Aditya Kumar, Ahmed Gheith, Michael Kistler:

Experiences with Dynamic Binary Translation in a Full System Simulator. 2168-2175 - Tayeb Bouhadiba, Matthieu Moy, Florence Maraninchi, Jérôme Cornet, Laurent Maillet-Contoz, Ilija Materic:

Co-simulation of Functional SystemC TLM Models with Power/Thermal Solvers. 2176-2181 - Daniele Bortolotti

, Christian Pinto, Andrea Marongiu, Martino Ruggiero, Luca Benini
:
VirtualSoC: A Full-System Simulation Environment for Massively Parallel Heterogeneous System-on-Chip. 2182-2187 - Christoph Schumacher, Jan Henrik Weinstock, Rainer Leupers, Gerd Ascheid, Laura Tosoratto, Alessandro Lonardo, Dietmar Petras, Thorsten Grötker:

legaSCi: Legacy SystemC Model Integration into Parallel Systemc Simulators. 2188-2193
ViPES Session 2: Simulation Frameworks and Fault Tolerance for Virtual FPGAs
- Dionysios Diamantopoulos, Kostas Siozios

, Efstathios Sotiriou-Xanthopoulos, George Economakos, Dimitrios Soudris:
HVSoCs: A Framework for Rapid Prototyping of 3-D Hybrid Virtual System-on-Chips. 2194-2199 - Gabriel Marchesan Almeida, Oliver Bellaver Longhi, Thomas Bruckschlögl, Michael Hübner, Fabiano Hessel

, Jürgen Becker
:
Simplify: A Framework for Enabling Fast Functional/Behavioral Validation of Multiprocessor Architectures in the Cloud. 2200-2205 - Kostas Siozios

, Dimitrios Soudris
, Michael Hübner:
On Supporting Adaptive Fault Tolerant at Run-Time with Virtual FPGAs. 2206-2211
PhD Forum
PhD Forum: Algorithms
- Ardavan Pedram:

Algorithm/Architecture Codesign of Low Power and High Performance Linear Algebra Compute Fabrics. 2214-2217 - Fei Cao, Mengxia Michelle Zhu:

Energy Efficient Workflow Job Scheduling for Green Cloud. 2218-2221 - Liudong Zuo

, Mengxia Michelle Zhu:
Toward Flexible and Fast Routing Strategies for Dynamic Network Provisioning. 2222-2225 - Xibo Jin, Fa Zhang, Zhiyong Liu:

Discrete Min-Energy Scheduling on Restricted Parallel Processors. 2226-2229 - Moussa Ehsan, Radu Sion

:
LiPS: A Cost-Efficient Data and Task Co-Scheduler for MapReduce. 2230-2233 - Vladimir Ufimtsev, Sanjukta Bhowmick:

Identifying High betweenness Centrality Vertices in Large Noisy Networks. 2234-2237
PhD Forum: Applications
- Satish Puri

, Sushil K. Prasad
:
Efficient Parallel and Distributed Algorithms for GIS Polygonal Overlay Processing. 2238-2241 - Alessandro Morari, Mateo Valero

:
HPC System Software for Regular and Irregular Parallel Applications. 2242-2245 - Thilan Ganegedara, Viktor K. Prasanna:

Wire Speed IPv6 Forwarding on Multi-core Platforms. 2246-2249 - Tekin Bicer

, Gagan Agrawal:
A Compression Framework for Multidimensional Scientific Datasets. 2250-2253
PhD Forum: Architecture
- Bin Wang, Weikuan Yu:

Performance and Power Simulation for Versatile GPGPU Global Memory. 2254-2257 - Scott Levy:

Exploiting Content Similarity to Improve Memory Performance in Large-Scale High-Performance Computing Systems. 2258-2261 - David Ediger, David A. Bader

:
Designing Hybrid Architectures for Massive-Scale Graph Analysis. 2262-2265 - Thilo Schondienst, Vinod Vokkarane

:
Reducing the Environmental Impact of Optical Networks. 2266-2269 - Wenwei Zha, Peter Athanas:

Fine-Grained Manipulation of FPGA Configuration for Incremental Design. 2270-2273 - Vito Giovanni Castellana, Fabrizio Ferrandi

:
Applications Acceleration through Adaptive Hardware Components. 2274-2277
PhD Forum: Software
- Radu Tudoran, Gabriel Antoniu, Luc Bougé:

SAGE: Geo-Distributed Streaming Data Analysis in Clouds. 2278-2281 - João Eugenio Marynowski:

Towards Dependability Testing of MapReduce Systems. 2282-2285 - Matthieu Dorier

:
Efficient I/O using Dedicated Cores in Large-Scale HPC Simulations. 2286-2289 - Houssem-Eddine Chihoub:

Self-Adaptive Cost-Efficient Consistency Management in the Cloud. 2290-2293 - Abhishek Gupta

, Laxmikant V. Kalé:
Towards Efficient Mapping, Scheduling, and Execution of HPC Applications on Platforms in Cloud. 2294-2297 - Silvia Lovergine, Fabrizio Ferrandi

:
Harnessing Adaptivity Analysis for the Automatic Design of Efficient Embedded and HPC Systems. 2298-2301 - Can Hankendi, Ayse K. Coskun:

Adaptive Power and Resource Management Techniques for Multi-threaded Workloads. 2302-2305

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














