


default search action
IPDPS 2017: Orlando / Buena Vista, FL, USA - Workshops
- 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2017, Orlando / Buena Vista, FL, USA, May 29 - June 2, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-3408-0

HCW: Heterogeneity in Computing Workshop
- Erik Saule, Emmanuel Jeannot:

Introduction to HCW Workshop. 1 - Behrooz A. Shirazi:

Message from the HCW Steering Committee Chair. 2 - Erik Saule:

Message from the HCW General Chair. 3 - Emmanuel Jeannot:

Message from the HCW Program Committee Chair. 4 - Ricky Yu-Kwong Kwok:

HCW Keynote Talk. 5
Session 1: Managing the Different Components of Heterogeneous Systems
- Oliver Jakob Arndt, Fabian David Trager, Tobias Moß, Holger Blume

:
Portable Implementation of Advanced Driver-Assistance Algorithms on Heterogeneous Architectures. 6-17 - Siddharth Rai

, Mainak Chaudhuri:
Improving CPU Performance Through Dynamic GPU Access Throttling in CPU-GPU Heterogeneous Processors. 18-29 - Benjamin Marks, Tia Newhall:

Transparent Heterogeneous Backing Store for File Systems. 30-41
Session 2: Scheduling and Resource Allocation
- Sonia López, Stavan Satish Karia:

Alternative Processor Within Threshold: Flexible Scheduling on Heterogeneous Systems. 42-53 - Dylan Machovec, Sudeep Pasricha, Anthony A. Maciejewski

, Howard Jay Siegel, Gregory A. Koenig, Michael Wright, Marcia Hilton, Rajendra Rambharos, Thomas J. Naughton, Neena Imam:
Preemptive Resource Management for Dynamically Arriving Tasks in an Oversubscribed Heterogeneous Computing System. 54-64 - Lilia Zaourar, Massinissa Ait Aba, David Briand, Jean-Marc Philippe:

Modeling of Applications and Hardware to Explore Task Mapping and Scheduling Strategies on a Heterogeneous Micro-Server System. 65-76 - Thibaud Ecarot

, Djamal Zeghlache
, Cedric Brandily:
Consumer-and-Provider-Oriented Efficient IaaS Resource Allocation. 77-85
RAW: Reconfigurable Architectures Workshop
- Marco D. Santambrogio, Ramachandran Vaidyanathan:

Introduction to RAW Workshop. 86-87 - Ronald F. DeMara

, Georgi Gaydadjiev:
RAW Keynote Speakers. 88-89
Session 1: Architectures for Convolutional Neural Networks and Sliding Window
- Marco Bacis, Giuseppe Natale, Emanuele Del Sozzo

, Marco Domenico Santambrogio:
A Pipelined and Scalable Dataflow Implementation of Convolutional Neural Networks on FPGA. 90-97 - Haruyoshi Yonekawa, Hiroki Nakahara:

On-Chip Memory Based Binarized Convolutional Deep Neural Network Applying Batch Normalization Free Technique on an FPGA. 98-105 - Murad Qasaimeh

, Joseph Zambreno, Phillip H. Jones:
A Modified Sliding Window Architecture for Efficient BRAM Resource Utilization. 106-114
Session 2: Design and Programming Methods
- Gary Gréwal, Shawki Areibi, Matthew Westrik, Ziad Abuowaimer, Betty Zhao:

Automatic Flow Selection and Quality-of-Result Estimation for FPGA Placement. 115-123 - Javier Alejandro Varela, Norbert Wehn

, Qian Liang, Songyin Tang:
Exploiting Decoupled OpenCL Work-Items with Data Dependencies on FPGAs: A Case Study. 124-131 - Luca Stornaiuolo, Alberto Parravicini, Gianluca Durelli, Marco D. Santambrogio:

Exploiting FPGAs from Higher Level Languages A Signal Analysis Case Study. 132-140 - Philip Gottschling, Christian Hochberger:

ReEP: A Toolset for Generation and Programming of Reconfigurable Datapaths for Event Processing. 141-149
Session 3: Acceleration of Curran's Approximation and Elliptic Curve Crypto
- Anna Maria Nestorov, Enrico Reggiani, Hristina Palikareva, Pavel Burovskiy, Tobias Becker

, Marco D. Santambrogio:
A Scalable Dataflow Implementation of Curran's Approximation Algorithm. 150-157 - Rabia Shahid, Ted Winograd, Kris Gaj:

A Generic Approach to the Development of Coprocessors for Elliptic Curve Cryptosystems. 158-167
Session 4: Acceleration of Biological Signal Processing
- Luca Cerina

, Pierandrea Cancian
, Giuseppe Franco, Marco Domenico Santambrogio:
A Hardware Acceleration for Surface EMG Non-Negative Matrix Factorization. 168-174 - Giovanni Pietro Seu

, Gian Nicola Angotzi, Giuseppe Tuveri, Luigi Raffo
, Luca Berdondini, Alessandro Maccione
, Paolo Meloni:
On-FPGA Real-Time Processing of Biological Signals From High-Density MEAs: a Design Space Exploration. 175-183
Session 5: Design Methods
- Yosi Ben-Asher, Esti Stein, Ramachandran Vaidyanathan:

Combining Boolean Gates and Branching Programs in One Model can Lead to Faster Circuits. 184-191 - Utsav Agarwal, Ramachandran Vaidyanathan:

Efficient Totally-Ordered Subset Generation, with Application in Partial Reconfiguration. 192-201
Short Papers
- Godwin Enemali

, Adewale Adetomi, Tughrul Arslan:
FAReP: Fragmentation-Aware Replacement Policy for Task Reuse on Reconfigurable FPGAs. 202-206 - Tejaswini Ananthanarayana, Sonia López, Marcin Lukowiak:

Power Analysis of HLS-Designed Customized Instruction Set Architectures. 207-212 - Tajas Ruschke, Lukas Johannes Jung, Christian Hochberger:

A Near Optimal Integrated Solution for Resource Constrained Scheduling, Binding and Routing on CGRAs. 213-218 - Adewale Adetomi, Godwin Enemali

, Tughrul Arslan:
Clock Buffers, Nets, and Trees for On-Chip Communication: A Novel Network Access Technique in FPGAs. 219-222 - Enrico Reggiani, Eleonora D'Arnese

, Andrea Purgato, Marco D. Santambrogio:
Pearson Correlation Coefficient Acceleration for Modeling and Mapping of Neural Interconnections. 223-228 - Tripti Jain, Klaus Schneider

, Frederik Walk:
Out-of-Order Execution of Buffered Function Units in Exposed Data Path Architectures. 229-234 - Andres Jacoby, Daniel Llamocca

:
Dynamic Dual Fixed-Point CORDIC Implementation. 235-240 - Emanuele Del Sozzo

, Lorenzo Di Tucci, Marco D. Santambrogio:
A Highly Scalable and Efficient Parallel Design of N-Body Simulation on FPGA. 241-246 - Francesca Palumbo

, Carlo Sau
, Danilo Pani
, Paolo Meloni, Luigi Raffo
:
Feasibility Study of Real-Time Spiking Neural Network Simulations on a Swarm Intelligence Based Digital Architecture. 247-250
HiCOMB: 16th IEEE International Workshop on High Performance Computational Biology
- Alex Pothen

, Ananth Grama:
Introduction to HiCOMB Workshop. 251 - Radu Marculescu:

HiCOMB Keynote. 252
Session 1
- Cyrus Cousins, Christopher M. Pietras, Donna K. Slonim:

Scalable FRaC Variants: Anomaly Detection for Precision Medicine. 253-262 - Jae-Seung Yeom

, Tanya Kostova-Vassilevska, Peter D. Barnes Jr., David R. Jefferson, Tomas Oppelstrup:
Exploratory Modeling and Simulation of the Evolutionary Dynamics of Single-Stranded RNA Virus Populations. 263-272
Session 2
- Julia D. Warnke-Sommer, Hesham H. Ali:

Parallel NGS Assembly Using Distributed Assembly Graphs Enriched with Biological Knowledge. 273-282 - Vasudevan Rengasamy, Paul Medvedev, Kamesh Madduri:

Parallel and Memory-Efficient Preprocessing for Metagenome Assembly. 283-292
Session 3
- Philip E. Davis, Adam M. Terwilliger, David Zeitler, Gregory Wolffe:

Scalable Parallelization of a Markov Coalescent Genealogy Sampler. 293-302 - Mücahid Kutlu

, Gagan Agrawal, James S. Blachly:
Par-eXpress: A Tool for Analysis of Sequencing Experiments With Ambiguous Assignment of Fragments in Parallel. 303-310
EduPar: NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Sheikh K. Ghafoor, Sushil K. Prasad

, Satish Puri
:
Introduction to EduPar Workshop. 311-313 - Jack J. Dongarra:

EduPar Keynote. 314
Session 1: Tools and Programming Environment
- Abdul Dakkak, Carl Pearson

, Cheng Li, Wen-mei W. Hwu:
RAI: A Scalable Project Submission System for Parallel Programming Courses. 315-322 - Brian Broll

, Ákos Lédeczi, Péter Völgyesi, János Sallai, Miklós Maróti
, Chris Vanags
:
Introducing Parallel and Distributed Computing to K12. 323-330 - Tianyi Bao, William B. Gardner:

Log Visualization Tool for Message-Passing Programming in Pilot. 331-338 - David A. Richie, James A. Ross:

I Can Has Supercomputer? A Novel Approach to Teaching Parallel and Distributed Computing Concepts Using a Meme-Based Programming Language. 339-345
Session 2: Pedagogy and Experience
- Joshua Eckroth:

Teaching Future Big Data Analysts: Curriculum and Experience Report. 346-351 - Jane Wyngaard

, Heather J. Lynch, Jaroslaw Nabrzyski, Allen Pope
, Shantenu Jha
:
Hacking at the Divide Between Polar Science and HPC: Using Hackathons as Training Tools. 352-359 - Vivek Sarkar, Max Grossman, Zoran Budimlic, Shams Imam:

Preparing an Online Java Parallel Computing Course. 360-366 - Jawwad Ahmed Shamsi:

A Laboratory Based Course on GPU Programming: Methods, Practices, and Lessons. 367-374
ParLearning: The 6th International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Anand Panangadan:

Introduction to ParLearning Workshop. 375-376 - John Feo, Wei Tan:

ParLearning Keynotes. 377-378
Session 1
- Azalia Mirhoseini, Bita Darvish Rouhani, Ebrahim M. Songhori, Farinaz Koushanfar

:
ExtDict: Extensible Dictionaries for Data- and Platform-Aware Large-Scale Learning. 379-388 - Songze Li, Sucha Supittayapornpong, Mohammad Ali Maddah-Ali, Salman Avestimehr:

Coded TeraSort. 389-398 - Nitin A. Gawande, Joshua B. Landwehr, Jeff A. Daily, Nathan R. Tallent, Abhinav Vishnu, Darren J. Kerbyson:

Scaling Deep Learning Workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing. 399-408 - Jing Chen, Jianbin Fang

, Weifeng Liu
, Tao Tang, Xuhao Chen
, Canqun Yang:
Efficient and Portable ALS Matrix Factorization for Recommender Systems. 409-418
Session 2
- Thomas P. Parnell, Celestine Dünner, Kubilay Atasu

, Manolis Sifalakis
, Haris Pozidis:
Large-Scale Stochastic Learning Using GPUs. 419-428 - Amaury Durand, Yanik Ngoko, Christophe Cérin:

Distributed and in-Situ Machine Learning for Smart-Homes and Buildings: Application to Alarm Sounds Detection. 429-432 - DeJiao Niu, Rui Xue, Tao Cai

, Hai Li
, Kingsley Effah, Hang Zhang:
The New Large-Scale RNNLM System Based on Distributed Neuron. 433-436 - Yuchen Qiao, Kazuma Hashimoto, Akiko Eriguchi, Haixia Wang, Dongsheng Wang, Yoshimasa Tsuruoka

, Kenjiro Taura
:
Cache Friendly Parallelization of Neural Encoder-Decoder Models Without Padding on Multi-core Architecture. 437-440
PDCO: 7th IEEE Workshop Parallel / Distributed Computing and Optimization
- Grégoire Danoy

, Didier El Baz
:
Introduction to PDCO Workshop. 441
Session 1: Scheduling I
- Laleh Ghalami, Daniel Grosu:

A Parallel Approximation Algorithm for Scheduling Parallel Identical Machines. 442-451 - Hadrien Croubois, Eddy Caron:

Communication Aware task Placement for Workflow Scheduling on DaaS-Based Cloud. 452-461 - Muhammad Qasim, Touseef Iqbal, Ehsan Ullah Munir, Nikos Tziritas, Samee U. Khan

, Laurence T. Yang:
Dynamic Mapping of Application Workflows in Heterogeneous Computing Environments. 462-471
Session 2: Scheduling II
- Jorge M. Cortés-Mendoza

, Andrei Tchernykh
, Igor V. Bychkov
, Alexander G. Feoktistov
, Pascal Bouvry
, Loic Didelot:
Load-Aware Strategies for Cloud-Based VoIP Optimization with VM Startup Prediction. 472-481 - David Pena

, Andrei Tchernykh
, Sergio Nesmachnow, Renzo Massobrio
, Alexander G. Feoktistov
, Igor V. Bychkov
:
Multiobjective Vehicle-type Scheduling in Urban Public Transport. 482-491
Session 3: Parallel Metaheuristics and Machine Learning
- Emmanuel Kieffer

, Grégoire Danoy
, Pascal Bouvry
, Anass Nagih
:
A new Co-evolutionary Algorithm Based on Constraint Decomposition. 492-500 - Javier A. Cruz-Lopez, Vincent Boyer, Didier El Baz

:
Training Many Neural Networks in Parallel via Back-Propagation. 501-509 - Amir Nakib

, Mohamed Hilia, Frederic Heliodore, El-Ghazali Talbi:
Design of Metaheuristic Based on Machine Learning: A Unified Approach. 510-518
Session 4: Graphs, Networks and Algorithms
- Raphael Kimmig, Henning Meyerhenke

, Darren Strash
:
Shared Memory Parallel Subgraph Enumeration. 519-529 - Julien Collet, Tanguy Sassolas, Yves Lhuillier, Renaud Sirdey

, Jacques Carlier:
Exploration of de Bruijn Graph Filtering for de novo Assembly Using GraphLab. 530-539 - He Li, Robson Eduardo De Grande, Azzedine Boukerche:

An Efficient CPP Solution for Resilience-Oriented SDN Controller Deployment. 540-549
Session 5: Parallel Algorithms
- Chris Rohlfs

, Mohamed Zahran
:
Optimal Bandwidth Selection for Kernel Regression Using a Fast Grid Search and a GPU. 550-556 - Numair Khan

, Mohamed Zahran
:
Space-Efficient Pointwise Computation of the Distance Transform on GPUs. 557-566 - Christian Herold, Olaf Krzikalla, Andreas Knüpfer

:
Optimizing One-Sided Communication of Parallel Applications Using Critical Path Methods. 567-576
GABB: Graph Algorithms Building Blocks
- Aydin Buluç

, Tim Mattson:
Introduction to GABB Workshop. 577 - Ümit V. Çatalyürek:

GABB Keynote. 578
Session 1
- Maryia Belova, Ming Ouyang:

Breadth-First Search with A Multi-Core Computer. 579-587 - George M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri:

Order or Shuffle: Empirically Evaluating Vertex Order Impact on Parallel Graph Computations. 588-597 - Sayyad Nayyaroddeen, Mahak Gambhir, Kishore Kothapalli:

A Study of Graph Decomposition Algorithms for Parallel Symmetry Breaking. 598-607
Session 2
- Hayden Jananthan, Karia Dibert, Jeremy Kepner:

Constructing Adjacency Arrays from Incidence Arrays. 608-615 - Yangzihao Wang, Sean Baxter, John D. Owens:

Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU. 616-626 - Charles Colley, Junyuan Lin

, Xiaozhe Hu, Shuchin Aeron:
Algebraic Multigrid for Least Squares Problems on Graphs with Applications to HodgeRank. 627-636
Session 3
- David Ediger, James P. Fairbanks:

Deriving Streaming Graph Algorithms from Static Definitions. 637-642
Session 4
- Aydin Buluç

, Tim Mattson, Scott McMillan, José E. Moreira, Carl Yang:
Design of the GraphBLAS API for C. 643-652 - William Horn, Gabriel Tanase, Hao Yu, Pratap Pattnaik:

A Linear Algebra-Based Programming Interface for Graph Computations in Scala and Spark. 653-659
AsHES: The Seventh International Workshop on Accelerators and Hybrid Exascale Systems
- Sunita Chandrasekaran:

Introduction to AsHES Workshop. 660 - Tim Mattson:

AsHES Keynote. 661
Session 1: Programming Models and Runtime Systems
- Michael Wolfe, Seyong Lee

, Jungwon Kim
, Xiaonan Tian, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman:
Implementing the OpenACC Data Model. 662-672 - Sergio Pino, Lori L. Pollock, Sunita Chandrasekaran:

Exploring Translation of OpenMP to OpenACC 2.5: Lessons Learned. 673-682 - Ivy Bo Peng

, Roberto Gioiosa, Gokcen Kestor
, Pietro Cicotti, Erwin Laure
, Stefano Markidis:
Exploring the Performance Benefit of Hybrid Memory System on HPC Environments. 683-692
Session 2: Algorithms
- Mehmet Deveci, Christian Trott, Sivasankaran Rajamanickam:

Performance-Portable Sparse Matrix-Matrix Multiplication for Many-Core Architectures. 693-702 - Antonio Gómez-Iglesias

, Miguel Cárdenas-Montes
:
Time and Energy to Solution Evaluation for the Three-Point Angular Correlation Function. 703-712 - Kaixi Hou, Wu-chun Feng, Shuai Che:

Auto-Tuning Strategies for Parallelizing Sparse Matrix-Vector (SpMV) Multiplication on Multi- and Many-Core Processors. 713-722
Session 3: Scheduling and Architectures
- Max Grossman, Vivek Kumar, Nick Vrvilo, Zoran Budimlic, Vivek Sarkar:

A Pluggable Framework for Composable HPC Scheduling Libraries. 723-732 - Sandra Catalán

, Rafael Rodríguez-Sánchez
, Enrique S. Quintana-Ortí
, José R. Herrero:
Static Versus Dynamic Task Scheduling of the Lu Factorization on ARM big. LITTLE Architectures. 733-742 - Zhigeng Xu, James Lin, Satoshi Matsuoka:

Benchmarking SW26010 Many-Core Processor. 743-752
HIPS: 22nd International Workshop on High Level Programming Models and Supportive Environments
- Bo Wu, Andreas Knüpfer:

Introduction to HIPS Workshop. 753-754 - Zizhong Chen:

HIPS Keynote. 755
Session 1
- Dana Akhmetova, Roman Iakymchuk

, Örjan Ekeberg, Erwin Laure
:
Performance Study of Multithreaded MPI and OpenMP Tasking in a Large Scientific Code. 756-765 - Solmaz Salehian, Jiawen Liu, Yonghong Yan:

Comparison of Threading Programming Models. 766-774 - Mostafa Mehrabi, Nasser Giacaman, Oliver Sinnen

:
Annotation-Based Parallelization of Java Code. 775-784
Session 2
- Alexis Engelke

, Josef Weidendorfer:
Using LLVM for Optimized Lightweight Binary Re-Writing at Runtime. 785-794 - Nathan Zhang, Michael B. Driscoll, Charles Markley, Samuel Williams

, Protonu Basu, Armando Fox:
Snowflake: A Lightweight Portable Stencil DSL. 795-804 - Pavel Shamis, M. Graham Lopez

, Gilad Shainer:
Enabling One-Sided Communication Semantics on ARM. 805-813
Session 3
- Jari-Matti Mäkelä

, Martti Forsell, Ville Leppänen
:
Towards a Language Framework for Thick Control Flows. 814-823 - Benjamin J. L. Wang, Uwe R. Zimmer:

Pure Concurrent Programming. 824-831
APDCM: 19th Workshop on Advances in Parallel and Distributed Computational Models
- Oscar H. Ibarra, Koji Nakano:

Introduction to APDCM Workshop. 832 - Hong Shen:

APDCM Keynote. 833
Session 1: Distributed Computing
- Aisha Aljohani

, Gokarna Sharma:
Complete Visibility for Mobile Agents with Lights Tolerating a Faulty Agent. 834-843 - Yonghwan Kim

, Haruka Ohno, Yoshiaki Katayama, Toshimitsu Masuzawa:
A Self-Stabilizing Algorithm for Constructing (1, 1)-Maximal Directed Acyclic Graph. 844-853 - Jonas Posner, Claudia Fohry:

Fault Tolerance for Cooperative Lifeline-Based Global Load Balancing in Java with APGAS and Hazelcast. 854-863 - Debarshi Dutta, Meher Chaitanya, Kishore Kothapalli, Debajyoti Bera:

Applications of Ear Decomposition to Efficient Heterogeneous Algorithms for Shortest Path/Cycle Problems. 864-873
Session 2: Scheduling and Hardware Models
- Guillaume Aupy, Anne Benoit

, Loïc Pottier
, Padma Raghavan, Yves Robert
, Manu Shantharam:
Co-Scheduling Algorithms for Cache-Partitioned Systems. 874-883 - Loris Marchal

, Samuel McCauley, Bertrand Simon, Frédéric Vivien
:
Minimizing I/Os in Out-of-Core Task Tree Scheduling. 884-893 - Basem Assiri, Costas Busch:

Approximate Count and Queue Objects in Transactional Memory. 894-903 - Max Plauth

, Christoph Sterz, Felix Eberhardt, Frank Feinbube, Andreas Polze:
Assessing NUMA Performance Based on Hardware Event Counters. 904-913
Session 3: Parallel Computing
- Daniel Dauwe, Sudeep Pasricha, Anthony A. Maciejewski

, Howard Jay Siegel:
An Analysis of Resilience Techniques for Exascale Computing Platforms. 914-923 - Tomoki Kawamura, Yoneda Kazunori, Takashi Yamazaki, Takashi Iwamura, Masahiro Watanabe, Yasushi Inoguchi

:
A Compression Method for Storage Formats of a Sparse Matrix in Solving the Large-Scale Linear Systems. 924-931 - Takahiro Nishimura, Jacir Luiz Bordim, Yasuaki Ito, Koji Nakano

:
Accelerating the Smith-Waterman Algorithm Using Bitwise Parallel Bulk Computation Technique on GPU. 932-941 - Yi Yang, Yasuaki Ito, Koji Nakano

:
Photomosaic Generation by Rearranging Subimages, with GPU Acceleration. 942-951
HPPAC: 13th Workshop on High-Performance, Power-Aware Computing
- Shuaiwen Leon Song, Richard W. Vuduc

:
HPPAC Workshop Introduction. 952 - Kirk W. Cameron:

HPPAC Keynote Talk. 953
Session 1
- Hayk Shoukourian, Torsten Wilde, Detlef Labrenz, Arndt Bode:

Using Machine Learning for Data Center Cooling Infrastructure Efficiency Prediction. 954-963 - Wissam Abu Ahmad, Andrea Bartolini

, Francesco Beneventi, Luca Benini
, Andrea Borghesi
, Marco Cicala, Privato Forestieri, Cosimo Gianfreda, Daniele Gregori, Antonio Libri, Filippo Spiga, Simone Tinti:
Design of an Energy Aware Petaflops Class High Performance Cluster Based on Power Architecture. 964-973 - Aniruddha Marathe

, Ghaleb Abdulla, Barry L. Rountree, Kathleen Shoga:
Towards a Unified Monitoring Framework for Power, Performance and Thermal Metrics: A Case Study on the Evaluation of HPC Cooling Systems. 974-983
Session 2
- Xinning Hui, Zhihui Du

, Jason Liu
, Hongyang Sun, Yuxiong He, David A. Bader
:
When Good Enough Is Better: Energy-Aware Scheduling for Multicore Servers. 984-993 - Shouq Alsubaihi, Jean-Luc Gaudiot:

A Runtime Workload Distribution with Resource Allocation for CPU-GPU Heterogeneous Systems. 994-1003
Session 3
- Vladimir A. Mironov

, Alexander A. Moskovsky
, Yuri Alexeev:
Power Measurements of Hartree-Fock Algorithms Using Different Storage Devices. 1004-1011 - Mohak Chadha, Thomas Ilsche, Mario Bielert, Wolfgang E. Nagel:

A Statistical Approach to Power Estimation for x86 Processors. 1012-1019
HPBDC: 3rd IEEE International Workshop on High-Performance Big Data Computing
- Xiaoyi Lu, Jianfeng Zhan, Dhabaleswar K. Panda:

Introduction to HPBDC Workshop. 1020
Session 1: High-Performance Graph Processing
- Manu Shantharam, Keita Iwabuchi, Pietro Cicotti, Laura Carrington, Maya B. Gokhale, Roger A. Pearce:

Performance Evaluation of Scale-Free Graph Algorithms in Low Latency Non-volatile Memory. 1021-1028 - Vito Giovanni Castellana, Marco Minutoli

, Shreyansh Bhatt, Khushbu Agarwal, Arthur Bleeker, John Feo, Daniel G. Chavarría-Miranda, David J. Haglin:
High-Performance Data Analytics Beyond the Relational and Graph Data Models with GEMS. 1029-1038 - Peter M. Kogge:

Graph Analytics: Complexity, Scalability, and Architectures. 1039-1047
Session 2: Benchmarking and Performance Analysis
- Saba Sehrish

, Jim Kowalkowski, Marc F. Paterno
:
Spark and HPC for High Energy Physics Data Analyses. 1048-1057 - Houliang Qi, Xu Chang, Xingwu Liu, Li Zha:

The Consistency Analysis of Secondary Index on Distributed Ordered Tables. 1058-1067 - Xinhui Tian, Shaopeng Dai, Zhihui Du

, Wanling Gao, Rui Ren, Yaodong Cheng
, Zhifei Zhang, Zhen Jia, Peijian Wang, Jianfeng Zhan:
BigDataBench-S: An Open-Source Scientific Big Data Benchmark Suite. 1068-1077 - Paras Jain, Chirag Tailor, Sam Ford, Liexiao Ding, Michael Phillips, Fang (Cherry) Liu

, Nagi Gebraeel, Duen Horng Chau
:
Scalable Architecture for Anomaly Detection and Visualization in Power Generating Assets. 1078-1082
CHIUW: The Fourth Annual Chapel Implementers and Users Workshop
- Tom MacDonald, Michael Ferguson:

Introduction to CHIUW Workshop. 1083-1084 - Jonathan Dursi:

CHIUW Keynote. 1085 - Jyothi Krishna V. S

, Vassily Litvinov:
Identifying Use-After-Free Variables in Fire-and-Forget Tasks. 1086-1094 - Ariful Azad, Aydin Buluç

:
Towards a GraphBLAS Library in Chapel. 1095-1104 - Engin Kayraklioglu, Wo Chang, Tarek A. El-Ghazawi:

Comparative Performance and Optimization of Chapel in Modern Manycore Architectures. 1105-1114
PDSEC: 18th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing
- Peter E. Strazdins, Keita Teranishi, Raphaël Couturier

, Joseph Antony, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
Introduction to PDSEC Workshop. 1115-1116 - Pavan Balaji:

PDSEC Keynote. 1117
Session 1: Best Paper
- Ichitaro Yamazaki, Mark Hoemmen, Piotr Luszczek, Jack J. Dongarra:

Improving Performance of GMRES by Reducing Communication and Pipelining Global Collectives. 1118-1127
Session 2: Linear Algebra
- Bryce Adelstein-Lelbach, Hans Johansen

, Samuel Williams
:
Simultaneously Solving Swarms of Small Sparse Systems on SIMD Silicon. 1128-1137 - Gregoire Pichon

, Eric Darve, Mathieu Faverge, Pierre Ramet
, Jean Roman:
Sparse Supernodal Solver Using Block Low-Rank Compression. 1138-1147 - José Ignacio Aliaga

, Rocío Carratalá-Sáez
, Ronald Kriemann, Enrique S. Quintana-Ortí
:
Task-Parallel LU Factorization of Hierarchical Matrices Using OmpSs. 1148-1157
Session 3: Applications
- Ramachandran Kodanganallur Narayanan, Kamesh Madduri:

Parallel Particle-in-Cell Performance Optimization: A Case Study of Electrospray Simulation. 1158-1167 - Yann Barsamian

, Sever A. Hirstoaga, Eric Violard:
Efficient Data Structures for a Hybrid Parallel and Vectorized Particle-in-Cell Code. 1168-1177 - Hongzhang Shan, Samuel Williams

, Calvin W. Johnson, Kenneth S. McElvain:
A Locality-Based Threading Algorithm for the Configuration-Interaction Method. 1178-1187 - Yunfan Xiao, Min Huang, Qinghai Miao, Jun Xiao, Ying Wang:

Architecting the Discontinuous Deformation Analysis Method Pipeline on the GPU. 1188-1197
Session 4: Parallel Techniques
- Zahra Khatami, Hartmut Kaiser

, J. Ramanujam
:
Redesigning OP2 Compiler to Use HPX Runtime Asynchronous Techniques. 1198-1207 - Thomas Marrinan, Joseph A. Insley, Silvio Rizzi, Francois Tessier

, Michael E. Papka
:
Automated Dynamic Data Redistribution. 1208-1215 - Lina Yu, Hongfeng Yu, Hong Jiang, Jun Wang

:
An Application-Aware Data Replacement Policy for Interactive Large-Scale Scientific Visualization. 1216-1225 - Jackson DeBuhr, Bo Zhang, Luke Dalessandro:

Scalable Hierarchical Multipole Methods Using an Asynchronous Many-Tasking Runtime System. 1226-1234
JSSPP: 21st Workshop on Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai, Dalibor Klusácek:

Introduction to JSSPP Workshop. 1235-1236
DPDNS: 22nd IEEE Workshop on Dependable Parallel, Distributed and Network-Centric Systems
- Dimiter R. Avresky, Erik Maehle:

Introduction to DPDNS Workshop. 1237
Session 1
- Satoshi Fujita:

Reliability Calculation of P2P Streaming Systems with Bottleneck Links. 1238-1244 - Chaoyang Li, Anu G. Bourgeois:

Lifetime and Full-View Coverage Guarantees Through Distributed Algorithms in Camera Sensor Networks. 1245-1250
Session 2
- Jason St. John, Thomas J. Hacker:

A Small-Scale Testbed for Large-Scale Reliable Computing. 1251-1258 - Santosh Aditham, Nagarajan Ranganathan, Srinivas Katkoori

:
LSTM-Based Memory Profiling for Predicting Data Attacks in Distributed Big Data Systems. 1259-1267 - Salvatore Distefano, Samuele Rodi:

An Outlook on Volunteer and Croudsourcing Based Computing. 1268-1273 - Rizwan A. Ashraf

, Roberto Gioiosa, Gokcen Kestor
, Ronald F. DeMara
:
Exploring the Effect of Compiler Optimizations on the Reliability of HPC Applications. 1274-1283
IPDRM: Second Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware
- Shuaiwen Leon Song, Torsten Hoefler:

IPDRM Workshop Introduction. 1284
Session 1
- Jaume Bosch

, Xubin Tan
, Carlos Álvarez
, Daniel Jiménez-González
, Xavier Martorell, Eduard Ayguadé:
Characterizing and Improving the Performance of Many-Core Task-Based Parallel Programming Runtimes. 1285-1292 - Kavitha Chandrasekar

, Xiang Ni, Laxmikant V. Kalé:
A Memory Heterogeneity-Aware Runtime System for Bandwidth-Sensitive HPC Applications. 1293-1300 - Alexis Champsaur, Jay F. Lofstead

, Jai Dayal, Matthew Wolf, Greg Eisenhauer, Patrick M. Widener
, Ada Gavrilovska:
SmartBlock: An Approach to Standardizing In Situ Workflow Components. 1301-1308
Session 2
- John Jenkins, Galen M. Shipman, Jamaludin Mohd-Yusof

, Kipton Barros, Philip H. Carns, Robert B. Ross
:
A Case Study in Computational Caching Microservices for HPC. 1309-1316 - Zahra Khatami, Sungpack Hong, Jinsoo Lee, Siegfried Depner, Hassan Chafi, J. Ramanujam

, Hartmut Kaiser
:
A Load-Balanced Parallel and Distributed Sorting Algorithm Implemented with PGX.D. 1317-1324
Session 3
- Carlos Rosales, Antonio Gómez-Iglesias

, Si Liu, Feng Chen, Lei Huang, Hang Liu, Antia Lamas-Linares, John Cazes:
Performance Prediction of HPC Applications on Intel Processors. 1325-1332 - Stefanos Gerangelos, Nectarios Koziris:

vPHI: Enabling Xeon Phi Capabilities in Virtual Machines. 1333-1340
iWAPT: 12th International Workshop on Automatic Performance Tuning
- Osni Marques, Reiji Suda:

Introduction to iWAPT Workshop. 1341
Session 1: New Methodology of Auto-Tuning
- Wilson Feng, Tarek S. Abdelrahman:

A Sampling Based Strategy to Automatic Performance Tuning of GPU Programs. 1342-1349 - Tianyi David Han, Tarek S. Abdelrahman:

Use of Synthetic Benchmarks for Machine-Learning-Based Performance Auto-Tuning. 1350-1361
Session 2: Auto-Tuning Software and Environment
- Tharindu Rusira

, Mary W. Hall
, Protonu Basu:
Automating Compiler-Directed Autotuning for Phased Performance Behavior. 1362-1371 - Hiroyuki Takizawa

, Daichi Sato, Shoichi Hirasawa, Daisuke Takahashi
:
A Customizable Auto-Tuning Scenario with User-Defined Code Transformations. 1372-1378 - Philip Pfaffe, Martin Peter Tillmann

, Sigmar Walter, Walter F. Tichy:
Online-Autotuning in the Presence of Algorithmic Choice. 1379-1388
Session 3: Case-Study of Auto-Tuning and Optimization
- Athena Elafrou, Georgios I. Goumas, Nectarios Koziris:

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Intel Xeon Phi. 1389-1398 - Takahiro Katagiri, Satoshi Ohshima

, Masaharu Matsumoto:
Auto-Tuning on NUMA and Many-Core Environments with an FDM Code. 1399-1407 - Mark Gates

, Jakub Kurzak, Piotr Luszczek, Yu Pei, Jack J. Dongarra:
Autotuning Batch Cholesky Factorization in CUDA with Interleaved Layout of Matrices. 1408-1417
Session 4: Scientific Applications by Auto-Tuning
- Susumu Yamada, Toshiyuki Imamura, Takuya Ina, Narimasa Sasa, Yasuhiro Idomura

, Masahiko Machida:
Quadruple-Precision BLAS Using Bailey's Arithmetic with FMA Instruction: Its Performance and Applications. 1418-1425 - Masayoshi Mochizuki, Akihiro Fujii, Teruo Tanaka:

Fast Multidimensional Performance Parameter Estimation with Multiple One-Dimensional d-Spline Parameter Search. 1426-1433 - Luigi Nardi, Bruno Bodin

, Sajad Saeedi, Emanuele Vespa, Andrew J. Davison, Paul H. J. Kelly:
Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications Using HyperMapper. 1434-1443
ParSocial: 2nd IEEE Workshop on Parallel and Distributed Processing for Computational Social System
- Eunice E. Santos, John Korah:

Introduction to ParSocial Workshop. 1444-1445 - Boleslaw K. Szymanski

:
ParSocial Keynote. 1446
Session 1
- Xiaoyan Lu, Boleslaw K. Szymanski

:
Predicting Viral News Events in Online Media. 1447-1456 - Julia Buwaya, José D. P. Rolim:

Mobile Crowdsensing from a Selfish Routing Perspective. 1457-1463 - George Cybenko:

Parallel Computing for Machine Learning in Social Network Analysis. 1464-1471
Session 2
- Gennaro Cordasco

, Carmine Spagnuolo
, Vittorio Scarano
:
Work Partitioning on Parallel and Distributed Agent-Based Simulation. 1472-1481 - Humayun Kabir

, Kamesh Madduri:
Parallel k-Core Decomposition on Multicore Platforms. 1482-1491 - Eric Tatara, Nicholson T. Collier, Jonathan Ozik, Charles M. Macal:

Endogenous Social Networks from Large-Scale Agent-Based Models. 1492-1499
Session 3
- Sindhuja Parimalarangan, George M. Slota, Kamesh Madduri:

Fast Parallel Graph Triad Census and Triangle Counting on Shared-Memory Platforms. 1500-1509 - Eunice E. Santos, John Korah, Vairavan Murugappan

, Suresh Subramanian
:
Efficient Anytime Anywhere Algorithms for Vertex Additions in Large and Dynamic Graphs. 1510-1519 - Wen-Jing Hsu, You Lu, Zhuo Qi Lee:

Accelerating Topic Exploration of Multi-Dimensional Documents. 1520-1527
BigDataEco: Big Data Regional Innovation Hubs and Spokes Workshop
- Chaitan Baru, Fen Zhao, Joanna Chan:

Introduction to BigDataEco Workshop. 1528
GraML: First Workshop on the Intersection of Graph Algorithms and Machine Learning
- Antonino Tumeo

, Mahantesh Halappanavar, John Feo:
Introduction to GraML Workshop. 1529-1530 - Sujith Ravi:

GraML Keynote. 1531 - Hristo N. Djidjev

, Daniel O'Malley, Hari S. Viswanathan, Jeffrey D. Hyman
, Satish Karra
, Gowri Srinivasan:
Learning on Graphs for Predictions of Fracture Propagation, Flow and Transport. 1532-1539 - Hongyuan Zhan, Kamesh Madduri:

Analyzing Community Structure in Networks. 1540-1549 - Ronald D. Hagan, Charles A. Phillips, Bradley J. Rhodes, Michael A. Langston:

Compound Analytics: Templates for Integrating Graph Algorithms and Machine Learning. 1550-1556
EMBRACE: Evolvable Methods for Benchmarking Realism and Community Engagement
- David A. Bader

:
Introduction to EMBRACE Workshop. 1557 - Torsten Hoefler:

EMBRACE Keynote. 1558
REPPAR: Workshop on Reproducibility in Parallel Computing
- Sascha Hunold

, Arnaud Legrand, Lucas Nussbaum:
Introduction to REPPAR Workshop. 1559 - Todd Gamblin:

REPPAR Keynote. 1560
Session 1
- Ivo Jimenez

, Michael Sevilla, Noah Watkins, Carlos Maltzahn, Jay F. Lofstead
, Kathryn M. Mohror
, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau:
The Popper Convention: Making Reproducible Systems Evaluation Practical. 1561-1570 - Lucas Nussbaum:

Towards Trustworthy Testbeds Thanks to Throughout Testing. 1571-1578 - Franziska Hoffeins, Florina M. Ciorba

, Ioana Banicescu:
Examining the Reproducibility of Using Dynamic Loop Scheduling Techniques in Scientific Applications. 1579-1587
Session 2
- Luka Stanisic, Lucas Mello Schnorr, Augustin Degomme, Franz C. Heinrich, Arnaud Legrand, Brice Videau:

Characterizing the Performance of Modern Architectures Through Opaque Benchmarks: Pitfalls Learned the Hard Way. 1588-1597 - Roman Iakymchuk

, Enrique S. Quintana-Ortí
, Erwin Laure
, Stef Graillat:
Towards Reproducible Blocked LU Factorization. 1598-1607

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














