default search action
32nd IPDPS 2018: Vancouver, BC, Canada - Workshops
- 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2018, Vancouver, BC, Canada, May 21-25, 2018. IEEE Computer Society 2018, ISBN 978-1-5386-5555-9
HCW: Heterogeneity in Computing Workshop
- Alexey L. Lastovetsky, Sudeep Pasricha:
Introduction to HCW 2018. 1 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 2 - Alexey L. Lastovetsky:
Message from the HCW General Chair. 3 - Sudeep Pasricha:
Message from the HCW Program Committee Chair. 4 - Manish Parashar:
HCW 2018 Keynote Talk 1. 5 - Ümit V. Çatalyürek:
HCW 2018 Keynote Talk 2. 6
Session 1: Reconfigurable and Cloud Systems
- Leslie Barron, Tarek S. Abdelrahman:
User-Transparent Translation of Machine Instructions to Programmable Hardware. 7-14 - Yves Caniou, Eddy Caron, Aurélie Kong Win Chang, Yves Robert:
Budget-Aware Scheduling Algorithms for Scientific Workflows with Stochastic Task Weights on Heterogeneous IaaS Cloud Platforms. 15-26 - Zheming Jin, Hal Finkel:
Optimizing Parallel Reduction on OpenCL FPGA Platform - A Case Study of Frequent Pattern Compression. 27-35
Session 2: Workload Scheduling and Architecture Analysis
- Massinissa Ait Aba, Lilia Zaourar, Alix Munier:
Approximation Algorithm for Scheduling Applications on Hybrid Multi-core Machines with Communications Delays. 36-45 - Sean Pennefather, Karen L. Bradshaw, Barry Irwin:
Exploration and Design of a Synchronous Message Passing Framework for a CPU-NPU Heterogeneous Architecture. 46-56 - Fei Lei, Lei Yu, Bing Shao, Fei Teng, Bo Zhou:
Large Scale Data Centers Simulation Based on Baseline Test Model. 57-68 - Anke Kreuzer, Norbert Eicker, Jorge Amaya, Estela Suarez:
Application Performance on a Cluster-Booster System. 69-78
RAW: Reconfigurable Architectures Workshop
- Marco D. Santambrogio, Diana Goehringer, Dirk Stroobandt, Ken Eguro:
Introduction to RAW 2018. 79-80 - Jürgen Becker, Viktor K. Prasanna, Markus Weimer, Wayne Luk, Kaveh Aasaraai, Derek Chiou:
RAW 2018 Invited Talks. 81-82
Session 1: Platforms and Memory
- Pekka Jääskeläinen, Aleksi Tervo, Guillermo Payá Vayá, Timo Viitanen, Nicolai Behmann, Jarmo Takala, Holger Blume:
Transport-Triggered Soft Cores. 83-90 - Francesco Peverelli, Marco Rabozzi, Emanuele Del Sozzo, Marco D. Santambrogio:
OXiGen: A Tool for Automatic Acceleration of C Functions Into Dataflow FPGA-Based Kernels. 91-98 - William E. Allcock, Bennett Bernardoni, Colleen Bertoni, Neil Getty, Joseph A. Insley, Michael E. Papka, Silvio Rizzi, Brian R. Toonen:
RAM as a Network Managed Resource. 99-106 - Catalin Bogdan Ciobanu, Giulio Stramondo, Cees de Laat, Ana Lucia Varbanescu:
MAX-PolyMem: High-Bandwidth Polymorphic Parallel Memories for DFEs. 107-114
Session 2: Applications
- Enrico Reggiani, Giuseppe Natale, Carlo Moroni, Marco D. Santambrogio:
An FPGA-Based Acceleration Methodology and Performance Model for Iterative Stencils. 115-122 - Hamid Reza Zohouri, Artur Podobas, Satoshi Matsuoka:
High-Performance High-Order Stencil Computation on FPGAs Using OpenCL. 123-130 - Alessandro Comodi, Davide Conficconi, Alberto Scolari, Marco D. Santambrogio:
TiReX: Tiled Regular eXpression Matching Architecture. 131-137 - Artur Podobas, Satoshi Matsuoka:
Hardware Implementation of POSITs and Their Application in FPGAs. 138-145
Session 3: Machine Learning 1
- Luca Cerina, Giuseppe Franco, Pierandrea Cancian, Marco D. Santambrogio:
Robustness of Surface EMG Classifiers with Fixed-Point Decomposition on Reconfigurable Architecture. 146-153 - Florian Kastner, Benedikt Janßen, Frederik Kautz, Michael Hübner, Giulio Corradi:
Hardware/Software Codesign for Convolutional Neural Networks Exploiting Dynamic Partial Reconfiguration on PYNQ. 154-161 - Chaim Baskin, Natan Liss, Evgenii Zheltonozhskii, Alexander M. Bronstein, Avi Mendelson:
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform. 162-169
Session 4: Machine Learning 2
- Niccolo Raspa, Giuseppe Natale, Marco Bacis, Marco D. Santambrogio:
A Framework with Cloud Integration for CNN Acceleration on FPGA Devices. 170-177 - Daniel Holanda Noronha, Philip Heng Wai Leong, Steven J. E. Wilton:
Kibo: An Open-Source Fixed-Point Tool-kit for Training and Inference in FPGA-Based Deep Learning Networks. 178-185 - Menbere Kina Tekleyohannes, Christian Weis, Norbert Wehn, Martin Klein, Michael Siegrist:
A Reconfigurable Accelerator for Morphological Operations. 186-193
Session 5: Short Papers 1
- Syed Waqar Nabi, Wim Vanderbauwhede:
MP-STREAM: A Memory Performance Benchmark for Design Space Exploration on Heterogeneous HPC Devices. 194-197 - Luca Stornaiuolo, Alberto Parravicini, Donatella Sciuto, Marco D. Santambrogio:
FIDA: A Framework to Automatically Integrate FPGA Kernels Within Data-Science Applications. 198-201 - Tien Thanh Nguyen, Mathieu Thevenin, Anthony Mouraud, Gwenolé Corre, Olivier Pasquier, Sébastien Pillement:
High-Level Reliability Evaluation of Reconfiguration-Based Fault Tolerance Techniques. 202-205 - Florian Oszwald, Jürgen Becker, Philipp Obergfell, Matthias Traub:
Dynamic Reconfiguration for Real-Time Automotive Embedded Systems in Fail-Operational Context. 206-209
Session 6: Short Papers 2
- Peter Rouget, Benoît Badrignans, Pascal Benoit, Lionel Torres:
FPGA Implementation of Pattern Matching for Industrial Control Systems. 210-213 - Lorenzo Di Tucci, Davide Conficconi, Alessandro Comodi, Steven A. Hofmeyr, David Donofrio, Marco D. Santambrogio:
A Parallel, Energy Efficient Hardware Architecture for the merAligner on FPGA Using Chisel HCL. 214-217 - Ayan Palchaudhuri, Anindya Sundar Dhar:
Redundant Binary to Two's Complement Converter on FPGAs Through Fabric Aware Scan Based Encoding Approach for Fault Localization Support. 218-221 - Matthias Goebel, Ilja Behnke, Ahmed Elhossini, Ben H. H. Juurlink:
An Application-Specific Memory Management Unit for FPGA-SoCs. 222-225
HiCOMB: High Performance Computational Biology
- Srinivas Aluru, David A. Bader, Paul Medvedev:
Introduction to HiCOMB 2018. 226 - James Taylor:
HiCOMB Keynote 1. 227 - Onur Mutlu:
HICOMB Keynote 2. 228 - Golnar Sheikhshab, Elizabeth Starks, Aly Karsan, Readman Chiu, Anoop Sarkar, Inanç Birol:
GraphNER: Using Corpus Level Similarities and Graph Propagation for Named Entity Recognition. 229-238 - William Arndt:
Modifying HMMER3 to Run Efficiently on the Cori Supercomputer Using OpenMP Tasking. 239-246 - Daniel L. Ayres, Michael P. Cummings:
Rerooting Trees Increases Opportunities for Concurrent Computation and Results in Markedly Improved Performance for Phylogenetic Inference. 247-256 - Raja Appuswamy, Jacques Fellay, Nimisha Chaturvedi:
Sequence Alignment Through the Looking Glass. 257-266
GABB: Graph Algorithms Building Blocks
- Tim Mattson:
Introduction to GABB 2018. 267
Keynote Session
- John R. Gilbert:
Graph Algorithms in the Language of Linear Algebra: How Did We Get Here, and Where Do We Go Next? 268 - Shad Kirmani, Kamesh Madduri:
Spectral Graph Drawing: Building Blocks and Performance Analysis. 269-277
Session 1: Generating Graphs with Known Properties
- Anil Kumar S. Vullikanti:
Parallel Generation of Large-Scale Random Graphs. 278 - Jeremy Kepner, Siddharth Samsi, William Arcand, David Bestor, Bill Bergeron, Tim Davis, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Roger Pearce, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Geoffrey Sanders, Charles Yee, Albert Reuther:
Design, Generation, and Validation of Extreme Scale Power-Law Graphs. 279-286 - Geoffrey Sanders, Roger Pearce, Timothy La Fond, Jeremy Kepner:
On Large-Scale Graph Generation with Validation of Diverse Triangle Statistics at Edges and Vertices. 287-296
Session 2: GraphBLAS Implementations
- Scott McMillan:
Patterns of GraphBLAS Algorithms: Tales from the Trenches. 297 - José E. Moreira, Manoj Kumar, William P. Horn:
Implementing the GraphBLAS C API. 298-309 - Jesse Chamberlin, Marcin Zalewski, Scott McMillan, Andrew Lumsdaine:
PyGB: GraphBLAS DSL in Python with Dynamic Compilation Into Efficient C++. 310-319
Session 3: Graph Building Blocks Community Meeting
- Chris Long:
A Survey of Modern Analysis on Graphs: Open Problems. 320
EduPar: NSF/TCPP W. on Parallel and Distributed Computing Education
- Martina Barnas, Sushil K. Prasad, Satish Puri:
Introduction to EduPar 2018. 321-322 - Alexandru Iosup:
EduPar 2018 Keynote. 323
EduPar Session 1
- Marin Abernethy, Oliver Sinnen, Joel C. Adams, Giuseppe De Ruvo, Nasser Giacaman:
ParallelAR: An Augmented Reality App and Instructional Approach for Learning Parallel Programming Scheduling Concepts. 324-331 - Devangi N. Parikh, Jianyu Huang, Margaret E. Myers, Robert A. van de Geijn:
Learning from Optimizing Matrix-Matrix Multiplication. 332-339 - Emanuel Buzek, Martin Krulis:
An Entertaining Approach to Parallel Programming Education. 340-346 - Sunny Raj, Sumit Kumar Jha:
Predicting Success in Undergraduate Parallel Programming via Probabilistic Causality Analysis. 347-352
EduPar Session 2
- Jawwad Ahmed Shamsi, Syed Zain ul Hassan, Narmeen Zakaria Bawany, Nausheen Shoaib:
A Comprehensive Course on Big Data for Undergraduate Students. 353-360 - Erik Saule:
Experiences on Teaching Parallel and Distributed Computing for Undergraduates. 361-368 - Mohammad Amin Kuhail, Spencer Cook, Joshua W. Neustrom, Praveen Rao:
Teaching Parallel Programming with Active Learning. 369-376 - Debzani Deb, Sebastian Cousins, M. Muztaba Fuad:
Teaching Big Data and Cloud Computing: A Modular Approach. 377-383
HIPS: High Level Programming Models and Supportive Environments
- Karl Fuerlinger, Philip C. Roth:
Introduction to HIPS 2018. 384-385 - Christian Trott:
HIPS 2018 Keynote. 386
Session 1: Tool Support for Parallel Programming Environments
- Hartmut Mix, Christian Herold, Matthias Weber:
Visualization of Multi-layer I/O Performance in Vampir. 387-394 - Simone Atzeni, Ganesh Gopalakrishnan:
An Operational Semantic Basis for Building an OpenMP Data Race Checker. 395-404 - Mostafa Mehrabi, Nasser Giacaman, Oliver Sinnen:
Unobtrusive Support for Asynchronous GUI Operations with Java Annotations. 405-414
Session 2: Distributed Memory and Task-Based Programming
- Hongbo Li, Zizhong Chen, Rajiv Gupta, Min Xie:
Non-intrusively Avoiding Scaling Problems in and out of MPI Collectives. 415-424 - Bernie van Veen, Sung-Shik Jongmans:
Modular Programming of Synchronization and Communication Among Tasks in Parallel Programs. 425-435 - Matthew Whitlock, Hemanth Kolla, Sean Treichler, Philippe P. Pébay, Janine C. Bennett:
Scalable Collectives for Distributed Asynchronous Many-Task Runtimes. 436-445
HPBDC: High-Performance Big Data, Deep Learning, and Cloud Computing
- Xiaoyi Lu, Jianfeng Zhan, Dhabaleswar K. Panda:
Introduction to HPBDC 2018. 446 - Geoffrey C. Fox:
HPBDC 2018 Keynote. 447
Regular Paper Session 1: High-Performance Data Processing Systems
- Felix Seibert, Mathias Peters, Florian Schintke:
Improving I/O Performance Through Colocating Interrelated Input Data and Near-Optimal Load Balancing. 448-457 - Tanuj kr Aasawat, Tahsin Reza, Matei Ripeanu:
How Well do CPU, GPU and Hybrid Graph Processing Frameworks Perform? 458-466 - Can Wu, Xiaoning Wang, Haili Xiao, Rongqiang Cao, Yining Zhao, Xuebin Chi:
EASIS: An Optimized Information Service for High Performance Computing Environment. 467-476
Regular Paper Session 2: High-Performance Data Processing Applications
- Michael Gowanlock, Ben Karsin:
GPU Accelerated Self-Join for the Distance Similarity Metric. 477-486 - Jun Chen, Peigang Zou:
Implementing a Parallel Graph Clustering Algorithm with Sparse Matrix Computation. 487-496 - Christopher Harrison, Sündüz Keles, Rebecca Hudson, Sunyoung Shin, Inês Dutra:
atSNPInfrastructure, a Case Study for Searching Billions of Records While Providing Significant Cost Savings over Cloud Providers. 497-506
Short Paper Session 1: Data Processing on HPC and Cloud Environments
- Yining Zhao, Xiaodong Wang, Haili Xiao, Xuebin Chi:
Improvement of the Log Pattern Extracting Algorithm Using Text Similarity. 507-514 - Xu Chang, Li Zha:
The Performance Analysis of Cache Architecture Based on Alluxio over Virtualized Infrastructure. 515-519
AsHES: Accelerators and Hybrid Exascale Systems
- Sunita Chandrasekaran, Antonio J. Peña, Min Si:
Introduction to AsHES 2018. 520 - Michael Wolfe:
AsHES 2018 Keynote. 521
Session 1: Runtime Scheduling and Performance Analytics
- Stefano Markidis, Steven Wei Der Chien, Erwin Laure, Ivy Bo Peng, Jeffrey S. Vetter:
NVIDIA Tensor Core Programmability, Performance & Precision. 522-531 - Zheming Jin, Hal Finkel:
Optimizing an Atomics-Based Reduction Kernel on OpenCL FPGA Platform. 532-539 - Osman Seckin Simsek, Andi Drebes, Antoniu Pop:
Leveraging Data-Flow Task Parallelism for Locality-Aware Dynamic Scheduling on Heterogeneous Platforms. 540-549
Session 2: Algorithms and Applications
- Kyungjoo Kim, H. Carter Edwards, Sivasankaran Rajamanickam:
Tacho: Memory-Scalable Task Parallel Sparse Cholesky Factorization. 550-559 - Michael Gowanlock, Ben Karsin:
Sorting Large Datasets with Heterogeneous CPU/GPU Architectures. 560-569 - Shaolong Chen, Miquel A. Senar:
Improving Performance of Genomic Aligners on Intel Xeon Phi-Based Architectures. 570-578
Session 3: Emerging Accelerator Architectures
- Eric R. Hein, Tom Conte, Jeffrey Young, Srinivas Eswar, Jiajia Li, Patrick Lavin, Richard W. Vuduc, E. Jason Riedy:
An Initial Characterization of the Emu Chick. 579-588 - Sergio Rivas-Gomez, Antonio J. Peña, David Moloney, Erwin Laure, Stefano Markidis:
Exploring the Vision Processing Unit as Co-Processor for Inference. 589-598
PDCO: Parallel / Distributed Computing and Optimization
- Grégoire Danoy, Didier El Baz, Vincent Boyer, Bernabé Dorronsoro:
Introduction to PDCO 2018. 599-600
Session 1: Scheduling, Parallel Genetic Algorithms, Genetic Programming
- Jheisson López, Danny Munera, Daniel Diaz, Salvador Abreu:
On Integrating Population-Based Metaheuristics with Cooperative Parallelism. 601-608 - Emmanuel Kieffer, Grégoire Danoy, Pascal Bouvry, Anass Nagih:
A Competitive Approach for Bi-Level Co-Evolution. 609-618 - Yuanzhe Li, Laleh Ghalami, Loren Schwiebert, Daniel Grosu:
A GPU Parallel Approximation Algorithm for Scheduling Parallel Identical Machines to Minimize Makespan. 619-628 - Jia Luo, Didier El Baz:
A Survey on Parallel Genetic Algorithms for Shop Scheduling Problems. 629-636
Session 2: Parallel Distributed Computing Systems and Optimization, Applications
- Md. Naim, Fredrik Manne:
Scalable b-Matching on GPUs. 637-646 - Richard Neill, Andi Drebes, Antoniu Pop:
Automated Analysis of Task-Parallel Execution Behavior Via Artificial Neural Networks. 647-656 - Thanasis Loukopoulos, Nikos Tziritas, Maria G. Koziri, George I. Stamoulis, Samee U. Khan, Cheng-Zhong Xu, Albert Y. Zomaya:
Data Stream Processing at Network Edges. 657-665 - Andrei Tchernykh, Mikhail G. Babenko, Vanessa Miranda-López, Alexander Yu. Drozdov, Arutyun Avetisyan:
WA-RRNS: Reliable Data Storage System Based on Multi-cloud. 666-673
HPPAC: High-Performance, Power-Aware Computing
- Shuaiwen Leon Song, Natalie J. Bates, Ang Li:
Introduction to HPPAC 2018. 674 - Gregory A. Koenig:
HPPAC 2018 Keynote. 675 - Rolando Brondolin, Tommaso Sardelli, Marco D. Santambrogio:
DEEP-Mon: Dynamic and Energy Efficient Power Monitoring for Container-Based Infrastructures. 676-684 - Matthias Maiterth, Gregory A. Koenig, Kevin T. Pedretti, Siddhartha Jana, Natalie J. Bates, Andrea Borghesi, Dave Montoya, Andrea Bartolini, Milos Puzovic:
Energy and Power Aware Job Scheduling and Resource Management: Global Survey - Initial Analysis. 685-693 - Sean Rea, Ehsan Atoofian:
Mitigating Critical Path Decompression Latency in Compressed L1 Data Caches Via Prefetching. 694-701 - Satyabrata Sen, Neena Imam, Chung-Hsing Hsu:
Quality Assessment of GPU Power Profiling Mechanisms. 702-711 - Thomas Ilsche, Robert Schöne, Philipp Joram, Mario Bielert, Andreas Gocht:
System Monitoring with lo2s: Power and Runtime Impact of C-State Transitions. 712-715 - Zheming Jin, Hal Finkel:
Power and Performance Tradeoff of a Floating-Point Intensive Kernel on OpenCL FPGA Platform. 716-720 - Vignesh Adhinarayanan, Bishwajit Dutta, Wu-chun Feng:
Making a Case for Green High-Performance Visualization Via Embedded Graphics Processors. 721-724 - Kevin T. Pedretti, Ryan E. Grant, James H. Laros III, Michael J. Levenhagen, Stephen L. Olivier, Lee Ward, Andrew J. Younge:
A Comparison of Power Management Mechanisms: P-States vs. Node-Level Power Cap Control. 725-729
APDCM: Advances in Parallel and Distributed Computational Models
- Oscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae:
Introduction to APDCM 2018. 730-731 - Yuji Shinano:
APDCM 2018 Keynote. 732
Session 1: Parallel Computing Models
- Yan Gu:
Survey: Computational Models for Asymmetric Read and Write Costs. 733-743 - Martti Forsell, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Implementation of Multioperations in Thick Control Flow Processors. 744-752 - Anup Zope, Edward Luke:
A Block Streaming Model for Irregular Applications. 753-762