


default search action
30th SBAC-PAD 2018: Lyon, France
- 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018, Lyon, France, September 24-27, 2018. IEEE 2018, ISBN 978-1-5386-7769-8

Computer Architecture and Compilers
- Nishant Rao, Akshay Ramachandran, Amish Shah:

MLNoC: A Machine Learning Based Approach to NoC Design. 1-8 - Isaías B. Felzmann

, Matheus Martins Susin, Liana Dessandre Duenha, Rodolfo Azevedo, Lucas Francisco Wanner
:
ADeLe: Rapid Architectural Simulation for Approximate Hardware. 9-16 - Pedro Caldeira, Jeronimo Costa Penha

, Lucas Bragança
, Ricardo Ferreira, José Augusto Miranda Nacif, Renato Ferreira, Fernando Magno Quintão Pereira:
From Java to FPGA: An Experience with the Intel HARP System. 17-24 - Congmiao Li, Jean-Luc Gaudiot:

Online Detection of Spectre Attacks Using Microarchitectural Traces from Performance Counters. 25-28 - Luis Mattos, Divino Cesar S. Lucas, Juan Salamanca

, Joao P. L. de Carvalho
, Márcio Machado Pereira, Guido Araujo:
DOACROSS Parallelization Based on Component Annotation and Loop-Carried Probability. 29-32
Scheduling
- Louis-Claude Canon, Aurélie Kong Win Chang, Yves Robert

, Frédéric Vivien:
Scheduling Independent Stochastic Tasks Under Deadline and Budget Constraints. 33-40 - Jirí Dokulil, Siegfried Benkner:

Adaptive Scheduling of Collocated Applications Using a Task-Based Runtime System. 41-48 - Vinicius Freitas

, Alexandre de Limas Santana, Márcio Castro
, Laércio Lima Pilla
:
A Batch Task Migration Approach for Decentralized Global Rescheduling. 49-56 - Yubo Qin, Ivan Rodero

, Pradeep Subedi, Manish Parashar, Sandro Rigo:
Exploring Power Budget Scheduling Opportunities and Tradeoffs for AMR-Based Applications. 57-64 - Congfeng Jiang, Yumei Wang, Dongyang Ou, Yeliang Qiu, Youhuizi Li, Jian Wan, Bing Luo, Weisong Shi

, Christophe Cérin:
EASE: Energy Efficiency and Proportionality Aware Virtual Machine Scheduling. 65-68
Energy in the Cloud, Network
- David Guyon, Anne-Cécile Orgerie, Christine Morin:

Energy - Efficient IaaS-PaaS Co-Design for Flexible Cloud Deployment of Scientific Applications. 69-76 - Chaopeng Guo

, Jean-Marc Pierson:
Frequency Selection Approach for Energy Aware Cloud Database. 77-84 - Benjamin Camus, Fanny Dufossé, Anne Blavette, Martin Quinson

, Anne-Cécile Orgerie:
Network-Aware Energy-Efficient Virtual Machine Management in Distributed Cloud Infrastructures with On-Site Photovoltaic Production. 86-92 - Su-Hwan Jang, Jongpil Jeong

, Byungjun Park:
A Novel Broker-Based Hierarchical Authentication Scheme in Proxy Mobile IPv6 Networks. 93-96
Applications
- Yuankun Fu, Feng Li, Fengguang Song, Luoding Zhu:

Designing a Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems. 97-106 - Jucele Franca de Alencar Vasconcellos

, Edson Norberto Cáceres, Henrique Mongelli, Siang Wun Song:
A New Efficient Parallel Algorithm for Minimum Spanning Tree. 107-114 - Natalia Kalinnik, Robert Kiesel

, Thomas Rauber, Marcel Richter, Gudula Rünger:
Exploring Self-Adaptivity Towards Performance and Energy for Time-Stepping Methods. 115-123 - Daniel Oliveira

, Francis Birck Moreira, Paolo Rech, Philippe Olivier Alexandre Navaux:
Predicting the Reliability Behavior of HPC Applications. 124-131
GPU Based Computing
- Hartwig Anzt

, Jack J. Dongarra, Goran Flegar
, Thomas Grützmacher
:
Variable-Size Batched Condition Number Calculation on GPUs. 132-139 - Ming-Hung Chen, I-Hsin Chung, Bülent Abali, Paul Crumley:

Towards a Single-Host Many-GPU System. 140-147 - Matthias Korch, Tim Werner:

Exploiting Limited Access Distance for Kernel Fusion Across the Stages of Explicit One-Step Methods on GPUs. 148-157 - Suren Chilingaryan

, Evelina Ametova
, Andreas Kopmann
, Alessandro Mirone:
Balancing Load of GPU Subsystems to Accelerate Image Reconstruction in Parallel Beam Tomography. 158-166 - Eugenio Gianniti, Li Zhang, Danilo Ardagna

:
Performance Prediction of GPU-Based Deep Learning Applications. 167-170
Programming Paradigms and Memory
- Romain Fontaine, Laure Gonnord, Lionel Morel:

Polyhedral Dataflow Programming: A Case Study. 171-179 - George Kornaros

, Marcello Coppola
:
Enabling Efficient Job Dispatching in Accelerator-Extended Heterogeneous Systems with Unified Address Space. 180-188 - Mohammad Shakeel Laghari

, Najeeb Ahmad
, Didem Unat
:
Phase-Based Data Placement Scheme for Heterogeneous Memory Systems. 189-196 - João Vieira

, Nuno Roma
, Pedro Tomás
, Paolo Ienne, Gabriel Falcão Paiva Fernandes:
Exploiting Compute Caches for Memory Bound Vector Operations. 197-200
Data Analytics, Locality and I/O
- Shouwei Chen, Ivan Rodero

:
Exploring the Potential of Next Generation Software-Defined in Memory Frameworks. 201-208 - Yevhen Alforov, Thomas Ludwig, Anastasiia Novikova

, Michael Kuhn, Julian M. Kunkel:
Towards Green Scientific Data Compression Through High-Level I/O Interfaces. 209-216 - Luiz Angelo Steffenel:

Improving the Performance of Fog Computing Through the Use of Data Locality. 217-224 - Alberto Miranda

, Ramon Nou
, Toni Cortes
:
ECHOFS: A Scheduler-Guided Temporary Filesystem to Leverage Node-Local NVMS. 225-228 - Hartwig Anzt

, Jack J. Dongarra:
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs. 229-232
Performance Prediction and Evaluation
- Markus Wittmann, Georg Hager

, Radim Janalík, Martin Lanser
, Axel Klawonn
, Oliver Rheinbach
, Olaf Schenk
, Gerhard Wellein:
Multicore Performance Engineering of Sparse Triangular Solves Using a Modified Roofline Model. 233-241 - Nelson Mimura Gonzalez, José R. Brunheroto, Fausto Artico, Yoonho Park, Tereza Cristina M. B. Carvalho

, Charles Christian Miers
, Maurício Aronne Pillon, Guilherme Piêgas Koslovski
:
Predicting the Performance Impact of Increasing Memory Bandwidth for Scientific Workflows. 242-249 - Milan Radulovic

, Kazi Asifuzzaman
, Darko Zivanovic, Nikola Rajovic, Guillaume Colin de Verdière, Dirk Pleiter, Manolis Marazakis, Nikolaos D. Kallimanis, Paul M. Carpenter, Petar Radojkovic, Eduard Ayguadé:
Mainstream vs. Emerging HPC: Metrics, Trade-Offs and Lessons Learned. 250-257 - Gabriel Fernandez, Francisco J. Cazorla, Jaume Abella

, Sylvain Girbal
:
Assessing Time Predictability Features of ARM Big. LITTLE Multicores. 258-261 - Pierre Huchant, Denis Barthou

, Marie Christine Counilh:
Adaptive Partitioning for Iterated Sequences of Irregular OpenCL Kernels. 262-265
IoT, Fog, Edge, and Cloud Computing
- Fabíola Martins Campos de Oliveira, Edson Borin:

Partitioning Convolutional Neural Networks for Inference on Constrained Internet-of-Things Devices. 266-273 - Ali Reza Zamani, Daniel Balouek-Thomert, Juan J. Villalobos, Ivan Rodero

, Manish Parashar:
Runtime Management of Data Quality for Scientific Observatories Using Edge and In-Transit Resources. 274-281 - Jose Pergentino Araujo Neto, Donald M. Pianto, Célia Ghedini Ralha

:
A Fault-Tolerant Agent-Based Architecture for Transient Servers in Fog Computing. 282-289
HPML 2018 Workshop: Section I
- Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro:

Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. 290-297 - Guojing Cong, Giacomo Domeniconi, Joshua Shapiro, Fan Zhou, Barry Chen:

Accelerating Deep Neural Network Training for Action Recognition on a Cluster of GPUs. 298-305 - Renato Luiz de Freitas Cunha

, Eduardo R. Rodrigues, Matheus Palhares Viana, Dário Augusto Borges Oliveira
:
An Argument in Favor of Strong Scaling for Deep Neural Networks with Small Datasets. 306-313 - Kazumasa Sakivama, Shinpei Kato, Yutaka Ishikawa, Atsushi Hori, Abraham Monrroy:

Deep Learning on Large-Scale Muticore Clusters. 314-321 - Behzad Salami

, Osman S. Unsal, Adrián Cristal Kestelman:
On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation. 322-329 - David M. Chan

, Roshan Rao, Forrest Huang, John F. Canny:
T-SNE-CUDA: GPU-Accelerated T-SNE and its Applications to Modern Data. 330-338
HPML 2018 Workshop: Section II
- M. Todd Young, Jacob D. Hinkle

, Arvind Ramanathan, Ramakrishnan Kannan:
HyperSpace: Distributed Bayesian Hyperparameter Optimization. 339-347 - Marisol Monterrubio Velasco, José Carlos Carrasco-Jiménez

, Octavio Castillo Reyes, Fernando M. Cucchietti
, Josep de la Puente:
A Machine Learning Approach for Parameter Screening in Earthquake Simulation. 348-355 - Kenny Peou, Alan Kelly, Joel Falcou, Cécile Germain:

A Case Study on Optimizing Accurate Half Precision Average. 356-363 - Paul-Cristian Sarbu, Hans-Joachim Bungartz:

Optimization of a Sparse Grid-Based Data Mining Kernel for Architectures Using AVX-512. 364-371 - Matheus Alcântara Souza, Lucas Andrade Maciel

, Pedro Henrique Penna
, Henrique C. Freitas
:
Energy Efficient Parallel K-Means Clustering for an Intel® Hybrid Multi-Chip Package. 372-379
HPML 2018 Workshop: Section III
- Christina Diedhiou, Bryan Carpenter, Aamir Shafi

, Soumabha Sarkar, Ramazan Esmeli
, Ryan Gadsdon:
Performance Comparison of a Parallel Recommender Algorithm Across Three Hadoop-Based Frameworks. 380-387 - Shirin Tavara

, Alexander Schliep
:
Effect of Network Topology on the Performance of ADMM-Based SVMs. 388-393 - Luis Fernando L. Grim

, André Leon S. Gradvohl
:
High-Performance Ensembles of Online Sequential Extreme Learning Machine for Regression and Time Series Forecasting. 394-401
WAMCA 2018 Workshop: Architecture and Performance Analysis
- Matheus Alcântara Souza, Henrique C. Freitas

, Jean-François Méhaut:
Design Space Exploration of Energy Efficient NoC-and Cache-Based Many-Core Architecture. 402-409 - Fabio Verbosio

, Jurai Kardos, Mauro Bianco, Olaf Schenk
:
Highly Scalable Stencil-Based Matrix-Free Stochastic Estimator for the Diagonal of the Inverse. 410-419 - Shad Kirmani, Hongyang Sun, Padma Raghavan:

A Scalability and Sensitivity Study of Parallel Geometric Algorithms for Graph Partitioning. 420-427
WAMCA 2018 Workshop: OpenMP Parallelization
- Matheus Mortatti, Hervé Yviquel, Guido Araujo:

Automatic Ray-Tracer Cloud Offloading in OPENMP. 428-435 - Olfa Haggui, Claude Tadonki, Fatma Sayadi

, Bouraoui Ouni:
Evaluation of an OPENMP Parallelization of Lucas-Kanade on a NUMA-Manycore. 436-441 - Taylor Lloyd, Artem Chikin, Sanket Kedia, Dhruv Jain, José Nelson Amaral:

Automated GPU Grid Geometry Selection for OPENMP Kernels. 442-449 - Abdoul Wahid Mainassara Checkaraou, Alban Rousset

, Xavier Besseron
, Sébastien Varrette, Bernhard Peters
:
Hybrid MPI+openMP Implementation of eXtended Discrete Element Method. 450-457
WAMCA 2018 Workshop: Hybrid Parallelization
- Evan Coleman

, Erik J. Jensen, Masha Sosonkina:
Impacts of Three Soft-Fault Models on Hybrid Parallel Asynchronous Iterative Methods. 458-465 - Guillaume Latu, Yuuichi Asahi, Julien Bigot, Tamas B. Fehér, Virginie Grandgirard:

Scaling and Optimizing the Gysela Code on a Cluster of Many-Core Processors. 466-473

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














