


default search action
IPDPS 2025: Milano, Italy - Workshops
- 2025 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2025 - Workshops, Milano, Italy, June 3-7, 2025. IEEE 2025, ISBN 979-8-3315-2643-6

- Brian Donnelly, Michael Gowanlock:

Performance Characterization of Parallel Combination Generators on CPU and GPU Systems. 4-14 - Matyás Brabec

, Jirí Klepl
, Martin Krulis:
Slaying a Life: Optimizing GPU-accelerated Game of Life Stencil. 15-24 - Jordan M. Abt, Ali Farazdaghi, Elizabeth Reid, Curtis Shorts, Tooraj Taraz, Zachary Silva, Ethan Shama, Scott Levy, Whit Schonbein, Matthew G. F. Dosanjh, Amirreza Barati Sedeh, Ryan E. Grant:

Science per Dollar: Modeling Emerging Node Architectures for Accelerator-centric Computing. 25-34 - Ronaldo Canizales, Jedidiah McClurg:

Heterogeneity-Aware Software Performance Characterization via Graph Machine Learning. 35-44 - Paul Hübner, Andong Hu, Ivy Peng, Stefano Markidis:

Apple vs. Oranges: Evaluating the Apple Silicon M-Series SoCs for HPC Performance and Efficiency. 45-54 - Tom Springer, Peiyi Zhao, Robert Alexander, Thomas Jordan:

ARTEMIS: Adaptive Real-Time Task Execution & Management in Heterogeneous Systems. 55-58 - Majid Salimi Beni, Ruben Laso, Biagio Cosenza, Siegfried Benkner, Sascha Hunold:

Exploring NCCL Tuning Strategies for Distributed Deep Learning. 59-62 - Amir Hossein Sojoodi, Ali Farazdaghi, Hamed Sharifian, Ryan E. Grant, Ahmad Afsahi:

Collaborative Bandwidth-Efficient Intra-Node Allreduce. 63-67 - Calvin Bombis, Lena Oden:

nbshmem: Enabling GPU-Initiated Multi-GPU Communication in Python. 68-77 - Adrien Gegout, Djob Mvondo, Davide Frey, Pascal Monchon:

Towards an Efficient Containerized Cloud Gaming Platform. 78-86 - Mohamed Bouaziz

, Suhaib A. Fahmy:
PRNGine: Massively Parallel Pseudo-Random Number Generation and Probability Distribution Approximations on AMD AI Engines. 91-98 - Mohamed Bouaziz

, Suhaib A. Fahmy:
Benchmarking Floating Point Performance of Massively Parallel Dataflow Overlays on AMD Versal Compute Primitives. 99-103 - Felix Böseler, Jörg Walter, Verena Klös:

Enabling Manual-Controllable Compilation for Dataflow CGRAs. 104-111 - Hisako Ito, Takuya Kojima, Hideki Takase, Hiroshi Nakamura:

A Decoupled Coarse-Grained Reconfigurable Architecture by Introducing Data Flow Management Unit. 112-119 - Anh Nguyen

, Sebastian Czyrny, Takahide Yoshikawa, Jason Helge Anderson:
RAAP-CGRA: Placement for CGRAs with Restricted Routing Architectures. 120-126 - Isaac David Núñez Araya, Michael Gerndt, Shajulin Benedict:

Serverless IoT Framework. 130-139 - Michele Martone, Julia Lawall:

Advances in Semantic Patching for HPC-oriented Refactorings with Coccinelle. 140-148 - Patrick J. Flynn, Xinyao Yi, Erik Saule, Gokcen Kestor, Yonghong Yan:

SpMM-Bench: Performance Characterization of Sparse Formats for Sparse-Dense Matrix Multiplication. 149-158 - Yao Xu, Grace Nansamba, Anthony Skjellum, Gene Cooperman:

The Case for ABI Interoperability in a Fault Tolerant MPI. 159-167 - Raneem Abu Yosef, Bokyeong Yoon, Martin Kong:

Exploring Communication Anomalies in Chapel. 168-177 - Aaron Welch, Oscar R. Hernandez, Stephen W. Poole, Wendy Poole:

Implementing Directive-Based Deferred Execution for Effective Network Aggregation. 178-186 - Josef Weidendorfer, Lukas Neef, Robert Hubinger, Amir Raoofy:

Data Transfer Schemes in the High-Level Communication Library LAIK. 187-196 - Ashish Bisht, Aniket P. Garade, Deepika H. V, Haribabu P, S. A. Kumar, S. D. Sudarsan:

SYCL for HPC: Adapting to Diverse CPU Architecture. 197-204 - Sanil Rao, Larry Tang, Franz Franchetti:

LibraryX-ASIC: A First Look. 205-208 - Upasana Sridhar, Elliott Binder, Tze Meng Low:

Gen-AI in a Bottle: Experiments with LLMs to Generate HPC Kernels. 215-224 - Mohammed Baydoun, Mohammad Sonji, Pedro Bruel, Dejan S. Milojicic, Eitan Frachtenberg, Izzat El Hajj:

Predicting Performance Variability. 225-234 - Aishwarya Parab, Prakhar Pradhan, Yogesh Simmhan, Arnab K. Paul:

A Blockchain-Enabled Framework for Storage and Retrieval of Social Data. 237-240 - Rohullah Akbari, Daniel Thilo Schroeder, Petra Filkuková, Johannes Langguth:

Monitoring Digital Wildfires: a Large-Scale Dataset of COVID-19 Conspiracy Tweets Created via Fast NLP Inference using the Graphcore IPU. 241-250 - Suman Raj, Bhavani A Madhabhavi, Kautuk Astu, Arnav A Rajesh, Pratham M, Yogesh Simmhan:

Ocularone-Bench: Benchmarking DNN Models on GPUs to Assist the Visually Impaired. 251-254 - Roman Wiatr, Renata G. Slota:

Predicting the Predictor: Linear Metamodeling for Evolving User Response Prediction. 255-264 - Pranav Pamidighantam, Vairavan Murugappan, Suresh Subramanian, Eunice E. Santos:

Towards community-based influence spread prediction (CIP) for edge changes in large-scale dynamic social networks. 265-274 - Mykhailo Novikov, Xavier Besseron

:
Rapid Random Packing of Poly-disperse Spheres using Adam Stochastic Optimization. 277-286 - Chu-Yuan Huang, Kazuhiko Komatsu, Makoto Onoda, Masahito Kumagai, Masayuki Sato, Hiroaki Kobayashi:

A Compressed QUBO Format for Traveling Salesperson Problems. 287-296 - Byron DeVries, Christian Trefftz:

Reusable Object-Oriented Parallelization of Branch-and-Bound Algorithms. 297-306 - Mahmoud El Mehdi El Khadiri, El-Ghazali Talbi:

Parallel Fractal Decomposition Optimization Algorithms on Heterogeneous Architectures. 307-315 - Leszek Sliwko

, Jolanta Mizera-Pietraszko:
Enhancing Cluster Scheduling in HPC: A Continuous Transfer Learning for Real-Time Optimization. 316-325 - Tarek Menouer, Patrice Darmon, Christophe Cérin, Jonathan Rivalan:

Dynamic configuration of Kubernetes containers resources with SLA classes. 326-332 - Bohdan Ivaniuk-Skulskyi, Nadiya Shvai, Amir Nakib, El-Ghazali Talbi:

Enhancing Generalization in Video Anomaly Detection through Multimodal Data Mixing. 333-342 - Andrei Tchernykh, Marianne Salgado-Ramos, Bernardo Pulido-Gaytan, Horacio González-Vélez, Esteban Mosckos, Mikhail G. Babenko:

Efficient Privacy-Preserving Convolutional Neural Networks with CKKS-RNS for Encrypted Image Classification. 343-352 - Desh Ranjan, Mohammad Zubair:

A Header-Based C++ Library for Computing Hessian on GPU using Automatic Differentiation. 355-364 - Jakub Homola, Radim Vavrík, Ondrej Meca, Tomás Brzobohatý, Lubomír Ríha:

Assembly of FETI dual operator using CUDA. 365-374 - Ryo Yoda, Matthias Bolten:

Block Epsilon-Circulant Preconditioning with GPU-Accelerated Spatial Solvers for Linear Time-Dependent PDEs. 375-384 - Peter E. Strazdins:

A Simple Tiled Approach to Teaching Parallel Computing. 385-394 - Subhajit Sahu, Mahen N, Kishore Kothapalli:

ν-LPA: Fast GPU-based Label Propagation Algorithm (LPA) for Community Detection. 395-404 - Jelle van Dijk

, Gábor Závodszky, Ana Lucia Varbanescu, Andy D. Pimentel:
Embracing Load Imbalance for Energy Optimizations: a Case-Study. 405-412 - Jurdana Masuma Iqrah, Young Hyun Koo, Wei Wang, Hongjie Xie, Sushil K. Prasad

:
Scalable Higher Resolution Polar Sea Ice Classification and Freeboard Calculation from ICESat-2 ATL03 Data. 413-422 - Sehrish Qummar, August Ernstsson, Christoph W. Kessler, Oleg Sysoev:

SkePU-DNN: Algorithmic Skeleton Programming for Deep Learning on Heterogeneous Systems. 423-432 - Wajih Halim Boukaram, Yang Liu, Pieter Ghysels, Xiaoye Sherry Li:

Adaptive Sketching Based Construction of H2 Matrices on GPUs. 433-442 - Kewei Yan, Yonghong Yan:

In-Situ Auto-Regressive Surrogate Modeling for Feature Extraction Using Ascent. 443-452 - Elvis Rojas, Luis Carlos N. Todd

, Esteban Meneses:
Using Checkpoint Alteration to Gauge Fault Sensitivity of HPC Scientific Applications. 453-462 - Michail Boulasikis, Flavius Gruian, Robert-Zoltán Szász:

Thalassa: Transforming Symbolic PDEs into Tensor-Based Solvers Running on ML Accelerators. 463-472 - Luca Pennati, Måns I. Andersson, Klaus Steiniger, René Widera, Tapish Narwal, Michael Bussmann, Stefano Markidis:

A Parallel and Highly-Portable HPC Poisson Solver: Preconditioned Bi-CGSTAB with alpaka. 473-483 - Joseph Touzet, Oguz Kaya, Pablo Arrighi, Amélia Durbec:

QuIDS: A Large-Scale Distributed Framework for Quantum Irregular Dynamics Simulations. 491-500 - Océane Koska, Marc Baboulin, Arnaud Gazda:

A mixed-precision quantum-classical algorithm for solving linear systems. 501-508 - Giacomo Antonioli, Alessandro Berti, Alessandro Poggiali, Anna Bernasconi, Gianna M. Del Corso:

Outlier Detection and other applications of Quantum Matrix Multiplication. 509-518 - Robin Ollive, Stéphane Louise:

Gate Efficient Composition of Hamiltonian Simulation and Block-Encoding with its Application on HUBO, Chemistry and Finite Difference Method. 519-528 - Ashfaq A. Khokhar:

Q-CASA: Concluding Remarks From Theory to Execution: System-Level Challenges and Innovations in Scalable Quantum Computing*. 529 - Abdulfatah Bahbouh, Ishfaq Ahmad, Hansheng Lei, Saif Ul Islam:

Parallel Processing for Distributed Machine Learning: A Taxonomy of Techniques and Associated Security Risks. 533-542 - Dikshant Pratap Singh, Mathialakan Thavappiragasam, Brice Videau:

Efficient Intra-node Hierarchical Parallelisms And Dynamic Load Balancing Strategies On Heterogeneous Systems. 543-552 - Fernando H. L. Buzato, Alfredo Goldman:

Extending Microservices Performance Optimization Through Horizontal Pod Autoscaling: A Comprehensive Study. 553-562 - Youssef Elmougy, Nirjhar Deb, Akihiro Hayashi, Vivek Sarkar:

Enhancing Productivity and Performance of HClib-Actor with Efficient Task Termination. 563-567 - Kohya Shiozaki, Junya Nakamura:

Dynatune: Dynamic Tuning of Raft Election Parameters Using Network Measurement. 568-577 - Yonghwan Kim, Yoshiaki Katayama, Koichi Wada:

Pairbot: Enhancing Computational Capabilities by Pairing of Autonomous Mobile Robots. 578-587 - Victor Parque:

Towards a Fast and Generalizable Neural Inference Scheme for Tabular Data. 588-596 - Steven D. Harris, Roger D. Chamberlain

, Christopher D. Gill:
Performance Modeling of Non-Uniform Heterogeneous Platforms. 597-607 - Reo Gakumi, Ryota Yasudo:

RL-assisted Annealing for QUBO on a Multi-GPU System. 608-617 - Shunsuke Tsukiyama, Xiaotian Li, Koji Nakano, Victor Parque, Yasuaki Ito, Takumi Kato, Yuya Kawamata, Kaiki Ii:

CUBO-to-QUBO Conversion: Reducing Cubic Formulations to Quadratic Formulations. 618-625 - Koji Nakano, Shunsuke Tsukiyama, Xiaotian Li, Yasuaki Ito, Victor Parque, Takumi Kato, Yuya Kawamata, Kaiki Ii:

QUBO++: A C++ Library for Developing and Solving QUBO Problems. 626-637 - Isil Öz, Chelsea Cropper:

Teaching Accelerated Computing with Hands-on Experience. 642-649 - Leonel Sousa:

Crash Course on Quantum Computing for Engineering Students. 650-657 - Mary L. Smith, Srishti Srivastava, David P. Bunde, April Renee Crockett, Michael C. Gerten, Peter Maher, Jaime Spacco, Xiaoyuan Suo, Jiayin Wang, Michelle Zhu:

A Visual Unplugged Activity to Introduce PDC. 658-665 - Brian P. Railing, Lukas Kebuladze, Nathan Deyak, Zachary Weinberg

:
SFS: A Simple File System for Teaching Parallelism in Computer Systems. 666-672 - Srishti Srivastava, Mary L. Smith:

Assessing Parallel and Distributed Computing Knowledge Through a Card Game. 673-679 - Yuede Ji:

High-Performance Computing for Graph AI: A Top-Down Perspective. 680-683 - Christopher Atala, Meredith Morrison, Grey Ballard:

Visualizing MPI Collective Communication. 684-687 - Callie Stewart, Gerald C. Gannod:

Experience using AI in MPI Test Suite Development: Implications for Educators. 688-691 - H. Martin Bücker, Johannes Schoder

, Xiaoyuan Suo, David P. Bunde:
Peachy Parallel Assignments (EduPar 2025). 692-696 - Sandra Catalán, Rocío Carratalá-Sáez, Vicente Lopez-Oliva, Katerina Michalickova, Shubbhi Taneja:

EduPar 2025 Posters. 697-700 - Méline Trochon, Julien Bigot, Virginie Grandgirard, Dorian Midou:

Checkpointing Optimisation to Prepare Future Exascale Plasma Turbulence Simulations. 703-711 - Jayesh Krishna, Danqing Wu, Robert L. Jacob, Dmitry Ganyushin:

SCORPIO: A Parallel I/O library for Exascale Earth System Models. 712-721 - Dlyaver Djebarov

, Radita Liem, Sarah Neuwirth, Jean Luca Bez, Suren Byna
:
Streamlining HDF5's AI Workloads Benchmarking. 722-730 - Mahamat Abdraman, Francieli Boito, Luan Teylo:

IOPS: I/O Performance Evaluation Suite. 731-736 - Yili Ma, Shengquan Yin, Jing Xing, Haoquan Long, Zheng Wei, Guangming Tan, Dingwen Tao:

NAPEH: An Asynchronous and NUMA-Aware KV Store Based on Non-Volatile Memory Architectures. 737-743 - Binglin Ji, Chenfeng Zhao, Roger D. Chamberlain:

FGI: Fast GNN Inference on Multi-Core Systems. 748-757 - Youssef Elmougy, Akihiro Hayashi, Vivek Sarkar:

Divide, Conquer, and Match: A Distributed and Asynchronous Approach for Subgraph Isomorphism. 758-761 - Joseph Zuber, Aishwarya Sarkar, Joseph Jennings, Ali Jannesari:

Enhanced Soups for Graph Neural Networks. 762-771 - Alok Tripathy, Alina Lazar, Xiangyang Ju, Paolo Calafiura, Katherine A. Yelick, Aydin Buluç:

Scaling Graph Neural Networks for Particle Track Reconstruction. 772-776 - Lance G. Fletcher, Trevor Steil, Roger Pearce:

RaNT-Graph: A Scalable Approach to Sampling Billions of Walks or Paths from Weighted Graphs. 777-786 - Mohammad Sonji, Mohammed Baydoun, Aditya Dhakal, Gourav Rattihalli, Dejan S. Milojicic, Izzat El Hajj:

Serverless Graph Analytics on Multi-Instance GPU. 787-794 - Saikat Dey

, Sonal Jha, Frank Wanye, Wu-Chun Feng:
On the Landscape of Graph Clustering at Scale. 795-804 - Lorenzo Asquini, Manos Frouzakis, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu, Francesco Silvestri:

Accelerating Triangle Counting with Real Processing-in-Memory Systems. 805-814 - Albert d'Aviau de Piolant, Hayfa Tayeb, Bérenger Bramas, Mathieu Faverge, Abdou Guermouche, Amina Guermouche:

Improving energy efficiency of HPC applications using unbalanced GPU power capping. 820-829 - Daniel Velicka

, Ondrej Vysocky, Lubomir Riha:
Methodology for GPU Frequency Switching Latency Measurement. 830-839 - Jianbo Wu, Jie Ren, Shuangyan Yang, Konstantinos Parasyris, Giorgis Georgakoudis, Ignacio Laguna, Dong Li:

LM-Offload: Performance Model-Guided Generative Inference of Large Language Models with Parallelism Control. 840-849 - Colleen Bertoni, Thomas Applencourt, Longfei Gao, Ti Leggett:

Millions of Matrix-Multiplications: GEMM Variations on Aurora. 850-856 - Mickaël Boichot, Adrien Roussel, Elisabeth Brunet, Patrick Carribault:

Leveraging Interaction Between Memory Footprint and Parallelism Degree for efficient GPU Portings. 857-865 - Carlos Lima, Rui Alves, José Rufino:

HaaS - A Platform for Password Cracking in Distributed Heterogeneous Systems. 866-875 - Martin Wilhelm, Thilo Pionteck:

Static task mapping for heterogeneous systems based on series-parallel decompositions. 876-885 - Steffen Christgau, Dylan Everingham, Max Lübke, Marco De Lucia, Danny Puhan, Niklas Schelten, Bettina Schnor, Hannes Signer, Johannes Spazier, Benno Stabernack, Fritjof Steinert

, Serhii Yahdzhyiev:
On the Usability and Energy Efficiency of High-Level Synthesis for FPGA-based Network-Attached Accelerators. 886-895 - Diane Orhan, Yacine Idouar, Laércio Lima Pilla, Adrien Cassagne, Denis Barthou, Christophe Jégo:

Scheduling Strategies for Partially-Replicable Task Chains on Two Types of Resources. 896-905 - Filip Vaverka, Ondrej Vysocky, Lubomir Riha:

Heterogeneous Memory Pool Tuning. 906-912 - Ami Marowka:

On the Singularity of SYCL. 913-922 - Ferrol Aderholdt, Aamir Shafi, Manjunath Gorentla Venkata:

Proactive Endpoint Congestion Avoidance in UCC. 923-930 - Eleni Adam

, Terry Stilwell, Desh Ranjan, Harold Riethman:
Development and Deployment of a Genomic Cancer Data Extraction Pipeline on the Cloud. 935-938 - Charly Airault, Charles Deltel, Florestan De Moor, Erwan Drezen, Meven Mognol, Dominique Lavenier:

Protein database search using Processing-in-Memory architecture. 939-948 - Nikolaos Alachiotis, Matthijs Leon Souilljee:

Exploring the AMD® Deep Learning Processor Unit for Accelerating Selective Sweep Detection. 949-958 - André Merzky, Mikhail Titov, Matteo Turilli, Ozgur O. Kilic, Tianle Wang, Shantenu Jha:

Scalable Runtime Architecture for Data-driven, Hybrid HPC and ML Workflow Applications. 962-969 - Moiz Arif, Avinash Maurya, Sudharshan Vazhkudai, Bogdan Nicolae:

Evaluating Expansion Memory for Optimizer State Offloading for Large Transformer Models. 970-977 - Thomas Randall, Akhilesh Bondapalli, Rong Ge, Prasanna Balaprakash:

Is In-Context Learning Feasible for HPC Performance Autotuning? 978-985 - Max H. Faykus, Luanzheng Guo, Rizwan A. Ashraf, Jan Strube, Jon C. Calhoun, Nathan R. Tallent:

Exploration of LLM Lossless Compression on Scientific Data. 986-990 - Ioanna Tasou, Petros Anastasiadis, Panagiotis Mpakos, Dimitrios Galanopoulos, Nectarios Koziris, Georgios I. Goumas:

Breaking Down LLM Inference: A preliminary performance analysis of sparsified transformers. 991-995 - Chinmay Sahasrabudhe, Yang Ho, Nick Winovich, Sivasankaran Rajamanickam:

Imperfect Recognition: A Study of OCR Limitations in the Context of Scientific Documents. 996-1002 - Shiva Sai Krishna Anand Tokal, Vaibhav Jha, Anand Eswaran, Praveen Jayachandran, Yogesh Simmhan:

Towards Orchestrating Agentic Applications as FaaS Workflows. 1003-1010 - Aymen Alsaadi, Jonathan Ash, Mikhail Titov, Matteo Turilli, André Merzky, Shantenu Jha, Sagar Khare:

Adaptive Protein Design Protocols and Middleware. 1011-1015 - Flavio Renzi, Haoyu Ren, Alessio Bernardo, Giacomo Ziffer, Darko Anicic, Emanuele Della Valle:

Online Learning Techniques for Occupancy Detection on Resource Constrained Devices. 1019-1026 - Christophe Cérin, Melvyn Chemin:

Trade-Offs in Resource-Constrained Dimensionality Reduction Algorithms. 1027-1034 - Paul Daniëlse, Hsiang-Ling Tai, Shashikant Ilager, Zhiming Zhao:

Investigating Efficient Edge Offloading Architectures for Serverless Systems. 1035-1038 - Narges Mehran, Zahra Najafabadi Samani, Reza Farahani, Josef Hammer, Dragi Kimovski:

DEEP: Edge-Based Dataflow Processing with Hybrid Docker Hub and Regional Registries. 1039-1042 - Sashko Ristov, Anna Meshcheriakova, Philipp Gritsch, Philipp Zech, Ruth Breu:

Goal-Driven building automation using serverless computing. 1043-1049 - Jyotishman Sarkar, Urmi Jana, Barnali Basak, Himadri Sekhar Paul, Swagata Biswas:

Towards Predicting Inference Latency of TinyML Models. 1050-1053 - Riccardo Cantini, Alessio Orsino, Domenico Talia, Paolo Trunfio:

Towards Interpretable Energy Estimation for Edge AI Applications. 1054-1057 - Matthijs Jansen

, Maciej Kozub, Alexandru Iosup, Daniele Bonetta:
Memory Efficient WebAssembly Containers. 1058-1065 - Kurt Horvath, Shpresa Tuda, Blerta Idrizi, Stojan Kitanov, Fisnik Doko, Dragi Kimovski:

6G Infrastructures for Edge AI: An Analytical Perspective. 1066-1072 - Gabriele Russo Russo, Pierpaolo Spaziani, Valeria Cardellini:

Towards QoS-Aware Serverless Function Offloading in the Edge-Cloud Continuum through Reinforcement Learning. 1073-1080 - Alfredo Lipari, Gabriele Proietti Mattia, Roberto Beraldi:

Dynamic and Forecast-Based Containers Autoscaling for Kubernetes with Reinforcement Learning. 1081-1088 - Thomas Auer, Kurt Horvath, Dragi Kimovski:

Blockchain consensus mechanisms for democratic voting environments. 1089-1096 - Amir Ali Pour, Julien Gascon-Samson:

SDFLMQ: A Semi-Decentralized Federated Learning Framework over MQTT. 1100-1107 - Mayank Arya, Yogesh Simmhan:

Understanding the Performance and Power of LLM Inferencing on Edge Accelerators. 1108-1111 - Yongho Kim, Seongha Park, Swann Perarnau, Akhilesh Raj:

Charon: An End-to-End Infrastructure for Connecting AI@Edge to HPC. 1112-1119 - Vincenzo Barbuto, Claudio Savaglio, Giancarlo Fortino, Edward A. Lee:

Edge AI in the computing continuum: Consistency and Availability at Early Design Stages. 1120-1127 - Emanuele Petriglia

, Federica Filippini
, Michele Ciavotta, Marco Savi:
Multi-Agent Reinforcement Learning for Workload Distribution in FaaS-Edge Computing Systems. 1128-1131 - Mohamed Anisse Belhadj

, Kods Trabelsi, Loïc Cudennec, Henri-Pierre Charles:
Software Container-based Energy Estimation Models for ARM Architecture. 1132-1139 - Gonzalo Salinas, Guilherme Sequeira, Alfonso Rodríguez, João Bispo, Nuno Paulino:

SIMD Acceleration of Matrix-Vector Operations on RISC-V for Variable Precision Neural Networks. 1140-1147 - Jin-Shyan Lee, Pin-Hsuan Lee:

Optimizing Speech Emotion Recognition with Dynamic Dilation Rates for Efficient Edge Deployment. 1148-1155 - Nishant Saurabh

, Pradeep Kumar Mantha, Shantenu Jha, André Luckow:
Compositional Execution Motifs for Quantum-HPC Systems. 1157-1162 - Meriam Gay Bautista-Jurney, Patricia Gonzalez-Guerrero, Anastasiia Butko:

SFQ-Driven Pulse-Phase Sequence Generator for Superconducting Qubit Control. 1163-1169 - Alejandro Becerra, Abani K. Patra:

A Recursive Approach to Representation in Hilbert Spaces of Increasing Dimension: Applications to Quantum-centric HPC tool development. 1170-1174 - Sophia Keip, Daan Camps, Roel Van Beeumen:

QCLAB: A Matlab Toolbox for Quantum Computing. 1175-1181 - Kiyotaka Murashima:

Computational Speedup of Simulated Annealing with Nested Monte Carlo Loop. 1182-1187 - Gianluca Scanu, Marco Venere, Donatella Sciuto, Marco D. Santambrogio:

QABE: a Framework for Quantum Annealer Programming and Benchmarking. 1188-1193 - Muhammad Ali Farooq, Abid Rafique, Suhaib A. Fahmy, Aman Arora:

High Throughput Low Latency Network Intrusion Detection on FPGAs: A Raw Packet Approach. 1201-1207 - Brindusa Mihaela Damian-Kosterhon, Andreas Koch, Felix Kosterhon, Lucian Petrica:

Improving mapping of convolutional neural networks on FPGAs through tailored macro sizes. 1208-1215 - Rodrigo Olmos, Andrés Otero:

An FPGA-Accelerated Framework for Optimizing Decision Tree Ensembles in Supervised Learning. 1216 - Tomoya Yokono

, Yoshiki Yamaguchi:
Accelerating CRS Format Conversion for Sparse Matrix Computation with an FPGA. 1217 - Eleonora Cabai, Giuseppe Sorrentino, Marco Domenico Santambrogio, Davide Conficconi:

A Hardware/Software Co-Design Approach for Versal-Based K-means Acceleration. 1218 - Federico Mansutti, Davide Ettori, Giuseppe Sorrentino, Marco Domenico Santambrogio, Davide Conficconi:

Towards a Methodology to Leverage Alveo Versal System Usability And Parallelization. 1219 - Aya Jendoubi, Jean-Christophe Prévotet, Philippe Tanguy, Pascal Cotret:

Security of Dynamically Reconfigurable RISC-V Systems: I/O Attack Focus. 1220 - Rohan Krishna Vijayaraghavan, Ahmed Kamaleldin, Matthias Nickel, Diana Göhringer:

A RISC-V Coprocessor for Seamless Integration of Stream-Based Accelerators. 1221-1227 - Martin Langhammer, Gregg Baeckler, Kim Bozman:

A 950 MHz SIMT Soft Processor. 1228-1235 - Luis Waucquez, Alfonso Rodriguez:

Reconfigurable Processor-Centric Accelerators for Safety-Critical Applications. 1236-1242 - Noemi D'Abbondanza, Stylianos Tzelepis, Nicolò Ghielmetti, Ioannis Kakogeorgiou, Vanya Buchova, Konstantinos Karantzalos, Katerina Kikaki, Nicolas-Marcel Lemoine, Maurizio Pierini, Sioni Summers, Simon Vellas, François de Vieilleville, Boyan-Nikola Zafirov:

Edge SpAIce: Deep Learning Deployment Pipeline for Onboard Data Reduction on Satellite FPGAs. 1243-1249 - Yngve Hafting

, Alexander Wold:
Testbench analysis using non-invasive fault injection. 1250-1256 - Simone Pernice, Ahmad Tarraf, Jean-Baptiste Besnard, Barbara Cantalupo, Alberto Cascajo, David E. Singh, Felix Wolf, Jesús Carretero, Sameer Shende, Marco Aldinucci:

A Simulation-Based Framework to Reduce I/O Contention in HPC. 1258-1260 - Md Hasanur Rahman, Sheng Di, Guanpeng Li, Franck Cappello:

Characterizing Spatial Data Traits for Modeling Generic Lossy Rate-Distortion Quality. 1261-1262 - Toni Böhnlein, Pál András Papp, Raphael S. Steiner, Albert-Jan Nicholas Yzelman:

Efficient Parallel Scheduling for Sparse Triangular Solvers. 1263-1265 - Ellie Lipe, Neel Karia, Clifford Stein, Connor Espenshade, Olivier Tardieu, Asser N. Tantawi:

Energy Efficient Scheduling of AI/ML Workloads on Multi-Instance GPUs with Dynamic Repartitioning. 1266-1268 - Jun-Liang Lin, Kamesh Madduri, Mahmut Taylan Kandemir:

Enhancing Graph Transformer Training through Adaptive Graph Parallelism. 1269-1270 - Minyu Cui, Miquel Pericàs:

Evaluation and Mitigation of Performance Variability of OpenMP Applications on Modern Multicore Systems. 1271-1273 - Przemyslaw Dominikowski, Atte Torri, Brice Pointal, Oguz Kaya, Laércio Lima Pilla, Olivier Coulaud:

Exploring Near-Optimal Contraction Strategies for the Scalar Product in the Tensor-Train Format. 1274-1276 - Yiqing Wang, Hailong Yang, Xiaoyan Liu, Xinyu Yang, Pengbo Wang, Xin You, Qingxiao Sun, Mingzhen Li, Yi Liu, Zhongzhi Luan, Depei Qian:

INSPIRIT: Adaptive Priority-based Task Scheduling for Heterogeneous Hardware. 1277-1279 - Sanil Rao, Mohammad Alaul Haque Monil, Het Mankad, Narasinga Rao Miniskar, Keita Teranishi, Jeffrey S. Vetter, Franz Franchetti:

IRISX: A Dynamic Trade-off System for Harnessing Heterogeneity for Performance Portability. 1280-1282 - Yongfeng Qiu, Yuxiao Li, Xin Liang, Yafan Huang, Guanpeng Li, Sheng Di, Franck Cappello, Hanqi Guo:

Lossy Parallel Visualization of Large-Scale Volume Data with Error-Bounded Image Compositing. 1283-1285 - Si Chen

, Simon Garcia De Gonzalo, Avani Wildani:
MetaCast: Generalizing HPC Application Runtime Prediction. 1286-1287 - Bartlomiej Wróblewski, Gioele Gottardo, Anastasios Zouzias:

Parallel Scan on Ascend AI Accelerators. 1290-1292 - Ivan Tagliaferro

, Guillaume Helbecque, Ezhilmathi Krishnasamy, Nouredine Melab, Grégoire Danoy:
Performance and Portability in Multi-GPU Branch-and-Bound: Chapel Versus CUDA and HIP for Tree-Based Optimization. 1293-1295 - Amena Begum Farha, Abdullah Al-Mamun, Gagan Agrawal:

Poster: A Scalable and Fault-Tolerant Decentralized Middleware for CI/CD Workflow. 1296-1297 - Arivarasan Karmegam, Gabina Luz Bianchi, Margarita Capretto, Martín Ceresa, Antonio Fernández Anta, César Sánchez:

Setchain Algorithms for Blockchain Scalability (Extended Abstract). 1298-1300 - Marco D'Antonio

, Son Thai Mai, Hans Vandierendonck:
Toward Efficient Asynchronous Single-Source Shortest Path. 1301-1303 - Ehan Sohn, Changjong Kim, Alex Sim

, Dong Kyu Sung, Yongseok Son, Jisung Park, Sunggon Kim:
Toward Performance Prediction in Large-Scale Systems through Temporal System and Application Log Analysis. 1304-1306 - Shanghao Liu, Hailong Yang, Xin You, Zhongzhi Luan, Yi Liu, Depei Qian:

Towards Efficient Instruction Stream Scheduling for Stencil Computation on ARM Processors. 1307-1310 - Zheng Wei, Jing Xing, Yida Gu, Guangming Tan, Dingwen Tao:

TSUE: A Two-Stage Data Update Method for an Erasure Coded Cluster File System. 1311-1313 - Jingyao Zhang, Elaheh Sadredini:

Unlocking Energy-Efficient and High-Throughput Secure Data Communication in IoT with Memory-Centric Computing. 1314-1315

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














