


default search action
31st HiPC 2024: Bangalore, India
- 31st IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2024, Bangalore, India, December 18-21, 2024. IEEE 2024, ISBN 979-8-3315-0909-5

- Robin Boëzennec, Danilo Carastan-Santos, Fanny Dufossé, Guillaume Pallez:

Allocation Strategies for Disaggregated Memory in HPC Systems. 1-11 - Abrar Hossain, Abdel-Hameed A. Badawy, Mohammad A. Islam

, Tapasya Patki, Kishwar Ahmed:
HPC Application Parameter Autotuning on Edge Devices: A Bandit Learning Approach. 12-22 - Benjamin Michalowicz

, Kaushik Kandadi Suresh, Hari Subramoni, Mustafa Abduljabbar, Dhabaleswar K. Panda, Steve Poole:
Effective and Efficient Offloading Designs for One-Sided Communication to SmartNICs. 23-33 - Zhibo Xuan

, Xin You, Hailong Yang, Mingzhen Li, Zhongzhi Luan, Yi Liu, Depei Qian:
Retrospection on the Performance Analysis Tools for Large-Scale HPC Programs. 34-44 - Anastasia Khartikova, Denis Shaikhislamov, Ilya Timokhin, Roman Kostromin, Vladislav Muratov, Aleksey Demakov, Maxim Belov, Aleksey Teplov:

BigThrill: MPI-based Data Processing Engine. 45-56 - Lang Xu, Quentin Anthony, Jacob Hatef, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:

Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning. 57-67 - Aryan Kumar Singh, Arpit Saikia, Pranita Baro, Malaya Dutta Borah:

Transformer-based Self-Supervised Imputation and Attention GANs Oversampling for Medical Data Processing. 68-77 - Changxin Li, Sanmukh Kuppannagari:

Exploring Algorithmic Design Choices for Low Latency CNN Deployment. 78-88 - Ashwin Krishnan, Venkatesh Pasumarti, Samarth Inamdar, Arghyajoy Mondal, Manoj Nambiar, Rekha Singhal:

CAR-LLM: Cloud Accelerator Recommender for Large Language Models. 89-99 - Nawras Alnaasan, Bharath Ramesh, Jinghan Yao, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:

HyperSack: Distributed Hyperparameter Optimization for Deep Learning using Resource-Aware Scheduling on Heterogeneous GPU Systems. 100-110 - Revanth Reddy Munugala, Michael Gowanlock:

GDBOD: Density-Based Outlier Detection Exploiting Efficient Tree Traversals on the GPU. 111-121 - Chen-Chun Chen, Goutham Kalikrishna Reddy Kuncham, Hari Subramoni, Dhabaleswar K. Panda:

Design and Implementation of Kernel-based MPI Reduction Operations for Intel GPU s. 122-131 - Brian Donnelly, Michael Gowanlock:

Multi-Space Tree with Incremental Construction for GPU-Accelerated Range Queries. 132-142 - Andrew Geyko, Gerald Collom, Derek Schafer, Patrick G. Bridges, Amanda Bienz:

A More Scalable Sparse Dynamic Data Exchange. 143-154 - Kaushik Kandadi Suresh, Benjamin Michalowicz

, Nick Contini, Bharath Ramesh, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Using BlueField-3 SmartNICs to Offload Vector Operations in Krylov Subspace Methods. 155-165 - Sudhanshu Pravin Kulkarni, E. Wes Bethel:

From Bits to Qubits: Challenges in Classical-Quantum Integration. 166-176 - Kartikey Sarode:

Circuit Partitioning and Full Circuit Execution: A Comparative Study of GPU - Based Quantum Circuit Simulation. 177-187 - Bo Zhang, Philip E. Davis, Zhao Zhang, Keita Teranishi, Manish Parashar:

Dual Channel Dual Staging: Hierarchical and Portable Staging for GPU-Based In-Situ Workflow. 188-198 - Samuel Curtis, Harry Waugh, Tom Deakin, Gihan R. Mudalige

:
Mini-Combust - An Open-Source Unstructured FGM Combustion Mini-App for Co-Designing Aero-Engines at Extreme Scale. 199-209 - Andy Wolff, Avinash Karanth:

Training Photonic Mach Zehnder Meshes for Neural Network Acceleration. 210-220 - Yiheng Xu, Pranav Sivaraman, Hariharan Devarajan, Kathryn M. Mohror, Abhinav Bhatele:

ML-based Modeling to Predict I/O Performance on Different Storage Sub-systems. 221-231 - Julien Monniot, François Tessier, Henri Casanova, Gabriel Antoniu:

Simulation of Large-Scale HPC Storage Systems: Challenges and Methodologies. 232-242 - S. Haleh S. Dizaji, Reza Farahani, Joze M. Rozanec, Dragi Kimovski, Ahmet Soylu, Radu Prodan:

Graph Sampling Quality Prediction for Algorithm Recommendation. 243-254 - Cèdric Prigent, Melvin Chelli, Alexandru Costan, Loïc Cudennec, René Schubotz, Gabriel Antoniu:

Efficient Resource-Constrained Federated Learning Clustering with Local Data Compression on the Edge-to-Cloud Continuum. 255-265 - Advik Raj Basani, Siddharth Chaitra Vivek, Advaith Krishna, Arnab K. Paul:

When Less is More: Achieving Faster Convergence in Distributed Edge Machine Learning. 266-276 - Rahulkumar Gayatri, Shilei Tian, Stephen L. Olivier

, Eric Wright, Johannes Doerfert:
Leveraging LLVM OpenMP GPU Offload Optimizations for Kokkos Applications. 277-287

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














