


default search action
39th IPDPS 2025: Milano, Italy
- IEEE International Parallel and Distributed Processing Symposium, IPDPS 2025, Milano, Italy, June 3-7, 2025. IEEE 2025, ISBN 979-8-3315-3237-6

- Marco D. Santambrogio, Ananth Kalyanaraman:

Message from the 2025 General Co-chairs. xxii-xxiv - Saisha Kamat, Mai Zheng, Bo Fang, Dong Dai:

Be Aware of Metadata Corruption in Parallel File System: It can be Silent and Catastrophic. 1-13 - Qianpiao Ma, Junlong Zhou, Xiangpeng Hou, Jianchun Liu, Hongli Xu, Jianeng Miao, Qingmin Jia:

Air-FedGA: A Grouping Asynchronous Federated Learning Mechanism Exploiting Over-The-Air Computation. 1-12 - Nathaniel Tomczak, Sanmukh Kuppannagari:

Longer Attention Span: Increasing Transformer Context Length With Sparse Graph Processing Techniques. 1-12 - Chen Wang

, Zhaobin Zhu
, Kathryn M. Mohror
, Sarah Neuwirth
, Marc Snir
:
VerifyIO: Verifying Adherence to Parallel I/O Consistency Semantics. 1-12 - David E. Keyes

:
For What the Bell Tolls. 1 - Shihui Song, Robert Underwood, Sheng Di, Yafan Huang, Peng Jiang, Franck Cappello:

A Memory-Efficient and Computation-Balanced Lossy Compressor on Wafer-Scale Engine. 1-13 - Adrian Munera, Eduardo Quiñones, Sara Royuela:

GuardianOMP: A Framework for Highly Productive Fault Tolerance Via OpenMP Task-Level Replication. 1-12 - Yu Kuang, Li Yan, Zhuozhao Li:

P3 Forecast: Personalized Privacy-Preserving Cloud Workload Prediction Based on Federated Generative Adversarial Networks. 1-11 - Theodore Michailidis, Juno Kim, Linsong Guo, Steven Swanson, Jishen Zhao:

TOSS: Tiering of Serverless Snapshots for Memory-Efficient Serverless Computing. 2-14 - Xin Chen, Manoj Prabhakar Paidiparthy, Dilma Da Silva, Liting Hu:

Ekko: Fully Decentralized Scheduling for Serverless Edge Computing. 15-27 - Jing Wu, Lin Wang, Quanfeng Deng, Chen Yu, Dong Zhang, Bingheng Yan, Fangming Liu:

It Takes Two to Tango: Serverless Workflow Serving via Bilaterally Engaged Resource Adaptation. 28-41 - Xiaohui Peng, Wenkai Yan, Yifan Wang, Shoujian Zheng, Zhiwei Xu:

Tide: A Distributed Runtime Management Framework for Things-Edge-Cloud Computing Continuum. 42-53 - Jared Coleman, Bhaskar Krishnamachari:

PISA: An Adversarial Approach to Comparing Task Graph Scheduling Algorithms. 54-66 - Arnau Cinca

, Aleix Roca, Kevin Sala, Raúl Peñacoba Veigas, David Álvarez, Vicenç Beltran:
Enhancing OmpSs-2 Suspendable Tasks by Combining Operating System and User-Level Threads with C++ Coroutines. 67-80 - Wenyi Wang, Maxime Gonthier, Poornima Nookala, Haochen Pan, Ian T. Foster, Ioan Raicu, Kyle Chard:

Optimizing Fine-Grained Parallelism Through Dynamic Load Balancing on Multi-Socket Many-Core Systems. 81-93 - Ayush Pandey, Julien Sopena, Marc Shapiro, Swan Dubois:

CALock: Multi-Granularity Locking in Dynamic Hierarchies. 94-105 - Roberto L. Castro, Diego Andrade, Basilio B. Fraguela

:
Adapt-S: Effective DNN Pruning via Unified Accuracy and Performance Tuning. 106-117 - Xing Peng, Qinglin Wang, Chuhe Hong

, Gencheng Liu, Rui Xia, Xinhai Chen, Zhigang Sun, Jie Liu:
An Efficient Adaptive Dual-Threshold Svm Based on Heterogeneous Collaboration. 118-129 - Shenghao Qiu, Chunwei Xia, Zheng Wang:

Accelerating Tensor-Train Decomposition on Graph Neural Networks. 130-141 - Lukas Gianinazzi, Tal Ben-Nun, Maciej Besta, Saleh Ashkboos, Yves Baumann, Piotr Luczynski, Torsten Hoefler:

Energy-Optimal and Low-Depth Algorithmic Primitives for Spatial Dataflow Architectures. 142-153 - XinYu Piao

, JooYong Shim, Joongheon Kim, Jong-Kook Kim:
AQUA: Hardware-Agnostic Qubit Allocation for Quantum Multi-Programming. 154-161 - Aleksander Figiel, Darya Melnyk, Tijana Milentijevic, Stefan Schmid:

Distributed Construction of Demand-Aware Datacenter Networks. 162-172 - Amogh Lonkar, Scott Beamer

:
PivotScale: A Holistic Approach for Scalable Clique Counting. 173-186 - Hans Vandierendonck

:
Less is More: Faster Maximum Clique Search by Work-Avoidance. 187-198 - Yuanhui Chen, Lixiao Cui, Zebin Yao, Hao Zhou, Gang Wang, Xiaoguang Liu:

ALGAS: A Low-Latency GPU-Based Approximate Nearest Neighbor Search System. 199-209 - JaeHyuk Kwack, Colleen Bertoni, Umesh Unnikrishnan

, Riccardo Balin, Khalid Hossain, Yasaman Ghadar, Timothy J. Williams, Abhishek Bagusetty, Mathialakan Thavappiragasam, Väinö Hatanpää, Archit Vasan, John R. Tramm, Scott Parker:
AI and HPC Applications on Leadership Computing Platforms: Performance and Scalability Studies. 210-222 - Zelin Xu, Jie Ren, Yupu Zhang, Jose Maria Gonzalez Ondina, Maitane Olabarrieta, Tingsong Xiao, Wenchong He, Zibo Liu, Shigang Chen, Kaleb E. Smith, Zhe Jiang:

Accelerate Coastal Ocean Circulation Model with AI Surrogate. 223-235 - Yuanchang Zhou, Siyu Hu, Chen Wang

, Lin-Wang Wang, Guangming Tan, Weile Jia:
FastCHGNet: Training One Universal Interatomic Potential to 1.5 Hours with 32 GPUs. 236-246 - Lana Scravaglieri, Ani Anciaux-Sedrakian, Olivier Aumage, Thomas Guignon, Mihail Popov:

Compiler, Runtime, and Hardware Parameters Design Space Exploration. 247-260 - Clément Gavoille, Hugo Taboada, Jens Domke

, Brice Goglin, Emmanuel Jeannot:
Performance Projection for Design-Space Exploration on future HPC Architectures. 261-272 - Catherine Guelque

, Valentin Honoré
, Philippe Swartvagher
, Gaël Thomas, François Trahay
:
PALLAS: A Generic Trace Format for Large HPC Trace Analysis. 273-284 - Daniel Salwasser, Daniel Seemaier, Lars Gottesbüren, Peter Sanders:

Tera-Scale Multilevel Graph Partitioning. 285-296 - Anju Mongandampulath Akathoott

, Martin Burtscher
:
A Bidirectional GPU Algorithm for Computing Maximum Matchings in Bipartite Graphs. 297-308 - Kelly Isham, Laura Monroe, Kartik Lakhotia, Aleyah Dawkins, Daniel Hwang, Ales Kubicek:

Edge-Disjoint Spanning Trees on Star Products. 309-321 - Chris Egersdoerfer, Arnav Sareen

, Jean Luca Bez, Suren Byna, Dongkuan Xu
, Dong Dai:
IOAgent: Democratizing Trustworthy HPC I/O Performance Diagnosis Capability via LLMs. 322-334 - Bowen Sun, Riccardo Pinciroli, Giuliano Casale, Evgenia Smirni

:
DeepBAT: Performance and Cost Optimization of Serverless Inference Using Transformers. 335-346 - Youshao Xiao, Zhenglei Zhou, Fagui Mao, Weichang Wu, Shangchun Zhao, Lin Ju, Lei Liang, Xiaolu Zhang, Jun Zhou:

FlexRLHF: A Flexible Placement and Parallelism Framework for Efficient RLHF Training. 358-369 - Xuan Wu, Sheng Di, Congrong Ren, Pu Jiao, Mingze Xia, Cheng Wang, Hanqi Guo, Xin Liang, Franck Cappello:

Enabling Efficient Error-Controlled Lossy Compression for Unstructured Scientific Data. 370-382 - Jinman Zhao, Seyed Aryan Vahabpour, Xingyu Yue, Kai-Ting Amy Wang, Tarek S. Abdelrahman:

PolyMorphous: An MLIR-Based Polyhedral Compiler with Loop Transformation Primitives. 383-394 - Pascal Fradet, Alain Girault, Alexandre Honorat:

Parallel Scheduling of Task Graphs with Minimal Memory Requirements. 395-406 - Jeffrey Kelling, Vicente Bolea, Michael Bussmann, Ankush Checkervarty, Alexander Debus

, Jan Ebert, Greg Eisenhauer, Vineeth Gutta, Stefan Kesselheim, Scott Klasky, Vedhas Pandit, Richard Pausch, Norbert Podhorszki, Franz Pöschel, David Rogers, Jeyhun Rustamov, Steve Schmerler, Ulrich Schramm, Klaus Steiniger
, René Widera, Anna Willmann, Sunita Chandrasekaran:
The Artificial Scientist: in-Transit Machine Learning of Plasma Simulations. 407-418 - Srinivas Aluru:

The Power of Parallelism: Accelerating Discovery in the Biosciences. 419 - Hyungro Lee, Jesun Firoz, Nathan R. Tallent, Luanzheng Guo, Mahantesh Halappanavar:

FlowForecaster: Automatically Inferring Detailed & Interpretable Workflow Scaling Models for Forecasts. 420-432 - Raveesh Garg, Michael Pellauer, Sivasankaran Rajamanickam, Tushar Krishna:

Cello: Co-Designing Schedule and Hybrid Implicit/Explicit Buffer for Complex Tensor Reuse. 433-446 - Md Nahid Newaz, Sayan Ghosh, Nathan R. Tallent, Guangzhi Qu:

Locality Aware Process Remapping for Distributed-Memory Graph Workloads. 447-459 - Aranya Banerjee, Daniel Gibney, Helen Xu, Srinivas Aluru:

A Work-Optimal Parallel Algorithm for Aligning Sequences to Genome Graphs. 460-471 - Souvadra Hati, Akihiro Hayashi, Richard W. Vuduc:

An Asynchronous Distributed-Memory Parallel Algorithm for $k$-Mer Counting. 472-483 - Joy Kitson, Ian J. Costello, Jiangzhuo Chen, Diego Jiménez, Stefan Hoops, Henning S. Mortveit, Esteban Meneses, Jae-Seung Yeom, Madhav V. Marathe, Abhinav Bhatele:

Pandemics in Silico: Scaling Agent-Based Simulations on Realistic Social Contact Networks. 484-496 - Md Sirajul Islam, Sanjeev Panta, Fei Xu, Xu Yuan, Li Chen, Nian-Feng Tzeng:

SEAFL: Enhancing Efficiency in Semi-Asynchronous Federated Learning Through Adaptive Aggregation and Selective Training. 509-519 - Ahmad Faraz Khan, Xinran Wang, Qi Le, Zain ul Abdeen, Azal Ahmad Khan, Haider Ali, Ming Jin, Jie Ding, Ali Raza Butt, Ali Anwar

:
IP-FL: Incentive-Driven Personalization in Federated Learning. 520-532 - Jiayu Zhao, Chunwei Xia, Zheng Wang:

Leveraging Compilation Statistics for Compiler Phase Ordering. 533-545 - Le Chen, Nesreen K. Ahmed, Mihai Capota, Ted Willke, Niranjan Hasabnis, Ali Jannesari

:
PCEBench: A Multi-Dimensional Benchmark for Evaluating Large Language Models in Parallel Code Generation. 546-557 - Hangda Liu, Boyu Diao, Yu Yang, Wenxin Chen, Xiaohui Peng, Yongjun Xu:

Gensor: A Graph-Based Construction Tensor Compilation Method for Deep Learning. 558-569 - Weicong Chen, Sarah J. Carr, Jing Zhang, Curtis Tatsuoka, Xiaoyi Lu:

SPRT2: Scalable, Parallel, and Real-Time fMRI Data Analysis on Heterogeneous Architectures. 570-581 - Leon C. Oostrum, Bram Veenboer, Ronald Rook, Michael Brown, Pieter Kruizinga, John W. Romein:

The Tensor-Core Beamformer: A High-Speed Signal-Processing Library for Multidisciplinary Use. 582-592 - Shahaf Gargir, Sivan Toledo:

Parallel-in-Time Kalman Smoothing Using Orthogonal Transformations. 593-604 - Diletta Chiaro, Pian Qi, Edoardo Prezioso, Antonella Guzzo, Francesco Piccialli:

FLAME: Federated Learning for Attack Mitigation and Evasion. 605-615 - Mulin Li, Zhaolong Jian, Kaixuan Yang, Xueshuo Xie, Wajdy Othman, Tao Li:

Hybrid-Granularity Parallelism Support for Fast Transaction Processing in Blockchain-Based Federated Learning. 616-628 - Xiaoyu Fan, Kun Chen, Guosai Wang, Xiaowei Zhu, Haoqing He, Xie Yong, Xiaofeng Jia, Yidong Li, Wei Xu:

Pair-Then-Aggregate: Simplified and Efficient Parallel Programming Paradigm for Secure Multi-Party Computation. 629-640 - Zhuo Yuan, Haopeng Chen, Yucheng Tao, Zihong Lin:

LaOvl: Lifecycle-Aware Overlay File System for Efficient Container I/O in Cloud Computing. 654-665 - Kihwan Kim

, Hyunsun Chung, Seonghoon Ahn, Junhyeok Park
, Safdar Jamil, Hongsu Byun
, Myungcheol Lee, Jinchun Choi, Youngjae Kim:
KVACCEL: A Novel Write Accelerator for LSM-Tree-Based KV Stores with Host-SSD Collaboration. 666-677 - Lucas Esclapez

, Laurent Soucasse
, Caspar Jungbacker
, Fredrik Jansson
, Stephan R. de Roode
, Pedro Costa
, Gijs van den Oord
, Alessio Sclocco
:
Accelerating the Dutch Atmospheric Large-Eddy Simulation (DALES) Model with OpenACC. 678-688 - George Bisbas, Rhodri Nelson, Mathias Louboutin, Fabio Luporini, Paul H. J. Kelly, Gerard Gorman:

Automated MPI-X Code Generation for Scalable Finite-Difference Solvers. 689-701 - Minseok Ryu, Geunyeong Byeon, Kibaek Kim:

A GPU-Accelerated Distributed Algorithm for Optimal Power Flow in Distribution Systems. 702-711 - Dipak Acharya, Tong Shu

:
PredTOP: Latency Predictor Utilizing DAG Transformers for Distributed Deep Learning Training with Operator Parallelism. 712-724 - Guangqiang Luan, Pu Pang, Quan Chen, Chen Chen, Guoyao Xu, Chi Zhang, Yanyi Zi, Yinghao Yu, Guodong Yang, Liping Zhang, Minyi Guo:

Reducing the End-to-End Latency of DNN-Based Recommendation Systems in GPU Pools. 725-736 - Lihan Hu, Peng Jiang:

Improving Accuracy and Efficiency of Graph Embedding Training with Fine-Grained Parameter Management. 737-748 - Francieli Boito, Luan Teylo, Mihail Popov, Théo Jolivel

, François Tessier, Jakob Lüttgau, Julien Monniot, Ahmad Tarraf, André Ramos Carneiro, Carla Osthoff:
A Deep Look into the Temporal I/O Behavior of HPC Applications. 749-762 - Md. Hasanur Rashid

, Dong Dai:
AdapTBF: Decentralized Bandwidth Control via Adaptive Token Borrowing for HPC Storage. 775-788 - Arijus Lengvenis, Holger Dachsel, Laura Morgenstern, Ivo Kabadshow:

A New Spin on the Fast Multipole Method for GPUS: Rethinking the Far-Field Operators. 789-800 - Rongrong Liu, Zhuoqiang Guo, Qiuchen Sha, Tong Zhao, Haibo Li

, Wei Hu, Lijun Liu, Guangming Tan, Weile Jia:
Large Scale Finite-Temperature Real-Time Time Dependent Density Functional Theory Calculation with Hybrid Functional on ARM and GPU Systems. 801-812 - Brian C. Dandurand

, Hans Vandierendonck, Bronis R. de Supinski:
Improving Parallel Scalability for Molecular Dynamics Simulations in the Exascale Era. 813-823 - Lorenzo Carpentieri, Antonio De Caro, Majid Salimi Beni, Kaijie Fan, Biagio Cosenza

:
Phase-Based Frequency Scaling for Energy-Efficient Heterogeneous Computing. 824-836 - Kejie Ma, Hailong Yang, Zizheng Zhang, Xin You, Zhibo Xuan

, Qingxiao Sun, Zhongzhi Luan, Yi Liu, Depei Qian:
GNNPerf: Towards Effective Performance Profiling and Analysis Across GNN Frameworks. 837-849 - Zheng Chu, Ren Hang Zhang, Baozhu Li, Changtian Ying, Weiyun Li:

Graph Neural Network-Based Latency Prediction for Stream Processing Task. 850-859 - Robert Haas:

Next-gen Infrastructure for Scalable Generative AI: Focus on Advances in Storage, Computing and Orchestration. 860 - Grant Wilkins, Sheng Di, Jon C. Calhoun, Robert Underwood, Franck Cappello:

To Compress or Not to Compress: Energy Trade-Offs and Benefits of Lossy Compressed I/O. 861-873 - Alex Fallin, Noushin Azami, Sheng Di, Franck Cappello, Martin Burtscher

:
Fast and Effective Lossy Compression on GPUs and CPUs with Guaranteed Error Bounds. 874-887 - Yukang Dong

, Wenbin Jiang, Xinhai Shen, Haihong Guo, Zhiyuan Shao, Hai Jin:
BRP-SpMM: Block-Row Partition Based Sparse Matrix Multiplication with Tensor and CUDA Cores. 901-912 - Hanan Khan, Deniz Gurevin, Omer Khan:

Graph Input-Aware Matrix Multiplication for Pruned Graph Neural Network Acceleration. 913-925 - Cong Ma

, Du Wu, Zhelang Deng, Jiang Chen, Xiaowen Huang, Jintao Meng, Wenxi Zhu, Bingqiang Wang, Amelie Chi Zhou, Peng Chen, Minwen Deng, Yanjie Wei, Shengzhong Feng, Yi Pan:
NM-SpMM: Accelerating Matrix Multiplication Using N: M Sparsity with GPGPU. 926-937 - Chen-Chun Chen, Jinghan Yao, Lang Xu, Hari Subramoni, Dhabaleswar K. Panda:

Unified Designs of Multi-Rail-Aware MPI Allreduce and Alltoall Operations Across Diverse GPU and Interconnect Systems. 938-949 - Mert Hidayetoglu, Simon Garcia de Gonzalo, Elliott Slaughter, Pinku Surana, Wen-mei W. Hwu, William Gropp, Alex Aiken:

HiCCL: A Hierarchical Collective Communication Library. 950-961 - Alexandre Denis, Charles Goedefroit:

NBLFQ: A Lock-Free MPMC Queue Optimized for Low Contention. 962-973 - Pu Jiao, Sheng Di, Mingze Xia, Xuan Wu, Jinyang Liu

, Xin Liang, Franck Cappello:
Improving the Efficiency of Interpolation-based Scientific Data Compressors with Adaptive Quantization Index Prediction. 974-986 - Roberto Nuca, Matteo Parsani

, George Turkiyyah:
An Adaptive Two-Stage Algorithm for Error-Bounded Scientific Data Compression. 987-997 - Fengkui Yang, Bo Mao, Yuhan Liu, Liang Bao, Weipeng Jiang, Dongying Zhang, Chunhua Li, Ke Zhou:

Achieving Better Benefits via Flexible Feature Matching in Post-Deduplication Delta Compression. 998-1010 - Alycia Lisito, Mathieu Faverge, Matthieu Kuhn, Florent Pruvost, Pierre Ramet:

Scalable and Portable LU Factorization with Partial Pivoting on Top of Runtime Systems. 1011-1022 - Tim Noack

, Louis Krüger, Andreas Koch:
Accelerating Sparse Linear Solvers on Intelligence Processing Units. 1023-1035 - Robert Ernstbrunner, Wilfried N. Gansterer:

Adaptive s-Step GMRES with Randomized and Truncated Low-Synchronization Orthogonalization. 1036-1047 - Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li:

Performance Characterization of CXL Memory and Its Use Cases. 1048-1061 - Rashid Aligholipour, Pavlos Aimoniotis, Stefanos Kaxiras, Yuan Yao:

RXT: RefleXive Address Translation for Pointer-Chasing Workloads. 1062-1073 - Maksym Planeta, Jan Bierbaum, Michael Roitzsch, Hermann Härtig:

CoRD: Converged RDMA Dataplane. 1074-1090 - João Nuno Ferreira Alves

, Samir Moustafa
, Siegfried Benkner
, Alexandre P. Francisco, Wilfried N. Gansterer, Luís M. S. Russo:
Accelerating Graph Neural Networks Using a Novel Computation-Friendly Matrix Compression Format. 1091-1103 - Jieyang Chen, Qian Gong, Yanliang Li

, Xin Liang, Lipeng Wan, Qing Liu, Norbert Podhorszki, Scott Klasky:
HPDR: High-Performance Portable Scientific Data Reduction Framework. 1104-1116 - Alexandra Poulos, Robert Underwood

, Jon C. Calhoun, Sheng Di, Franck Cappello:
Sensitivity and Impacts on Parallel Compression of Prediction of Lossy Compression Ratios for Scientific Data. 1117-1128 - Xinmiao Zhang

, Cheng Liu, Shengwen Liang, Hayden Kwok-Hay So, Ying Wang, Lei Zhang, Huawei Li, Xiaowei Li:
Taijigraph: an Out-Of-Core Graph Processing System Enhanced with Computational Storage. 1129-1140 - Wahid Uz Zaman

, Cyan Subhra Mishra, Saleh AlSaleh
, Abutalib Aghayev, Mahmut Taylan Kandemir:
CORD: Parallelizing Query Processing Across Multiple Computational Storage Devices. 1141-1153 - Yihua Wei, Lihan Hu, Peng Jiang:

Matcha: A Language and Compiler for Backtracking-Based Subgraph Matching. 1154-1165 - Elliott Binder, Arvind Sudarsanam, Ravi Sunkavalli, Tze Meng Low:

FATHOM: Fast Attention Through Optimizing Memory. 1166-1178 - Jie Ye, Jaime Cernuda, Avinash Maurya, Xian-He Sun, Anthony Kougkas, Bogdan Nicolae:

Characterizing the Behavior and Impact of KV Caching on Transformer Inferences Under Concurrency. 1191-1202 - Zecheng Li

, Shruti Shivakumar, Jiajia Li, Ramakrishnan Kannan:
SymProp: Scaling Sparse Symmetric Tucker Decomposition via Symmetry Propagation. 1203-1214 - Chiang-Heng Chien, Ahmad Abdelfattah, Benjamin B. Kimia:

Accelerating Homotopy Continuation with GPUs: Application to Trifocal Pose Estimation. 1215-1227 - Jieun Kim, Dukyun Nam:

Enhanced JPEG Decoding Using PIM Architectures with Parallel MCU Processing. 1228-1237 - Xiaobo Zheng, Lisha Qin, Shiyi Li, Wen Xia, Chentao Wu, Yunfei Gu, Qicong Lin, Jun Wan, Huifang Jiao, Rubing Huang:

An Effective Uncorrectable Memory Error Prediction Framework by Exploiting UPH Indicators in Production Environments. 1238-1248 - Xin Yi, Hengbiao Yu, Liqian Chen, Xiaoguang Mao, Ji Wang, Chun Huang, Deheng Yang:

Fine-Grained Global Search for Inputs Triggering Floating-Point Exceptions in Gpu Programs. 1249-1260 - Dan Wu, Zhaoying Li, Tulika Mitra:

Inkstream: Instantaneous GNN Inference on Dynamic Graphs via Incremental Update. 1273-1285 - Ruiyang Chen, Xing Li, Xiaoyao Liang, Zhuoran Song:

GIFTS: Efficient GCN Inference Framework on PyTorch-CPU via Exploring the Sparsity. 1286-1297 - Waris Gill, Mohamed Elidrisi, Pallavi Kalapatapu, Ammar Ahmed, Ali Anwar

, Muhammad Ali Gulzar:
MeanCache: User-Centric Semantic Caching for LLM Web Services. 1298-1310

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














