


default search action
"Subkv: Quantizing Long Context KV Cache for Sub-Billion Parameter Language ..."
Ziqian Zeng et al. (2025)
- Ziqian Zeng, Tao Zhang
, Zhengdong Lu, Wenjun Li, Huiping Zhuang, Hongen Shao, Sin G. Teo, Xiaofeng Zou:
Subkv: Quantizing Long Context KV Cache for Sub-Billion Parameter Language Models on Edge Devices. Softw. Pract. Exp. 55(8): 1287-1304 (2025)

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.