


default search action
39th AAAI 2025: Philadelphia, PA, USA
- Toby Walsh, Julie Shah, Zico Kolter:
AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA. AAAI Press 2025, ISBN 978-1-57735-897-8
Technical Tracks 1
- Seokho Ahn, Hyungjin Kim, Sungbok Shin, Young-Duk Seo:
Real-Time Calibration Model for Low-Cost Sensor in Fine-Grained Time Series. 3-11 - Randy Ardywibowo, Rakesh Sunki, Shin Tsz Lucy Kuo, Sankalp Nayak:
BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale. 12-20 - Feiyang Cai, Chuchu Fan, Stanley Bak:
Scalable Surrogate Verification of Image-Based Neural Network Control Systems Using Composition and Unrolling. 21-30 - Biwei Cao, Qihang Wu, Jiuxin Cao, Bo Liu, Jie Gui:
External Reliable Information-enhanced Multimodal Contrastive Learning for Fake News Detection. 31-39 - Ji Cao, Tongya Zheng, Qinghong Guo, Yu Wang, Junshu Dai, Shunyu Liu, Jie Yang, Jie Song, Mingli Song:
Holistic Semantic Representation for Navigational Trajectory Generation. 40-48 - Jipeng Cen, Jiaxin Liu, Zhixu Li, Jingjing Wang:
SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent Collaboration. 49-57 - Geng Chen, Wuyuan Xie, Di Lin, Ye Liu, Miaohui Wang:
mmFAS: Multimodal Face Anti-Spoofing Using Multi-Level Alignment and Switch-Attention Fusion. 58-66 - Jie Chen
, Liangmin Wang, Huijuan Zhu, Victor S. Sheng:
CLEP: A Novel Contrastive Learning Method for Evolutionary Reentrancy Vulnerability Detection. 67-74 - Xiaocan Chen, Qilin Yin, Jiarui Liu, Wei Lu, Xiangyang Luo, Jiantao Zhou:
GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection. 75-83 - Zhe Chen, Zhe Fang, Wenhao Tian, Zhaoguang Long, Changzhi Sun, Yuefeng Chen, Hao Yuan, Honglin Li, Man Lan:
ReactGPT: Understanding of Chemical Reactions via In-Context Tuning. 84-92 - Kaihui Cheng, Ce Liu, Qingkun Su, Jun Wang, Liwei Zhang, Yining Tang, Yao Yao, Siyu Zhu, Yuan Qi:
4D Diffusion for Dynamic Protein Structure Prediction with Reference and Motion Guidance. 93-101 - Xiaoxia Cheng, Zeqi Tan, Zhe Zheng, Weiming Lu:
G2LDetect: A Global-to-Local Approach for Hallucination Detection. 102-109 - Sungjun Cho, Dae-Woong Jeong, Sung Moon Ko, Jinwoo Kim, Sehui Han, Seunghoon Hong, Honglak Lee, Moontae Lee:
3D Denoisers Are Good 2D Teachers: Molecular Pretraining via Denoising and Cross-Modal Distillation. 110-118 - Muzhi Dai, Zhuoer Dong, Weining Fu, Kui Xu, Qiangfeng Cliff Zhang:
CryoDomain: Sequence-free Protein Domain Identification from Low-resolution Cryo-EM Density Maps. 119-127 - Zhenlong Dai, Bingrui Chen, Zhuoluo Zhao, Xiu Tang, Sai Wu, Chang Yao, Zhipeng Gao, Jingyuan Chen:
Less Is More: Adaptive Program Repair with Bug Localization and Preference Learning. 128-136 - Chao Deng, Hongdong Li, Jianxin Wang:
Improving Cancer Gene Prediction by Enhancing Common Information Between the PPI Network and Gene Functional Association. 137-145 - Saaketh Desai, Sadhvikas Addamane, Jeffrey Y. Tsao, Igal Brener, Laura P. Swiler, Rémi Dingreville, Prasad P. Iyer:
AutoSciLab: A Self-Driving Laboratory for Interpretable Scientific Discovery. 146-154 - Zhihao Ding, Ting Zhang, Yiran Li, Jieming Shi, Chen Jason Zhang:
RingFormer: A Ring-Enhanced Graph Transformer for Organic Solar Cell Property Prediction. 155-163 - Zhiang Dong, Jingyuan Chen, Fei Wu:
Knowledge Is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis. 164-172 - Yitong Duan, Weiran Wang, Jian Li:
FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction. 173-181 - Haodong Feng, Yue Wang, Dixia Fan:
How to Re-enable PDE Loss for Physical Systems Modeling Under Partial Observation. 182-190 - Myles Foley, Sergio Maffeis:
APIRL: Deep Reinforcement Learning for REST API Fuzzing. 191-199 - Daniel Freedman, Eyal Rozenberg, Alex M. Bronstein:
A Theoretical Framework for an Efficient Normalizing Flow-Based Solution to the Electronic Schrödinger Equation. 200-209 - Lihao Gan, Xin Man, Chenghong Zhang, Jie Shao:
EWMoE: An Effective Model for Global Weather Forecasting with Mixture-of-Experts. 210-218 - Zhangyang Gao, Cheng Tan, Jue Wang, Yufei Huang, Lirong Wu, Stan Z. Li:
FoldToken: Learning Protein Language via Vector Quantization and Beyond. 219-227 - Hao Guo, Zihan Ma, Zhi Zeng, Minnan Luo, Weixin Zeng, Jiuyang Tang, Xiang Zhao:
Each Fake News Is Fake in Its Own Way: An Attribution Multi-Granularity Benchmark for Multimodal Fake News Detection. 228-236 - Rong Han, Wenbing Huang, Lingxiao Luo, Xinyan Han, Jiaming Shen, Zhiqiang Zhang, Jun Zhou, Ting Chen:
HeMeNet: Heterogeneous Multichannel Equivariant Network for Protein Multi-task Learning. 237-245 - Rong Han, Xiaohong Liu, Tong Pan, Jing Xu, Xiaoyu Wang, Wuyang Lan, Zhenyu Li, Zixuan Wang, Jiangning Song, Guangyu Wang, Ting Chen:
CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction. 246-254 - Xiao Han, Zijian Zhang, Xiangyu Zhao, Yuanshao Zhu, Guojiang Shen, Xiangjie Kong, Xuetao Wei, Liqiang Nie, Jieping Ye:
GARLIC: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching. 255-263 - Zehua Han, Jing Xiao, Qirui Zhao, Zhexuan Cui, Yufeng Wang, Duona Zhang, Wenrui Ding:
Open-world Radio Frequency Fingerprint Identification via Augmented Semi-supervised Learning. 264-272 - Harish Haresamudram, Apoorva Beedu, Mashfiqui Rabbi, Sankalita Saha, Irfan Essa, Thomas Ploetz:
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them. 273-281 - Meixia He, Peican Zhu, Keke Tang, Yangming Guo:
Hypergraph Attacks via Injecting Homogeneous Nodes into Elite Hyperedges. 282-290 - Qiang He, Yunting Bao, Hui Fang, Yuting Lin, Hao Sun:
HHAN: Comprehensive Infectious Disease Source Tracing via Heterogeneous Hypergraph Neural Network. 291-299 - Chia-Tung Ho, Haoxing Ren, Brucek Khailany:
VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool. 300-307 - Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran, Kiet Van Nguyen:
ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese. 308-316 - Jingjing Hu, Dan Guo, Zhan Si, Deguang Liu, Yunfeng Diao, Jing Zhang, Jinxing Zhou, Meng Wang:
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights. 317-325 - Xinlei Huang, Zhiqi Ma, Dian Meng, Yanran Liu, Shiwei Ruan, Qingqiang Sun, Xubin Zheng, Ziyue Qiao:
PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics Analysis. 326-333 - Yinxuan Huang, Ke Liang, Yanyi Huang, Xiang Zeng, Kai Chen, Bin Zhou:
Social Recommendation via Graph-Level Counterfactual Augmentation. 334-342 - Zhiheng Huang, Yannan Liu, Daojing He, Yu Li:
DF-MIA: A Distribution-Free Membership Inference Attack on Fine-Tuned Large Language Models. 343-351 - Pengcheng Jiang, Cao Xiao, Tianfan Fu, Parminder Bhatia, Taha A. Kass-Hout, Jimeng Sun, Jiawei Han:
Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations. 352-360 - Sizhuo Jin, Shuo Chen, Jianjun Qian, Ying Tai, Jun Li:
Learning Generalized Residual Exchange-Correlation-Uncertain Functional for Density Functional Theory. 361-369 - Feifei Kou, Yuhan Yao, Siyuan Yao, Jiahao Wang, Lei Shi, Yawen Li, Xuejing Kang:
IWRN: A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack. 370-378 - Yao Lai, Sungyoung Lee, Guojin Chen, Souradip Poddar, Mengkang Hu, David Z. Pan, Ping Luo:
AnalogCoder: Analog Circuit Design via Training-Free Code Generation. 379-387 - Hao Li, Ruoyuan Gong, Hao Jiang:
Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models. 388-396 - Haoran Li, Yulin Chen, Zihao Zheng, Qi Hu, Chunkit Chan, Heshan Liu, Yangqiu Song:
Simulate and Eliminate: Revoke Backdoors for Generative Large Language Models. 397-405 - Haoran Li
, Xingjian Li, Jiahua Shi, Huaming Chen, Bo Du, Daisuke Kihara, Johan Barthelemy, Jun Shen
, Min Xu:
Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo-Labeling. 406-414 - Junxian Li, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Wei Li, Mao Su, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. 415-423 - Kai Li, Wenqi Ren, Jianshu Li, Wei Wang, Xiaochun Cao:
Critical Forgetting-Based Multi-Scale Disentanglement for Deepfake Detection. 424-432 - Mingxin Li, Yuchen Zhang, Haowei Xu, Xianghua Li, Chao Gao, Zhen Wang:
Learning Complex Heterogeneous Multimodal Fake News via Social Latent Network Inference. 433-441 - Tian Li, Xiao-Yue Xu, Chen Ding, Tian-Ci Tian, Wei-You Liao, Shuo Zhang, He-Liang Huang:
AI-Powered Algorithm-Centric Quantum Processor Topology Design. 442-450 - Zhiting Li, Shibai Yin, Tai-Xiang Jiang, Yexun Hu, Jia-Mian Wu, Guowei Yang, Guisong Liu:
Enhancing the Adversarial Robustness via Manifold Projection. 451-459 - Zhufeng Li, Sandeep Suresh Cranganore, Nicholas D. Youngblut, Niki Kilbertus:
Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity. 460-469 - Zongwei Li
, Xiaoqi Li, Wenkai Li, Xin Wang:
SCALM: Detecting Bad Practices in Smart Contracts Through LLMs. 470-477 - Yuqi Liang, Jun Luo, Xiaoxi Guo, Jianqi Bi:
An Evaluation Framework for Product Images Background Inpainting Based on Human Feedback and Product Consistency. 478-486 - Panfeng Liu, Guoliang Qiu, Biaoshuai Tao, Kuan Yang:
A Thorough Comparison Between Independent Cascade and Susceptible-Infected-Recovered Models. 487-495 - Runxin Liu, Tian Xie, Jiaming Li, Lingyun Yu, Hongtao Xie:
IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation. 496-504 - Xiangyu Liu, Yi Liu, Silei Chen, Wei Hu:
Controllable Protein Sequence Generation with LLM Preference Optimization. 505-513 - Xiyao Liu, Junxing Ma, Xinda Wang, Qianyu Lin, Jian Zhang, Gerald Schaefer, Cagatay Turkay, Hui Fang:
Recoverable Facial Identity Protection via Adaptive Makeup Transfer Adversarial Attacks. 514-522 - Xuan Liu, Menglu Li:
Knowledge-Guided Domain Adaptation Model for Transferring Drug Response Prediction from Cell Lines to Patients. 523-531 - Yupei Liu, Yanting Wang, Jinyuan Jia:
TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning. 532-540 - Yuxuan Liu, Hongda Sun, Wenya Guo, Xinyan Xiao, Cunli Mao, Zhengtao Yu, Rui Yan:
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking. 541-549 - Zhendong Liu, Le Zhang, Bing Li, Yingjie Zhou, Zhenghua Chen, Ce Zhu:
WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network. 550-558 - Weihai Lu, Yu Tong
, Zhiqiu Ye:
DAMMFND: Domain-Aware Multimodal Multi-view Fake News Detection. 559-567 - Bingjun Luo, Jinpeng Wang, Zewen Wang, Junjie Zhu, Xibin Zhao:
Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval. 568-576 - Rui Lv, Qi Liu, Weibo Gao, Haotian Zhang, Junyu Lu, Linbo Zhu:
GenAL: Generative Agent for Adaptive Learning. 577-585 - Tianxu Lv, Jie Zhu, Jinyi Liu, Shiyun Nie, Hongnian Tian, Yang Xiao, Yuan Liu, Lihua Li, Xiang Pan:
M²N: A Progressive Macro-to-Micro 3D Modeling Scheme for Unveiling Drug-Target Affinity. 586-594 - Takashi Matsubara, Takaharu Yaguchi:
Number Theoretic Accelerated Learning of Physics-Informed Neural Networks. 595-603 - Jian-Ping Mei, Weibin Zhang, Jie Chen, Xuyun Zhang, Tiantian Zhu:
Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy. 604-611 - Hui Miao, Yuanfang Guo, Zeming Liu, Yunhong Wang:
Multi-modal Deepfake Detection via Multi-task Audio-Visual Prompt Learning. 612-621 - Yuwei Miao, Yuzhi Guo, Hehuan Ma, Jingquan Yan, Feng Jiang, Rui Liao, Junzhou Huang:
GoBERT: Gene Ontology Graph Informed BERT for Universal Gene Function Prediction. 622-630 - Li Ni, Rui Ye, Wenjian Luo, Yiwen Zhang, Lei Zhang, Victor S. Sheng:
SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning. 631-639 - Zhibin Ni, Chang Liu, Hai Wan, Xibin Zhao:
Robust Heterogeneous Graph Classification for Molecular Property Prediction with Information Bottleneck. 640-648 - Pedro Orvalho
, Mikolás Janota, Vasco M. Manquinho:
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization. 649-657 - Jimin Park, AHyun Ji, Minji Park, Mohammad Saidur Rahman, Se Eun Oh:
MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification. 658-666 - Gaozheng Pei, Shaojie Lyu, Ke Ma, Pinci Yang, Qianqian Xu, Yingfei Sun:
Exploring Query Efficient Data Generation Towards Data-Free Model Stealing in Hard Label Setting. 667-675 - Jiaxin Qi, Yan Cui, Kailei Guo, Xiaomin Zhang, Jianqiang Huang, Gaogang Xie:
A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics. 676-684 - Xing Qiu
, Guang Cheng, Weizhou Zhu, Dandan Niu, Nan Fu:
Dual-Channel Interactive Graph Transformer for Traffic Classification with Message-Aware Flow Representation. 685-693 - Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin:
Revisiting Tampered Scene Text Detection in the Era of Generative AI. 694-702 - Huiru Shao, Kaizhu Huang, Wei Wang, Xiaowei Huang, Qiufeng Wang:
Towards Better Robustness Against Natural Corruptions in Document Tampering Localization. 703-710 - Guobin Shen, Dongcheng Zhao, Aorigele Bao, Xiang He, Yiting Dong, Yi Zeng:
StressPrompt: Does Stress Impact Large Language Models and Human Performance Similarly? 711-719 - Ziqi Sheng, Wei Lu, Xiangyang Luo, Jiantao Zhou, Xiaochun Cao:
SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints. 720-728 - Yi Shi, Yun-Kai Wang, Xu-Peng Tian, Tie-Yi Zhang, Bing Yao, Hui Wang, Yong Shao, Cen-Cen Wang, Rong Zeng:
SpeHeaTal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis. 729-737 - Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu
:
Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation. 738-745 - Shibo Feng, Peilin Zhao, Liu Liu, Pengcheng Wu, Zhiqi Shen:
HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting. 746-754 - Xiaozhuang Song, Yuzhao Tu, Hangting Ye, Wei Fan, Qingquan Zhang, Xiaoxue Wang, Tianshu Yu:
Enhancing Generalizability in Molecular Conformation Generation with METRIZATION-Informed Geometric Diffusion Pretraining. 755-763 - Yunpeng Song, Jiawei Li, Yiheng Bian, Zhongmin Cai:
Predicting User Behavior in Smart Spaces with LLM-Enhanced Logs and Personalized Prompts. 764-772 - Nan Sun, Han Fang, Yuxing Lu, Chengxin Zhao, Hefei Ling:
END^2: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions. 773-781 - Cheng Tan, Yijie Zhang, Zhangyang Gao, Yufei Huang, Haitao Lin, Lirong Wu, Fandi Wu, Mathieu Blanchette, Stan Z. Li:
dyAb: Flow Matching for Flexible Antibody Design with AlphaFold-driven Pre-binding Antigen. 782-790 - Lei Tan, Yuliang Xue, Guobiao Li, Zhenxing Qian, Sheng Li, Chunlei Bao:
Embedding Robust Watermarking into Pattern to Protect the Copyright of Ceramic Artifacts. 791-798 - Renshuai Tao, Manyi Le, Chuangchuang Tan, Huan Liu, Haotong Qin, Yao Zhao:
ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks. 799-807 - Chengyue Wang, Haicheng Liao, Bonan Wang, Yanchen Guan, Bin Rao, Ziyuan Pu, Zhiyong Cui, Cheng-Zhong Xu, Zhenning Li:
NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving. 808-816 - Jia Wang, Liyan Zhu, Zhe Wang, Chenqiu Zhang, Yaoxing Wu, Jun Cui, Jianqiang Li:
PScalpel: A Machine Learning-based Guider for Protein Phase-Separating Behaviour Alteration. 817-825 - Jiabao Wang, Zepeng Wu, Qian Dong, Lingzhong Meng, Yunzhi Xue, Yukuan Yang:
Hybrid-Driving: An Autonomous Driving Decision Framework Integrating Large Language Models, Knowledge Graphs and Driving Rules. 826-833 - Jingyuan Wang, Yujing Lin, Yudong Li:
GTG: Generalizable Trajectory Generation Model for Urban Mobility. 834-842 - Lingzhi Wang, Xingshan Zeng, Jinsong Guo, Kam-Fai Wong, Georg Gottlob:
Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models. 843-851 - Ruoqi Wang, Haitao Wang, Qiong Luo, Feng Wang, Hejun Wu:
VisRec: A Semi-Supervised Approach to Visibility Data Reconstruction in Radio Astronomy. 852-860 - Xiaozheng Wang, Yong Yang, Shuying Huang, Hangyuan Lu, Weiguo Wan, Aoqi Zhao:
FMPM-DNet: Hyperspectral Pansharpening Dynamic Network Based on Feature Modulation and Probability Mask. 861-868 - Yueqing Wang, Peng Zhang, Yushuang Liu, Jianing Zhao, Jie Lin, Yi Chen:
Aerodynamic Coefficients Prediction via Cross-Attention Fusion and Physical-Informed Training. 869-876 - Fang Wu, Bozhen Hu, Stan Z. Li:
Generalized Implicit Neural Representations for Dynamic Molecular Surface Modeling. 877-885 - Juntao Wu, Ziyu Song, Xiaoyu Zhang, Shujun Xie, Longxin Lin, Ke Wang:
Vision Transformers Beat WideResNets on Small Scale Datasets Adversarial Robustness. 886-894 - Lirong Wu, Haitao Lin, Yufei Huang, Zhangyang Gao, Cheng Tan, Yunfan Liu, Tailin Wu, Stan Z. Li:
Relation-Aware Equivariant Graph Networks for Epitope-Unknown Antibody Design and Specificity Optimization. 895-904 - Zhihao Wu, Yushi Cheng, Tianyang Sun, Xiaoyu Ji, Wenyuan Xu:
MYOPIA: Protecting Face Privacy from Malicious Personalized Text-to-Image Synthesis via Unlearnable Examples. 905-913 - Zeke Xia, Ming Hu, Dengke Yan, Ruixuan Liu, Anran Li, Xiaofei Xie, Mingsong Chen:
MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay. 914-922 - Di Xiong, Shuoyuan Wang, Lei Zhang, Wenbo Huang, Chaolei Han:
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning. 923-931 - Xovee Xu, Yifan Zhang, Fan Zhou, Jingkuan Song:
Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation. 932-940 - Yongxin Xu, Xinke Jiang, Xu Chu, Rihong Qiu, Yujie Feng, Hongxin Ding, Junfeng Zhao, Yasha Wang, Bing Xie:
DearLLM: Enhancing Personalized Healthcare via Large Language Models-Deduced Feature Correlations. 941-949 - Chenchen Yang, Hao Wu, Tao Shen, Kai Zou, Siqi Sun:
PriFold: Biological Priors Improve RNA Secondary Structure Predictions. 950-958 - Xinyu Yang, Yu Sun, Xinyang Chen, Ying Zhang, Xiaojie Yuan:
Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales. 959-967 - Tong Ye, Yangkai Du, Tengfei Ma, Lingfei Wu, Xuhong Zhang, Shouling Ji, Wenhai Wang:
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting. 968-976 - Na Yu, Yutong Deng, Shunyu Liu, Kaixuan Chen, Tongya Zheng, Mingli Song:
Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location. 977-985 - Xinquan Yu, Ziqi Sheng, Wei Lu, Xiangyang Luo, Jiantao Zhou:
RaCMC: Residual-Aware Compensation Network with Multi-Granularity Constraints for Fake News Detection. 986-994 - Zeqin Yu, Jiangqun Ni, Jian Zhang, Haoyi Deng, Yuzhen Lin:
Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization. 995-1003 - Wenwu Zeng, Liangrui Pan, Boya Ji, Liwen Xu, Shaoliang Peng:
Accurate Nucleic Acid-Binding Residue Identification Based Domain-Adaptive Protein Language Model and Explainable Geometric Deep Learning. 1004-1012 - Xi Zeng, Fei Ni, Shaoqing Jiao, Dazhi Lu, Jianye Hao, Jiajie Peng:
SWAMamba: A Sliding Window Attention Mamba Framework for Predicting Translation Elongation Rates. 1013-1021 - Jiangou Zhan, Wenhui Zhang, Zheng Zhang, Huanran Xue, Yao Zhang, Ye Wu:
Portcullis: A Scalable and Verifiable Privacy Gateway for Third-Party LLM Inference. 1022-1030 - Chaowei Zhang, Zongling Feng, Zewei Zhang, Jipeng Qiang, Guandong Xu, Yun Li:
Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection. 1031-1039 - Chongyu Zhang, Qiping Tao, Liangyu Chen, Min Zhang:
BERT-Based Code Learning for Exception Localization and Type Prediction. 1040-1047 - Haozhen Zhang, Haodong Yue, Xi Xiao, Le Yu, Qing Li, Zhen Ling, Ye Zhang:
Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model. 1048-1056 - Honggen Zhang, Xiangrui Gao, June Zhang, Lipeng Lai:
mRNA2vec: mRNA Embedding with Language Model in the 5'UTR-CDS for mRNA Design. 1057-1065 - Kuiyuan Zhang, Zhongyun Hua, Rushi Lan, Yushu Zhang, Yifang Guo:
Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes. 1066-1074 - Kuiyuan Zhang, Zhongyun Hua, Rushi Lan, Yifang Guo, Yushu Zhang, Guoai Xu:
Multi-View Collaborative Learning Network for Speech Deepfake Detection. 1075-1083 - Lei Zhang, Guanyu Gao, Haiyan Yin, Huaizheng Zhang:
Multi-Edge Reinforced Collaborative Data Acquisition for Continuous Video Analytics by Prioritizing Quality over Quantity. 1084-1092 - Qianru Zhang, Xinyi Gao, Haixin Wang, Siu Ming Yiu, Hongzhi Yin:
Efficient Traffic Prediction Through Spatio-Temporal Distillation. 1093-1101 - Ran Zhang, Xuezhi Wang, Guannan Liu, Pengyang Wang, Yuanchun Zhou, Pengfei Wang:
Motif-Oriented Representation Learning with Topology Refinement for Drug-Drug Interaction Prediction. 1102-1110 - Rongchao Zhang, Yu Huang, Yiwei Lou, Yi Xin, Haixu Chen, Yongzhi Cao, Hanpin Wang:
Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models. 1111-1119 - Shiqi Zhang, Pan Mu, Cheng Huang, Jinglin Zhang, Cong Bai:
TC-Diffuser: Bi-Condition Multi-Modal Diffusion for Tropical Cyclone Forecasting. 1120-1128 - Xin Zhang, Peiliang Zhang, Jingling Yuan, Lin Li:
Zero-Shot Learning for Materials Science Texts: Leveraging Duck Typing Principles. 1129-1137 - Xiongqi Zhang, Junwei Xu, Yang Wang, Dongming Xiang, Wang Lin, Zuohua Ding:
Formal Synthesis of Barrier Certificates Using Fourier Kolmogorov-Arnold Network. 1138-1146 - Yudong Zhang, Xu Wang, Xuan Yu, Zhaoyang Sun, Kai Wang, Yang Wang:
Drawing Informative Gradients from Sources: A One-stage Transfer Learning Framework for Cross-city Spatiotemporal Forecasting. 1147-1155 - Zhenbang Zhang, Hongjia Li, Zhiqiang Xu, Wenjia Meng, Renmin Han:
A Gaussian Filter-Based 3D Registration Method for Series Section Electron Microscopy. 1156-1164 - Ziyang Zhang, Yang Zhao, Ming-Ching Chang, Changyao Lin, Jie Liu:
E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS. 1165-1173 - Guanhao Zhao, Zhenya Huang, Cheng Cheng, Yan Zhuang, Qingyang Mao, Xin Li, Shijin Wang, Enhong Chen:
Multi-Perspective Consolidation Enhanced Cognitive Diagnosis via Conditional Diffusion Model. 1174-1182 - Penghai Zhao, Qinghua Xing, Kairan Dou, Jinyu Tian, Ying Tai, Jian Yang, Ming-Ming Cheng, Xiang Li:
From Words to Worth: Newborn Article Impact Prediction with LLM. 1183-1191 - Qihua Zhou, Ruibin Li, Jingcai Guo, Yaodong Huang, Zhenda Xu, Laizhong Cui, Song Guo:
DeNC: Unleash Neural Codecs in Video Streaming with Diffusion Enhancement. 1192-1200 - Ziqi Zhou, Bowen Li, Yufei Song, Zhifei Yu, Shengshan Hu, Wei Wan, Leo Yu Zhang, Dezhong Yao, Hai Jin:
NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors. 1201-1209 - Ziyi Zhou, Xiaoming Zhang, Shenghan Tan, Litian Zhang, Chaozhuo Li:
Collaborative Evolution: Multi-Round Learning Between Large and Small Language Models for Emergent Fake News Detection. 1210-1218 - Jun Zhu, Yifu Li, Zhenchao Tang, Cheng Chang:
DUSTED: Dual-Attention Enhanced Spatial Transcriptomics Denoiser. 1219-1227 - Yu Zhu, Bo Lei, Chunfeng Song, Wanli Ouyang, Shan Yu, Tiejun Huang:
Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis. 1228-1236 - Linlin Zong, Wenmin Lin, Jiahui Zhou, Xinyue Liu, Xianchao Zhang, Bo Xu, Shimin Wu:
Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News Detection. 1237-1245
Technical Tracks 2
- Qing Chang, Yao-Xiang Ding, Kun Zhou:
Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment. 1247-1255 - Xuping Chen, Wuzhen Shi:
Dynamic Interactive Bimodal Hypergraph Networks for Emotion Recognition in Conversations. 1256-1264 - Yuhong Chen, Ailin Song, Huifeng Yin, Shuai Zhong, Fuhai Chen, Qi Xu, Shiping Wang, Mingkun Xu:
Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency. 1265-1273 - Zhuang Chen, Yaru Cao, Guanqun Bi, Jincenzi Wu, Jinfeng Zhou, Xiyao Xiao, Si Chen, Hongning Wang, Minlie Huang:
SocialSim: Towards Socialized Simulation of Emotional Support Conversation. 1274-1282 - Mateus de Oliveira Oliveira, Wim Vanden Broeck
:
Symbolic Functional Decomposition: A Reconfiguration Approach. 1283-1290 - Yiting Dong, Xiang He, Guobin Shen, Dongcheng Zhao, Yang Li, Yi Zeng:
EventZoom: A Progressive Approach to Event-Based Data Augmentation for Enhanced Neuromorphic Vision. 1291-1299 - Yi Feng, Mingyang Song, Jiaqi Wang, Zhuang Chen, Guanqun Bi, Minlie Huang, Liping Jing, Jian Yu:
SS-GEN: A Social Story Generation Framework with Large Language Models. 1300-1308 - Xilin He, Haijian Liang, Boyi Peng, Weicheng Xie, Muhammad Haris Khan, Siyang Song, Zitong Yu:
MSAmba: Exploring Multimodal Sentiment Analysis with State Space Models. 1309-1317 - Jinbing Hou, Youpeng Zhao, Jian Zhao:
CraftFactory: A Conditioned Control Policy Benchmark for Compositional Generalization. 1318-1326 - Zhejing Hu, Yan Liu, Gong Chen, Bruce X. B. Yu:
Compose with Me: Collaborative Music Inpainter for Symbolic Music Infilling. 1327-1335 - Zihan Ji, Xuetao Tian, Ye Liu:
AFFAKT: A Hierarchical Optimal Transport Based Method for Affective Facial Knowledge Transfer in Video Deception Detection. 1336-1344 - Md Rysul Kabir, James Mochizuki-Freeman, Zoran Tiganj:
Deep Reinforcement Learning with Time-Scale Invariant Memory. 1345-1354 - Lucio La Cava, Andrea Tagarelli:
Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models. 1355-1363 - Zhenxin Lei, Man Yao, Jiakui Hu, Xinhao Luo, Yanye Lu, Bo Xu, Guoqi Li:
Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation. 1364-1372 - Chengtai Li, Yee Yang Tan, Yuting He, Jianfeng Ren, Ruibin Bai, Yitian Zhao, Heng Yu, Xudong Jiang:
DARR: A Dual-Branch Arithmetic Regression Reasoning Framework for Solving Machine Number Reasoning. 1373-1382 - Jingmeng Li, Lukang Fu, Surun Yang, Hui Wei:
MI-CAPTCHA: Enhance the Security of CAPTCHA Using Mooney Images. 1383-1391 - Yinan Li, Jun Long, Zhan Yang:
Asymmetric Cross-Modal Hashing Based on Formal Concept Analysis. 1392-1401 - Yu Liang, Wenjie Wei, Ammar Belatreche, Honglin Cao, Zijian Zhou, Shuai Wang, Malu Zhang, Yang Yang:
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism. 1402-1410 - Jinhao Lin, Yifei Wang, Yanwu Xu, Qi Liu:
Semi-IIN: Semi-Supervised Intra-Inter Modal Interaction Learning Network for Multimodal Sentiment Analysis. 1411-1419 - Wei Liu, Li Yang, Mingxuan Zhao, Dengfeng Xue, Shuxun Wang, Boyu Cai, Jin Gao, Wenjuan Li, Bing Li, Weiming Hu:
Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision. 1420-1428 - Xiaochuan Liu, Xin Cheng
, Yuchong Sun, Xiaoxue Wu, Ruihua Song, Hao Sun, Denghao Zhang:
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics. 1429-1437 - Yan-Kai Liu, Jinyu Cai, Bao-Liang Lu, Wei-Long Zheng:
Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition Through Contrastive Learning. 1438-1446 - Haifeng Lu, Jiuyi Chen, Feng Liang, Mingkui Tan, Runhao Zeng, Xiping Hu:
Understanding Emotional Body Expressions via Large Language Models. 1447-1455 - Ryo Masumura, Shota Orihashi, Mana Ihori, Tomohiro Tanaka, Naoki Makishima, Satoshi Suzuki, Saki Mizuno, Nobukatsu Hojo:
Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores. 1456-1464 - Wei Miao, Jiangrong Shen, Qi Xu, Timo Hämäläinen
, Yi Xu, Fengyu Cong:
SpikingYOLOX: Improved YOLOX Object Detection with Fast Fourier Convolution and Spiking Neural Networks. 1465-1473 - Philippe Pasquier, Jeff Ens, Nathan Fradet, Paul Triana, Davide Rizzotti, Jean-Baptiste Rolland, Maryam Safi:
MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition. 1474-1482 - Lang Qin, Ziming Wang, Runhao Jiang, Rui Yan, Huajin Tang:
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL. 1483-1491 - Mirabel Reid, Santosh S. Vempala:
Does GPT Really Get It? A Hierarchical Scale to Quantify Human and AI's Understanding of Algorithms. 1492-1500 - Yimeng Shan, Malu Zhang, Ruijie Zhu, Xuerui Qiu, Jason K. Eshraghian, Haicheng Qu:
Advancing Spiking Neural Networks Towards Multiscale Spatiotemporal Interaction Learning. 1501-1509 - Haojun Shi, Suyu Ye, Xinyu Fang, Chuanyang Jin, Leyla Isik, Yen-Ling Kuo, Tianmin Shu:
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind. 1510-1519 - Kazutoshi Shinoda, Nobukatsu Hojo, Kyosuke Nishida, Saki Mizuno, Keita Suzuki, Ryo Masumura, Hiroaki Sugiyama, Kuniko Saito:
ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind. 1520-1528 - Yuxuan Song, Qiudan Li, Yilin Wu, David Jingjun Xu, Daniel Dajun Zeng:
Knowledge-Enhanced Hierarchical Heterogeneous Graph for Personality Identification with Limited Training Data. 1529-1537 - Bin Tang, Keqi Pan, Miao Zheng, Ning Zhou, Jialu Sui, Dandan Zhu, Cheng-Long Deng, Shu-Guang Kuai:
Pose as a Modality: A Psychology-Inspired Network for Personality Recognition with a New Multimodal Dataset. 1538-1546 - Chuanqi Tao, Jiaming Li, Tianzi Zang, Peng Gao:
A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis. 1547-1555 - Neha Upadhyay, Vijay Marupudi, Kamala Varma, Sashank Varma:
Alignment of CNN and Human Judgments of Geometric and Topological Concepts. 1556-1564 - Miaohui Wang, Zhenming Li, Wuyuan Xie:
DDJND: Dual Domain Just Noticeable Difference in Multi-Source Content Images with Structural Discrepancy. 1565-1573 - Yusong Wang, Xuanye Fang, Huifeng Yin, Dongyuan Li, Guoqi Li, Qi Xu, Yi Xu, Shuai Zhong, Mingkun Xu:
BIG-FUSION: Brain-Inspired Global-Local Context Fusion Framework for Multimodal Emotion Recognition in Conversations. 1574-1582 - Ziqing Wang, Yuetong Fang, Jiahang Cao, Hongwei Ren, Renjing Xu:
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks. 1583-1591 - Zachary Wojtowicz, Simon DeDeo:
Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier. 1592-1600 - Sheng Wu, Dongxiao He, Xiaobao Wang, Longbiao Wang, Jianwu Dang:
Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audio Content. 1601-1609 - Zijian Wu, Leijing Zhou, Shuanglin Li, Changzeng Fu, Jun Lu, Jing Han, Yi Zhang, Zhuang Zhao, Siyang Song:
DepMGNN: Matrixial Graph Neural Network for Video-based Automatic Depression Assessment. 1610-1619 - Dingyi Zeng, Yuchen Wang, Honglin Cao, Wanlong Liu, Yichen Xiao, Chengzhuo Lu, Wenyu Chen, Malu Zhang, Guoqing Wang, Yang Yang:
Leveraging Asynchronous Spiking Neural Networks for Ultra Efficient Event-Based Visual Processing. 1620-1628 - Dengming Zhang, Weitao You, Ziheng Liu, Lingyun Sun, Pei Chen:
Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning. 1629-1637 - Miao Zhang, Jiawei Wang
, Kui Xiao, Shihui Wang, Yan Zhang, Hao Chen, Zhifei Li:
Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization. 1638-1646 - Chunyu Zhao, Wentao Mu, Xian Zhou, Wenbo Liu, Fei Yan, Tao Deng:
SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention. 1647-1655 - Shiyi Zheng, Peizhi Zhao, Zhilong Zheng, Peihang He, Haonan Cheng, Yi Cai, Qingbao Huang:
Look Around Before Locating: Considering Content and Structure Information for Visual Grounding. 1656-1664 - Hengde Zhu, Xiangyu Kong, Weicheng Xie, Xin Huang, Xilin He, Lu Liu, Linlin Shen, Wei Zhang, Hatice Gunes, Siyang Song:
PerReactor: Offline Personalised Multiple Appropriate Facial Reaction Generation. 1665-1673 - Jiankun Zhu, Sicheng Zhao, Jing Jiang, Wenbo Tang, Zhaopan Xu, Tingting Han, Pengfei Xu, Hongxun Yao:
Bridge Then Begin Anew: Generating Target-Relevant Intermediate Model for Source-Free Visual Emotion Adaptation. 1674-1682 - Linlin Zhu, Heli Sun, Qunshu Gao, Yuze Liu, Liang He:
Aspect Enhancement and Text Simplification in Multimodal Aspect-Based Sentiment Analysis for Multi-Aspect and Multi-Sentiment Scenarios. 1683-1691 - Yaohui Zhu, Kaiming Sun, Zhengdong Luo, Lingfeng Wang:
Progressive Self-Learning for Domain Adaptation on Symbolic Regression of Integer Sequences. 1692-1699 - Han Yang, Chuanguang Yang, Zhulin An, Libo Huang, Yongjun Xu:
HSRDiff: A Hierarchical Self-Regulation Diffusion Model for Stochastic Semantic Segmentation. 1701-1709 - Yihao, Limei Hu, Feng Chen, Sen Zhao
, Shukai Duan:
GRICP: Granular-Ball Iterative Closest Point with Multikernel Correntropy for Point Cloud Fine Registration. 1710-1718 - Shivang Agarwal, Jyoti Chaudhary, Sadiq Siraj Ebrahim, Mayank Vatsa, Richa Singh, Shyam Prasad Adhikari, Sangeeth Reddy Battu:
AQUAFace: Age-Invariant Quality Adaptive Face Recognition for Unconstrained Selfie vs ID Verification. 1719-1727 - Daechul Ahn, Yura Choi, San Kim, Youngjae Yu, Dongyeop Kang, Jonghyun Choi:
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO. 1728-1736 - Tim Alpherts, Sennay Ghebreab, Nanne van Noord:
EMPLACE: Self-Supervised Urban Scene Change Detection. 1737-1745 - Jingkun An, Yinghao Zhu, Zongjian Li, Enshen Zhou, Haoran Feng, Xijie Huang, Bohua Chen, Yemin Shi, Chengwei Pan:
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation. 1746-1754 - Xiaoqi An, Lin Zhao, Chen Gong, Jun Li, Jian Yang:
Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation. 1755-1763 - Yajun An, Jiale Chen, Huan Lin, Zhenbing Liu, Siyang Feng, Hualong Zhang, Rushi Lan, Zaiyi Liu, Xipeng Pan:
CA-MLIF: Cross-Attention and Multimodal Low-Rank Interaction Fusion Framework for Tumor Prognostic Prediction. 1764-1772 - Kazi Hasan Ibn Arif, JinYi Yoon, Dimitrios S. Nikolopoulos, Hans Vandierendonck, Deepu John, Bo Ji:
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models. 1773-1781 - Sithu Aung, Min-Cheol Sagong, Junghyun Cho:
Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset. 1782-1790 - Hamed Ayoobi, Nico Potyka, Francesca Toni:
ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation. 1791-1799 - Sana Ayromlou, Vahid Reza Khazaie, Fereshteh Forghani, Arash Afkanpour:
Can Generative Models Improve Self-Supervised Representation Learning? 1800-1808 - Zahra Babaiee, Peyman M. Kiasari, Daniela Rus, Radu Grosu:
The Master Key Filters Hypothesis: Deep Filters Are General. 1809-1816 - Lichen Bai, Zixuan Xiong, Hai Lin, Guangwei Xu, Xiangjin Xie, Ruijie Guo, Zhanhui Kang, Haitao Zheng, Hong-Gee Kim:
Frozen Language Models Are Gradient Coherence Rectifiers in Vision Transformers. 1817-1825 - Jingwei Bao
, Jinhua Hao, Pengcheng Xu, Ming Sun, Chao Zhou, Shuyuan Zhu:
Plug-and-Play Tri-Branch Invertible Block for Image Rescaling. 1826-1834 - Oren Barkan, Yehonatan Elisha, Jonathan Weill, Noam Koenigstein:
BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation. 1835-1843 - Jian Bi, Qianliang Wu, Jianjun Qian, Lei Luo, Jian Yang:
Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis. 1844-1852 - Qi Bi, Jingjun Yi, Haolan Zhan, Wei Ji, Gui-Song Xia:
Learning Fine-grained Domain Generalization via Hyperbolic State Space Hallucination. 1853-1861 - Qi Bi, Jingjun Yi, Hao Zheng, Haolan Zhan, Wei Ji, Yawen Huang, Yuexiang Li:
DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization. 1862-1870 - Xiuli Bi, Jian Lu, Bo Liu, Xiaodong Cun, Yong Zhang, Weisheng Li, Bin Xiao:
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training. 1871-1879 - Yuxuan Bian, Ailing Zeng, Xuan Ju, Xian Liu, Zhaoyang Zhang, Wei Liu, Qiang Xu:
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls. 1880-1888 - Yuntian Bo, Yazhou Zhu, Lunbo Li, Haofeng Zhang:
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation. 1889-1897 - Lingling Cai, Kang Zhao, Hangjie Yuan, Yingya Zhang, Shiwei Zhang, Kejie Huang:
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing. 1898-1906 - Rui Cai, Zhiyu Dong, Jianfeng Dong, Xun Wang:
Dynamic Adapter with Semantics Disentangling for Cross-lingual Cross-modal Retrieval. 1907-1916 - Shuo Cai, Xinzhe Han, Shuhui Wang:
Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue. 1917-1925 - Wenxiao Cai, Wankou Yang:
Object-level Geometric Structure Preserving for Natural Image Stitching. 1926-1934 - Cong Cao, Huanjing Yue, Xin Liu, Jingyu Yang:
Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model. 1935-1943 - Qihang Cao, Huangxun Chen:
ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects. 1944-1952 - Yuan Cao, Xiangru Chen, Zifan Liu, Wenzhe Jia, Fanlei Meng, Jie Gui:
Deep Graph Online Hashing for Multi-Label Image Retrieval. 1953-1961 - Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem, Pablo Arbeláez, Ali K. Thabet:
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models. 1962-1970 - Jiazhong Cen, Jiemin Fang, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian:
Segment Any 3D Gaussians. 1971-1979 - Junuk Cha, Mengwei Ren, Krishna Kumar Singh, He Zhang, Yannick Hold-Geoffroy, Seunghyun Yoon, Hyunjoon Jung, Jae Shin Yoon, Seungryul Baek:
Text2Relight: Creative Portrait Relighting with Text Guidance. 1980-1988 - Keng Wei Chang, Zi-Ming Wang, Shang-Hong Lai:
KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences. 1989-1997 - Laibin Chang, Yunke Wang, Longxiang Deng, Bo Du, Chang Xu:
WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration. 1998-2006 - Qikai Chang, Mingjun Chen, Changpeng Pi, Pengfei Hu, Zhenrong Zhang, Jiefeng Ma, Jun Du, Baocai Yin, Jinshui Hu:
RFL: Simplifying Chemical Structure Recognition with Ring-Free Language. 2007-2015 - Changgu Chen, Junwei Shu, Gaoqi He, Changbo Wang, Yang Li:
Motion-Zero: A Zero-Shot Trajectory Control Framework of Moving Object for Diffusion-Based Video Generation. 2016-2024 - Chao Chen, Yu-Shen Liu, Zhizhong Han:
Sharpening Neural Implicit Functions with Frequency Consolidation Priors. 2025-2033 - Dongpan Chen, Dehui Kong, Jinghua Li, Baocai Yin:
MaskPrompt: Open-Vocabulary Affordance Segmentation with Object Shape Mask Prompts. 2034-2042 - Haipeng Chen, Yuheng Yang, Yingda Lyu:
Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion. 2043-2051 - Haipeng Chen, Sifan Wu, Zhigang Wang, Yifang Yin, Yingying Jiao, Yingda Lyu, Zhenguang Liu:
Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation. 2052-2060 - Jiahao Chen, Zhou Feng, Rui Zeng, Yuwen Pu, Chunyi Zhou, Yi Jiang, Yuyou Gan, Jinbao Li, Shouling Ji:
Enhancing Adversarial Transferability with Adversarial Weight Tuning. 2061-2069 - Jie Chen, Xinyuan Liu, Xintong Liu, Jianqiang Li:
Adversarial Learning Under Hybrid Perturbations for Robust Acute Lymphoblastic Leukemia Classification. 2070-2078 - Jingyuan Chen, Fuchen Long, Jie An, Zhaofan Qiu, Ting Yao, Jiebo Luo, Tao Mei:
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion. 2079-2087 - Junyi Chen, Weicai Ye, Yifan Wang, Danpeng Chen, Di Huang, Wanli Ouyang, Guofeng Zhang, Yu Qiao, Tong He:
GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction. 2088-2096 - Kang Chen, Yajing Zheng, Tiejun Huang, Zhaofei Yu:
Rethinking High-speed Image Reconstruction Framework with Spike Camera. 2097-2104 - Kehua Chen, Zhenlong Yuan, Tianlu Mao, Zhaoqi Wang:
Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization. 2105-2113 - Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo:
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations. 2114-2122 - Nan Chen, Mengqi Huang, Zhuowei Chen, Yang Zheng, Lei Zhang, Zhendong Mao:
CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization. 2123-2131 - Qi Chen, Changli Wu, Jiayi Ji, Yiwei Ma, Danni Yang, Xiaoshuai Sun:
IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation. 2132-2140 - Qibo Chen, Weizhong Jin, Jianyue Ge, Mengdi Liu, Yuchao Yan, Jian Jiang, Li Yu, Xuanjiang Guo, Shuchang Li, Jianzhong Chen:
CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection. 2141-2149 - Qihua Chen, Yue Ma, Hongfa Wang, Junkun Yuan, Wenzhe Zhao, Qi Tian, Hongmei Wang, Shaobo Min, Qifeng Chen, Wei Liu:
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation. 2150-2158 - Qirui Chen, Shangzhe Di, Weidi Xie:
Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos. 2159-2167 - Qizhou Chen, Taolin Zhang, Chengyu Wang, Xiaofeng He, Dakan Wang, Tingting Liu:
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit. 2168-2176 - Sen Chen, Hongying Liu, Chaowei Fang, Fanhua Shang, Yuanyuan Liu, Liang Wan, Dongmei Jiang, Yaowei Wang:
Unsupervised Degradation Representation Aware Transform for Real-World Blind Image Super-Resolution. 2177-2185 - Shengjia Chen
, Luping Ji, Weiwei Duan, Shuang Peng, Mao Ye:
Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection. 2186-2194 - Shunxin Chen, Ajian Liu, Junze Zheng, Jun Wan, Kailai Peng, Sergio Escalera, Zhen Lei:
Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection. 2195-2203 - Sijia Chen, En Yu, Wenbing Tao:
Cross-View Referring Multi-Object Tracking. 2204-2211 - Siran Chen, Yuxiao Luo, Yue Ma, Yu Qiao, Yali Wang:
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving. 2212-2220 - Wei Chen, Jianwei Niu, Xuefeng Liu, Zhendong Wang, Shaojie Tang, Guogang Zhu:
DiffDVC: Accurate Event Detection for Dense Video Captioning via Diffusion Models. 2221-2229 - Xiao Chen, Xudong Jiang, Yunkang Tao, Zhen Lei, Qing Li, Chenyang Lei, Zhaoxiang Zhang:
FIRM: Flexible Interactive Reflection ReMoval. 2230-2238 - Xin Chen, Ben Kang, Wanting Geng, Jiawen Zhu, Yi Liu, Dong Wang, Huchuan Lu:
SUTrack: Towards Simple and Unified Single Object Tracking. 2239-2247 - Xingchi Chen, Zhuoran Zheng, Xuerui Li, Yuying Chen, Shu Wang, Wenqi Ren:
Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning. 2248-2255 - Xinyue Chen, Miaojing Shi, Zijian Zhou, Lianghua He, Sophia Tsoka:
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer. 2256-2265 - Xiongren Chen, Jiuyong Li, Jixue Liu, Lin Liu, Stefan Peters, Thuc Duy Le, Wentao Gao, Xiaojing Du, Anthony Walsh:
Diffusion Models for Attribution. 2266-2274 - Xuesong Chen, Shaoshuai Shi, Tao Ma, Jingqiu Zhou, Simon See, Ka Chun Cheung, Hongsheng Li:
M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving. 2275-2283 - Yi Chen, Muyoung Son, Chuanbo Hua, Joo-Young Kim:
AoP-SAM: Automation of Prompts for Efficient Segmentation. 2284-2292 - Yi Chen, Jian Xu, Xu-Yao Zhang, Wen-Zhuo Liu, Yang-Yang Liu, Cheng-Lin Liu:
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information. 2293-2301 - Yiliang Chen, Steven SC Ho, Cheng Xu, Yao Jie Xie, Wing-Fai Yeung, Shengfeng He, Jing Qin:
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis. 2302-2310 - Yirui Chen, Xudong Huang, Quan Zhang, Wei Li, Mingjian Zhu, Qiangyu Yan, Simiao Li, Hanting Chen, Hailin Hu, Jie Yang, Wei Liu, Jie Hu:
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization. 2311-2319 - Yitong Chen, Wenhao Yao
, Lingchen Meng, Sihong Wu, Zuxuan Wu, Yu-Gang Jiang:
Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection. 2320-2328 - Yuchong Chen, Jian Yu
, Shaoyan Gai, Zeyu Cai, Feipeng Da:
3D Measurement of Complex Textured Objects Based on Bidirectional Fringe Projection. 2329-2338 - Yujia Chen, Rui Sun, Wangkai Li, Huayu Mai, Naisong Luo, Yuwen Pan, Tianzhu Zhang:
Alleviate and Mining: Rethinking Unsupervised Domain Adaptation for Mitochondria Segmentation from Pseudo-Label Perspective. 2339-2347 - Yuying Chen, Mingde Yao, Wenbo Li, Renjing Pei, Jinjing Zhao, Wenqi Ren:
Unsupervised Diffusion-Based Degradation Modeling for Real-World Super-Resolution. 2348-2356
Technical Tracks 3
- Zehao Chen, Rong Pan:
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers. 2358-2366 - Zehao Chen, Zhan Lu, De Ma, Huajin Tang, Xudong Jiang, Qian Zheng, Gang Pan:
EvHDR-GS: Event-guided HDR Video Reconstruction with 3D Gaussian Splatting. 2367-2375 - Zehao Chen, Zhanfeng Liao, De Ma, Huajin Tang, Qian Zheng, Gang Pan:
EvHDR-NeRF: Building High Dynamic Range Radiance Fields with Single Exposure Images and Events. 2376-2384 - Zheng Chen, Yu Zeng, Zehui Chen, Hongzhi Gao, Lin Chen, Jiaming Liu, Feng Zhao:
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping. 2385-2393 - Zhipeng Chen
, Lan Yang, Yonggang Qi, Honggang Zhang, Kaiyue Pang, Ke Li, Yi-Zhe Song:
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis. 2394-2402 - Zhiyuan Chen, Jiajiong Cao, Zhiquan Chen, Yuming Li, Chenguang Ma:
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions. 2403-2410 - Zikang Chen, Tao Jiang, Xiaowan Hu, Wang Zhang, Huaqiu Li, Haoqian Wang:
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising. 2411-2419 - Zining Chen, Xingshuang Luo, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men:
Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection. 2420-2428 - Ziyang Chen, Yiwen Ye, Yongsheng Pan, Yong Xia:
Gradient Alignment Improves Test-Time Adaptation for Medical Image Segmentation. 2429-2437 - Jiaxiang Cheng, Pan Xie, Xin Xia, Jiashi Li, Jie Wu, Yuxi Ren, Huixia Li, Xuefeng Xiao, Shilei Wen, Lean Fu:
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models. 2438-2446 - Junfeng Cheng, Yingkai Yang, Tania Stathaki:
3DPGS: 3D Probabilistic Graph Search for Archaeological Piece Grouping. 2447-2454 - Kun Cheng, Lei Yu, Zhijun Tu, Xiao He, Liyu Chen, Yong Guo, Mingrui Zhu, Nannan Wang, Xinbo Gao, Jie Hu:
Effective Diffusion Transformer Architecture for Image Super-Resolution. 2455-2463 - Yongkang Cheng, Shaoli Huang, Xuelin Chen, Jifeng Ning, Mingming Gong:
DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech. 2464-2472 - Yutao Cheng, Zhao Zhang, Maoke Yang, Hui Nie, Chunyuan Li, Xinglong Wu, Jie Shao:
Graphic Design with Large Multimodal Model. 2473-2481 - Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Xiawu Zheng, Chang Liu, Jie Chen:
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation. 2482-2490 - Zhixin Cheng, Jiacheng Deng, Xinjun Li, Baoqun Yin, Tianzhu Zhang:
Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment. 2491-2499 - Cheol-Ho Cho, WonJun Moon, Woojin Jun, Minseok Jung, Jae-Pil Heo:
Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval. 2500-2508 - Kyusik Cho, Dong Yeop Kim, Euntai Kim:
Zero-Shot Scene Change Detection. 2509-2517 - Seungju Cho, Hongsin Lee, Changick Kim:
Enhancing Robustness in Incremental Learning with Adversarial Training. 2518-2526 - Suhwan Cho, Seoung Wug Oh, Sangyoun Lee, Joon-Young Lee:
Elevating Flow-Guided Video Inpainting with Reference Generation. 2527-2535 - Dasol Choi, Dongbin Na:
Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting. 2536-2544 - Sooyoung Choi, Sungyong Park, Heewon Kim:
SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses. 2545-2554 - Wonhyeok Choi, Kyumin Hwang, Minwoo Choi, Kiljoon Han, Wonjoon Choi, Mingyu Shin, Sunghoon Im:
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces. 2555-2563 - Yongjin Choi, Chanhun Park, Seung Jun Baek:
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis. 2564-2572 - Jisheng Chu, Wenrui Li, Xingtao Wang, Kanglin Ning, Yidan Lu, Xiaopeng Fan:
Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion. 2573-2581 - Chaeyeon Chung, Sunghyun Park, Jeongho Kim, Jaegul Choo:
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer. 2582-2590 - Jiwan Chung, Seungwon Lim, Sangkyu Lee, Youngjae Yu:
MASS: Overcoming Language Bias in Image-Text Matching. 2591-2599 - Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor
, Luca Demetrio, Ambra Demontis, Battista Biggio, Ismail Ben Ayed, Fabio Roli:
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples. 2600-2608 - Yubo Cui, Zhiheng Li, Jiaqiang Wang, Zheng Fang:
LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba. 2609-2617 - Ming Dai, Jian Li, Jiedong Zhuang, Xian Zhang, Wankou Yang:
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints. 2618-2626 - Tao Dai, Yang Lin, Hang Guo, Jinbao Wang, Zexuan Zhu:
DCSF-KD: Dynamic Channel-wise Spatial Feature Knowledge Distillation for Object Detection. 2627-2635 - Tao Dai, Yanzi Wang, Jianyu Xiong, Yaohua Zha, Shu-Tao Xia, Zexuan Zhu:
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud. 2636-2644 - Yuqin Dai, Wanlu Zhu, Ronghui Li, Zeping Ren, Xiangzheng Zhou, Jixuan Ying, Jun Li, Jian Yang:
Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion. 2645-2653 - Quan Dao, Hao Phung, Trung Tuan Dao, Dimitris N. Metaxas, Anh Tuan Tran:
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation. 2654-2662 - Shristi Das Biswas, Matthew Shreve, Xuelu Li, Prateek Singhal, Kaushik Roy:
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery. 2663-2671 - Gabriel della Maggiora, Luis Alberto Croquevielle, Harry Horsley, Thomas Heinis, Artur Yakimovich:
Single Exposure Quantitative Phase Imaging with a Conventional Microscope Using Diffusion Models. 2672-2680 - Hui Deng, Jiawei Shi, Zhen Qin, Yiran Zhong, Yuchao Dai:
Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling. 2681-2689 - Jiacheng Deng, Jiahao Lu, Zhixin Cheng, Wenfei Yang:
DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence. 2690-2698 - Jiacheng Deng, Jiahao Lu:
Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence. 2699-2707 - Shangqi Deng, Jun Ma, Liang-Jian Deng, Ping Wei:
OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion. 2708-2716 - Xiongwen Deng, Haoyu Tang, Han Jiang, Qinghai Zheng, Jihua Zhu:
Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization. 2717-2725 - Yuhui Deng, Yuqin Lu, Yangyang Xu, Yongwei Nie, Shengfeng He:
Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation. 2726-2734 - Bonan Ding, Jin Xie, Jing Nie, Jiale Cao:
SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection. 2735-2743 - Guanqi Ding, Chengyu Yang, Shuhui Wang, Xincheng Li, Jinzhe Zhang, Xin Jin, Qingming Huang:
Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models. 2744-2752 - Yanbo Ding, Shaobin Zhuang, Kunchang Li, Zhengrong Yue, Yu Qiao, Yali Wang:
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration. 2753-2761 - Ziheng Ding, Xiaze Zhang, Qi Jing, Ying Cheng, Rui Feng:
AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds. 2762-2770 - Chenghu Du, Junyin Wang, Yi Rong, Feng Yu, Shengwu Xiong:
GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach. 2771-2779 - Chenghu Du, Junyin Wang, Feng Yu, Shengwu Xiong:
Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation. 2780-2788 - Keyu Du, Hao Xu, Haipeng Li, Hong Qu, Chi-Wing Fu, Shuaicheng Liu:
HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions. 2789-2797 - Yongkun Du, Zhineng Chen, Caiyan Jia, Xieping Gao, Yu-Gang Jiang:
Out of Length Text Recognition with Sub-String Matching. 2798-2806 - Chen Duan, Qianyi Jiang, Pei Fu, Jiamin Chen, Shengxi Li, Zining Wang, Shan Guo, Junfeng Luo:
InstructOCR: Instruction Boosting Scene Text Spotting. 2807-2815 - Zheng-Peng Duan, Jiawei Zhang, Siyu Liu, Zheng Lin, Chun-Le Guo, Dongqing Zou, Jimmy S. J. Ren, Chongyi Li:
A Diffusion-Based Framework for Occluded Object Movement. 2816-2824 - Zheng-Peng Duan, Jiawei Zhang, Zheng Lin, Xin Jin, Xundong Wang, Dongqing Zou, Chun-Le Guo, Chongyi Li:
DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts. 2825-2833 - Guodong Fan, Zishu Yao, Guang-Yong Chen, Jian-Nan Su, Min Gan:
IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective. 2834-2842 - Haozhi Fan, Yuan Cao:
Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization. 2843-2851 - Junkai Fan, Kun Wang, Zhiqiang Yan, Xiang Chen, Shangbing Gao, Jun Li, Jian Yang:
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video. 2852-2860 - Rui Fan, Weidong Hao, Juntao Guan, Lai Rui, Lin Gu, Tong Wu, Fanhong Zeng, Zhangming Zhu:
EventPillars: Pillar-based Efficient Representations for Event Data. 2861-2869 - Wenxiao Fan, Kan Li:
Combating Semantic Contamination in Learning with Label Noise. 2870-2878 - Zhen Fan, Peng Dai, Zhuo Su, Xu Gao, Zheng Lv, Jiarui Zhang, Tianyuan Du, Guidong Wang, Yang Zhang:
EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs. 2879-2887 - Han Fang, Kejiang Chen, Zijin Yang, Bosen Cui, Weiming Zhang, Ee-Chien Chang:
CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework. 2888-2896 - Shijie Fang, Hongping Gan:
SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening. 2897-2905 - Wenxuan Fang, Junkai Fan, Yu Zheng, Jiangwei Weng, Ying Tai, Jun Li:
Guided Real Image Dehazing Using YCbCr Color Space. 2906-2914 - Xiang Fang, Wanlong Fang, Changshuo Wang, Daizong Liu, Keke Tang, Jianfeng Dong, Pan Zhou, Beibei Li:
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network. 2915-2923 - Chaoran Feng, Wangbo Yu, Xinhua Cheng, Zhenyu Tang, Junwu Zhang, Li Yuan, Yonghong Tian:
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes. 2924-2932 - Chen Feng, Ziquan Liu, Zhuo Zhi, Ilija Bogunovic, Carsten Gerner-Beuerle, Miguel Rodrigues:
PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks. 2933-2941 - Chun-Mei Feng, Yang Bai, Tao Luo, Zhen Li, Salman H. Khan, Wangmeng Zuo, Rick Siow Mong Goh, Yong Liu:
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering. 2942-2950 - Dong Feng
, Ping Guo, Encheng Peng, Mingmin Zhu, Wenhao Yu, Peng Wang:
PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose Manipulation. 2951-2959 - Haoxuan Feng, Haohui Zhou, Tian Ye, Sixiang Chen, Lei Zhu:
Residual Diffusion Deblurring Model for Single Image Defocus Deblurring. 2960-2968 - Kunyu Feng, Yue Ma, Bingyuan Wang, Chenyang Qi, Haozhe Chen, Qifeng Chen, Zeyu Wang:
DiT4Edit: Diffusion Transformer for Image Editing. 2969-2977 - Mingtao Feng, Fenghao Tian, Jianqiao Luo, Zijie Wu, Weisheng Dong, Yaonan Wang, Ajmal Saeed Mian:
Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization. 2978-2986 - Siyang Feng, Huadeng Wang, Chu Han, Zhenbing Liu, Hualong Zhang, Rushi Lan, Xipeng Pan:
Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration. 2987-2995 - Tonghui Feng, Chunsheng Yan, Qianru Wang, Jiangtao Cui, Xiaotian Qiao:
HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation. 2996-3003 - Yi Feng, Yu Han, Xijing Zhang, Tanghui Li, Yanting Zhang, Rui Fan:
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction. 3004-3012 - Zhida Feng, Li Chen, Yuenan Sun, Jiaxiang Liu, Shikun Feng:
Simplifying Control Mechanism in Text-to-Image Diffusion Models. 3013-3021 - Chenlin Fu, Yingying Zhu:
BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining. 3022-3030 - Teng Fu, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue:
Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking. 3031-3039 - Xinghe Fu, Zhiyuan Yan, Taiping Yao, Shen Chen, Xi Li:
Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing. 3040-3048 - Keke Gai, Dongjue Wang, Jing Yu, Mohan Wang, Liehuang Zhu, Qi Wu:
MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark. 3049-3058 - Lianqiang Gan, Junyu Lai, Jingze Ju, Lianli Gao, Yi Bin:
DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction. 3059-3067 - Ge Gao, Ho Man Kwan, Fan Zhang, David Bull:
PNVC: Towards Practical INR-based Video Compression. 3068-3076 - Jun Gao, Qian Qiao, Tianxiang Wu, Zili Wang, Ziqiang Cao, Wenjie Li:
AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning. 3077-3085 - Mingze Gao, Jingyu Liu, Mingda Li, Jiangtao Xie, Qingbin Liu, Kevin Zhao, Xi Chen, Hui Xiong:
TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations. 3086-3094 - Xianqiang Gao, Pingrui Zhang, Delin Qu, Dong Wang, Zhigang Wang, Yan Ding, Bin Zhao:
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding. 3095-3103 - Chengjie Ge, Xueyang Fu, Peng He, Kunyu Wang, Chengzhi Cao, Zheng-Jun Zha:
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction. 3104-3112 - Shiping Ge, Qiang Chen, Zhiwei Jiang, Yafeng Yin, Liu Qin, Ziyao Chen, Qing Gu:
Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning. 3113-3121 - Xinyu Geng, Jiaming Wang, Xiaolin Huang, Fanglin Chen, Jun Xu:
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis. 3122-3130 - Zichen Geng, Zeeshan Hayder, Wei Liu, Ajmal Saeed Mian:
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions. 3131-3139 - Haifan Gong, Yu Lu, Xiang Wan, Haofeng Li:
Domain Generalized Medical Landmark Detection via Robust Boundary-Aware Pre-Training. 3140-3148 - Tao Gong, Qi Chu, Bin Liu, Nenghai Yu:
Rethinking Masked Data Reconstruction Pretraining for Strong 3D Action Representation Learning. 3149-3157 - Jiaxiang Gou, Luping Ji, Pei Liu, Mao Ye:
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification. 3158-3166 - Anna Grim, Jayaram Chandrashekar, Uygar Sümbül:
Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function. 3167-3175 - Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang, Ancong Wu, Weishi Zheng:
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning. 3176-3184 - Zijian Gu, Jianwei Ma, Yan Huang, Honghao Wei, Zhanye Chen, Hui Zhang, Wei Hong:
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection. 3185-3193 - Xianchao Guan, Yifeng Wang, Ye Zhang, Zheng Zhang, Yongbing Zhang:
OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer. 3194-3202 - Ming Gui, Johannes Schusterbauer, Ulrich Prestel, Pingchuan Ma, Dmytro Kotovenko, Olga Grebenkova, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer:
DepthFM: Fast Generative Monocular Depth Estimation with Flow Matching. 3203-3211 - Chuchen Guo, Weijie Zhou, Zheng Liu, Ying He:
You Should Learn to Stop Denoising on Point Clouds in Advance. 3212-3219 - Diandian Guo, Weixin Si, Zhixi Li, Jialun Pei, Pheng-Ann Heng:
Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver. 3220-3228 - Haipeng Guo, Huanyu Liu, Jiazheng Wen, Junbao Li:
Cross-Spectral Gaussian Splatting with Spatial Occupancy Consistency. 3229-3237 - Haojie Guo, Junyu Gao, Yuan Yuan:
Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting. 3238-3246 - Heng Guo, Jianfeng Zhang, Jiaxing Huang, Tony C. W. Mok, Dazhou Guo, Ke Yan, Le Lu, Dakai Jin, Minfeng Xu:
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model Using 3D Whole-Body CT Scans. 3247-3256 - Jialong Guo, Ke Liu, Jiangchao Yao, Zhihua Wang, Jiajun Bu, Haishuai Wang:
MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance. 3257-3265 - Kun Guo, Qiang Ling:
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts. 3266-3274 - Pinxue Guo, Hao Huang, Peiyang He, Xuefeng Liu, Tianjun Xiao, Wenqiang Zhang:
OpenVIS: Open-vocabulary Video Instance Segmentation. 3275-3283 - Puyuan Guo, Tuo Hao, Wenxin Fu, Yingming Gao, Ya Li:
Controllable 3D Dance Generation Using Diffusion-Based Transformer U-Net. 3284-3292 - Yijia Guo, Liwen Hu, Yuanxi Bai, Jiawei Yao, Lei Ma, Tiejun Huang:
SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera. 3293-3301 - Yongxin Guo, Jingyu Liu, Mingda Li, Dingxin Cheng, Xiaoying Tang, Dianbo Sui, Qingbin Liu, Xi Chen, Kevin Zhao:
VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding. 3302-3310 - Ameer Hamza, Abdullah, Yong Hyun Ahn, Sungyoung Lee, Seong Tae Kim:
LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies. 3311-3319 - Feng Han, Kai Chen, Chao Gong, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang:
DuMo: Dual Encoder Modulation Network for Precise Concept Erasure. 3320-3328 - Huasong Han, Kaixuan Zhou, Xiaoxiao Long, Yusen Wang, Chunxia Xiao:
GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving. 3329-3337 - Jumin Han, Jun-Hee Kim, Seong-Whan Lee:
ProPose: Probabilistic 3D Human Pose Estimation with Instance-Level Distribution and Normalizing Flow. 3338-3346 - Wencheng Han, Dongqian Guo, Cheng-Zhong Xu, Jianbing Shen:
DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving. 3347-3355 - Xumeng Han, Longhui Wei, Xuehui Yu, Zhiyang Dou, Xin He, Kuiran Wang, Yingfei Sun, Zhenjun Han, Qi Tian:
Boosting Segment Anything Model Towards Open-Vocabulary Learning. 3356-3365 - Yushan Han, Hui Zhang, Honglei Zhang, Jing Wang, Yidong Li:
CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework. 3366-3373 - Zihao Han, Baoquan Zhang, Lisai Zhang, Shanshan Feng, Kenghong Lin, Guotao Liang, Yunming Ye, Joeq, Kola Ye:
AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting. 3374-3382 - Jinkun Hao, Junshu Tang, Jiangning Zhang, Ran Yi, Yijia Hong, Moran Li, Weijian Cao, Yating Wang, Chengjie Wang, Lizhuang Ma:
ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image. 3383-3391 - Gang He, Guancheng Quan, Chang Wu, Shihao Wang, Dajiang Zhou, Yunsong Li:
Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement. 3392-3400 - Hangzhou He, Lei Zhu, Xinliang Zhang, Shuang Zeng, Qian Chen, Yanye Lu:
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer. 3401-3409 - Qingdong He, Jiangning Zhang, Jinlong Peng, Haoyang He, Xiangtai Li, Yabiao Wang, Chengjie Wang:
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning. 3410-3418 - Ruian He, Ri Cheng, Xinkai Lyu, Weimin Tan, Bo Yan:
Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution. 3419-3427 - Xiankang He, Guangkai Xu, Bo Zhang, Hao Chen, Ying Cui, Dongyan Guo:
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation. 3428-3436 - Xu He, Zhiyong Wu, Xiaoyu Li, Di Kang, Chaopeng Zhang, Jiangnan Ye, Liyang Chen, Xiangjun Gao, Han Zhang, Haolin Zhuang:
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement. 3437-3445 - Yina He, Lei Peng, Yongcun Zhang, Juanjuan Weng, Shaozi Li, Zhiming Luo:
Long-Tailed Out-of-Distribution Detection: Prioritizing Attention to Tail. 3446-3454 - Yulin He, Wei Chen, Siqi Wang, Tianci Xun, Yusong Tan:
Achieving Speed-Accuracy Balance in Vision-based 3D Occupancy Prediction via Geometric-Semantic Disentanglement. 3455-3463 - Yuwen He, Wei Wang, Wanyu Wu, Kui Jiang:
Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal. 3464-3472
Technical Tracks 4
- Zihao He, Shengchuan Zhang, Runze Hu, Yunhang Shen, Yan Zhang:
BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution. 3474-3482 - Miran Heo, Seoung Wug Oh, Seon Joo Kim, Joon-Young Lee:
Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation. 3483-3490 - Cuong Manh Hoang, Yeejin Lee, Byeongkeun Kang:
Generalized Class Discovery in Instance Segmentation. 3491-3499 - Yan Hong, Jianming Feng, Haoxing Chen, Jun Lan, Huijia Zhu, Weiqiang Wang, Jianfu Zhang:
WildFake: A Large-Scale and Hierarchical Dataset for AI-Generated Images Detection. 3500-3508 - Jie Hou, Jianghong Ma, Xiangyu Mu, Haijun Zhang, Zhao Zhang:
FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving. 3509-3517 - Shiyu Hou, Tianfei Zhou, Shuai Zhang, Ye Yuan, Guoren Wang:
Prompt Tuning In a Compact Attribute Space. 3518-3526 - Wenjin Hou, Dingjie Fu, Kun Li, Shiming Chen, Hehe Fan, Yi Yang:
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning. 3527-3535 - Xiaolu Hou, Mingcheng Li, Dingkang Yang, Jiawei Chen, Ziyun Qian, Xiao Zhao, Yue Jiang, Jinjie Wei, Qingyao Xu, Lihua Zhang:
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation. 3536-3544 - Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai:
Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References. 3545-3553 - Jintong Hu, Bin Xia, Bin Chen, Wenming Yang, Lei Zhang:
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution. 3554-3562 - Qiang Hu, Houqiang Zhong
, Zihan Zheng, Xiaoyun Zhang, Zhengxue Cheng, Li Song, Guangtao Zhai, Yanfeng Wang:
VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression. 3563-3571 - Qiang Hu, Zhenyu Yi, Ying Zhou, Fan Huang, Mei Liu, Qiang Li, Zhiwei Wang:
MonoBox: Tightness-Free Box-Supervised Polyp Segmentation Using Monotonicity Constraint. 3572-3580 - Xiantao Hu, Ying Tai, Xu Zhao, Chen Zhao, Zhenyu Zhang, Jun Li, Bineng Zhong, Jian Yang:
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking. 3581-3589 - Xiao Hu, Libo Long, Jochen Lang:
Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation. 3590-3598 - Hang Hua, Yunlong Tang, Chenliang Xu, Jiebo Luo:
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning. 3599-3607 - Bin Huang, Xin Wang, Hong Chen, Houlun Chen, Yaofei Wu, Wenwu Zhu:
Identity-Text Video Corpus Grounding. 3608-3616 - Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang:
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control. 3617-3625 - Chihan Huang, Xiaobo Shen:
HUANG: A Robust Diffusion Model-based Targeted Adversarial Attack Against Deep Hashing Retrieval. 3626-3634 - Dongshuo Huang, Xiaoshui Huang, Chengdong Zhang, Yilei Shi:
LPCG: A Self-conditional Architecture for Labeled Point Cloud Generation. 3635-3643 - Han Huang, Yulun Wu, Chao Deng, Ge Gao, Ming Gu, Yu-Shen Liu:
FatesGS: Fast and Accurate Sparse-View Surface Reconstruction Using Gaussian Splatting with Depth-Feature Consistency. 3644-3652 - Jiaqi Huang, Zunnan Xu, Ting Liu, Yong Liu, Haonan Han, Kehong Yuan, Xiu Li:
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation. 3653-3661 - Jie Huang, Rui Huang, Jinghao Xu, Siran Peng, Yule Duan, Liang-Jian Deng:
Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening. 3662-3670 - Lifeng Huang, Tian Su, Chengying Gao, Ning Liu, Qiong Huang:
AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models. 3671-3679 - Muye Huang, Han Lai, Xinyu Zhang, Wenjun Wu, Jie Ma, Lingling Zhang, Jun Liu:
EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding. 3680-3688 - Muye Huang, Lingling Zhang, Han Lai, Wenjun Wu, Xinyu Zhang, Jun Liu:
VProChart: Answering Chart Question Through Visual Perception Alignment Agent and Programmatic Solution Reasoning. 3689-3696 - Pei-Kai Huang, Jun-Xiong Chong, Cheng-Hsuan Chiang, Tzu-Hsien Chen, Tyng-Luh Liu, Chiou-Ting Hsu:
SLIP: Spoof-Aware One-Class Face Anti-Spoofing with Language Image Pretraining. 3697-3706 - Qihan Huang, Siming Fu, Jinlong Liu, Hao Jiang, Yipeng Yu, Jie Song:
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation. 3707-3714 - Shaofei Huang, Rui Ling, Hongyu Li, Tianrui Hui, Zongheng Tang, Xiaoming Wei, Jizhong Han, Si Liu:
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation. 3715-3723 - Shiqi Huang, Shuting He, Bihan Wen:
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation. 3724-3732 - Tianyu Huang, Haoze Zhang, Yihan Zeng, Zhilu Zhang, Hui Li, Wangmeng Zuo, Rynson W. H. Lau:
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors. 3733-3741 - Tingxuan Huang, Jiacheng Miao, Shizhuo Deng, Tong Jia, Dongyue Chen:
Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution. 3742-3750 - Wenbo Huang, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang, Fang Dong, Jiahui Jin, Takahiro Ogawa, Miki Haseyama:
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence. 3751-3759 - Xiang Huang, Qing Zhang, Jian-Fang Hu, Wei-Shi Zheng:
CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction. 3760-3768 - Xiaofei Huang, Wenting Chen, Jie Liu, Qisheng Lu, Xiaoling Luo, Linlin Shen:
DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching. 3769-3778 - Xiaoshuang Huang, Lingdong Shen, Jia Liu, Fangxin Shang, Hongxiang Li, Haifeng Huang, Yehui Yang:
Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine. 3779-3787 - Xiaoshui Huang, Zhou Huang, Yifan Zuo, Yongshun Gong, Chengdong Zhang, Deyang Liu, Yuming Fang:
PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration. 3788-3796 - Xijie Huang, Xinyuan Wang, Hantao Zhang, Yinghao Zhu, Jiawen Xi, Jingkun An, Hao Wang, Hao Liang, Chengwei Pan:
Medical MLLM Is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models. 3797-3805 - Xun Huang, Ziyu Xu, Hai Wu, Jinlong Wang, Qiming Xia, Yan Xia, Jonathan Li, Kyle Gao, Chenglu Wen, Cheng Wang:
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection. 3806-3814 - Yan Huang, Xiaoshan Liao, Jinxiu Liang, Yuhui Quan, Boxin Shi, Yong Xu:
Zero-Shot Low-Light Image Enhancement via Latent Diffusion Models. 3815-3823 - Yanglin Huang
, Kai Hu, Yuan Zhang, Zhineng Chen, Xieping Gao:
Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation. 3824-3832 - Yongle Huang, Haodong Chen, Zhenbang Xu, Zihan Jia, Haozhou Sun, Dian Shao:
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization. 3833-3841 - Yunlong Huang, Junshuo Liu, Ke Xian, Robert Caiming Qiu:
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model. 3842-3850 - Jiayu Huo, Xi Ouyang, Sébastien Ourselin, Rachel Sparks:
Generative Medical Segmentation. 3851-3859 - Yixiong Huo, Guangfeng Jiang, Hongyang Wei, Ji Liu, Song Zhang, Han Liu, Xingliang Huang, Mingjie Lu, Jinzhang Peng, Dong Li, Lu Tian, Emad Barsoum:
EGSRAL: An Enhanced 3D Gaussian Splatting Based Renderer with Automated Labeling for Large-Scale Driving Scene. 3860-3867 - Junhwa Hur, Charles Herrmann, Saurabh Saxena, Janne Kontkanen, Wei-Sheng Lai, Yichang Shih, Michael Rubinstein, David J. Fleet, Deqing Sun:
High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion. 3868-3876 - Hyoseok Lee, Kyeong Seon Kim, Byung-Ki Kwon, Tae-Hyun Oh:
Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior. 3877-3885 - Muhammet Furkan Ilaslan, Ali Köksal
, Kevin Qinghong Lin, Burak Satar, Mike Zheng Shou, Qianli Xu:
VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting. 3886-3894 - Elkhan Ismayilzada, MD Khalequzzaman Chowdhury Sayem, Yihalem Yimolal Tiruneh, Mubarrat Tajoar Chowdhury, Muhammadjon Boboev, Seungryul Baek:
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects. 3895-3903 - Alexander Jaus, Constantin Marc Seibold, Simon Reiß, Zdravko Marinov, Keyi Li, Zeling Ye, Stefan Krieg, Jens Kleesiek, Rainer Stiefelhagen:
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks. 3904-3912 - Yuxiang Ji, Boyong He, Zhuoyue Tan, Liaoni Wu:
Game4Loc: A UAV Geo-Localization Benchmark from Game Data. 3913-3921 - Yuzhou Ji, He Zhu, Junshu Tang, Wuyi Liu, Zhizhong Zhang, Xin Tan, Yuan Xie:
FastLGS: Speeding Up Language Embedded Gaussians with Feature Grid Mapping. 3922-3930 - Mingda Jia, Liming Zhao, Ge Li, Yun Zheng:
ContextHOI: Spatial Context Learning for Human-Object Interaction Detection. 3931-3939 - Mingda Jia, Liming Zhao, Ge Li, Yun Zheng:
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection. 3940-3948 - Yizhen Jia, Rong Quan, Yue Feng, Haiyan Chen, Jie Qin:
Doubly Contrastive Learning for Source-Free Domain Adaptive Person Search. 3949-3957 - Yueru Jia, Aosong Cheng, Yuhui Yuan, Chuke Wang, Ji Li, Huizhu Jia, Shanghang Zhang:
DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework. 3958-3966 - Dadong Jiang, Xianghui Yang, Zibo Zhao, Sheng Zhang, Jiaao Yu, Zeqiang Lai, Shaoxiong Yang, Chunchao Guo, Xiaobo Zhou, Zhihui Ke:
FlexiTex: Enhancing Texture Generation via Visual Guidance. 3967-3975 - Hao Jiang, Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Yang Song, Kun Gai, Yadong Mu:
Granularity-Adaptive Spatial Evidence Tokenization for Video Question Answering. 3976-3984 - Jianan Jiang, Hao Tang, Zhilin Jiang, Weiren Yu, Di Wu:
ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling. 3985-3993 - Jianfei Jiang, Liyong Wang, Haochen Yu, Tianyu Hu, Jiansheng Chen, Huimin Ma:
RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo. 3994-4002 - Jimao Jiang, Diya Sun, Tianbing Wang, Yuru Pei:
SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation. 4003-4011 - Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohammadreza Samadi, Jiao He, Fengyu Sun, Di Niu:
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation. 4012-4020 - Luoqian Jiang, Yong Guo, Bingna Xu, Haolin Pan, Jiezhang Cao, Wenbo Li, Jian Chen:
Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution. 4021-4029 - Nan Jiang, Shanchao Liang, Chengxiao Wang, Jiannan Wang, Lin Tan:
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement. 4030-4038 - Pengfei Jiang, Mingbao Lin, Fei Chao:
Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing. 4039-4047 - Rui Jiang, Xinghe Fu, Guangcong Zheng, Teng Li, Taiping Yao, Xi Li:
Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models. 4048-4056 - Sijia Jiang, Jing Hua, Zhizhong Han:
Query Quantized Neural SLAM. 4057-4065 - Sijia Jiang, Tong Wu, Jing Hua, Zhizhong Han:
Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions. 4066-4074 - Yutao Jiang, Qiong Wu, Wenhao Lin, Wei Yu, Yiyi Zhou:
What Kind of Visual Tokens Do We Need? Training-Free Visual Token Pruning for Multi-Modal Large Language Models from the Perspective of Graph. 4075-4083 - Xianhe Jiao, Chenlei Lv, Junli Zhao, Ran Yi, Yu-Hui Wen, Zhenkuan Pan, Zhongke Wu, Yong-Jin Liu:
Weighted Poisson-disk Resampling on Large-Scale Point Clouds. 4084-4092 - Yingying Jiao, Zhigang Wang, Sifan Wu, Shaojing Fan, Zhenguang Liu, Zhuoyue Xu, Zheqi Wu:
SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos. 4093-4101 - Yingying Jiao, Zhigang Wang, Zhenguang Liu, Shaojing Fan, Sifan Wu, Zheqi Wu, Zhuoyue Xu:
Optimizing Human Pose Estimation Through Focused Human and Joint Regions. 4102-4110 - Can Jin, Tianjin Huang, Yihua Zhang, Mykola Pechenizkiy, Sijia Liu, Shiwei Liu, Tianlong Chen:
Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective. 4111-4119 - Dongyang Jin, Chao Fan, Weihua Chen, Shiqi Yu:
Exploring More from Multiple Gait Modalities for Human Identification. 4120-4128 - Er Jin, Qihui Feng, Yongli Mou, Gerhard Lakemeyer, Stefan Decker, Oliver Simons, Johannes Stegmaier:
LogicAD: Explainable Anomaly Detection via VLM-based Text Feature Extraction. 4129-4137 - Jiandong Jin, Xiao Wang, Qian Zhu, Haiyang Wang, Chenglong Li:
Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework. 4138-4146 - Long Jin, Han Nong, Liangming Chen, Zhenming Su:
A Method for Enhancing Generalization of Adam by Multiple Integrations. 4147-4155 - Hyungjun Joo, Hyeonggeun Han, Sehwan Kim, Sangwoo Hong, Jungwoo Lee:
Constructing Fair Latent Space for Intersection of Fairness and Explainability. 4156-4165 - Woojin Jun, WonJun Moon, Cheol-Ho Cho, Minseok Jung, Jae-Pil Heo:
Bridging the Semantic Granularity Gap Between Text and Frame Representations for Partially Relevant Video Retrieval. 4166-4174 - Dachun Kai, Yueyi Zhang, Jin Wang, Zeyu Xiao, Zhiwei Xiong, Xiaoyan Sun:
Event-Enhanced Blurry Video Super-Resolution. 4175-4183 - Danial Kamali, Elham J. Barezi, Parisa Kordjamshidi:
NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization. 4184-4193 - Ben Kang, Xin Chen, Simiao Lai, Yang Liu, Yi Liu, Dong Wang:
Exploring Enhanced Contextual Information for Video-Level Object Tracking. 4194-4202 - Gyeongjin Kang, Younggeun Lee, Seungjun Oh, Eunbyung Park:
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis. 4203-4211 - Jiahui Kang, Qing Cai, Runqing Tan, Yimei Liu, Zhi Liu:
C2PD: Continuity-Constrained Pixelwise Deformation for Guided Depth Super-Resolution. 4212-4220 - Jingcheng Ke, Waikeung Wong, Jia Wang, Mu Li, Lunke Fei, Jie Wen:
DiffusionREC: Diffusion Model with Adaptive Condition for Referring Expression Comprehension. 4221-4229 - Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Muzammal Naseer, Luc Van Gool, Federico Tombari:
Learning to Prompt with Text Only Supervision for Vision-Language Models. 4230-4238 - Donghyun Kim, Hyeonkyeong Kwon, Yumin Kim, Seong Jae Hwang:
PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling. 4239-4247 - Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee:
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration. 4248-4256 - Hyunjun Kim, Nam Ik Cho:
APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising. 4257-4265 - Jihwan Kim, Miso Lee, Cheol-Ho Cho, Jihyun Lee, Jae-Pil Heo:
Prediction-Feedback DETR for Temporal Action Detection. 4266-4274 - Jisoo Kim, Jungbin Cho, Joonho Park, Soonmin Hwang, Da Eun Kim, Geon Kim, Youngjae Yu:
DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation. 4275-4283 - Jungho Kim, Changwon Kang, Dongyoung Lee, Sehwan Choi, Jun Won Choi:
ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder. 4284-4292 - Minkuk Kim, Hyeon Bae Kim, Jinyoung Moon, Jinwoo Choi, Seong Tae Kim:
HiCM²: Hierarchical Compact Memory Modeling for Dense Video Captioning. 4293-4301 - Seyeon Kim, Siyoon Jin, Jihye Park, Kihong Kim, Jiyoung Kim, Jisu Nam, Seungryong Kim:
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation. 4302-4310 - Soowoong Kim, Minseong Kwon, Junho Choi, Gun Bang, Seungjoon Yang:
TSDF-Based Efficient Motion-Compensated Temporal Interpolation for 3D Dynamic Sequences. 4311-4319 - Taewhan Kim
, Soeun Lee, Si-Woo Kim, Dong-Jin Kim:
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning. 4320-4328 - Taewoong Kim, Byeonghwi Kim, Jonghyun Choi:
Multi-Modal Grounded Planning and Efficient Replanning for Learning Embodied Agents with a Few Examples. 4329-4337 - Younghyun Kim, Geunmin Hwang, Junyu Zhang, Eunbyung Park:
DiffuseHigh: Training-Free Progressive High-Resolution Image Synthesis Through Structure Guidance. 4338-4346 - Konstantin Klemmer, Esther Rolf, Caleb Robinson, Lester Mackey, Marc Rußwurm:
SatCLIP: Global, General-Purpose Location Embeddings with Satellite Imagery. 4347-4355 - Hyun-kyu Ko, Dongheok Park, Youngin Park, Byeonghyeon Lee, Juhee Han, Eunbyung Park:
Sequence Matters: Harnessing Video Models in 3D Super-Resolution. 4356-4364 - Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin, Danila Rukhovich, Anton Konushin:
UniDet3D: Multi-dataset Indoor 3D Object Detection. 4365-4373 - Hanyang Kong, Xingyi Yang, Xinchao Wang:
Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling. 4374-4382 - Jiayi Kong, Xurui Song, Shuo Huai, Baixin Xu, Jun Luo, Ying He:
Do Not DeepFake Me: Privacy-Preserving Neural 3D Head Reconstruction Without Sensitive Images. 4383-4391 - Mengxun Kong, Jie Guo, Chen Wang, Ye Yuan, Yanwen Guo:
Real-Time Neural Denoising with Render-Aware Knowledge Distillation. 4392-4400 - Ming Kong, Xianzhou Zeng, Luyuan Chen, Yadong Li, Bo Yan, Qiang Zhu:
MHBench: Demystifying Motion Hallucination in VideoLLMs. 4401-4409 - Koen Kraaijveld, Yifan Jiang, Kaixin Ma, Filip Ilievski:
COLUMBUS: Evaluating COgnitive Lateral Understanding Through Multiple-Choice reBUSes. 4410-4418 - Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat:
Stable Mean Teacher for Semi-supervised Video Action Detection. 4419-4427 - Suruchi Kumari, Pravendra Singh:
A Unified Degradation-Robust Approach to SSL and UDA for 3D Medical Images. 4428-4436 - Myung-Joon Kwon, Wonjun Lee, Seung-Hun Nam, Minji Son, Changick Kim:
SAFIRE: Segment Any Forged Image Region. 4437-4445 - Jian Lan, Diego Frassinelli, Barbara Plank:
Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA. 4446-4454 - Yunwei Lan, Zhigao Cui, Chang Liu, Jialun Peng, Nian Wang, Xin Luo, Dong Liu:
Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training. 4455-4463 - Maria A. Larchenko, Alexander Lobashev, Dmitry Guskov, Vladimir Vladimirovich Palyulin:
Color Transfer with Modulated Flows. 4464-4472 - Quang-Hung Le, Long Hoang Dang, Ngan Hoang Le, Truyen Tran, Thao Minh Le:
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models. 4473-4481 - Chan Lee, Seungho Shin, Gyeong-Moon Park, Jung Uk Kim:
Multispectral Pedestrian Detection with Sparsely Annotated Label. 4482-4490 - Hyunjee Lee, Youngsik Yun, Jeongmin Bae, Seoha Kim, Youngjung Uh:
Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space. 4491-4498 - Ji Soo Lee, Jongha Kim, Jeehye Na, Jinyoung Park, Hyunwoo J. Kim:
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning. 4499-4507 - Jooyoung Lee, Jaeyoon Lee, Jongwon Choi:
NBA3D: Neighbor-Based Confidence Adjustment for 3D Rare Object Detection Using LiDAR. 4508-4516 - JunGyu Lee, Yeji Choi, Haksub Kim, Ig-Jae Kim, Gi Pyo Nam:
Navigating Label Ambiguity for Facial Expression Recognition in the Wild. 4517-4525 - Minhyeok Lee, Suhwan Cho, Chajin Shin, Jungho Lee, Sunghun Yang, Sangyoun Lee:
Video Diffusion Models Are Strong Video Inpainter. 4526-4533
Technical Tracks 5
- Sangho Lee, Il Yong Chun, Hogun Park:
MAMS: Model-Agnostic Module Selection Framework for Video Captioning. 4535-4543 - Sanghyeon Lee, Jooyeol Yun, Jaegul Choo:
Enabling Region-Specific Control via Lassos in Point-Based Colorization. 4544-4552 - Subeen Lee, Jiyeon Han, Soyeon Kim, Jaesik Choi:
Diverse Rare Sample Generation with Pretrained GANs. 4553-4561 - Yuxiao Lee, Xiaofeng Cao, Jingcai Guo, Wei Ye, Qing Guo, Yi Chang:
Concept Matching with Agent for Out-of-Distribution Detection. 4562-4570 - Mengqi Lei, Haochen Wu, Xinhua Lv, Xin Wang:
ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement. 4571-4579 - Jiaqi Leng, Yakun Ju, Yuanxu Duan, Jiangnan Zhang, Qingxuan Lv, Zuxuan Wu, Hao Fan:
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients. 4580-4588 - Yicheng Leng, Chaowei Fang, Junye Chen, Yixiang Fang, Sheng Li, Guanbin Li:
Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal. 4589-4597 - Yarin Yerushalmi Levi, Edita Grolman, Idan Yankelev, Amit Giloni, Omer Hofman, Toshiya Shimizu, Asaf Shabtai, Yuval Elovici:
KDAT: Inherent Adversarial Robustness via Knowledge Distillation with Adversarial Tuning for Object Detection Models. 4598-4606 - Jaihyun Lew, Jooyoung Choi, Chaehun Shin, Dahuin Jung, Sungroh Yoon:
Disentangled Motion Modeling for Video Frame Interpolation. 4607-4615 - Bingliang Li, Fengyu Yang, Yuxin Mao, Qingwen Ye, Hongkai Chen, Yiran Zhong:
Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control. 4616-4624 - Bonan Li, Zicheng Zhang, Xuecheng Nie, Congying Han, Yinhan Hu, Xinmin Qiu, Tiande Guo:
StyO: Stylize Your Face in Only One-Shot. 4625-4633 - Chade Li, Pengju Zhang, Bo Liu, Hao Wei, Yihong Wu:
FEAST-Mamba: FEAture and SpaTial Aware Mamba Network with Bidirectional Orthogonal Fusion for Cross-Modal Point Cloud Segmentation. 4634-4642 - Chen Li, Rui Zhao, Zeyu Wang, Huiying Xu, Xinzhong Zhu:
RemDet: Rethinking Efficient Model Design for UAV Object Detection. 4643-4651 - Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, Yifan Liu, Zhen Chen, Yixuan Yuan:
U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation. 4652-4660 - Chuanhao Li, Zhen Li, Chenchen Jing, Xiaomeng Fan, Wenbo Ye, Yuwei Wu, Yunde Jia:
Consistency of Compositional Generalization Across Multiple Levels. 4661-4669 - Chunxiao Li, Xiaoxiao Wang, Boming Miao, Chuanlong Xie, Zizhe Wang, Yao Zhu:
An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques. 4670-4678 - Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liu, Wen Gao:
CALLIC: Content Adaptive Learning for Lossless Image Compression. 4679-4688 - Guangyuan Li, Yongkang Wang, Junsheng Luan, Lei Zhao, Wei Xing, Huaizhong Lin, Binkai Ou:
Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution. 4689-4697 - Guoqiu Li, Jin Song, Yiyun Fei:
HomeDiffusion: Zero-Shot Object Customization with Multi-View Representation Learning for Indoor Scenes. 4698-4706 - Hao Li, Hao Fei, Zechao Hu, Zhengwei Yang, Zheng Wang:
VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence. 4707-4715 - Haojin Li, Heng Li, Jianyu Chen, Rihan Zhong, Ke Niu, Huazhu Fu, Jiang Liu:
AIF-SFDA: Autonomous Information Filter Driven Source-Free Domain Adaptation for Medical Image Segmentation. 4716-4724 - Huafeng Li, Dayong Su, Qing Cai, Yafei Zhang:
BSAFusion: A Bidirectional Stepwise Feature Alignment Network for Unaligned Medical Image Fusion. 4725-4733 - Huaqiu Li, Wang Zhang, Xiaowan Hu, Tao Jiang, Zikang Chen, Haoqian Wang:
Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single Image Denoising. 4734-4742 - Jiafeng Li, Ying Wen, Lianghua He:
M²RL-Net: Multi-View and Multi-Level Relation Learning Network for Weakly-Supervised Image Forgery Detection. 4743-4751 - Jiahao Li, Yang Lu, Yuan Xie, Yanyun Qu:
MaskViM: Domain Generalized Semantic Segmentation with State Space Models. 4752-4760 - Jian Li, Siwang Zhou:
Block-Based Multi-Scale Image Rescaling. 4761-4769 - Jiawei Li
, Hongwei Yu, Jiansheng Chen, Xinlong Ding, Jinlong Wang, Jinyuan Liu, Bochao Zou, Huimin Ma:
A²RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion. 4770-4778 - Jiaxing Li, Lin Jiang, Zeqi Ma, Kaihang Jiang, Xiaozhao Fang, Jie Wen:
Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval. 4779-4787 - Junyi Li, Zhilu Zhang, Wangmeng Zuo:
Rethinking Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising. 4788-4796 - Ke Li, Di Wang, Zhangyuan Hu, Shaofeng Li, Weiping Ni, Lin Zhao, Quan Wang:
FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection. 4797-4805 - Ke Li, Gengyu Lyu, Hao Chen, Bochen Xie, Zhen Yang, Youfu Li, Yongjian Deng:
Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation. 4806-4814 - Kun Li, Dan Guo, Guoliang Chen, Chunxiao Fan, Jingyuan Xu, Zhiliang Wu, Hehe Fan, Meng Wang:
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition. 4815-4823 - Kunxi Li, Tianyu Zhan
, Kairui Fu, Shengyu Zhang, Kun Kuang, Jiwei Li, Zhou Zhao, Fan Wu, Fei Wu:
MergeNet: Knowledge Migration Across Heterogeneous Models, Tasks, and Modalities. 4824-4832 - Ling Li, Ruiwen Gu, Chongyang Wang, Junliang Xing, Xinchun Yu, Xiao-Ping Zhang:
Multi-View 3D Human Pose Estimation with Weakly Synchronized Images. 4833-4841 - Maodong Li, Chao Zheng, Jian Wang, Bing Li:
Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization. 4842-4850 - Peize Li, Qingyi Si, Peng Fu, Zheng Lin, Yan Wang:
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering. 4851-4859 - Pengna Li, Kangyi Wu, Jingwen Fu, Sanping Zhou:
REGNav: Room Expert Guided Image-Goal Navigation. 4860-4868 - Pu Li, Wenhao Zhang, Jianwei Guo, Jinglu Chen, Dong-Ming Yan:
Revisiting CAD Model Generation by Learning Raster Sketch. 4869-4877 - Qiang Li, Di Liu, Jun Kong, Sen Li, Hui Xu, Jianzhong Wang:
Temporal Action Localization with Cross Layer Task Decoupling and Refinement. 4878-4886 - Rong Li, Liang Li, Jiehua Zhang, Qiang Zhao, Hongkui Wang, Chenggang Yan:
Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning. 4887-4895 - Ruihang Li, Tao Li, Shanding Ye, Kaikai Xiao, Huangnan Zheng, Zhe Yin, Zhijie Pan:
Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception. 4896-4904 - Ruihuang Li, Liyi Chen, Zhengqiang Zhang, Varun Jampani, Vishal M. Patel, Lei Zhang:
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing. 4905-4913 - Ruoran Li, Runzhao Yang, Wenxin Xiang, Yuxiao Cheng, Tingxiong Xiao, Lu Yang, Jinli Suo:
A Compact Implicit Neural Representation for Efficient Storage of Massive 4D Functional Magnetic Resonance Imaging. 4914-4922 - Shijie Li, Weijun Lin, Qingyuan Xiang, Yunbin Tu, Shitan Asu, Zheng Li:
Unsupervised Photometric-Consistent Depth Estimation from Endoscopic Monocular Video. 4923-4931 - Shiyu Li, Pengxu Wei, Pengchong Qiao, Chang Liu, Jie Chen:
DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs. 4932-4940 - Teng Li, Xingjun Ma, Yu-Gang Jiang:
AIM: Additional Image Guided Generation of Transferable Adversarial Attacks. 4941-4949 - Tengpeng Li, Hanli Wang, Xianfei Li, Wenlong Liao, Tao He, Pai Peng:
Generative Planning with 3D-Vision Language Pre-training for End-to-End Autonomous Driving. 4950-4958 - Wenrui Li, Zhe Yang, Wei Han, Hengyu Man, Xingtao Wang, Xiaopeng Fan:
Hyperbolic-Constraint Point Cloud Reconstruction from Single RGB-D Images. 4959-4967 - Wenxue Li, Lie Ju, Feilong Tang, Peng Xia, Xinyu Xiong, Ming Hu, Lei Zhu, Zongyuan Ge:
Towards Realistic Semi-supervised Medical Image Classification. 4968-4976 - Wenyun Li, Zheng Zhang, Xiangyuan Lan, Dongmei Jiang:
Transferable Adversarial Face Attack with Text Controlled Attribute. 4977-4985 - Xiaohai Li, Bineng Zhong, Qihua Liang, Guorong Li, Zhiyi Mo, Shuxiang Song:
MambaLCT: Boosting Tracking via Long-term Context State Space Model. 4986-4994 - Xinzhe Li, Jiahui Zhan, Shengfeng He, Yangyang Xu, Junyu Dong, Huaidong Zhang, Yong Du:
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium. 4995-5003 - Xudong Li, Yan Zhang, Yunhang Shen, Ke Li, Runze Hu, Xiawu Zheng, Sicheng Zhao:
Feature Denoising Diffusion Model for Blind Image Quality Assessment. 5004-5012 - Xueyang Li, Yunzhong Lou, Yu Song, Xiangdong Zhou:
Mamba-CAD: State Space Model for 3D Computer-Aided Design Generative Modeling. 5013-5021 - Yachao Li, Dong Liang, Tianyu Ding, Sheng-Jun Huang:
StructSR: Refuse Spurious Details in Real-World Image Super-Resolution. 5022-5030 - Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Ying Shan, Yuexian Zou:
Image Conductor: Precision Control for Interactive Video Synthesis. 5031-5038 - Yayuan Li, Jintao Guo, Lei Qi, Wenbin Li, Yinghuan Shi:
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP. 5039-5047 - Yiheng Li
, Yang Yang, Zhen Lei:
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection. 5048-5056 - Yihui Li, Chengxin Lv, Hongyu Yang, Di Huang:
Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images. 5057-5065 - Yinghui Li, Qianyu Zhou, Jingyu Gong, Ye Zhu, Richard Dazeley, Xinkui Zhao, Xuequan Lu:
DAPoinTr: Domain Adaptive Point Transformer for Point Cloud Completion. 5066-5074 - Zhangbin Li, Jinxing Zhou, Jing Zhang, Shengeng Tang, Kun Li, Dan Guo:
Patch-level Sounding Object Tracking for Audio-Visual Question Answering. 5075-5083 - Zhangheng Li, Tianlong Chen, Linyi Li, Bo Li, Zhangyang Wang:
Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study. 5084-5091 - Zhuoyuan Li, Yubo Ai, Jiahao Lu, Chuxin Wang, Jiacheng Deng, Hanzhi Chang, Yanzhe Liang, Wenfei Yang, Shifeng Zhang, Tianzhu Zhang:
Pamba: Enhancing Global Interaction in Point Clouds via State Space Model. 5092-5100 - Zixu Li, Zhiwei Chen, Haokun Wen, Zhiheng Fu, Yupeng Hu, Weili Guan:
ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval. 5101-5109 - Zonglin Li, Xiaoqian Lv, Qinglin Liu, Quanling Meng, Xin Sun, Shengping Zhang:
ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition. 5110-5118 - Zongyi Li, Jianbo Li, Yuxuan Shi, Jiazhong Chen, Shijuan Huang, Linnan Tu, Fei Shen, Hefei Ling:
Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval. 5119-5127 - Baoyu Liang, Qile Su, Shoutai Zhu, Yuchen Liang, Chao Tong:
VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos. 5128-5136 - Guoyan Liang, Qin Zhou, Zhe Wang, Jingyuan Chen, Lin Gu, Chang Yao, Sai Wu, Bingcang Huang, Kai Chen:
Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities. 5137-5145 - Hanzhe Liang, Guoyang Xie, Chengbin Hou, Bingshu Wang, Can Gao, Jinbao Wang:
Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection. 5146-5154 - Li Liang, Naveed Akhtar, Jordan Vice, Xiangrui Kong, Ajmal Saeed Mian:
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion. 5155-5163 - Yiyuan Liang, Zhiying Yan, Liqun Chen, Jiahuan Zhou, Luxin Yan, Sheng Zhong, Xu Zou:
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes. 5164-5172 - Zixi Liang, Guowei Xu, Haifeng Wu, Ye Huang, Wen Li, Lixin Duan:
S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field. 5173-5181 - Bencheng Liao, Xinggang Wang, Lianghui Zhu, Qian Zhang, Chang Huang:
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention. 5182-5190 - Dongping Liao, Xitong Gao, Yabo Xu, Cheng-Zhong Xu:
Progressive Distribution Matching for Federated Semi-Supervised Learning. 5191-5199 - Sangbeom Lim, Seongchan Kim, Seungjun An, Seokju Cho, Paul Hongsuck Seo, Seungryong Kim:
Multi-Granularity Video Object Segmentation. 5200-5208 - Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan:
NightHaze: Nighttime Image Dehazing via Self-Prior Learning. 5209-5217 - Ente Lin, Xujie Zhang, Fuwei Zhao, Yuxuan Luo, Xin Dong, Long Zeng, Xiaodan Liang:
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder. 5218-5226 - Guixu Lin, Muyao Niu, Qingtian Zhu, Zhengwei Yin, Zhuoxiao Li, Shengfeng He, Yinqiang Zheng:
Adversarial Attacks on Event-Based Pedestrian Detectors: A Physical Approach. 5227-5235 - Jiaqi Lin, Zhihao Li, Binxiao Huang, Xiao Tang, Jianzhuang Liu, Shiyong Liu, Xiaofei Wu, Fenglong Song, Wenming Yang:
Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting. 5236-5244 - Jiayi Lin, Jiabo Huang, Jian Hu, Shaogang Gong:
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation. 5245-5253 - Jiaying Lin, Yuen Hei Yeung, Shuquan Ye, Rynson W. H. Lau:
Leveraging RGB-D Data with Cross-Modal Context Mining for Glass Surface Detection. 5254-5261 - Kaiqing Lin, Yuzhen Lin, Weixiang Li, Taiping Yao, Bin Li:
Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection. 5262-5270 - Min Lin, Gangwei Xu, Yun Wang, Xianqi Wang, Xin Yang:
FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation. 5271-5279 - Pei Lin:
HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models. 5280-5288 - Yangkai Lin, Jiabao Lei, Kui Jia:
Multi-StyleGS: Stylized Gaussian Splatting with Multiple Styles. 5289-5297 - Yiheng Lin, Yihan Hu, Chenyi Zhang, Ting Liu, Xiaochao Qu, Luoqi Liu, Yao Zhao, Yunchao Wei:
Memory Efficient Matting with Adaptive Token Routing. 5298-5306 - Yunlong Lin, Tian Ye, Sixiang Chen, Zhenqi Fu, Yingying Wang, Wenhao Chai, Zhaohu Xing, Wenxue Li, Lei Zhu, Xinghao Ding:
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement. 5307-5315 - Yunlong Lin, Zhenqi Fu, Kairun Wen, Tian Ye, Sixiang Chen, Ge Meng, Yingying Wang, Chui Kong, Yue Huang, Xiaotong Tu, Xinghao Ding:
DPLUT: Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors. 5316-5324 - Yuxin Lin, Wei Wang, Xiaoling Luo, Zhihao Wu, Chengliang Liu, Jie Wen, Yong Xu:
Deep Hierarchies and Invariant Disease-Indicative Feature Learning for Computer Aided Diagnosis of Multiple Fundus Diseases. 5325-5333 - Zhihang Lin, Mingbao Lin, Luxi Lin, Rongrong Ji:
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference. 5334-5342 - Peng Ling, Tiao Tan, Jiaqi Lin, Wenming Yang:
SOVGaussian: Sparse-View 3D Gaussian Splatting for Open-Vocabulary Scene Understanding. 5343-5351 - Baolong Liu, Ruiqing Yang, Roukai Huang, Wenhao Xu, Xin Pan, Chuanhuang Li, Bin Wang, Xun Wang, Jianfeng Dong:
Towards Ship License Plate Recognition in the Wild: A Large Benchmark and Strong Baseline. 5352-5360 - Chengzhi Liu, Zile Huang, Zhe Chen, Feilong Tang, Yu Tian, Zhongxing Xu, Zihong Luo, Yalin Zheng, Yanda Meng:
Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis. 5361-5369 - Chuang Liu, Yichao Cao, YingYing Zhang, Xiu Su, Haogang Zhu:
Perturbating, Tuning, and Collaborating: Harnessing Vision Foundation Models for Single Domain Generalization on Medical Imaging. 5370-5378 - Decheng Liu, Zongqi Wang, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao:
Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations. 5379-5387 - Delong Liu, Zhaohui Hou, Mingjie Zhan, Shihao Han, Zhicheng Zhao, Fei Su:
UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer. 5388-5396 - Dunqiang Liu, Shujun Huang, Wen Li, Siqi Shen, Cheng Wang:
Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning. 5397-5405 - Duo Liu, Yiqi Shi, Guoyin Zhang, Sizhao Li, Liguo Zhang:
Zero-Shot Noise2Mean: Gap Minimization for Efficient Denoising from a Single Noisy Image. 5406-5414 - Fan Liu, Wenwen Cai, Jian Huo, Chuanyi Zhang, Delong Chen, Jun Zhou:
Making Large Vision Language Models to Be Good Few-Shot Learners. 5415-5423 - Gaofeng Liu, Zhiyuan Ma, Tao Fang:
DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment. 5424-5432 - Han Liu, Yuanyuan Wang, Xiaotong Zhang, Feng Zhang, Wei Wang, Fenglong Ma, Hong Yu:
Multi-Label Few-Shot Image Classification via Pairwise Feature Augmentation and Flexible Prompt Learning. 5433-5441 - Haomiao Liu, Hao Xu, Chuhuai Yue, Bo Ma:
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection. 5442-5450 - Hongjian Liu, Qingsong Xie, Tianxiang Ye, Zhijie Deng, Chen Chen, Shixiang Tang, Xueyang Fu, Haonan Lu, Zheng-Jun Zha:
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation. 5451-5459 - Hongyuan Liu, Haochen Yu, Bochao Zou, Juntao Lyu, Qi Mei, Jiansheng Chen, Huimin Ma:
ProtoCar: Learning 3D Vehicle Prototypes from Single-View and Unconstrained Driving Scene Images. 5460-5468 - Huaizhuo Liu, Hai-Miao Hu, Yonglong Jiang, Yurui Liu:
PEIE: Physics Embedded Illumination Estimation for Adaptive Dehazing. 5469-5477 - Jiajie Liu, Mengyuan Liu, Hong Liu, Wenhao Li:
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation. 5478-5486 - Jiapeng Liu, Liang Li, Shihao Rao, Xiyan Gao, Weixin Guan, Bing Li, Can Ma:
Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering. 5487-5495 - Jingyu Liu, Minquan Wang, Ye Ma, Bo Wang, Aozhu Chen, Quan Chen, Peng Jiang, Xirong Li:
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching. 5496-5503 - Man Liu, Huihui Bai, Feng Li, Chunjie Zhang, Yunchao Wei, Tat-Seng Chua, Yao Zhao:
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning. 5504-5512