


default search action
ICME 2025: Nantes, France
- IEEE International Conference on Multimedia and Expo, ICME 2025, Nantes, France, June 30 - July 4, 2025. IEEE 2025, ISBN 979-8-3315-9495-4

- Xiaoyu Tan, Teqi Hao, Xihe Qiu, Shaojie Shi, Yuan Cheng, Wei Chu, Yinghui Xu, Yuan Qi:

Leave the Bias in Bias: Mitigating the Label Noise Effects in Continual Visual Instruction Fine-Tuning. 1-7 - Xi Liu, Fanfan Ji, Bo Liu, Xiao-Tong Yuan:

Hier-pFedMe: Hierarchical Personalized Federated Learning with Moreau Envelopes. 1-6 - Hang Yu, Yansen Yu

, Jiayan Qiu:
Pedestrian Trajectory Prediction Driven by Bidirectional Intention-Interaction. 1-6 - Qianqian Sun, Lu Shi, Linna Zhang, Gaoyun An, Yi Jin, Yidong Li, Yigang Cen:

Injecting Cross-modal Fine-Grained Perception into LLMs for 3D Object-of-Interest Understanding. 1-6 - Haoxuan Ji, Zheng Lin, Yuyao Sun, Fei Gao, Yuhang Wang, Haichang Gao, Zhenxing Niu:

Towards Aligned Data Forgetting via Twin Machine Unlearning. 1-6 - Yidan Sun, Qin Chao, Yangfeng Ji, Boyang Li:

Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding. 1-6 - Bimei Wang, Huilin Song, Jisheng Dang, Fei Shen, Hui Zhang, Liting Wang, Mangang Xie, Jizhao Liu, Jiasi Weng:

AS-Memory: Adaptive Sparse Memory Meeting Video-Language Models. 1-6 - Zhouhao Ouyang, Wen Xue, Tianyi Chen, Yan Huang, Si Wu, Yong Xu, Patrick Le Callet, Dapeng Oliver Wu

:
InpaintFormer: Prompt-guided High-Quality Face Inpainting with Mask-Aware Self-Attention. 1-6 - Shuo Zhang, Jinsong Zhang, Zhejun Zhang, Lei Li:

Multimodal Mixture of Low-Rank Experts for Sentiment Analysis and Emotion Recognition. 1-6 - Xinyao Yu, Hao Sun, Zeyu Ling, Ziwei Niu, Zhenjia Bai, Rui Qin, Yen-Wei Chen, Lanfen Lin:

EPIC: Efficient Prompt Interaction for Text-Image Classification. 1-6 - Ruicheng Zhang, Haowei Guo, Zeyu Zhang, Puxin Yan, Shen Zhao:

GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation. 1-6 - Malya Singh, Priyankar Choudhary, Abdulmotaleb El-Saddik, Mukesh Saini:

ASTAnet: Transformer-based Siamese Network for Robust Audio-to-Audio Alignment in Amateur User Generated Audio Clips. 1-6 - Taiyu Niu, Geng Tu, Hui Wang, Bing Qin, Ruifeng Xu:

A Multi-stage and Multi-target Knowledge Distillation Framework for Multimodal Conversational Emotion Recognition. 1-6 - Dilxat Abdureyim, Bo Ma, Yating Yang, Rui Dong, YiDu Chen, Azmat Anwar, Lei Wang:

Bidirectional Feature Fusion and Adaptive Decision Network for Multimodal Fake News Detection. 1-6 - Yanmin Chen, Shuo Wang, Mengyao Zhou, Chenglin Liu, Jun Luo:

Dataset Pruning: Optimizing Image Datasets with a Cross-Validation Method. 1-6 - Amar Tious, Toinon Vigier, Vincent Ricordel:

Subjective Quality Assessment for Point Clouds of Digital Humans with Shaded Rendering. 1-6 - Maosheng Su, Shuo Wang, Zhichuan Wang, Jun Luo:

VLCO:A Dual-Optimization Framework for Precise Camouflaged Object Localization and Segmentation. 1-6 - Rui Zhu, Zhaokang Lu, Bohan Liu, Yun Yang, Hua Yue, Chaogang Wang, Zixin Zhou:

IMTrack: Interlayer Interoperability and Multi-scene Optimization for Visual Multimodal Target Tracking. 1-6 - Aoran Liu, Kun Hu, Clinton Mo, Changyang Li, Zhiyong Wang:

Extended Short- and Long-Range Mesh Learning for Fast and Generalised Garment Simulation. 1-6 - Shuping Zhao, Chongli Zhuang, Li Yang, Yanling Zhong, Yanping Li, Yonghan Chen:

Toward Uncontrolled Palmprint Recognition via Multi-View Block Diagonal Structure Learning. 1-6 - Zijie Song

, Zhenzhen Hu, Yixiao Ma, Jia Li, Richang Hong:
Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA. 1-6 - Wei Feng, Xin Wang, Hong Chen, Zeyang Zhang, Wenwu Zhu:

Multi-sentence Video Grounding for Long Video Generation. 1-6 - Yifan Lyu, Zehua Zang, Hongzhou Wu, Lixiang Liu, Jiangmeng Li:

RBDN: A Robust Background Denoising Network for Weakly Supervised Temporal Language Grounding. 1-6 - Jennifer Piane, Thiruvarangan Ramaraj, Jacob D. Furst, Daniela Raicu:

Video Label Refinement for Temporal Localization. 1-6 - Yulei Jian, Lingma Sun, Xiaofeng Wang, Jin Tang:

CLIP Guided Multimodal Prototype Learning for One-Shot Semantic Segmentation. 1-6 - Yang Li, Chengliang Wang, Xing Wu, Yonggang Luo, Peng Wang, Haidong Wang:

SAM-GA: SAM-Guided Grouped Aggregation Network for Weakly Supervised cardiac MRI Segmentation. 1-6 - Jiale Zhang, Qianxi Jia, Yang Liu, Wei Zhang, Wei Wei, Xin Tian:

SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting. 1-6 - Yixuan Guan

, Jianwei Niu, Tao Ren, Xuefeng Liu:
Enabling Communication-efficient and Robust Federated Learning over Packet Lossy Networks via Random Interleaved Vector Quantization. 1-6 - Teng Zhang, Yiqiang Chen, Xinlong Jiang, Wuliang Huang, Qian Chen, Chenlong Gao, Zhirui Wang, Bingjie Yan:

FairFHTL: Achieving Task-Agnostic Fairness in Federated Hetero-Task Learning. 1-6 - Pengfei Wang, Hao Zheng, Zhigang Hu, Aikun Xu, Meiguang Zheng, Liu Yang:

PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification. 1-6 - Zhengyang Qi, Xiaohua Xu:

TrojFlow: Flow Models are Natural Targets for Trojan Attacks. 1-6 - Juepeng Zheng, Yibin Wen:

Evidential Graph Contrastive Alignment for Source-Free Blending-Target Domain Adaptation. 1-6 - Wei Zhang, Xuekang Peng, Zhichao Lian:

Free Try-On: Virtual Try-On without Garment-Agnostic Images and Warped Garments. 1-6 - Jiahao Fan, Weiting Chen, Zheming Fan, Ruizhi Yu:

Temporal Invariant Feature Combined with Arbitrary Enhancement for Missing Modality Emotion Recognition. 1-6 - Rui Niu, Weihao Wu, Jie Chen, Long Ma, Zhiyong Wu:

A Multi-Stage Framework for Multimodal Controllable Speech Synthesis. 1-6 - Qian Yao

, Jun-Jie Huang, Yongjun Wang, Zihan Chen:
DRTNet: Diffusion Reconstruction Texture Network for AI-generated Image Detection. 1-6 - Xiangzheng Kong, Zhi Zeng, Chenxi Zhu, Zihan Ma, Minnan Luo:

Harmony in Chaos: A Progressive Noise-Resilient Network for Robust Fake News Video Detection. 1-6 - Limin Cheng, Hang Qin, Shouxu Kuang, Xinyu Wang, Ling Li, Yanjun Wu, Chen Zhao:

InterLayer: Efficient Inference with Interleaved Scheduling and Layer-Specific Optimization. 1-6 - Junyu Chen, Yihua Gao, Mingyong Li:

Visual Semantic Description Generation with MLLMs for Image-Text Matching. 1-6 - Feng Xiong, Geng Tu, Yice Zhang, Jun Wang, Shiwei Chen, Bin Liang, Yue Yu, Min Yang, Ruifeng Xu:

Multimodal Emotion Recognition in Conversations via Graph Structure Learning. 1-6 - Heqing Zou, Tianze Luo, Guiyang Xie, Victor Xiao Jie Zhang, Fengmao Lv, Guangcong Wang, Junyang Chen

, Zhuochen Wang, Hansheng Zhang, Huaijian Zhang:
HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding. 1-6 - Bohong Li, Weiqi Luo, Peijia Zheng, Shunquan Tan, Jiwu Huang:

A GAN Framework for Asymmetric Embedding Costs Learning in JPEG Steganography. 1-6 - Zijian Wang, Yuqi Liu, Yan Zhao, Binghao Wang, Shen Cai, Yanting Zhang:

Neural Implicit Reconstruction and Fast Rendering Based on Dual Spherical Shell. 1-6 - Yiwei Lin, Yuying Bao, Tao Yu, Zhenqin Chen, Xu Cheng, Jinshan Xu:

Unsupervised Domain Adaptation for Fetal R-peak Detection at Trans-Pregnancy Stages based on Multiview Mixing. 1-6 - Fei Zhang, Hongxia Wang:

Robust Blind Spatio-Temporal Adaptive Video Watermarking Based on 3-D Symmetry. 1-6 - Qi Zhang, Haoqian Wang, Yuanxi Peng, Teng Li:

Geometrically-Inspired Irregular Expansion Techniques for Graph-based Point Cloud Learning. 1-6 - Kangrui Du, Yujun Qian, Juepeng Zheng:

Instance-Distance Active Learning for Source-Free Cross-Domain Object Detection. 1-6 - Qianyang Wu, Ye Shi, Xiaoshui Huang, Lan Xu, Jingyi Yu, Jingya Wang:

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention. 1-7 - Yanda Li, Chi Zhang, Wenjia Jiang, Wanqi Yang, Bin Fu, Pei Cheng, Xin Chen, Meng Fang, Ling Chen, Yunchao Wei:

Adaptive Mobile Agent for Dynamic Interactions. 1-6 - Jingjing Lu, Huilong Pi, Yunchuan Qin, Zhuo Tang, Ruihui Li:

Self-Supervised Point Cloud Completion based on Multi-View Augmentations of Single Partial Point Cloud. 1-6 - Yuanyuan Wang, Hangting Chen, Dongchao Yang, Weiqin Li, Dan Luo, Guangzhi Li, Shan Yang, Zhiyong Wu, Helen Meng, Xixin Wu:

UniSep: Universal Target Audio Separation with Language Models at Scale. 1-6 - Yibin Wang, Changhai Zhou, Honghui Xu:

Enhancing Object Coherence in Layout-to-Image Synthesis. 1-6 - Jiming Yang, Xu Wang, Yi Jin, Yidong Li, Hui Yu:

Generative Adversarial Network-based Image and Tabular Data Generation with Differential Privacy. 1-6 - Jie Cheng, Hao Zheng, Meiguang Zheng, Lei Wang, Hao Wu, Jian Zhang:

ElimPCL: Eliminating Noise Accumulation with Progressive Curriculum Labeling for Source-Free Domain Adaptation. 1-6 - Weixiang Zhang, Wei Yao, Shuzhao Xie, Shijia Ge, Chen Tang, Zhi Wang:

Expansive Supervision for Neural Radiance Fields. 1-6 - Juncen Guo, Xiaoguang Zhu, Liangyu Teng, Hao Yang, Jing Liu, Yang Liu, Liang Song:

Adaptive Weighted Parameter Fusion with CLIP for Class-Incremental Learning. 1-6 - Yifan Yang, Jianheng Zhuo, Zengrui Jin, Ziyang Ma, Xiaoyu Yang, Zengwei Yao, Liyong Guo, Wei Kang, Fangjun Kuang, Long Lin, Daniel Povey, Xie Chen:

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning. 1-6 - Mingyu Cao, Huibin Tan, Xueqiong Li, Wanrong Huang, Kedi Zhang, Yuhua Tang, Shaowu Yang:

Adaptive Distribution-Aware Modeling for Transformer Tracking. 1-6 - Zhiqiang Zeng, Longpei Wu, Xiaodong Wang, Fei Yan, Haiyan Huang:

Continuous Lane Detection Network with Hybrid Feature Fusion and Differential Aggregation. 1-6 - Xingyuan Li, Ruichao Hou, Tongwei Ren, Gangshan Wu:

KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection. 1-6 - Lingyu Qiu, Ke Jiang, Xiaoyang Tan:

RoGA: Towards Generalizable Deepfake Detection through Robust Gradient Alignment. 1-6 - Yu Qiao, Tianyu Meng, Huilin Ge, Xinning Wang, Jiayue Zhao, Qianchen Xia, Xin Yang:

Localization Hints Exploration for Object Matting. 1-6 - Yingru Chen, Zhihao Guo, Haimin Zhang, Min Xu:

STPM: Spatial-Temporal Point Mamba for Activity Recognition Using mmWave Radar Point Clouds. 1-6 - Xiang Zhang, Suping Wu, Weibin Qiu, Zhaocheng Jin, Sheng Yang:

Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery. 1-6 - Jiaxi Yang, Haowen Hou:

RWKV-UI: UI Understanding with Enhanced Perception and Reasoning. 1-6 - Zhengrong Chen, Qinghua Zhu, An Zeng, YuZhu Ji, Baoyao Yang, Dan Pan:

Action Decomposition-based Actor-Critic for Supply Chain Optimization. 1-6 - Mingfu Yan, Jiancheng Huang, Yifan Liu, Shifeng Chen:

Component Adaptive Clustering for Generalized Category Discovery. 1-6 - Shun Zou, Yi Zou, Mingya Zhang, Shipeng Luo, Guangwei Gao, Guojun Qi:

Learning Dual-Domain Multi-Scale Representations for Single Image Deraining. 1-6 - Chunlin Lu, Yongheng Zhang, Peng Wang

, Wenpeng Lu, Libo Qin:
MdCoT: Medical Diagnosis Chain-of-Thought with Self-Diagnostic Refinement for Alzheimer's Disease. 1-6 - Jinbin Wang

, Aiping Yang, Yumeng Liu, Qinghua Hu:
Uncertainty-Driven Weakly Supervised Dehazing Network: Integrating Dynamic Attention and Multi-Scale Feature Fusion. 1-6 - Qing Xu, Shunbo Wang, Yunxiang Jiang, Simon Parkinson

, Klaus Schoeffmann, Chuntie Chen:
Characterizing High-order Interactions between Eye Movement and Head Motion Variables in Augmented Reality-based Navigation Experience. 1-6 - Ying Zeng, Meiling Liu

, Jiyun Zhou, Jingfeng Zhang:
Enhancing Hateful Meme Detection via Modality Enhancement and Multi-View Fusion. 1-6 - Wenjie Liu, Zhongliang Liu, Xiaoyan Yang, Man Sha, Yang Li:

ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting. 1-6 - Zhiyue Liu, Fanrong Ma, Xin Ling:

Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing. 1-6 - Yuzhe Wu, Yipeng Xu, Tianyu Xu, Jialu Zhang, Jianfeng Ren

, Xudong Jiang:
GCA-SUNet: A Gated Context-Aware Swin-UNet for Exemplar-Free Counting. 1-6 - Nasar Iqbal, Niki Martinel:

Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection. 1-6 - Zunhai Su, Wang Shen, Linge Li, Zhe Chen, Hanyu Wei, Huangqi Yu, Kehong Yuan:

AKVQ-VL: Attention-Aware KV Cache Adaptive 2-Bit Quantization for Vision-Language Models. 1-6 - Guohong Huang, Ling-An Zeng, Zexin Zheng, Shengbo Gu, Wei-Shi Zheng:

Efficient Explicit Joint-level Interaction Modeling with Mamba for Text-guided HOI Generation. 1-6 - Longzhao Huang, Wenhao Xu, Changwei Wang, Rongtao Xu, Peng Lu, Shibiao Xu:

MVPS: Multi-View Adaptive Prompt Synergy for Zero-shot Anomaly Detection. 1-6 - Min Zhang, Zilin Wang, Liyan Chen, Kunhong Liu, Juncong Lin:

Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling. 1-6 - Bin Qin, Yi Li, Jiangmeng Li, Xuesong Wu, Yupeng Wang, Jianwen Cao:

Causal Deconfounding for Spurious Correlation in Domain Generalization. 1-6 - Xiangning Ruan, Baoxing Xie, Zhaohui Hou, Qixiang Yin, Fei Su, Zhicheng Zhao:

Think Twice: Empowering Action Recognition Models with Human-Like Deep Reasoning. 1-6 - Yefei Hou, Jie Tang:

True Match: Leveraging 2D-Assisted Queries for Multi-view 3D Detection in Polar Space. 1-6 - Minglin Hong, Bo Sun, Jun He, Yinghui Zhang:

TAD-IVR: Enhancing Temporal Action Detection via Instrumental Variable Regression. 1-6 - Zizhi Chen, Minghao Han, Xukun Zhang, Shuwei Ma, Tao Liu, Xing Wei, Lihua Zhang:

VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction. 1-6 - Boyang Song, Jin Xiao, Xiaoguang Hu, Guofeng Zhang, Jiaqi Shi, Hao Jiang:

When Epipolar Transformers Meets Implicit Neural Super-Resolution in Multi-View Stereo. 1-6 - Kaili Lu, Jian Ji, Ruoxue Li, Falin Wang, Chengwei Xu:

PDFIN: Prompt-Guided Dynamic Feature Integration Network for Few-Shot Class-Incremental Remote Sensing Scene Classification. 1-6 - Xingqian Guo, Tingting Chai, Lunke Fei, Jialing Xu, Guanglu Zhou, Xiangqian Wu, Haoxing Cao:

TRAMFuse: Text image Tampering Detection via Directional Residual Attention Mechanism. 1-6 - Kai Li, Guo Chen, Runxuan Yang, Xiaolin Hu:

SPMamba: Leveraging Long-Sequence Modeling with State Space Models for Speech Separation. 1-6 - Houqiang Zhong

, Shaocheng Shen, Ke Cai, Zhenlong Wu, Jiangchao Yao, Yuan Cheng, Xuefei Li, Xiaoyun Zhang, Li Song, Qiang Hu:
Serial Low-rank Adaptation of Vision Transformer. 1-6 - Shaokang Wang, Dingquan Li, Guoqing Xiang, Jinchang Xu, Shanghang Zhang, Xiaodong Xie:

Adaptive Semantic Compression: Compatible Bitstream for Scalable Human-Machine Perception Sample Adaption. 1-6 - Xiangfei Sheng, Weidong Zou, Pengfei Chen, Li Cai, Chao He, Leida Li:

Text-to-Image Diffusion Models are AI-Generated Image Quality Scorers. 1-6 - Li Yin, Baigang Mi, Yi Fan:

GateM2Net: A Gated Multi-Modal Network for Joint Emotion and Sentiment Analysis. 1-6 - Lingxing Chen, Yang Gu, Yi Guo, Jianqi Chen, Yingting Zhu, Yehong Zhuo, Dongmei Jiang, Yiqiang Chen:

DTAD: A Distribution-Transformed Supervised Anomaly Detection Method. 1-6 - Chenrui Liu, Zhichao Lian:

Latent Diffusion-based Face Anonymization with Identity and Attribute Decoupling. 1-6 - Luyao Ren, Wenxin Yu, Zhiqiang Zhang, Chang Liu, Jun Gong:

ECAIF: Efficient Context Aware Information Fusion Network for Medical Image Segmentation. 1-6 - Yan Hong, Chao He

, Zhibo Rao, Zhen Chen, Nan Li, Congxuan Zhang:
WCG-Net: Warping Consistency Compensation Guided Multi-Feature Fusion For Stereo Matching. 1-6 - Qiuran Li

, Yi Luo, Yan Sun, Tong Wu, Aiguo Chen:
Trustworthy Localized Corrections-guided Mutual Learning for Multi-View Learning. 1-6 - Menglin Zhang, Xiaoxin Guo, Bohao Qu, Xiaofeng Cao, Shuifa Sun, Qing Guo:

PhysLight: Accurate rPPG Heart Rate Measurement with Adaptive Video Relighting. 1-6 - Fangkai Li, Hao Hu, Feiyu Pan, Yanzhen Wang, Yiyou Guo, Xiankai Lu:

Context-Enhanced Zero-Shot Video Temporal Grounding with Adaptive Boundary Refinement. 1-6 - Chuankai Xu, Junhao Li, Ruxin Wang:

Mutual Teaching: Semi-supervised Medical Image Classification with Cross Structural Consistency Learning. 1-6 - Qi Shen, Liu Yang, Canguang Ruan:

MG-STK: Weakly Supervised Multi-Granularity Learning Guided by Semantic Topological Knowledge. 1-6 - Zhe Lei, Jie Zhang, Jingtao Li, Tianwei Zhang, Haibin Kan, Weiming Zhang, Nenghai Yu:

Aparecium: Revealing Secrets from Physical Photographs. 1-6 - Tao Lu, Shangyang Li:

Harnessing Pre-trained Language Models for EEG-based Epilepsy Detection. 1-6 - Qiang Zhang, Jiahang Cao, Jingkai Sun, Yecheng Shao, Gang Han, Wen Zhao, Yijie Guo, Renjing Xu:

ES-Parkour: Advanced Robot Parkour with Bio-inspired Event Camera and Spiking Neural Network. 1-6 - Yujun Wu, Chen Wang, Meixuan Chen, Tongguan Wang, Ying Sha:

Incongruity-aware Cross-modal Interaction Network for Multimodal Sarcasm Detection. 1-6 - Yizhang Yang, Jinshi Cui, Xi Guo, Xing Su, Wei Ni, Junshi Lu, Li Wang, Huimin Ma:

Gaze4ASD: A Novel Dataset and Visual Saliency Map-Based Method for Autism Screening. 1-6 - Ziyun Cai, Yawen Huang, Jie Song, Chang-Hui Hu, Tengfei Zhang:

Make Multi-source Task Greater Again: Adaptive Causal Diffusion Strategy. 1-6 - Fei Gao, Jiaqi Shi, Yuhao Lin, Xiaodan Zhang, Lihuo He, Nannan Wang:

Mixture-of-Modality-Experts for Unified Image Aesthetic Assessment with Multi-Level Adaptation. 1-6 - Weiguang Zhao, Chaolong Yang, Jianan Ye, Rui Zhang, Yuyao Yan, Xi Yang, Bin Dong, Amir Hussain

, Kaizhu Huang:
From 2D Images to 3D Model: Weakly Supervised Multi-View Face Reconstruction with Deep Fusion. 1-6 - Yixiao He, Haifeng Sun, Qi Qi, Zirui Zhuang, Pengfei Ren, Huazheng Wang, Yafeng Nan, Jing-Yu Wang:

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization. 1-6 - Yiwen Guan, Viet Anh Trinh, Vivek Voleti, Jacob Whitehill:

Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? 1-6 - Aotian Zheng, Jenq-Neng Hwang, Rania Hussein, Farron Wallace, Kelsey Magrane, Lauren Shiosaka:

Target Distribution Agnostic Domain Adaptation for in-the-Wild Image Classification under Both Domain and Label Shifts. 1-6 - Rongduo Han, Cihan Ruan, Shunye Tang, Haoyu Wu, Nam Ling, Haining Zhang:

Tactile Information Coding for DNA Storage with Prospects for AI Applications. 1-6 - Tianpei Zhang, Yiming Zhu, Jufeng Zhao, Guangmang Cui, Yuchen Zheng:

Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model. 1-6 - Qingzheng Wang, Jiazhi Xie, Ning Li:

Structure-Guided Camouflaged Object Detection with Progressive Enhancement Strategy. 1-6 - Naihao Wang, Can Zhang, Yunfeng Liu, Wentao Chen, Ruirui Li:

Double-Shrink: Enhancing Model Robustness under SDN Noise by Reducing Uncertain Confidence. 1-6 - Xiaoqian Han, Guanglin Niu, Mingliang Zhou, Xiaowei Zhang:

Knowledge Distilled Group Prompts Learning for HOI Detection with Large Vision-Language Models. 1-6 - Wenbin Yan, Qingwei Wu, Hua Chen, Xiaogang Zhang, Shengjie Hu:

Consolidating Selective SSM with Spatial-Angular and Bidirectional Structural Fusion Perception for Light Field Semantic Segmentation. 1-6 - Meng-Lun Yu, Wen-Jiin Tsai:

Automatic Natural Image Matting via Dual Encoder Aggregation. 1-6 - Xinru Ying, Jiaqi Mo, Jingyang Lin, Canghong Jin, Fangfang Wang, Lina Wei:

MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval. 1-6 - Xian Gao, Luyang Wang, Jiacheng Ruan, Yuyang Zhang, Zongyun Zhang, Ting Liu, Yuzhuo Fu:

NLOSdiffuser: Generalized Steady-State Non-Line-of-sight Imaging toward Indoor Scenarios. 1-6 - Xiaoyan Wang, Zeju Li, Yifan Xu, Jiaxing Qi, Zhifei Yang, Ruifei Ma, Xiangde Liu, Chao Zhang:

Spatial 3D-LLM : Exploring Spatial Awareness in 3D Vision-Language Models. 1-6 - Kang Fu, Huiyu Duan, Zicheng Zhang, Xiaohong Liu, Xiongkuo Min, Jia Wang, Guangtao Zhai:

SI23DCQA: Perceptual Quality Assessment of Single Image-to-3D Content. 1-6 - Qiang Xu, Lixuan Meng, Guangjie Zhang, Wei Gao, Ge Li:

Coding-Free Multiscale Latent Variables for Lossless Point Cloud Attribute Compression. 1-6 - Mohsen Abdoli, Ramin G. Youvalari, Frank Plowman, Alexandre Tissier:

Merge Mode for Template-based Intra Mode Derivation (TIMD) in ECM. 1-6 - Zhixian He, Pengcheng Zhao, Shujin Lin:

QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems. 1-6 - Shuai Cheng, Lin Wang, Xiaoshuai Hao, Wanqian Zhang, Xiaohua Chen, Wei Wang:

Multi-Granularity Based Collaborative Learning for Semi-Supervised Hashing. 1-6 - Jingxuan Zhang, Libao Zhang:

Clouds and Haze Co-Removal Based on Saliency-Guided Multi-Scale Diffusion Model for Remote Sensing Images. 1-6 - Siyu Cheng, Chao Yang, Bin Jiang:

Multi-Passage Retrieval-Augmented Multimodal Language Generation Model for Knowledge-Based Visual Question Answering. 1-6 - Zailong Chen, Peng Gao, Yujian Lee, Johan Barthelemy, Luping Zhou, Lei Wang:

Optimizing Efficiency and Visual-Textual Alignment for LLM-Based Radiology Report Generation. 1-6 - Disen Hu, Xun Jiang, Zhe Sun, Hao Yang, Chong Peng, Peng Yan, Xing Xu:

Social Optimum Assisted Gradient Modulation for Imbalanced Multimodal Learning. 1-6 - Zeyu An, Zichong Chen:

Rectified Mixed-Label Learning for Semi-Supervised Medical Image Segmentation. 1-6 - Xing Wei, Dexuan Zhao, Fan Yang, Taizhang Hu, Chong Zhao, Yang Lu:

Redundancy Optimization via Mutual Information for Unsupervised Domain Adaptation. 1-6 - Yixuan Ye

, Yang Zhang, Liang Peng, Rui Li
, Cheng Liu, Si Wu, Hau-San Wong:
Cross-View Neighborhood Contrastive Multi-View Clustering with View Mixup Feature Learning. 1-6 - Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He:

Beyond Multimodal Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization. 1-6 - Yi-Hsin Chen, Kuan-Wei Ho

, Martin Benjak
, Jörn Ostermann
, Wen-Hsiao Peng:
Conditional Residual Coding with Explicit-Implicit Temporal Buffering for Learned Video Compression. 1-6 - Xiongbo Lu, Yaxiong Chen, Shengwu Xiong:

AnyArtisticGlyph: Multilingual Controllable Artistic Glyph Generation. 1-6 - Dan Luo, Chengyuan Ma, Weiqin Li, Jun Wang, Wei Chen, Zhiyong Wu:

AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis. 1-6 - Jizhou Wang, Xiaodan Fang, Lei Huang, Yongfeng Huang:

TaxAgent: How Large Language Model Designs Fiscal Policy. 1-6 - Kang Fu, Zicheng Zhang, Huiyu Duan, Xiaohong Liu, Xiongkuo Min, Jiarui Wang, Guangtao Zhai:

VIP-PCQA: A Multi-Modal Framework for No-reference Point Cloud Quality Assessment. 1-6 - Ziqi Shi, Fan Lyu, Ye Liu, Fanhua Shang, Fuyuan Hu, Wei Feng, Zhang Zhang, Liang Wang:

Controllable Continual Test-Time Adaptation. 1-6 - Ziqi Gu, Chunyan Xu, Zhen Cui:

CLIP-driven Few-Shot Continual Learning. 1-6 - Wangjin Zhou, Tianjiao Du, Wenhao Guan, Meng Xiao, Chenglin Xu, Yi Zhao, Tatsuya Kawahara:

InvoxSVC: Any-to-any Zero-shot Singing Voice Conversion with In-Context Learning in Latent Flow Matching. 1-6 - Jinming Liu, Junhao Geng, Lexiang Lv, Wenjun Zeng, Xin Jin:

Representation Disentanglement for Semantic Coding. 1-6 - Qishan Wang, Shuyong Gao, Junjie Hu, Jiawen Yu, Xuan Tong, You Li, Wenqiang Zhang:

HSS-IAD: A Heterogeneous Same-Sort Industrial Anomaly Detection Dataset. 1-6 - Hao Xi, Meiqin Liu, Zechen Yang, Ping Wei:

Achieving Seamless Camouflage: Attention Fusion Diffusion Model for Image Synthesis. 1-6 - Chongyang Xu, Runtian Zheng, Ziliang Feng, Chengfang Zhang:

Multi-Scale Hypergraph Relational Reasoning for Weakly Supervised Recognition of Group Activities. 1-6 - Zhan Li, Mingyu Zhao, Xin Dong, Haibin Ling, Bingyao Huang:

CAPAA: Classifier-Agnostic Projector-Based Adversarial Attack. 1-6 - Yuhan Zhou, Xiaotian Li, Baojie Fan:

AdaptiveFusion: LiDAR-Camera Adaptive Fusion for 3D Object Detection. 1-6 - Yuming Chen, Rongyu Chen, Zhongqun Zhang, Yihua Cheng, Hyung Jin Chang:

Multi-Hypothesis 3D Hand Mesh Recovering from a Single Blurry Image. 1-8 - Jiaming He, Wenbo Jiang, Guanyu Hou

, Qiyang Song, Ji Guo, Hongwei Li:
Weaponizing Tokens: Backdooring Text-to-Image Generation via Token Remapping. 1-6 - Linkai Liu, Xiaoyan Xiao, Yijian Yang, Yuchen Zhou, Zipeng Guo, Chao Gou:

Boosting Road Event Detection with Adaptive Multi-Modal Models. 1-6 - Yibin Wang, Yucan Zhou, Xiaoyan Gu, Weiping Wang:

FLR: Feature-based Label Recovery in Federated Learning with Classifier-free Communication. 1-6 - Lingxiao Kong, Jiahui Jiang, Wenchao Xu, Haozhao Wang, Ruixuan Li:

FedRF: Input-side Client Drift Mitigation for Federated Learning via Reusing Features. 1-6 - Yang Liu, Zhangtao Cheng, Bin Chen, Yan Liu, Xing He, Ting Zhong, Fan Zhou:

Missing Pieces, Complete Picture: Navigating Micro-Video Popularity with Flexible Mixture of Modality Experts. 1-6 - Jinliang Han, Xiongkuo Min, Wei Sun, Guangtao Zhai:

Perceiving Smoothness: Temporal Consistency Learning for Multi-Frame-Rate Video Quality Assessment. 1-6 - Zhanpeng Liu, Yuqiang Zhang, Tianlong Yu, Xi Lin, Yang Yang, Chenxi Huang, Bin Wang:

Feature and Temporal Disruption Attacks from Images to Videos. 1-6 - Yunhe Feng, Lingren Wang, Jiaxin Wang:

DWS-FedSeg: A Federated Learning Framework for Automatic Segmentation of CT and MRI Images. 1-6 - Meghna Kapoor, Vinit Jakhetiya, Badri Narayan Subudhi, Ankur Bansal, Weisi Lin:

Feature Affinity based Clustering for Test-Time Adaptation for Image Quality Assessment. 1-6 - Zhengli Shi, Chenhao Lin, Zhengyu Zhao, Peter Peer, Chao Shen:

Evading Deepfake Detectors via Adversarially Degrading and Restoring Forged Images. 1-6 - Xixiang He, Hao Yu, Qiyao Sun, Ao Cheng, Tailai Zhang, Cong Liu, Shuxuan Guo:

TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection. 1-6 - Da Li, Di Zhou, Yishan Zou, Shenghua Li, Meng Liu:

MAPLE: Modality-Agnostic Prototype Learning for Egocentric Action Recognition. 1-6 - Xu Yang, Zhuo Tang, Boyao Hao, Xiong Xiao, Jiapeng Zhang:

FedMPQ: Secure and Efficient Federated Learning with Multi-codebook Product Quantization. 1-6 - Mingliang Xue, Chong Cao, Zhengyang Zhao, Xiaodong Duan, Shu Cao:

Visual-Textual Feature Learning for Rare Human-Object Interactions Detection. 1-6 - Tristan Tsoi, Jiajun Deng, Yaolong Ju, Benno Weck, Holger Kirchhoff, Simon Lui:

CrossMuSim: A Cross-Modal Framework for Music Similarity Retrieval with LLM-Powered Text Description Sourcing and Mining. 1-6 - Chun-Shuo Qiu, Feng-Lin Liu, Hongbo Fu, Fan Zhang, Yan-Pei Cao, Yu-Kun Lai, Lin Gao:

Audio-Driven Emotion-Aware 3D Talking Face Generation from Single Image. 1-6 - Jianfeng Guan

, Haoyang Meng, Yizhong Hu, Pengcheng Wang, Kexian Liu:
MPCSFL: A Privacy-Preserving Split Federated Learning Framework in Edge Network. 1-6 - Zhi Yang, Chunyang Ma, Liejun Wang, Zhiqing Guo:

Insulator Defect Detection Method Based on Lightweight Feature Extraction and Efficient Cross-Scale Fusion. 1-6 - Yuheng Shao, Zhangkai Ni, Qinyuan Liu:

D2AD: Diffusion Distillation for Unsupervised Image Anomaly Detection. 1-6 - Yiqing Xu, Liwei Liao, Ronggang Wang:

NVPose: Novel View Data Augmentation for Human Pose Estimation. 1-6 - Haoyu Guan, Qianzi Yu, Kai Zhu, Yang Cao, Yu Kang:

FastAno: Accelerating Defect Image Generation with Efficient Sampling. 1-6 - Xiaoqiang Cui, Kaixuan Hou, Jianping Luo:

Lightweight Video Super-Resolution Network Based on Pyramid Optical Flow Extraction and Alignment. 1-6 - Wang Zhang, Huaqiu Li, Xiaowan Hu, Tao Jiang, Zikang Chen, Haoqian Wang:

Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising. 1-6 - Xingxing Yang, Jie Chen, Zaifeng Yang:

Learning Physics-Informed Color-Aware Transforms for Low-Light Image Enhancement. 1-6 - Dongdong Gong, Tengfei Gong, Yaxiong Chen, Jinglin Yuan, Shengwu Xiong:

Exploring Flexibility in Incremental Few-Shot Object Detection. 1-6 - Guoqing Zhu

, Xiaojie Gan, Lingye Zhao, Luojun Lin:
Assessing the Generalizability of Deep Models without Out-of-Distribution Data. 1-6 - Wenwei Luo, Yuguo Hu, Jiafu Yan, Mengmeng Jing, Lin Zuo:

Diffusion-Driven Source Consistency for Gradual Domain Adaptation. 1-6 - Dong Liu, Yifan Yang, Zixiong Huang, Yuxin Gao, Mingkui Tan:

CHRIS: Clothed Human Reconstruction with Side View Consistency. 1-6 - Shixiang Cai, Liangzhen Liu, Zhirui Kuai, Li Kuang, Lingyan Zhang:

DBE: Dual Branch re-Extraction for Unseen Diffusion-Generated Image Detection. 1-6 - Longjuan Sun, Xixia Xu, Dongchen Zhu, Jiamao Li:

The Motion in the Details: Adapting CLIP for Action Recognition via Dual-prompt Guidance. 1-6 - Mingkai Sheng, Jichao Wang, Yi Liu, Wen Cheng, Lingfang Zeng:

3D-Contrastive Anchors and Structure Enhancement for Multi-modal Representations. 1-6 - Ruhui Zhang, Hezhe Qiao, Pengcheng Xu, Mingsheng Shang, Lin Chen:

Semantic-guided Representation Learning for Multi-Label Recognition. 1-6 - Yuqi Liao, Aodong Li, Yisha Chen, Qianfang Xu, Jiarui Xie, Anxin Li, Bo Xiao:

Learning from Noisy Data Using Pretrained Vision-Language Representations. 1-6 - Zhiying Yan, Yiyuan Liang, Shilv Cai, Tao Zhang, Sheng Zhong, Luxin Yan, Xu Zou:

Divide-And-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting. 1-6 - Xinyu Liu

, Zhenghao Qi, Rong Ding:
OGS-Mapping: Object-Level 3D Gaussian Splatting Mapping. 1-6 - Yufei Tang, Daiheng Gao, Pingyu Wu, Wenbo Zhou, Bang Zhang, Weiming Zhang:

Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation. 1-6 - Zhen Tan, Wei Wei:

ConAvatar: Harnessing Facial Mesh for Controllable Avatar Animation. 1-6 - Lanning Zhang, Yali Shi, Shujie Lan, Fei Gao, Hao Qin, Nannan Wang:

ReCLIP: Reconstruction-Refined Zero-/Few-Shot Anomaly Classification and Segmentation. 1-6 - Haoquan Wang, Yong Chen, Shengbo Chen, Hong Rao:

CLAP: Overcoming Language Priors via Contrastive Learning and Answer Perturbation. 1-6 - Tengjun Liu, Qianbin Guo, Xuanchi Gong, Huan Zhang, Xianyi Chen:

Mimicing Real-world Knowledge to Generate 3D Adversarial Point Clouds. 1-6 - Lixuan Meng, Qiang Xu, Shan Liu, Wei Gao, Ge Li:

Towards Specialized and Generalizable Geometry Restoration of Compressed Point Clouds. 1-6 - Dongfang Zhao, Menghe Zhang, Yangwen Liang, Shuangquan Wang, Kee-Bong Song, Donghoon Kim:

Mobile-StereoHPE: Real-Time Mobile 3D Hand Pose Estimation from Stereo Gray Images. 1-6 - Mingwei Xing, Yao Wu, Yachao Zhang, Yanyun Qu:

Prompt-driven Multi-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation. 1-6 - Ye Zhou, Wenfei Yang, Tianzhu Zhang, Xiang Liu:

Prototype Optimal Transport for Box-Supervised 3D Instance Segmentation. 1-6 - Tianle Fang

, Zhenbing Liu, Yutao Tang, Yingxin Huang, Haoxiang Lu, Chuangtao Zheng:
RDFNet: Real-time Object Detection Framework for Foggy Scenes. 1-6 - Qingxue Zhao, Zhongjie Pan, Di Wu, Ge Tang, Jun Tian:

RLK-Net: An Efficient Residual Large Kernel Convolution with Channel-Wise Adaptive Feature Fusion for Medical Image Segmentation. 1-6 - Jingqi Qu, Hui Yu, Dongchen Zhu, Jiamao Li:

Mutual Semantic Bridged Tri-Tower Fusion for Audio-Visual Segmentation. 1-6 - Ming Zhao, Pingping Liu, Tongshun Zhang, Zhe Zhang:

ReF-LLE: Personalized Low-Light Enhancement via Reference-Guided Deep Reinforcement Learning. 1-6 - Yang Wei, Yi Pan, Limai Jiang, Juan He, Bokai Yang, Yufu Huo, Yunpeng Cai, Ruitao Xie:

Lesion Localization for Medical Imaging Using Counter-factual Generation Prompt Learning. 1-6 - Zhiqiang Yuan, Jiapei Zhang, Ying Deng, Yeshuang Zhu, Jie Zhou, Jinchao Zhang:

VSD2M: Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation. 1-6 - Shihao Dong, Xiaotong Zhou, Yuhui Zheng, Huiying Xu, Xinzhong Zhu:

Center-Oriented Prototype Contrastive Clustering. 1-6 - Wei-Xin Chen, Yong-Yong Chen, Shi-Chao Kan:

Scene Graph Generation with Large Vision-Language Model and Its Applications. 1-6 - Peijin Guo, Minghui Li, Hewen Pan, Ruixiang Huang, Lulu Xue, Shengqing Hu, Zikang Guo, Wei Wan, Shengshan Hu:

Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction. 1-6 - Mingya Zhang, Liang Wang, Limei Gu, Tingsheng Ling, Xianping Tao:

WT-BCP: Wavelet Transform based Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation. 1-6 - Chenbo Zhang, Yinglu Zhang, Jihong Guan, Shuigeng Zhou:

DuMo: A Dual-Model Framework for Effective Long-tailed Object Detection. 1-6 - Nana Zhang, Qian Liu, Dandan Zhu, Kun Zhu, Xiongkuo Min, Guangtao Zhai:

LD2Scan: A Lightweight Dual-Temporal Constrained Scanpath Prediction Model for Omnidirectional Images. 1-6 - Xuancheng Xu, Ming Tao, Bing-Kun Bao:

CLGC: Continuous Layout Guidance for Consistent Text-to-Video Editing. 1-6 - Shuoxi Zhang, Hanpeng Liu, Stephen Lin, Kun He:

Hyperspherical Dataset Distillation via Contrastive Embedding Alignment. 1-6 - Pei Wang, Xiaotong Luo, Zekun Ai, Yanyun Qu:

S3SR: Towards Efficient Image Super-Resolution with Selective State Space Model. 1-6 - Bingjie Gao, Bo Zhang, Li Niu:

Object Placement for Anything. 1-6 - Xuehui Liang, Ruomei Wang, Baoquan Zhao, Jiawei Feng:

Dynamic Feature-Focusing with Cross-Modal Semantic Alignment for Video Moment Retrieval and Highlight Detection. 1-6 - Yu Cai, Tianjiao Jing, Chang Liu, Zhengxuan Lian, Shi-Sheng Huang, Hua Huang:

Spatio-Temporally Consistent Depth Estimation for Dynamic Scenes using 3D Scene Flows. 1-6 - Xiaorui Jiang, Yu Gao, Hengwei Xu, Qi Zhang, Yong Liao, Peng Yuan Zhou:

Knowledge Rumination for Client Utility Evaluation in Heterogeneous Federated Learning. 1-6 - Tuo Xiong, Suping Wu, Xiang Zhang, Ruijie Peng, Bing Wang, Xitie Zhang, Zhijian Duan:

Complementary Multi-dimensional Variance Attention Learning for 3D Human Mesh Reconstruction from Videos. 1-6 - Xianjie Liu, Keren Fu, Yao Jiang, Qijun Zhao:

Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation. 1-6 - Hanjing Zhou, Mingze Yin, Danny Z. Chen, Jian Wu, Jintai Chen:

Group-On: Boosting One-Shot Segmentation with Supportive Query. 1-6 - Hongyan An, Kuan Zhu, Xin He, Haiyun Guo, Chaoyang Zhao, Ming Tang, Jinqiao Wang:

FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition. 1-6 - Bingbing Dan, Xinyu Tian, Meihui Li, Tao Tang, Jing Zhang:

IRSTS Generalist: Improving Generalization in Infrared Small Target Segmentation Using One Shot. 1-6 - Xianchao Zhang, Senqi Guan, Yunlong Gao, Linlin Zong, Wenxin Liang, Xinyue Liu:

Effective Linear Vision Transformer Via Selective Sampling Softmax and Multi-Feature Enhancement. 1-6 - Zongyi Xu, Xinqi Jiang, Xinyu Gao, Shanshan Zhao, Qianni Zhang, Weisheng Li, Xinbo Gao:

ALCReg: Active Label Correction for Partial Point Cloud Registration. 1-6 - Junjie Li, Jianghong Ma, Xiaofeng Zhang, Yuhang Li, Jianyang Shi:

GiVE: Guiding Visual Encoder to Perceive Overlooked Information. 1-6 - Li Yin, Baigang Mi, Yi Fan:

Uncertainty-guided Multi-modal Sequential Recommendation. 1-6 - Yunrui Jian

, Yi Xue, Yue Huang, Xueli Cheng, Weilun Feng, Zhenan Lin, Chao Zhou:
Content-Adaptive Motion Compensated Temporal Filter for Versatile Video Coding. 1-6 - Fei He, Houji Du, Fan Zhang, Yipeng Liu, Ce Zhu:

Incrementally Constrained Tucker Decomposition for Feature Extraction of Structural Diffusion Tensor Imaging Data. 1-6 - Chao Sun, Yaobo Liang, Yaming Yang, Shilin Xu, Tianmeng Yang, Yunhai Tong:

Direct Preference Optimization for LLM-Enhanced Recommendation Systems. 1-6 - Di Niu, Enyuan Zhao, Jie Nie, Min Ye, Shusong Yu, Xinyue Liang:

Time-Series Anomaly Detection Method Based on Frequency-Domain Decoupling and Correction. 1-6 - Shizhou Huang, Bo Xu, Changqun Li, Yang Yu

, Xin Lin:
Low-Redundancy Knowledge Generation and Modality-Aware Interaction for Multimodal Information Extraction in Social Media. 1-6 - Shuguo Hu, Jun Hu, Junwei Lv, Huaiwen Zhang:

EarlyMix: Hierarchical Mixing for Early Time Series Classification. 1-6 - Maoyi Xiong, Jun-Jie Huang, Zihan Chen, Tianrui Liu, Xueqiong Li, Lin Liu, Wentao Zhao, Yuhua Tang:

DFDUN: Deep Infrared and Visible Image Fusion with Diffusion Prior Unfolding Network. 1-6 - Junlin Chen, Qiushan Guo, Ka Chun Cheung, Mingrui Liang, Dezhi Chen:

TEVLA: Text-oriented Enhancement for Vision-Language Alignment in Relation Extraction. 1-6 - Tong Ji, Yunting Tao, Fanyu Kong, Guoyan Zhang, Yuliang Shi, Jia Yu:

Privacy-Preserving Gait Authentication Scheme Based on Partial Euclidean Distance in Cloud Computing. 1-6 - Hongyi Liu

, Xiaosong Huang, Mengxi Jia, Lingzhe Zhang, Tong Jia, Zhonghai Wu, Ying Li:
AAAD: Asynchronous Inter-Variable Relationship-Aware Anomaly Detection for Multivariate Time Series. 1-6 - Yixu Huang, Rui Zhong, Ségolène Rogge, Adrian Munteanu:

Global-to-Local Color Correction with Full-Region Coverage for Multi-view Light Field Images. 1-6 - Hu Li, Long Long, Lin Cheng, Zichen Liu, Jing Wang, Yucheng Zhang, Feng Dai:

LM-net: Integrating Linear Temporal Features and Multi-Scale Attention for Crop Yield Estimation. 1-6 - Fan Li, Xuan Wang, Min Qi, Zhaoxiang Zhang, Chengming Xu, Yuelei Xu:

Unlocking Instance Semantic Awareness for Domain Adaptive Semantic Segmentation. 1-6 - Zhuan Shi, Jing Yan, Xiaoli Tang, Lingjuan Lyu, Boi Faltings:

RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model. 1-6 - Zining Qin, Chenhao Wang, Jianxiong Guo, Huiling Qin, Weijia Jia:

Brainstorming Brings Power to Large Language Models of Knowledge Reasoning. 1-6 - Lingzhi Shen, Yunfei Long, Xiaohao Cai, Guanming Chen, Yuhan Wang, Imran Razzak, Shoaib Jameel:

LL4G: Self-Supervised Dynamic Optimization for Graph-Based Personality Detection. 1-6 - Yan Xing, Qi'ao Xu, Zongyu Guo, Rui Huang, Yuxiang Zhang:

GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection. 1-6 - Song Zhao, Shuhua Wang, Xiaobing Zhou:

LKPM: Large Kernel Point Mamba for 3D Point Clouds. 1-6 - Mingjun Li, Zeming Zhuang, Feng Su:

Scene Text Image Super-Resolution with Visual Text Cues Transfer and Enhancement. 1-6 - Kun Wang, Donglin Di, Tonghua Su, Lei Fan

:
EFDiT: Efficient Fine-grained Image Generation Using Diffusion Transformer Models. 1-6 - Yongqi Chen, Lin Zhao, Rizhao Cai, Zitong Yu, Changsheng Chen, Bin Li:

Forensicability Assessment: Not All Samples Qualify for Recapture Detection. 1-6 - Hongxu Li, Xiaodi Li, Fulin Su, Qinglang Guo:

Multi-Level Graph Pruning-Based Framework for Graph Retrieval-Augmented Generation. 1-6 - Desen Wang, Zhiming Chen, Xiang Qiu, Yishu Liu, Bingzhi Chen:

Enhancing Few-Shot Class-Incremental Learning via Cross-Modal Bias Alignment. 1-6 - Zhaoquan Yuan, Chengbin Zhao, Yuting Tang, Lishu Guo, Xiao Wu, Changsheng Xu:

Contrastive Invariant Risk Minimization for Grounded Situation Recognition. 1-6 - Yuqi Wang, Bob Zhang:

A GAN-based Data Poisoning Backdoor Attack Method for Palmprint Recognition CNNs. 1-6 - Yi Yang, Xinzhu Li, Yufeng Chen, Guanghui Yue, Wei Zhou, Zhuo Su, Ruomei Wang, Fan Zhou, Baoquan Zhao:

MCSMoG: Multi-Conditional Diffusion for Stylized Motion Generation with Parametric Control. 1-6 - Hao Li, Jingxuan Zhou, Jinlong Wang, Jiangmeng Li, Xiongxin Tang, Fanjiang Xu:

Learning Adaptive High-Frequency Semantic Guidance for Low-light Image Enhancement. 1-6 - Huikai Liu, Junyin Wang, Wenqian Zhu, Bin Fu, Shengwu Xiong, Cheng Liu:

MDC: Modality Distribution Consistent Distillation for Multi-View 3D Object Detection. 1-6 - Haiyan Wei, Hangrui Xu, Bingxu Zhu, Yulian Geng, Aolei Liu, Wenfei Yin, Jian Liu:

Patch-Wise Hypergraph Contrastive Learning with Dual Normal Distribution Weighting for Multi-Domain Stain Transfer. 1-6 - Xiaolan Tang, Yan Xiang, Zhengtao Yu, Yuxin Huang:

VFFG-CL: Virtual Fusion Feature Generation with Curriculum Learning for Missing-Modality Emotion Recognition. 1-6 - Xiufeng Liu, Zhongqiu Zhao, Yi Yang, Donghui Hu, Zhao Zhang:

SAM2-Cap: Segment Anything 2 with using Parts and Object Spatial Hierarchical Relationships for Image Segmentation. 1-6 - Yan Ma, Ruijie Peng, Suping Wu:

WL-MVSNet: Frequency-Aware and Regularized Learning for Multi-View Stereo. 1-6 - Siting Chen, Weijie Chen, Jiji Tang, Rongsheng Zhang, Xiaoshuai Sun:

InterID: Improving Multi-ID Interaction for Personalized Image Generation. 1-6 - A. V. Subramanyam:

Beckman Adversarial Defense. 1-6 - Xiqiao Fang, Qingfeng Wu, Lu Cao:

CI-MER: A Novel Causal Intervention Framework For Micro-Expression Recognition. 1-6 - Zhonghao Zhang, Ruonan Zhang, Libo Liu:

UniBind: Leveraging LLM-Augmented Knowledge Base for Scene Integration. 1-6 - Long Ma, Zhiyuan Yan, Qinglang Guo, Yong Liao, Haiyang Yu, Pengyuan Zhou:

Detecting AI-Generated Video via Frame Consistency. 1-6 - Yao Wang, Shuang Li, Ganggang Dong, Hongwei Liu:

CSDet: Clutter Suppression-Aided SAR Inshore Ship Detection Network. 1-6 - Yang Chang, Aoxing Li, Yuxuan Lin, Jianan Wang, Lizheng Liu, Yang Liu, Jing Liu, Liang Cao, Yan Wang, Zhongxue Gan, Wenqiang Zhang:

Towards Advanced Emotional Care: Embodied Emotional Care System for Humanoid Robots. 1-6 - Yunhan Ren

, Ruihuang Li, Lingbo Liu, Changwen Chen:
Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling. 1-6 - Yanlin Xu, Yiwei Ru, Dongsen Zhang, Yongji Liu, Zhenan Sun:

Contextualizing Borderline ECG Analysis via Multi-Modal Feature Extraction and Large Language Model Inference. 1-6 - Yichen Guo, Rui Ding, Mai Xu, Lai Jiang, Shengxi Li, Xin Deng:

Quality Control For HEVC: A Deep Reinforcement Learning Approach. 1-6 - Huichuan Huang, Zhiqing Zhong, Guangyu Wei, Yonghao Wan, Wenlong Sun, Aimin Feng:

Bi-Grid Reconstruction for Image Anomaly Detection. 1-6 - Zhangye Han, Xun Jiang, Zheng Wang, Xin Liu, Fumin Shen, Xing Xu:

Egocentric Online Action Segmentation with Behavior-Centred Feature Augmentation. 1-6 - Hongyang Chen, Yuhong Yang, Xinmeng Xu, Xingyu Liu, Weiping Tu, Zhongyuan Wang, Cedar Lin, Xin Zhao:

PGD-N2L: A Parameter-Guided Disentanglement Approach for Normal-To-Lombard Speech Conversion. 1-6 - Shihui Zhang, Junbin Su, Jiawei Zhang, Ziteng Xue, Zhipeng Zhang:

TGATrack: Template-Guided Low-Rank Adaption for Robust RGB-T Tracking. 1-6 - Wenyu Wang, Yiquan Zhou, Jihua Zhu, Hongwu Ding, Jiacheng Xu, Shihao Li:

AVENet: Disentangling Features by Approximating Average Features for Voice Conversion. 1-6 - Weifei Jin, Junjie Su, Hejia Wang, Yulin Ye, Jie Hao:

Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization. 1-6 - Ziyu Zhao, Zilu Guo, Jun Du, Feng Ma, Jia Pan:

An Investigation on Audio-Prompt and Structure Guided Long-Duration Music Generation Based on Diffusion Models. 1-6 - Jiagen Li, Rui Yu, Huihao Huang, Huaicheng Yan:

Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion. 1-6 - Weiqing Xiao, Fengjun Zhong, Hao Zhao:

Uncertainty-Guided Iterative Architecture for Stereo Matching. 1-6 - Mengchao Liu, Chao Yang, Bin Jiang, Chenglong Lei:

Advancing Multi-Hop Question Answering via Alternating Retrieval and Reasoning over Multi-view Knowledge Integration. 1-6 - Qiao Wang, Menghao Zhang, Lei Zhang, Qi Qi, Haifeng Sun, Pengfei Ren, Bo He, Jing Wang, Jingyu Wang:

Masked Self-Supervised Learning and Semantic Noise Separation for Video Anomaly Detection. 1-6 - Yunpeng Zhou, Qiwen Liang, Xin Li, Jianping Ren, Liujinxiang Zhu, Shuhua Liu:

CLIP-based Robust Pedestrian Attribute Recognition via Attribute Localization and Data Augmentation. 1-6 - Fei Su, Cancan Li, Juan Liu:

Enhanced Self-Supervised Multi-View Representations with Modality-Missing Robustness for Audio-Visual Speech Recognition. 1-6 - Qiang Wan, Sanshuai Cui, Anjie Peng, Hui Zeng, Rong Wei:

Boosting Adversarial Transferability by Constructing Adversarial Trajectories. 1-6 - Xiangrui Yang, Zhihao Zeng, Jiawei Yang, Yekang Zhan, Qiang Cao, Jie Yao:

HeteroGNN: A Heterogeneous Stage Division Based GNN Training Framework to Maximize CPU-GPU Parallelism. 1-6 - Shuhao Zhang, Bo Cheng, Jiale Han, Yuli Chen, Zhixuan Wu, Changbao Li, Pingli Gu:

CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models. 1-6 - Yunlong Zhao, Xiyun Li, Ziyi Wang, Haoran Wu, Minglun Han, Bo Xu:

Integrate-and-Fire Compressor: Learning to Compress Context for LLMs Adaptively. 1-6 - Jingwei Sun, Xuchong Zhang, Changfeng Sun, Qicheng Bai, Hongbin Sun:

Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection. 1-6 - Yang Dong, Zhuoqi Ma, Zejun You, Yunan Li, Qiguang Miao:

RetouchDiffusion: Unsupervised Personalized Image Retouching via Diffusion Models. 1-6 - Wei Li, Kuan Zhu, Haiyun Guo, Honghui Dong, Jinqiao Wang:

Semantic-aware Fine-grained Point Augmentation for 3D Multi-modal Object Detection. 1-6 - Yanxiang Huang, Kai Zhang, Yuxiang Wang, Dongtai Du, Yuping Yuan, Zheng Zhao:

Enhancing Open-Vocabulary Panoptic Segmentation with Semantic-Guided Q-Tuning. 1-6 - Xiaohui Chen, Xin Wang, Zuhui Yue, Zheng Li, Peipei Liu, Hongsong Zhu:

MalDenoise: Enhancing Robustness of API-Based Malware Detection Against Adversarial Attacks. 1-6 - Zilong Ling, Xinran Zhong, Siyu Zhou, Yu Yang, Zhongcheng Gui, Huabin Wang:

Defect Detection-Guided Reconstruction Network for Ground Penetrating Radar B-Scan Images. 1-6 - Yinan Xiao, Shijun Xiang:

General Distortion Metric Based Multiple Histograms Modification for Reversible Data Hiding. 1-6 - Zhiqiang Shen, Qinfeng Li, Xuhong Zhang, Yuxiang Cai, Xiaochu Chen, Ping An, Haiqin Weng, Yang Liu:

PatchSegDet: Attack-Agnostic Detection of Physical Adversarial Patches in Face Recognition Systems. 1-6 - Yuxiao Sun, Yao Zhao, Meiqin Liu, Chao Yao, Weisi Lin:

Embedding Compression Distortion in Video Coding for Machines. 1-6 - Kunshan Yang, Wenwei Luo, Yuguo Hu, Jiafu Yan, Mengmeng Jing, Lin Zuo:

A Semantic-Enhanced Heterogeneous Graph Learning Method for Flexible Objects Recognition. 1-6 - Zicheng Wu, Li-Hsuan Chang, Kuan-Wen Chen:

TC-NeRF:Temporal Consistent Neural Radiance Fields with Cross-View Complementation for Occluded Object Removal. 1-6 - Kaiyue Liu, Lei Wu, Mingzhe Yu, Xiaole Liu, Yajie Xu, Xiangxu Meng:

ArtTypo: Multi-Level Controlled Artistic Typography with Iterative Feedback. 1-6 - Weijian Zhang, Zhiwei Zhang, Tianfang Sun, Zhizhong Zhang, Xin Tan, Yuan Xie:

LFNet: Cross-Modal LiDAR-Fisheye Fusion Network for 3D Semantic Segmentation. 1-6 - Meng Wei, Xinzheng Xu, Peng Ying, Renke Sun, Guanjun Wang, Zhongnian Li:

Learning from Stochastic Labels. 1-6 - Xinzhu Li, Yi Yang, Yikun Chen, Guanghui Yue, Wei Zhou, Ruomei Wang, Xudong Mao, Juepeng Zheng, Fan Zhou, Ziqi Qiu, Baoquan Zhao:

MSPoint-Gait: Multi-Scale Point Cloud Analysis for 3D Gait Recognition via Cross-Modal Learning. 1-6 - Canhui Wu, Wei Xi, Dashan Gao, He Yang, Jizhong Zhao:

Advanced Backdoor Threats and Countermeasures in Dataset Condensation. 1-6 - Peiheng Wang, Haodan Zhang, Quanlu Jia, Jiangkai Wu, Liming Liu, Haoyang Wang, Xinggong Zhang:

3DGCoding: Novel Framework for 3D Gaussian Video Incremental Training and Coding. 1-6 - Chun Xie, Huimin Tong, Guoxi Xu, Yipeng Chen, Li Luking, Yiwei Chen:

Knowledge Calibration Distillation. 1-7 - Yiran Wang, Jiasheng Lu, Jun Chen, Xinyu Zhang, Yingshan Liang, Zhicheng Du, Qingyang Shi, Shao-Lun Huang:

Content-Style Disentangled Audio Style Transfer via Diffusion Model. 1-6 - Xiang Yuan, Xinrong Chen, Haochen Li, Hang Yang, Guanyu Wang, Weiping Li, Tong Mo:

Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction. 1-6 - Kangwei Liu

, Junwu Liu, Yun Cao, Jinlin Guo, Xiaowei Yi:
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model. 1-6 - Jiahua Bao, Siyao Cheng, Jiaxing Du, Ziqian Li, Changjiang He, Jie Liu:

History Tracker: Retrieving Historical Image Embeddings for Efficient Fine-Grained Reasoning in Vision-Language Models. 1-6 - Xi Wang, Ziqi He, Yang Zhou:

Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis. 1-6 - Yufan Liu, Jinyang An, Huashan Chen, Wanqian Zhang, Ming Li, Dayan Wu, Jingzi Gu, Zheng Lin, Weiping Wang:

Corer: Concept Residue Erasing in Text-to-Image Diffusion Models. 1-6 - Bin Li, Dehong Gao, Yeyuan Wang, Linbo Jin, Shanqing Yu, Xiaoyan Cai, Libin Yang:

Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models. 1-6 - Chong Geng, Zhen Liu, Yannan Wang, Yiran Li:

Distributed Cloud-Edge Scheduling for Multimedia Data Requests: A MARL Approach. 1-6 - Yanting Mei, Zhilu Zhang, Xiaojun Wu, Wangmeng Zuo:

Image Demoiréing Using Dual Camera Fusion on Mobile Phones. 1-6 - Xiaocheng Fang, Jieyi Cai, Huanyu Liu, Wenxiu Cai, Yishu Liu, Bingzhi Chen:

Revisiting DETR for Small Object Detection via Noise-Resilient Query Optimization. 1-6 - Chuanzhi Xu, Langyi Chen, Haodong Chen, Vera Chung, Vincent Qu:

Towards End-to-End Neuromorphic Voxel-based 3D Object Reconstruction Without Physical Priors. 1-6 - Xinyu Li, Hao Xu, Zhiheng Yang, Hongxiang Zhou, Hong Lu, Xin Wang, Jin Zhao:

ForeNet: Unlocking Long-Term Series Forecasting in High-Dimensional Scenario via Forest Structure. 1-6 - Juncheng Hu, Ximing Xing, Jing Zhang, Qian Yu:

VectorPainter: Advanced Stylized Vector Graphics Synthesis Using Stroke-Style Priors. 1-6 - Botao Zhao, Zuheng Kang, Yayun He, Xiaoyang Qu, Junqing Peng, Jing Xiao, Jianzong Wang:

Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy. 1-6 - Zhou Feng

, Jiahao Chen, Chunyi Zhou, Yuwen Pu, Qingming Li, Shouling Ji:
Poison in the Well: Feature Embedding Disruption in Backdoor Attacks. 1-6 - Wendong Li, Gaojie Wu, Xiang Huang, Wei-Shi Zheng:

CosGaussian: Towards Text-to-3D Semantically Controllable 3D Object Style Transfer with Gaussian Splatting. 1-6 - Shuaipeng Zhang, Lanju Kong, Yixin Zhang, Wei He, Yongqing Zheng, Han Yu, Lizhen Cui:

DAG-AFL: Directed Acyclic Graph-based Asynchronous Federated Learning. 1-6 - Zhe Tao, Lu Yu, Hantao Yao, Changsheng Xu:

Leveraging Multiple Deep Experts for Online Class-incremental Learning. 1-6 - Wenyu Li, Peng Qiao, Sidun Liu, Zongxin Ye, Ziteng Zhang, Zhenglun Sun, Yong Dou:

End-To-End Casual Video Reconstruction: Geometry, Pose and Motion. 1-6 - Wansong Qin, Zhijie Han, Yaru Li:

STGGait: A Graph Transformer Network for Pose-based Gait Recognition. 1-6 - Mengkui Li, Xinrui Chen, Hai Chen, Kang Zhao, Yanping Zhang, Shu Zhao, Fulan Qian:

Eff-DFQT: Efficient Model Inversion for Data-free Quantization of Vision Transformers. 1-6 - Yanchu Wu, Feng Tian:

GT-free_XAI: A Ground Truth-Free XAI Framework for Decision Interpretation and Evaluation. 1-6 - Anthony Trioux, Wei Zhang, Giuseppe Valenzise, Fuzheng Yang:

Exploring Compression Strategies for Blendshape-Based Avatar Facial Animation: Subjective and Objective Analysis. 1-6 - Jiandong Shi, Ming Li, Guoheng Huang, Siwei Zhou, Yongchun Gu, Zhanle Zhu:

Hierarchical Graph Learning Framework for Multimodal Conversational Emotion Recognition. 1-6 - Ruigeng Zeng, Wentao Ma, Qinglin Wang, Xinjun Mao, Jie Liu:

False Negatives Consensus Suppression for Text-to-Image Person Re-identificatio. 1-6 - Zhiwei Dong, Ying Liu:

BEV-MMC: Bird's-Eye-View-Based Multimodal Compression for Enhanced Visual Recognition. 1-6 - Junyang Chen, Hanjiang Lai:

Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval. 1-6 - Chao Yang, Chao Tian, Guoqing Zhu, Qiang Wang, Zhenyu He:

CMRFusion: Efficient Feature Decomposition for RGB-T Fusion via Cross Modality Mask Reconstruction. 1-6 - Xuli Shen, Hua Cai, Dingding Yu, Weilin Shen, Qing Xu, Xiangyang Xue:

EmoHead: Emotional Talking Head via Manipulating Semantic Expression Parameters. 1-6 - Siqian Nie, Xin Ding, Jiabo Wu, Sihan Lin, Qiong Liu:

Multi-view Video Coding with Decoupled Neural Representation for Multi-modal Traffic Data. 1-6 - Yiming Ding, Jianguo Wei:

Weak Semantic-Guided Entropy Model for Image Compression. 1-6 - Xu Wang, Yan Fu, Yanxia Wu, Dan Lin, Ye Yuan, Xue Zhang, Zhirou Ma:

SAG-KeyNet: Scale-Adaptive Keypoint Gaussian Heatmap Regression Network for Oriented SAR Ship Detection. 1-6 - Dongyang Li, Haoyang Qin, Mingyang Wu, Jiahua Tang, Chen Wei, Quanying Liu:

RealMind: Advancing Visual Decoding and Language Interaction via EEG Signals. 1-6 - Yuanwen Chen, Haoran Li, Yaran Chen, Dongbin Zhao:

LeAffordNav: Enhancing Open-vocabulary Mobile Manipulation with LLM-guided Exploration and Affordance-aware Navigation. 1-6 - Gaurav Rai

, Ojaswa Sharma:
LiveImage: Motion Condition Guided Diffusion Model for Video Motion Transfer. 1-6 - Tongfei Bian

, Yiming Ma, Mathieu Chollet, Victor Sanchez, Tanaya Guha:
Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions. 1-6 - Zeyu Lei, Lidan Fu, Anqi Xiao, Jie Tian, Zhenhua Hu:

WDiff: Wavelet-based Diffusion Models for Surgical Endoscopic Image Low-Light Enhancement. 1-6 - Ziniu Liu

, Mingqing Liu, Fengxia Han, Xingtong Liu, Chuan Liu, Xi Zhang, Hao Deng, Shengjie Zhao
:
ESTJ: Efficient Semantic Segmentation via Token Joint Merging. 1-6 - Zhiqiang Yuan, Jie Zhou, Jinchao Zhang:

A Synthetic-to-Real Dehazing Method based on Domain Unification. 1-6 - Hengrui Li, Tianyi Lu, Jianfeng Wang, Xiaopei Chen, Yongbing Zhang, Shaohui Liu:

MAMF-Net: Modality-Adaptive Masked Fusion Network for Speech Emotion Recognition. 1-6 - Leilei Wang, Renjie Lu, Fengzhao Sun, Yunxiang Zhang, Jun Yu, Qingsong Liu, Jianqing Sun, Jiaen Liang:

Optimization of Multimodal Inputs Based on Diffusion Models: Zero-Shot Semantic Image Generation. 1-6 - Yao Shen, Kaiyang Zeng, Guangyao Li:

Token-Driven Linkage Network: One-Shot Adaptation of SAM for Challenging Segmentation Scenarios. 1-6 - Yulong Bai

, Songlin Li, Xiuhong Li, Kuan Wang, Rong Wan, Haochu Ku, Mengge Lu:
Edge and Localization Feature Guidance Network for Accurate Polyp Segmentation. 1-6 - Yulei Kang, Teng-Yue Chen, Xiaotong Lin, Siyu Jiang, Jian-Fang Hu:

Recovering Human Mesh from Videos by 2D and 3D Deformable Attentions. 1-6 - Chuming Wang, Yingshuang Zou, Haoqian Wang:

Leveraging 2D Annotations for Cost-Effective Dynamic Urban Scene Reconstruction. 1-6 - Wei Li, Hebei Li, Yansong Peng, Siying Wu, Yueyi Zhang, Xiaoyan Sun:

Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects. 1-6 - Lingbo Zhang, Ye Zhang, Linghan Cai, Xianchao Guan, Kai Zhang, Yongbing Zhang:

Relation-Aware Graph Attention Network for Nuclei Classification. 1-6 - Xingshen Wei, Wei Liu, Wenzhong Li, Sanglu Lu:

Graph Anomaly Detection via Structure to Attribute Reconstruction. 1-6 - Hongcheng Li, Yucan Zhou, Yibin Wang, Xiaoyan Gu, Bo Li, Weiping Wang:

Take What I Need: Active Data Distillation for Federated Learning. 1-6 - Yiying Wei, Hadi Amirpour, Christian Timmerer:

Neural Representations for Scalable Video Coding. 1-6 - Zheng Dai, Chun Ding, Tianyi Chen, Si Wu, Yong Xu, Runzhe Liang, Tianshi Xu, Yedong Li, Dapeng Wu

:
IP-KGQA: Intent-Aware Prompt Learning for Knowledge Graph Question Answering. 1-6 - Wanqi Ma, Huanhuan Lv, Songru Jiang, Jiale Wu:

BFPS: A Boundary-Focused Polyp Segmentation Model via Frequency Domain Separation. 1-6 - Jie Shi, Xin Wen, Shijie Guo, Robert H. Deng, Jianan Xie, Rui Cao:

Multi-Level Normalizing Flow for Comprehensive Anomaly Detection and Localization. 1-6 - Xiaogang Du, Dong Wang, Tao Lei, Tongfei Liu, Yingbo Wang, Asoke K. Nandi:

HGCL: Semi-Supervised Polyp Segmentation via Hierarchical Granularity Contrastive Learning. 1-6 - Leidong Fan, Xiongkuo Min, Qing Li, Anjie Wang:

A Unified Inverse-Tone-Mapped HDR Video Quality Assessment Method across Two HDR Formats. 1-6 - Cai Pan, Guowei Zhang, Rui Zhong:

Trans-Diff:Transformer-based Video Summarization with Diffusion. 1-6 - Honglin Wu, Jun-Jie Huang, Huibin Tan, Wanrong Huang, Yuhua Tang, Xueqiong Li:

Multi-Resolution Infrared-Visible Image Fusion using Multi-Scale Residual Quantization. 1-6 - Yingjie Zhou, Farong Wen, Zicheng Zhang, Yanwei Jiang, Jun Jia, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai:

CAP: An Advanced No-Reference Quality Assessment Method for AI-Generated 3D Meshes. 1-6 - Yaokun Zhong, Siyu Jiang, Jian Zhu, Jian-Fang Hu:

Context Consistency Learning via Sentence Removal for Semi-Supervised Video Paragraph Grounding. 1-6 - Jianhua Li, Yongkang Liu, Gaoqi He, Wenxiang Liu, Weiliang Meng:

Shape-Preserving and Surface-Fitting Network for 3D Lane Detection. 1-6 - Ashutosh Singla, Irene Viola, Jack Jansen, Pablo César:

QoE Evaluation of Remote Physiotherapy in Volumetric Video and Video-Based Real-Time Communication. 1-6 - Yiming Ma, Victor Sanchez, Tanaya Guha:

CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification. 1-6 - Ruishuang Sun, Ruiting Wang, Enguang Zuo, Junyu Zhu, Chen Chen, Cheng Chen, Xiaoyi Lv:

SCGRL: Graph representation learning based on edge structure contrastive self-supervised framework. 1-6 - Boyu Chen, Lu Han, Zherui Zhang, Li Guo, Shibiao Xu:

DCSA-UNet: Lightweight UNet with Dual Cross-Shaped Attention For Skin Lesion Segmentation. 1-6 - Zichong Chen, Zeyu An, Jian Cheng:

Region Confidence Refinement with Progressive Semantic Mining for Source-Free Domain Adaptive Object Detection. 1-6 - Sijia Hu, Peng Chen, Xinxiao Wang, Luyue Sun, Guanghao Li, Hongyu Wang, Jian Pu:

JointDeblur-Gs: Joint Blur-Aware Gaussian Splatting. 1-6 - Ganghui Ru, Jieying Wang, Jiahao Zhao, Yulun Wu, Yi Yu, Nannan Jiang, Wei Wang, Wei Li:

HingeNet: A Harmonic-Aware Fine-Tuning Approach for Beat Tracking. 1-6 - Shiqi Mou, Zijie Li, Juxiang Zhou, Jun Wang, Jianhou Gan:

Enhancing Handwritten Mathematical Expression Recognition with Structure and Counting Aware Network. 1-6 - Junqing Huang

, Tong Liu, Chan-Tong Lam, Xiaochen Yuan:
SAM-FE: Segment Anything Model Guided Feature Enhancement for Semantic Change Detection of Remote Sensing Images. 1-6 - Jingchao Wang, Wenlong Zhang, Dingjiang Huang, Hong Wang, Yefeng Zheng:

A Simple and Better Baseline for Visual Grounding. 1-6 - Xin Zhang, Siting Huang, Xiangyang Luo, Yifan Xie, Weijiang Yu, Heng Chang, Fei Ma, Fei Yu:

MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach. 1-6 - Yinxuan Gui, Bin Zhu, Jingjing Chen, Chong-Wah Ngo:

Efficient Prompt Tuning for Hierarchical Ingredient Recognition. 1-6 - Jiyun Li, Jie Pan, Chen Qian, Ying Shen, Jiabao Zhao:

MSA-SAM2Net: A Polyp Segmentation Framework Based on Large Kernel Multi-Scale Attention. 1-6 - Yating Liu, Yan Lu:

Consistency Change Detection Framework for Unsupervised Remote Sensing Change Detection. 1-6 - Yicheng Di, Jiansong Fan, Rui Zhang, Song Shen, Jiayu Bao, Rongsheng Hu, Yuan Liu:

Global Perception Federated Recommender System for Click-Through Rate Prediction. 1-6 - Yubao Zhao, Jiaju Kang, Tian Zhang, Puyu Han, Tong Chen:

ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis. 1-6 - Yiheng Sun, Xiaopeng Hu, Fan Wang, Xinrong Wu, Ying Zhou, Jie Zhao

, Rongqi Zhu:
SODMAMBA-DETR:A Small Object DETR Detector Based on a Mamba Encoder. 1-6 - Tie Liu, Yue Yang, Peng Chen, Qijun Zhao:

Leveraging Hierarchical Spatio-Temporal Distribution Prompt for Zero-Shot Species Recognition. 1-6 - Jiaxuan Zhu, Siyu Huang:

A Low-Rank Defense Method for Adversarial Attack on Diffusion Models. 1-6 - Guohui Cai, Ruicheng Zhang, Hongyang He, Zeyu Zhang, Daji Ergu, Yuanzhouhan Cao, Jinman Zhao, Binbin Hu, Zhibin Liao, Yang Zhao, Ying Cai:

MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule. 1-6 - Jiati Cai, Yue Lei, Wenxin Tai, Xing He, Ting Zhong, Jia Chen, Fan Zhou:

Efficient Diffusion Bridge with Initial-Value Correction Strategy for Super-Resolution. 1-6 - Xinyu Lin, Yingjie Zhou, Zhen Long, Yipeng Liu, Lu Yang, Ce Zhu:

Unified Line Segment Detection and Description. 1-6 - Zijian Zhu, Ali Zia, Xuesong Li, Bingbing Dan, Yuebo Ma, Enhai Liu, Rujin Zhao:

SSTD: Stripe-Like Space Target Detection Using Single-Point Weak Supervision. 1-6 - Bingheng Pang, Wei Li, Zhuoxuan Liang, Yidan Chen, Zhihong Wang, Moustafa Youssef:

DiffMissing: Denoising Diffusion Model for Multivariate Time Series Forecasting with Variable Missing. 1-6 - Haolong Yan, Binghao Tang, Boda Lin, Jiachen Li, Si Li:

DF-Net: A Dual Fusion Network for Accurate Video Temporal Grounding. 1-6 - Canhui Wu, Wei Xi, Yuwei Fan, Yuhao Shen, Jizhong Zhao:

Fed3D: Enhancing Security in Federated Learning with Dataset Distillation. 1-6 - Ruicheng Zhang, Kanghui Tian, Zeyu Zhang, Qixiang Liu, Zhi Jin:

FDG-Diff: Frequency-Domain-Guided Diffusion Framework for Compressed Hazy Image Restoration. 1-6 - Haoyang Li

, Siyu Zhou, Liang Wang, Guodong Long:
MAO: Efficient Model-Agnostic Optimization of Prompt Tuning for Vision-Language Models. 1-6 - Tao Wu, Purui Bai, Huaibo Huang, Jie Cao, Yuang Ai, Ran He:

Degradation-Aware Multi-Task Image Restoration with State Space Models. 1-6 - Hongquan Liu, Yixin Ren, Jihong Guan, Shuigeng Zhou:

Mitigating Knowledge Forgetting by Generative Knowledge Replay and Forgetting-aware Aggregation in Semi-Supervised Federated Learning. 1-6 - Jiaqi Li, Ruowei Wang, Yu Liu, Qijun Zhao:

QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation. 1-6 - Shuhan Ye, Yuanbin Qian, Chong Wang, Sunqi Lin, Jiazhen Xu, Jiangbo Qian, Yuqi Li:

Cross Knowledge Distillation between Artificial and Spiking Neural Networks. 1-6 - Yuchuan Deng, Zhanpeng Hu, Zijie Xin, Chuang Deng, Qijun Zhao:

DAPL: Integration of Positive and Negative Descriptions in Text-Based Person Search. 1-6 - Chun Wang

, Chenyang Liu, Wenze Xu, Weihong Deng:
Dual Information Speech Language Models for Emotional Conversations. 1-6 - Ying Gao, Jing Lin, Wentian Cai, Yandan Chen, Zihao Huang, Zhiyong Xia:

BMCA: Weakly Supervised Semantic Segmentation via Beta Modulation and Cross-Modality Alignment. 1-6 - Zhenglun Sun, Peng Qiao, Yong Dou, Rongchun Li, Sidun Liu, Wenyu Li, Wenjie Hu:

Only One Stage: A Chemical-Aware Model for Accurate Combustion Chemical Kinetics Prediction. 1-6 - Ziwei Zhu, Xinzhu Zhang, Zhikang Zhao, Jing Zhao:

ITJP: Image and Text Joint Prompts for Few-Shot Whole Slide Image Classification. 1-6 - Hua Zhang, Tingting Xiao, Li Sun, Qingli Li:

RobusTReID: Defending Vision Transformer for Robust Image ReID. 1-6 - Guojun Lei, Chi Wang, Yikai Wang, Hong Li, Ying Song, Weiwei Xu:

MotionFlow: Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation. 1-6 - Dan Fu, Wai Keung Wong, Lunke Fei, Tingting Chai, Yuzhu Ji, Qinghua Zhu:

Coarse-To-Fine Graph Reasoning for 3D Hand Mesh Reconstruction. 1-6 - Xueping Zhang, Yaxiong Chen, Ruilin Yao, Yunfei Zi, Shengwu Xiong:

Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization. 1-6 - Xinzhong Wang, Lingyong Fang, Jidong Li

, Yichen Zhou, Gongshen Liu:
KMoP: Knowledge-injected Mixture-of-Prefix for Joint Multimodal Aspect-Based Sentiment Analysis. 1-6 - Zengyu Liu, Zhitao Liu, Yi Li, Zhenjiang Du, Lei Zhang, Ning Xie:

DCCL: Discriminative Cosine Center Learning for 3D Cross-Modal Retrieval with Real-world Image. 1-6 - Lohic Fotio Tiotsop, Andrés Altieri, Giuseppe Valenzise:

Non-Parametric Media Quality Recovery from Spammer-Affected Subjectively Annotated Datasets. 1-6 - Zeyu Wang, Yuankai Qi

, Dong An, Xu Yang, Hongxin Li, Zhaoxiang Zhang
:
Language-Conditioned Waypoint Predictor for Continuous Vision-and-Language Navigation. 1-6 - Jing Xiao, Chang You, Zhiyu Chen:

AlignKT: Explicitly Modeling Knowledge State for Knowledge Tracing with Ideal State Alignment. 1-6 - Jie Zhang, Qiongjie Cui, Xulei Yang, Na Zhao:

OcSplats: Rendering Occluded Humans with Prior Knowledge. 1-6 - Yu Cao, Sijia Li, Shiguang Liu:

Fitted-Singer: Singing Voice Synthesis with Style Control and Rhythm Control. 1-6 - Jingchao Wang, Zhijian Wu, Wenlong Zhang, Wenhui Liu, Jianwei Zhang, Dingjiang Huang:

Overcoming Feature Contamination by Unidirectional Information Modeling for Vision-Language Tracking. 1-6 - Xinlin Leng, Kangyu Hu, Hanlin Gu, Xiangui Kang, Wenyuan Yang:

Prototype-Based Communication Topology Optimization for Decentralized Federated Learning. 1-6 - Yunfei Chen, Yitian Long, Zhan Yang, Jun Long:

DMDH: Decentralized Multi-agent Distributed Hashing for Multimedia Retrieval. 1-6 - Shuangyi Tan, Mingzhi Mao, Guanbin Li:

NeRFSwap: A NeRF-Based Generative Model for Face Swapping. 1-6 - Qilin Wang, Zhengkai Jiang, Chengming Xu, Jiangning Zhang, Yabiao Wang, Xinyi Zhang, Yun Cao, Weijian Cao, Chengjie Wang, Zhanxiong Wang, Yanwei Fu:

VividPose: Vividly 3D-driven Stable Pose Diffusion of High Facial Fidelity. 1-6 - Di He

, Xinshan Zhu, Lan Zhang, Siyu Wang, Zhong Zhang:
Cross-modal Shared Concept Learning for Text-to-Image Person Retrieval. 1-6 - Liu Yang, Mengni Chen, Tingxuan Chen, Jinqi Hu, Zidong Wang:

GraphDEH: Graph Diffusion Enhanced Hypergrpah Method for Class-Imbalanced Node Classification. 1-6 - Sidharth Anand, Chaitanya Sai Chandu Yendru, Sreyasee Das Bhattacharjee, Junsong Yuan:

Multimodal Conversatioal Emotion Analysis with Robustness to Incomplete Modality Details. 1-6 - Xukun Zhou, Fengxin Li, Ziqiao Peng, Xinyu Wang, Hongyan Liu, Zhaoxin Fan, Jun He:

Meta-Learning Empowered Meta-Face: Personalized Speaking Style Adaptation for Audio-Driven 3D Talking Face Animation. 1-6 - Xue Xia, Zipeng Lin, Jingying Zhu, Jiebin Yan, Yuming Fang:

Cross-Structure and Semantic Enhancement for Diabetic Retinopathy Grading. 1-6 - Xianghui Fan, Zhaoyu Chen, Mengyang Pan, Anping Deng, Hang Yang:

Self-Supervised Learning for Transparent Object Depth Completion Using Depth from Non-Transparent Objects. 1-6 - Taixiang Lin, Shuyuan Lin, Yanjie Liang, Rong Chen, Yang Lu:

DTSNet: A Denoising Teacher-Student Network with Reverse Distillation for Anomaly Detection. 1-6 - Taorui Wang, Zitong Yu, Yong Xu:

TC-GS: Tri-plane based Compression for 3D Gaussian Splatting. 1-6 - Liuhan Chen, Zongjian Li, Bin Lin, Bin Zhu, Qian Wang, Shenghai Yuan, Xing Zhou, Xinhua Cheng, Li Yuan:

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model. 1-6 - Wenxin Meng, Shenshen Li, Lei Wang, Hao Yang, Chong Peng, Peng Yan, Xing Xu:

Causal Intervention with Active Learning for Large Vision-Language Models in Egocentric Contexts. 1-6 - Ke Gu, Peng Bai, Zhen Lei, Yue Zhou

, Zhicong Wu, Xiaodong Shi:
End-to-End Lyric-to-Melody Generation via Chord Integration and Bar-Level Modeling. 1-6 - Yi Liu, Keyu Fan, Bin Lan, Houde Liu:

DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments. 1-6 - Shaohui Pan, Yong Xu, Ruotao Xu, Zihan Zhou, Si Wu, Zhuliang Yu, Patrick Le Callet:

Rethinking 3D Robotic Perception: Elastic Voxel Representation with Splatting Distillation. 1-6 - Zongyun Zhang, Jiacheng Ruan, Xian Gao, Ting Liu, Yuzhuo Fu:

EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models. 1-6 - Yiwen Wang, Xinning Chai, Yuhong Zhang, Zhengxue Cheng, Jun Zhao, Rong Xie, Li Song:

Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution. 1-6 - Yuanchao Li, Azalea Gui, Dimitra Emmanouilidou, Hannes Gamper:

Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance. 1-6 - Yujian Lee, Peng Gao, Zailong Chen, Wentao Fan, Guquan Jing, Yiyang Hu:

Boosting Audio-Visual Segmentation via Triple-Modalities Alignment. 1-6 - Yuansu Hao, Fei Yu, Yanhao Wang, Yuehua Li, Quan Deng, Yuan Yu, Chen Huang, Nan Che:

Open-Scene Understanding-oriented 3D Scene Graph Generation. 1-6 - Yutao Wei, Hongzhu Fu, Yuxiang Li, Yichen Xin, Xovee Xu

, Fan Zhou, Ting Zhong:
Decoding Emotional Silences: Reliable Multimodal Sentiment Analysis with Bipolar Uncertainty. 1-6 - Chao Liu, Chuanlin Liao

, Tingting Zhang, Yi Lin:
MFA-Net: A Multi-Stage Network for Facial Acupoint Localization with Global-Local Feature Fusion and Acupoint Encoding. 1-6 - Zheyuan Liu, Jun Jia, Hongyi Miao, Yiwei Yang, Yanwei Jiang, Yingjie Zhou, Zhi Liu, Guangtao Zhai:

DiffDeid: High-Quality Face De-identification and Recovery via Diffusion Inversion. 1-6 - Xinyu Zhao, Weichen Xu, Jian Cao, Tianhao Fu, Ruilong Ren, Xing Zhang:

Human-Inspired Situated Question Answering with Large Language Models. 1-6 - Yifang Yin, Shengkai Chen, Yiyao Li, Lu Wang, Ruibing Jin, Wei Cui, Shili Xiang:

SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation. 1-6 - Bohan Xiong, Kan Chang, Mingyang Ling, Shilin Huang, Shucheng Xia, Yujian Yuan:

Prompt-Based Two-Stage Enhancement for Low-Light Object Detection. 1-6 - Linxuan Xin, Zheng Zhang, Zhiyi Pan, Jinfu Wei, Duan Gao, Wei Gao:

DreamPBR: Text-driven High-Resolution SVBRDF Generation with Multimodal Guidance. 1-6 - Yi Li, Weichao Li, Xin Zheng, Haiyan Fu, Yanqing Guo:

Long-Tailed Federated Learning with Fixed Classifier. 1-6 - Chengzhi Wang, Peng Yang:

Compression Metadata-assisted RoI Extraction and Adaptive Inference for Efficient Video Analytics. 1-6 - Rongrong Lian, Xiangdong Li, Zhenkai Wu, Mengting Ma, Wei Zhang:

SLGN: Spatiotemporal Language-Guided Graph Network for Referring Video Segmentation. 1-6 - Junjie Wu, Yumeng Fu, Nan Yu, Chen Gong, Guohong Fu:

2S-DGM4: A Two-Stage Framework for Detecting and Grounding Multi-Modal Media Manipulation. 1-6 - Xiang Li, Duyi Pan, Hongru Xiao, Jiale Han, Jing Tang, Jiabao Ma, Wei Wang, Bo Cheng:

DialogueAgents: A Hybrid Agent-Based Speech Synthesis Framework for Multi-Party Dialogue. 1-6 - Zhen Long, Qingqing Cao, Hu Yao

, Yipeng Liu, Le Zhang, Ce Zhu:
TRR-LGF: a Simple yet Efficient Classification Network. 1-6 - Guoming Lu, Guodong Zou, Dongnan Liu, Heng Yin, Jielei Wang, Guangchun Luo:

DEQuant: Distribution-Enhanced Reconstruction for Post-Training Quantization. 1-6 - Xinpan Yuan, Jiabao Li, Wei Xia, Wenguang Gan, Mengxi Ying, Liujie Hua:

Adapting Cross-Modal Semantic Discrepancy in Text-based Person Search. 1-6 - Zhuang Qi, Runhui Zhang, Lei Meng, Wei Wu, Yachong Zhang, Xiangxu Meng:

Global Intervention and Distillation for Federated Out-of-Distribution Generalization. 1-6 - Ziyi Huang, Binbin Yan, Dongliang Wang, Jinglun Feng, Shuo Chen, Xiangcheng Yi:

DynaGS-SLAM: Robust Dynamic SLAM with 3D Gaussian Splatting. 1-6 - Xiangyu Gao, Zhekai Luo, Peijia Zheng, Jian Li, Rui Yang:

Accurate and Efficient Privacy-Preserving Image SURF Feature Extraction. 1-6 - Ao Gao, Luosong Guo, Tao Chen, Zhao Wang, Ying Tai, Jian Yang, Zhenyu Zhang:

EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy. 1-6 - Yonghao Li, Xiangyu Zhao, Ping Ye, Qingxuan Jia:

GEST: Dual Structured Exploration with Graph ODE for Spatio-Temporal Dynamic System Modeling. 1-6 - Youbing Hu, Yun Cheng, Zimu Zhou, Anqi Lu, Zhiqiang Cao, Zhijun Li:

FoCTTA: Low-Memory Continual Test-Time Adaptation with Focus. 1-6 - Yafan Yuan, Zhen Liu, Xinxin Yang, Sibo Lu:

Contrastive Intent-Disentangled Variational AutoEncoder for Sequential Recommendation. 1-6 - Fuyao Cai, Daizong Liu, Xiang Fang, Jixiang Yu

, Keke Tang, Pan Zhou:
Imperceptible Beam-Sensitive Adversarial Attacks for LiDAR-based Object Detection in Autonomous Driving. 1-6 - Peng Qian, Ning Wang, Carl C. Udora, Carlos Velez Redondo, Jingxuan Men, Rahim Tafazolli:

Enabling Haptic-Integrated Interactive Holographic Video Streaming Powered by 5G Edge Computing. 1-6 - Sashuai Zhou, Yan Xia, Hai Huang:

Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning. 1-6 - Wentao Xie, Xingyu Li:

Enhancing Object-Attribute Alignment in Diffusion Models via Training-Free Contrastive Parallel Denoising. 1-6 - Yangyang Huang, Xing Xi, Ronghua Luo:

DLLM: Enhancing Open-World Object Detection with Dynamic Learning and Large Models. 1-6 - Jinghua Zhao, Yuhang Jia, Shiyao Wang, Jiaming Zhou, Hui Wang, Yong Qin:

Chinese-LiPS: A Chinese Audio-Visual Speech Recognition Dataset with Lip-Reading and Presentation Slides. 1-6 - Zhewei Wu, Ruilong Yu, Shilin Qiu, Qihe Liu, Shijie Zhou, Zhun Zhang:

AADN++: Latent Feature Improves Adversarial Defense Transferability on Object Tracking. 1-6 - Cong Ming

, Haojie Yuan, Xiangwen Wang, Qi Chu, Tao Gong, Bin Liu, Nenghai Yu:
Adversarial Examples Detection Based on Adversarial Attack Sensitivity. 1-6 - Junlong Ren, Hao Wang:

Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction. 1-6 - Hailan Shen, Yixiang Jiang, Zailiang Chen, Xujing Liu, Jian Zhang:

DB-NeRF: An Effective Dual-Branch Representation for Neural Radiance Fields. 1-6 - Yuchen Guo, Ruoxiang Xu, Rongcheng Li, Weifeng Su:

DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion. 1-6 - Jingui Ma, Yang Hu, Luyang Tang, Jiayu Yang, Yongqi Zhai, Ronggang Wang:

Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction. 1-6 - Cheng Qian

, Jiwu Cao, Ying Mao, Kai Liu, Peng Zhu, Jun Sang:
AMS-Counter: Text-Guided Zero-shot Object Counting via Adaptive Multi-view Similarity-map. 1-6 - Junsong Li, Jie Zhou, Yutao Yang, Bihao Zhan, Qianjun Pan, Yuyang Ding, Qin Chen, Jiang Bo, Xin Lin, Liang He:

Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning. 1-6 - Manman Yuan, Ting Xu, Jiazhen Ye, Peican Zhu, Jiacheng Wang, Keke Tang:

PopuDet: Autism Spectrum Disorder Detection in Population Graphs via Micro-macro Relationship Construction and Multi-feature Fusion. 1-6 - Gaowei Zhang, Wei Wang, Yi Wang:

ZeroPose: Leveraging Diffusion Models and Large Language Models for Advanced Multi-Hypothesis 3D Construction Workers' Pose Estimation. 1-6 - Jiankang Wei, Xu Ma, Yuan Ma, Hongwei Zhou, Jingtong Huang, Xiaoyu Zhang:

Enhancing Federated Learning Robustness with Pre-trained Staged Modular Distillation. 1-6 - Xiufeng Liu, Zhongqiu Zhao, Yi Yang, Donghui Hu, Zhao Zhang:

SwinCAE: Capsule Autoencoder using Shifted Windows for 3D Human Pose Estimation. 1-6 - Jinfu Wei, Zheng Zhang, Qinchuan Zhang, Ran Liao, Duan Gao:

Few-Shot 3D Face Generation via a Controllable Diffusion Model Guided by Text and Images. 1-6 - Fei Zhao, Da Pan, Zelu Qi, Ping Shi:

Research on Audio-Visual Quality Assessment Dataset and Method for User-Generated Omnidirectional Video. 1-6 - Zheng Chen, Wengang Zhou, Houqiang Li:

Active Object Tracking with Occluded Targets Estimation and Adversarial Reinforcement Learning. 1-6 - Jie Wang, Yan Huang, Yunfei Zhang, Tianyi Chen, Si Wu, Yong Xu, Patrick Le Callet:

SemanticLoom: Category-aware Dynamic Fusion for Multi-class Few-shot Image Synthesis. 1-6 - Shuo Yang, Junyi Wang, Yue Qi:

Multi-mode Bidirectional Feature Fusion and Domain-consistency Refinement for Real-time Monocular 6D Object Pose Estimation. 1-6 - Yi Lu, Shu Li, Huanglong Dong, Shuxiang Hou, Yurong Qian:

Make Prototypes Perform Again: Prior-Prototypes Based Feature learning Framework for Few-Shot Hashing. 1-6 - Zhenqin Chen, Yuying Bao, Fengbo Wang, Yiwei Lin, Jinshan Xu:

A Refined ECG Delineation Framework Incorporating Single-Beat Mode and Conditional Random Field. 1-6 - Aofan Liu, Lulu Tang, Ting Pan, Yuguo Yin, Bin Wang, Ao Yang:

PiCo: Jailbreaking Multimodal Large Language Models via Pictorial Code Contextualization. 1-6 - Jiangfeng Li, Shijie Wang, Zijun Huang, Yifan Li:

CAMCKG: A Framework for Trigger-Action Recommendation Combining Attention Mechanism and Continuous Kernel Graph Convolution. 1-6 - Changhai Ma, Ziyu Wu, Yunkang Zhang, Qijun Ying, Boyan Liu, Xiaohui Cai:

From Camera to World: A Plug-and-Play Module for Human Mesh Transformation. 1-6 - Zihan Cheng, Xingyu Pan, Xi Chen, Shenghua Fan:

Wavelet-based Feature Representation Framework for Event Stream Recognition. 1-6 - Yujie Hu, Xuanyu Zhang, Weiqi Li, Jian Zhang:

TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model. 1-6 - Zhekai Luo, Xiangyu Gao, Peijia Zheng, Jian Li, Weiqi Luo:

Privacy-Preserving Anti-Recompression Video Watermarking in Bitstream Domain. 1-6 - Shifeng Xu, Yanzhu Liu, Adams Wai-Kin Kong:

Variance-Reduction Guidance: Sampling Trajectory Optimization for Diffusion Models. 1-6 - Hailun Zhang, Qijun Zhao, Zhen Zhai, Xinrui Wang:

Perspective Makes Perfect: Prompt-tuning Vision-Language Models for Action Recognition with Diversified Multi-Modal Observation. 1-6 - Sik Chit Wu, Munan Ning, Dong Wei, Yefeng Zheng, Donghuan Lu, Li Yuan:

OpenDUN: To Discover Unknown Number of Visual Categories. 1-6 - Yixuan Wang, Kehan Wang, Huayu Zhang, Ming Fang, Shuhua Liu:

TriModal Enhanced Fusion Network: Advancing Multimodal Representation and Fusion for Enhanced Multimodal Intent Recognition. 1-6 - Puwei Lian, Xiao Ke, Zhou Tan, Jianping Cai, Ximeng Liu:

Achieving Zero-Glance Unlearning with Data-Free Inversion and Selective Parameters Suppression. 1-9 - Lei Tan, Yuliang Xue, Guobiao Li, Zhenxing Qian, Sheng Li, Xinpeng Zhang:

Texture-Aware Neural Radiance Fields Watermarking for Resisting Feature-Modulation Surrogate Model Attacks. 1-6 - Cheng Tang, Wenqi Lou, Qianyu Cheng, Jiayi Tuo, Wei Fu, Tianhao Jiang, Chao Wang, Xuehai Zhou:

Spectral Enhanced Tuning: An Efficient Plug-and-Play Framework for Frequency-Aware Dehazing. 1-6 - Lulu Tian, Hongxun Yao, Zhaopan Xu, Jiankun Zhu, Xi Chen, Yuxin Hou:

DreamAnimate: Temporal Consistency and Detail Preservation for Character Animation. 1-6 - Shang Song, Lin Liu, Rongmao Chen, Wei Peng:

Concretely Efficient Three-party Oblivious Selection. 1-6 - Hongpeng Pan, Yang Yang:

Coordinated Uni-modal Assistance for Enhancing Multi-modal Learning. 1-6 - Yutian Li, Jiaming Yang, Yiwen Hu, Lap-Kei Lee, Fu Lee Wang, Zhenguo Yang:

Aspect-attentioned Prompting for Multimodal Sentiment Analysis. 1-6 - Ganghui Ru, Jieying Wang, Jiahao Zhao, Yulun Wu, Yi Yu, Nannan Jiang, Wei Wang, Wei Li:

BeatFM: Improving Beat Tracking with Pre-trained Music Foundation Model. 1-6 - ShangXuan Xie, Haifeng Wu, Wen Li, Lixin Duan:

P2WNet: Homography Estimation for Part-To-Whole and Cross-Modality Scenarios. 1-6 - Jilong Luo, Yinsheng Chen, Yue Liu, Jinghai Wang, Zhiyi Yu, Shanlin Xiao:

Stair-LIF: Boosting the Representation of Spiking Neural Networks with Learnable Incremental Multi-Threshold Neurons. 1-6 - Dejie Yang, Zhu Xu, Xinjie Gao, Yang Liu:

Hierarchical Sub-action Tree for Continuous Sign Language Recognition. 1-6 - Kangwei Liu

, Junwu Liu, Xiaowei Yi, Jinlin Guo, Yun Cao:
Controllable Expressive 3D Facial Animation via Diffusion in a Unified Multimodal Space. 1-6 - Jiaqi Yin, Jingyang Qiao, Tiong-Thye Goh, Yi Hu:

Enhancing Personalized Recommendation via Metacognitive Profile. 1-6 - Yongji Li, Luping Wang:

Spectrum-Assisted Mamba for Infrared Small Target Detection. 1-6 - Chaozhuo Li, Hui Pang, Xi Zhang, Litian Zhang, Feiran Huang, Ming Lu:

Harnessing Counterfactual Reasoning for Explainable Multi-Modal Fact Verification with Large Language Models. 1-6 - Junjie Chu, Yugeng Liu, Xinlei He, Michael Backes, Yang Zhang, Ahmed Salem:

Neeko: Model Hijacking Attacks Against Generative Adversarial Networks. 1-6 - Haodong Zhang, Liu Yang, Zihan Jiang:

RKU: Relevant Knowledge-aware Unlearning for Federated Continual Learning. 1-6 - Bimei Wang, Jingmei Jiao, Jisheng Dang, Qingrun Jiang, Jiyuan Lin, Zhixuan Chen, Teng Wang, Jun Yang:

Quality-Guided Dynamic Memory for LLMs-based Long-Term Video Understanding. 1-6 - Mingyu Cao, Xihuai He, Xueqiong Li, Kedi Zhang, Yuhua Tang, Wanrong Huang, Huibin Tan:

AGFT-Tracker: Adaptive Game-Based PEFT for Object Tracking with PLMs. 1-6 - Na Lu, Xiaojie Zhao, Li Yao:

An EEG Dataset with Subjective-Objective Perception Data for Assessing Stereoscopic Visual Discomfort Induced by 3D Motion Videos. 1-6 - Yongfeng Dong, Jiaji Wang, Zhen Wang, Guifang Wu, Hao Cheng:

Training Robust DNNs with Noisy Labels via Contrastive Re-Calibration Learning. 1-6 - Xudong Wang:

CDIQA: Collaborative Learning with Diffusion Extension for Semi-supervised Blind Image Quality Assessment. 1-6 - Wei Tao, Xiaoyang Qu, Kai Lu, Jiguang Wan, Guokuan Li, Jianzong Wang:

MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs. 1-6 - Guquan Jing, Peng Gao, Yiyang Hu, Yujian Lee, Hui Zhang:

ESTI: An Efficient Spatial-Temporal Interaction Network For Video-Based Person Re-Identification. 1-6 - Meixuan Chen, Chen Wang, Liu Hui, Yujun Wu, Ying Sha:

Can MLLMs Tell Jokes Based on Images? A Visual Context-Driven Humor Generation Framework. 1-6 - Haiwen Li, Zining Chen, Ying Liu, Fei Su, Zhicheng Zhao:

Slot Inversion for Asymmetric Composed Image Retrieval. 1-6 - Jiaming Lu, Yunrui Zhu, Ruyu Liu, Xu Cheng, Jianhua Zhang, Bo Sun, Xiufeng Liu

:
Mamba-SLAM: Enhancing Neural Implicit SLAM with Uncertainty and Mamba. 1-6 - Xiao Shao, Weiqi Yan, Yu Zang:

Iterative Multi-Collaborative Training Network for Point Cloud Learning with Noisy Annotations. 1-6 - Jingzhi Zhang, Chengjie Bai:

Discrimination-based Method for Image Object Detection with Random Distinct Proposals. 1-6 - Ludan Ruan, Lei Tian, Chuanwei Huang, Xu Zhang, Xinyan Xiao:

UniVG: Towards UNIfied-modal Video Generation. 1-6 - Yu Zhong, Tao Xie, Anna Zhu:

High Resolution Wire Segmentation with Domain Adaption. 1-6 - Beizhen Zhao, Yifan Zhou, Zijian Wang, Hao Wang:

EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting. 1-6 - Yuxiao Yang, Peihao Li, Yuhong Zhang, Junzhe Lu, Xianglong He, Minghan Qin, Weitao Wang, Haoqian Wang:

NOVA3D: Normal Aligned Video Diffusion Model for Single Image to 3D Generation. 1-6 - Saeed Ranjbar Alvar, Mohammad Akbari, David Ming Xuan Yue, Yong Zhang:

AMUSE: Adaptive Multi-Segment Encoding for Dataset Watermarking. 1-6 - Jianrong Wang, Shuyun Zhang, Ying Guo, Qi Li, Ju Zhang, Di Jin:

LiPlan: A Multimodal Dataset for Livable Urban Environment Layout Generation. 1-6 - Yunze Deng, Haijun Xiong, Bin Feng, Xinggang Wang, Wenyu Liu:

STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-To-4D Gaussian Splatting. 1-6 - Meng Wei, Zhongnian Li, Peng Ying, Ridong Han, Tongfeng Sun, Xinzheng Xu:

Determined Multi-Label Learning via Similarity-Based Prompt. 1-6 - Xu Zhang, Ming Lu, Yan Chen, Zhan Ma:

Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference. 1-6 - Qilong Xing, Zikai Song, Yuteng Ye, Yuke Chen, Youjia Zhang, Na Feng, Junqing Yu, Wei Yang:

CA-Diff: Collaborative Anatomy Diffusion for Brain Tissue Segmentation. 1-6 - Shukuan Yuan, Zihao Zhang, Yahong Han:

Object-Centric Feature Enrichment for Single-Domain Generalized Object Detection. 1-6 - Yi Zou, Haonan Cheng, Long Ye, Qin Zhang:

Pop-Diffuseq: Controllable Symbolic Music Multi-Instrument Infilling and Accompaniment Generation with Long-Axis Attention. 1-6 - Andreas Goulas, Vasileios Mezaris, Ioannis Patras:

VidCtx: Context-aware Video Question Answering with Image Models. 1-6 - Fanyu Meng, Zhixin Bai, Yanming Wang, Jing Huo, Boyan Wang, Xi Yang, Yang Gao:

Advancing Safe Language Generation: Exploring Alternative Constrained RLHF. 1-6 - Yang Qu, Zhencai Shen, Yingyi Chen, Ping Zhong:

Graph-based Meta-Learning and Feature Disentanglement for Domain Generalization Crowd Counting. 1-6 - Zuying Xie, Changtao Miao, Ajian Liu, Jiabao Guo, Feng Li, Dan Guo, Yunfeng Diao:

SUEDE: Shared Unified Experts for Physical- Digital Face Attack Detection Enhancement. 1-6 - Zeyuan Li, Yangfan He, Lewei He, Jianhui Wang, Tianyu Shi, Bin Lei, Yuchen Li, Qiuwu Chen:

FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding OptimizatioN. 1-6 - Chuyang Ye, Dongyan Wei

, Zhendong Liu, Yuanyi Pang, Yixi Lin, Qinting Jiang, Jingyan Jiang, Dongbiao He:
DATTA: Domain Diversity Aware Test-Time Adaptation for Dynamic Domain Shift Data Streams. 1-6 - Qinjie Hu, Fei Qi, Kaiwen Fu, Chengyuan Chang, Xiaotian Wang, Kun Liu, Guangming Shi:

Redesigning Upsampling in Decoders with Aligned Feature Aggregation for Semantic Segmentation. 1-6 - Ye Liu, Yan Pan, Jian Yin:

Few-shot Prompt Learning with Large Vision-Language Model for Image Deep Hashing. 1-6 - Zhiliang Zeng, Mengyang Wu, Xianzhi Li, Wenzhao Gao, Shaohui Jiao, Chi-Wing Fu:

Geometrically-plausible and Semantically-consistent Generation of Indoor Panoramas. 1-6 - Guangmin Zheng, Jun Kong, Jin Wang, Xuejie Zhang:

Enhanced Multimodal Chain-of-Thought with Visual Self-Contrastive Distillation. 1-6 - Qianhan Tang, Yanan Liu, Ningxin Wang, Kangjian He, Hao Zhang, Dan Xu:

Construct a Powerful Discriminative Relationship for Few-Shot Action Recognition. 1-6 - Wenlong Wang, Dahua Gao, Pengfei He, Xinyu Liu, Danhua Liu:

3D Human Motion Corpus Moment Retrieval via Multi-Granularity Semantic Alignment. 1-6 - Xiaojie Yu, Mingzhi Pang, Zhongxu Bao, Xu Yang, Qiang Niu, Yuqing Yin:

Explore the Asymmetric Interference Sound Field for High-precision Localization. 1-6 - Zunian Wan

, Jiancheng Zhao, Yepeng Ding, Lingfeng Zhang, Hiroyuki Sato, Takefumi Ogawa:
Spectrum-Adaptive Distribution of 2D Gaussians for Image Representation and Compression. 1-6 - Youmin Xu, Xuanyu Zhang, Xiandong Meng, Chong Mou, Jian Zhang:

Diffusion-Based Hierarchical Image Steganography. 1-6 - Kun Zhou, Xinyu Lin:

Mutual Guidance and Residual Integration for Image Enhancement. 1-6 - Yuxue Hu, Junsong Li, Meixuan Chen, Dongyu Su, Tongguan Wang, Ying Sha:

Keyword-Oriented Multimodal Modeling for Euphemism Identification. 1-6 - Wei Li, Jiawei Jiang, Ni Xu, Ying Cui, Yan Li, Jianwei Zheng:

Spatial-Spectral Fusion Neural Operator. 1-6 - Hailan Shen, Yuqi Li, Zailiang Chen, Hui Liu, Wenyan Zhong, Yudi Wang:

AWUR: Adaptive Wavelet and Uncertainty Refinement for Semi-Supervised Medical Image Segmentation. 1-6 - Meng Geng, Qian Huang, Yulin Chen, Xuejie Zhang:

WDRE-NET: Wavelet-Differential Convolution and Region-Expansion to Enhance Weakly Supervised Adjacent Nuclei Segmentation. 1-6 - Yejing Guo, Ziqi Wang, Xia Yuan, Chunxia Zhao:

Bilateral Enhanced Complementary Network for Camouflaged Object Detection. 1-6 - Quanlin Chen, Chunjin Ye, Yiming Ma, Jiahui Pan, Jingcong Li:

A Novel Perspective on Leveraging Hubness in VAE for Eliminating Representative Shift Vectors in Few-Shot Learning. 1-6 - Yuhai Wang, Maryam Pishgar:

Dynamic Token Selective Transformer for Aerial-Ground Person Re-Identification. 1-6 - Jia Wen, Jialin Li, Ting Zhang:

VoxelDet: Towards Accurate 3D Object Detection with Voxel Pruning and Fine Geometric Shape. 1-6 - Changyong He, Jin Zeng, Jiawei Zhang, Jiajie Guo:

Towards Robust Time-Of-Flight Depth Denoising with Confidence-Aware Diffusion Model. 1-6 - Yiwen Luo, Tao Wei, Yong Luo, Zengmao Wang:

Enhancing Multimodal Chain-of-Thought Reasoning with Tree-Searched Self-Training. 1-6 - Yiran Song, Qianyu Zhou, Xuequan Lu, Zhiwen Shao, Lizhuang Ma:

SU-SAM: A Simple Unified Framework for Adapting SAM in Underperformed Scene. 1-6 - Zi-Yu Zhang, Bing-Feng Seng, Ya-Feng Du, Kang Li, Zhe-Cheng Wang, Zheng-Jun Du

:
Semantic Palette-Guided Color Propagation. 1-6 - Yukang Huo, Xianhui Meng, Li Zhang, Haonan Jiang, Yan Zhong, Mingyuan Yao, Haihua Wang:

Diff-Art: Category-level Articulation Pose Estimation via Conditional Diffusion. 1-6 - Yuchen Zhang, Mingxin Li, Chao Gao, Xianghua Li:

Confidence Breeds Success: Improving Fake News Video Detection via LVLM-Assisted Inference. 1-6 - Shun Zou, Yi Zou, Mingya Zhang, Shipeng Luo, Zhihao Chen, Guangwei Gao:

Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition. 1-6 - Lingfeng Ye, Kumie Gedamu, Jie Shao:

Decoupling Representations with Quantized Vectors for Semi-Supervised Action Quality Assessment. 1-6 - Kangsheng Wang, Xiao Zhang, Juntao Lyu, Tianyu Hu, Huimin Ma:

CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency. 1-6 - Xinyu Li, Yunqi Cai, Hao Xu, Xinyu Sun, Zhiheng Yang, Hong Lu, Xin Wang, Jin Zhao:

HMSformer: Hierarchical Multi-Scale Transformer for Multivariate Long-Term Series Forecasting. 1-6 - Chen Tang, Yangle Li, Tingrui Shen, Xinrong Gong, Tong Zhang:

Towards Trustworthy Model via Uncertainty Verification in Multimodal Sentiment Analysis. 1-6 - Xudong Ru, Xingce Wang, Peng Du, Yanghui Yan, Shaolong Liu, Yi-Cheng Zhu, Wuyang Shui, Zhongke Wu:

SE(3)-Equivariant Multi-Scale Graph Transformer for Multi-Resolution 3D Aneurysm Segmentation. 1-6 - Yuzhe Zhu, Lile Cai, Kangkang Lu, Fayao Liu

, Xulei Yang:
Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field. 1-6 - Junbang Liu, Enpei Huang, Dongxing Mao, Hui Zhang, Xinyuan Song

, Yongxin Ni:
ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting. 1-6 - Zitong Xu, Huiyu Duan, Guangji Ma, Liu Yang, Jiarui Wang, Qingbo Wu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet:

HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment. 1-6 - Wenhao Li, Qiangchang Wang, Jing Li, Shengnan Zhao, Mindi Ruan, Yilong Yin:

Semantic-Aware Adaptation with Hierarchical Multimodal Prompts for Few-Shot Learning. 1-6 - Qingzheng Wang, Ning Li, Jiazhi Xie:

Dual-Domain Iterative Refinement Network for Camouflaged Object Detection. 1-6 - Hanbing Liu, Zhi-Qi Cheng, Wangmeng Xiang, Jun-Yan He, Bin Luo, Yifeng Geng, Xuansong Xie:

Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation. 1-6 - Jun Liang, Yunyu Zou, Yang Peng, Yalong Cheng, Rui Luo, Yishu Liu, Bingzhi Chen:

Task-Aware Knowledge Prompt and Distillation for Cross-Domain Few-Shot Learning. 1-6 - Yongji Li, Luping Wang:

Infrared Small Target Detection via Multi-Path Deep Conduction. 1-6 - Zhengwei Peng, Conghan Yue, Tong Duan, Dongyu Zhang:

Inversion-Free Image Editing via Rectified Flow. 1-6 - Bowen Chen, Jiehua Zhang, Yuchen Sun, Li Liu:

Efficient Binarized Neural Network Intellectual Property Protection. 1-6 - Jianzhang Gao, Hao Pu, Yuchong Sun, Ruihua Song:

Uncovering Personality Traits via Multimodal LLM for Personalized Image Emotion Analysis. 1-6 - Jielei Wang, Zihan Cheng, Guoming Lu, Kexin Li, Guangchun Luo:

MRKD: Monotonic Relationship-based Knowledge Distillation for SAR Image Recognition. 1-6 - Pengyu Lin, Xunxun Zeng, Wanling Liu, Huayi Chen, Fei Chen:

Adaptive Pixel Classification and Equivalent Large Kernels for Lightweight Image Super-Resolution. 1-6 - Yuetong Li, Yilin Zhao, Qing Zhang, Qiangqiang Zhou, Yanjiao Shi:

Frequency-guided Camouflaged Object Detection with Perceptual Enhancement and Dynamic Balance. 1-6 - Yucheng Zeng, Aihua Mao, Xianghong Wang, Tianye Niu:

CT-MIE: Computed Tomography Multi-Task Image Enhancement via Vision-Language Model. 1-6 - Mingyang Liu, Fan Zhou, Ruomei Wang, Baoquan Zhao:

Multi-granularity Frequency Difference-Aware Attention for Video Question Answering. 1-6 - Yu Zhou, Xing Wu, Liangshan Zhu, Chengliang Wang, Zailin Yang, Yao Liu:

Nucleus-SAM:Point-Supervised SAM for Nucleus Segmentation. 1-6 - Jianxin Shi, Xiaolong Chen, Yusen Xie, Jinhao Chen, Fali Wang, Jun Ma, Tianyu Wo:

ScNet: Scene-Consistency Network Learning for Multi-Agent Motion Forecasting. 1-6 - Rui Yang

, Qindong Sun, Jiaming Cai, Jiangtao Yu:
ADoP: A Universal, Robust, Efficient, and Plug-and-Play Adversarial Example Detector. 1-6 - Wenrui Li, Meijun Sun, Cheng Liu, Xinyu Yan, Zheng Wang:

Gradient-guided Attention Fusion Network for Camouflaged Object Detection. 1-6 - Yaru Zhang, Haichao Shi, Xiaoyu Zhang:

Concept-Centric Learning for Weakly-Supervised Temporal Sentence Grounding. 1-6 - Tianyi Ma, Muqing Wu, Zijian Zhang:

3DGlobalFormer: Three Domain Global Feature Fusion in 3D Human Estimation. 1-6 - Tianyi Gong, Boyan Li, Yifei Zhong, Fangxin Wang:

ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image. 1-6 - Wei Yan, Xiaoman Zhao:

SASG: Semantic-Aware Salient Guidance for Day-to-Night Domain Adaptive Object Detection. 1-6 - Haoyu Xiong

, Qiuxia Yang, Chengchao Wang, Tianze Zhong, Zhengpeng Zhao, Yuanyuan Pu:
Layer-wise Parameter Robustness for Continual Test-time Adaptation. 1-6 - Pengyuan Qi, Ye Tian, Guisheng Yin:

Time-Series Acoustic Network for Underwater Acoustic Target Recognition. 1-6 - Miaomiao Dai, Qianyu Zhou, Lizhuang Ma:

StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture. 1-6 - Xiaodong Wang, Zijun He, Xin Yuan:

Texture-aware Intrinsic Image Decomposition with Model- and Learning-based Priors. 1-6 - Jinyang Wang, Wei Wu:

Progressively Enhanced Camouflaged Object Detection via Boundary Awareness. 1-6 - Yalong Xu, Mengting Jiang, Yang Gao, Junlong Mu, Di Wang, Lin Zhao:

MambaPose: Efficient 2D Human Pose Estimation with Pose-Prior Guided State Space Model. 1-6 - Wanning Zhu, Libao Zhang:

Contrastive Adversarial Learning for Region-Aware Weakly Annotated Object Segmentation in Hazy Remote Sensing Images. 1-6 - Xuhong Ren, Jianlang Chen, Wanli Xue, Lei Ma, Qing Guo, Jianjun Zhao, Shengyong Chen:

TGSR: Template-Guided Semantic Resampling against Adversarial Tracking Attacks. 1-6 - Xingyue Lin, Xingjian Hu, Shuai Peng, Jianhua Zhu, Liangcai Gao:

SketchRef: a Multi-Task Evaluation Benchmark for Sketch Synthesis. 1-6 - Baiqin Wang, Xiangyu Zhu, Fan Shen

, Hao Xu, Shukai Chen, Zhen Lei:
ET-Talk: Effective Training Strategy to Enhance Synchrony and Fidelity for Talking Face Generation. 1-6 - Min Wang, Xin Huang, Qing Wang:

ATM-NeRF: Learning Adaptive Tone Mapping for Normal-Light Neural Radiance Field Reconstruction. 1-6 - Rui Zhou, Gangyi Jiang, Linwei Zhu, Yeyao Chen, Yueli Cui, Ting Luo, Haiyong Xu:

Mamba-Based Blind Stitched Wide Field of View Light Field Image Quality Assessment via Dual-Viewport Sampling. 1-6 


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID