


default search action
ICME 2024: Niagara Falls, ON, Canada
- IEEE International Conference on Multimedia and Expo, ICME 2024, Niagara Falls, ON, Canada, July 15-19, 2024. IEEE 2024, ISBN 979-8-3503-9015-5

- Xinyue Chen, Miaojing Shi:

Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation. 1-6 - Ziran Zhu, Tongda Xu, Ling Li, Yan Wang:

Noise Dimension of GAN: An Image Compression Perspective. 1-6 - Weijun Yuan, Zhan Li, Xiaohan Li, Liangda Fang, Qingfeng Zhang, Zhixiang Qiu:

Crowd Counting and Localization in Haze and Rain. 1-6 - Jiayang Liu, Kai Wang, Zheng Wang, Xing Xu:

SADA: Self-Adaptive Domain Adaptation From Black-Box Predictors. 1-6 - Jin Chen, Jiahe Tian, Cai Yu, Xi Wang, Zhaoxing Li, Yesheng Chai, Jiao Dai, Jizhong Han

:
ConfR: Conflict Resolving for Generalizable Deepfake Detection. 1-6 - Wentao Ma, Anni Tang, Jun Ling, Han Xue, Huiheng Liao, Yunhui Zhu, Li Song:

SingAvatar: High-fidelity Audio-driven Singing Avatar Synthesis. 1-6 - Yuchen Wang, Xiaoguang Li, Li Yang, Lu Zhou, Jianfeng Ma, Hui Li:

Adaptive Oriented Adversarial Attacks on Visible and Infrared Image Fusion Models. 1-6 - Xin Li, Haizhuang Liu, Rongquan Wang, Bochao Zou, Yuxin Lin, Huimin Ma:

EMo Transformer: Transformer-Based Depression Detection via Eye Movements. 1-6 - Lin Bie

, Shouan Pan, Kai Cheng, Li Han:
Build a Cross-modality Bridge for Image-to-Point Cloud Registration. 1-6 - Yibowen Zhao, Yonghui Xu, Ning Liu, Yixin Zhang, Wei Guo, Xudong Lu, Lizhen Cui:

Causal Denoising Framework for Generalizable Recommendation System using Graph Neural Network. 1-6 - Ting Cai, Yu Xiong, Chengyang He, Chao Wu, Song Zhou:

TBU: A Large-scale Multi-mask Video Dataset for Teacher Behavior Understanding. 1-6 - Ying Ren

, Kailai Shen, Zhe Ye, Diqun Yan:
EventTrojan: Manipulating Non-Intrusive Speech Quality Assessment via Imperceptible Events. 1-6 - Ziqiang Shi, Rujie Liu:

Multimedia Generative Modelling with High-Order Langevin Dynamics. 1-6 - Ye Bai, Chenxing Li, Hao Li, Yuanyuan Zhao, Xiaorui Wang:

Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation. 1-6 - Zekun Xu, Yipeng Zhou

, Quan Z. Sheng
, Chao Li, Tongtong Lou, Weipeng Jing:
Adaptive Global-local Fusion Network Based Deep Unsupervised Hashing for Remote Sensing Image Retrieval. 1-6 - Chen Wu, Zhuoran Zheng, Pengwen Dai, Chenggang Shan, Xiuyi Jia:

Rethinking Image Deraining via Text-guided Detail Reconstruction. 1-6 - Hanlin Li, Yueyi Zhang, Guanting Dong, Shida Sun, Zhiwei Xiong:

Joint Flow Estimation from Point Clouds and Event Streams. 1-6 - Yulin Zhao, Xiangling Ding:

One-Class HEVC Double Compression Detection with Same Coding Parameters. 1-6 - Sumei Li, Xiaoxuan Chen, Peiming Lin:

A Lightweight CNN and Spatial-Channel Transformer Hybrid Network for Image Super-Resolution. 1-6 - Yunzhe Xiao, Xueqiong Li, Shaowu Yang, Wenjing Yang, Yong Dou:

CRNet: Cross-Reconstruction Network for Inconsistent Point Cloud Registration. 1-6 - Biao Wu, Haitao Wang, Hejun Wu:

Task-Aware Lipschitz Confidence Data Augmentation in Visual Reinforcement Learning From Images. 1-6 - Yaoxun Xu, Xingchen Song, Zhiyong Wu, Di Wu, Zhendong Peng, Binbin Zhang:

Hydraformer: One Encoder for All Subsampling Rates. 1-6 - Hao Deng, Shengmei Chen, Cheng Liu, Bo Jiang, Lin Wang:

Geo GCN: Geometric-based Graph CNN for Learning on Point Cloud. 1-6 - Xiaotian Han, Yiqi Wang, Bohan Zhai, Quanzeng You, Hongxia Yang:

COCO is "ALL" You Need for Visual Instruction Fine-tuning. 1-5 - Shuai Zhao, Shibin Liu, Boyuan Zhang

, Yang Zhai, Ziyi Liu, Yahong Han:
A Patch-wise Adversarial Denoising Could Enhance the Robustness of Adversarial Training. 1-6 - Zixian Gao, Xun Jiang, Hua Chen, Yujie Li, Yang Yang, Xing Xu:

Uncertainty-Debiased Multimodal Fusion: Learning Deterministic Joint Representation for Multimodal Sentiment Analysis. 1-6 - Shifeng Liu, Xinglong Mao, Sirui Zhao, Chaoyou Fu, Ying Yu, Tong Xu, Enhong Chen:

TGMAE: Self-supervised Micro-Expression Recognition with Temporal Gaussian Masked Autoencoder. 1-6 - Tianci Xun, Wei Chen, Yulin He, Di Wu, Yuanming Gao, Jiuyuan Zhu, Weiwei Zheng:

Distinguishing Textual Prompt Importance: Image-Guided Text Weighting for CLIP-Based Few-shot Learning. 1-6 - Xinyu Xiao, Yun Hu, Eryun Liu:

Local-to-Global Self-Consistency Learning for Temporal Action Localization. 1-6 - Gakusei Sato, Taketo Akama:

Annotation-Free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion. 1-6 - Ruisheng Yuan, Minzhe Tang, Dongliang Kou, Mingyang Sun, Dingkang Yang, Xiao Zhao, Lihua Zhang

:
IIPC: Intra-Inter Patch Correlations for Garment Collision Handling. 1-6 - Haoyu Tang, Shuaike Zhang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie:

Two-Stage Information Bottleneck For Temporal Language Grounding. 1-6 - Zhixiang Yuan, Kaixin Zhang

, Tao Huang:
Positive Label Is All You Need for Multi-Label Classification. 1-6 - Stephen D. Voran:

Why Some Audio Signal Short-Time Fourier Transform Coefficients Have Nonuniform Phase Distributions. 1-6 - Yixuan Guan

, Xuefeng Liu, Tao Ren, Jianwei Niu
:
FedMDC: Enabling Communication-Efficient Federated Learning over Packet Lossy Networks via Multiple Description Coding. 1-7 - Guosheng Cui, Fusheng Hao, Dan Wu, Ye Li:

Fast label prediction based on shrunk anchor graph for semi-supervised incomplete multiview classification. 1-6 - Xingbei Guo, Ziping Ma, Qing Wang, Pengxu Wei:

Towards Real-world Continuous Super-Resolution: Benchmark and Method. 1-6 - Feihu Jiang, Chuan Qin, Jingshuai Zhang, Kaichun Yao, Xi Chen, Dazhong Shen, Chen Zhu, Hengshu Zhu, Hui Xiong:

Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach. 1-6 - Ruoyan Pi, Jinglin Xu, Yuxin Peng:

FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection. 1-6 - Alysa Ziying Tan, Siwei Feng, Han Yu:

FL-Clip: Bridging Plasticity and Stability in Pre-Trained Federated Class-Incremental Learning Models. 1-6 - Mingzhou Wu, Shiqi Dai

, Han Hu, Zhi Wang:
Collaborative Edge Caching in LEO Satellites Networks: A MAPPO Based Approach. 1-6 - Jiaxin Deng, Shiyao Wang, Dong Shen, Liqin Zhao, Fan Yang, Guorui Zhou, Gaofeng Meng:

A Multimodal Transformer for Live Streaming Highlight Prediction. 1-6 - Qilong Xu, Xiuyang Zhao:

Contour-Guided Modality Mitigation Network for Visible-Infrared Person Re-Identification. 1-6 - Xiaowen Ma, Jiawei Yang, Rui Che, Huanting Zhang, Wei Zhang:

DDLNet: Boosting Remote Sensing Change Detection with Dual-Domain Learning. 1-6 - Qi Jia, Shuilian Yao, Youcan Xu, Yu Liu, Dehao Kong, Longin Jan Latecki:

Fuzzy Boundary-Guided Network for Camouflaged Object Detection. 1-6 - Yutao Rao, Liwei Sun, Junjie Zhang, Haoran Jiang, Jian Zhang

, Dan Zeng:
Densely Connected Transformer with Frequency Awareness and Sam Guidance for Semi-Supervised Hyperspectral Image Classification. 1-6 - Jinglin Zhao, Debin Liu, Laurence T. Yang, Ruonan Zhao, Zheng Wang, Zhe Li:

TD3D: Tensor-based Discrete Diffusion Process for 3D Shape Generation. 1-6 - Tingting Li, Gensheng Pei, Xinhao Cai, Qiong Wang, Huafeng Liu, Yazhou Yao:

Universal Organizer of Segment Anything Model for Unsupervised Semantic Segmentation. 1-6 - Jiabang He, Jia Liu, Lei Wang, Xiyao Li, Xing Xu:

MoCoSA: Momentum Contrast for Knowledge Graph Completion with Structure-Augmented Pre-trained Language Models. 1-6 - Zhichao Jiang, Hongsong Wang, Xi Teng, Baopu Li:

Robust 3D Face Alignment with Multi-Path Neural Architecture Search. 1-6 - Zongyuan Jiang, Jiayu Chen, Chongyu Liu, Ning Zhang, Jun Huang, Xue Gao, Lianwen Jin:

RISC: Boosting High-quality Referring Image Segmentation via Foundation Model CLIP. 1-6 - Zhuang Qi, Weihao He, Xiangxu Meng, Lei Meng:

Attentive Modeling and Distillation for Out-of-Distribution Generalization of Federated Learning. 1-6 - Wenyu Li, Zongxin Ye, Sidun Liu, Ziteng Zhang, Xi Wang, Peng Qiao, Yong Dou:

ParaSurRe: Parallel Surface Reconstruction with No Pose Prior. 1-6 - Pengfei Yao, Yinglong Zhu, Tianlu Mao, Hao Jiang, Zhaoqi Wang:

Modeling Scene-Agent Interaction for Pedestrian Trajectory Prediction. 1-6 - Yu Wang

, Shengjie Zhao:
Weakly-Supervised Action Localization by Hierarchical Attention Mechanism with Multi-Scale Fusion Strategies. 1-6 - Liwen Hu, Lei Ma, Yijia Guo, Tiejun Huang:

SCSim: A Realistic Spike Cameras Simulator. 1-6 - Guiyu Zhao, Zewen Du, Zhentao Guo, Hongbin Ma:

VRHCF: Cross-Source Point Cloud Registration via Voxel Representation and Hierarchical Correspondence Filtering. 1-6 - Yihong Lu, Jianyi Liu, Ru Zhang:

An Images Regeneration Method for CG Anti-Forensics Based on Sensor Device Trace. 1-6 - Shuhua Wang, Ke Lu, Yang Zhao

, Hengsheng Lun, Zehai Niu, Jian Xue:
VS3D: A Vote-Based Semi-Supervised 3D Object Detection Framework for Point Clouds. 1-6 - Ziming Cheng, Xiangning Ruan, Qixiang Yin, Zhicheng Zhao:

The Root Element of Human Poses is Radian: MCPRL is All You Need. 1-6 - Zheng Lin, Zheng-Peng Duan, Xuying Zhang, Luojun Lin:

No-Reference Segmentation Annotation Quality Assessment. 1-6 - Kangze Xu, Ziqiang He, Xiangui Kang, Z. Jane Wang:

Transferable and high-quality adversarial example generation leveraging diffusion model. 1-6 - Jiaxin Chen

, Xin Liao, Zhenxing Qian
, Zheng Qin:
Multi-domain Probability Estimation Network for Forgery Detection over Online Social Network Shared Images. 1-6 - Hengsheng Lun, Ke Lu, Liping Hou, Shuhua Wang, Jian Xue:

From 3D to 4D: Fixing the Erroneous Coupling between IoU and Angle for Optimizing 3D Object Detection. 1-6 - Xiaogang Du, Meng Yang, Tao Lei, Xuejun Zhang, Yingbo Wang, Asoke K. Nandi:

HSVFormer: Robust and Unsupervised HSV-based Transformer Framework for Low-Light Image Enhancement. 1-6 - Xin Zheng, Ziang Peng, Yuan Cao, Hongming Shan, Junping Zhang:

SIAM: A Simple Alternating Mixer for Video Prediction. 1-10 - Yu Cai, Shihao Gao, Songzhi Su, Xizhi Chen, Xi Wang:

MeshStyle: Text-driven Efficient and High-Quality 3D Mesh Stylization via Hypergraph Convolution. 1-6 - Yijie Wei, Bo Liu, Peng Luan, Yinchi Ma:

Multi-Scale Dense Description for Blind Image Quality Assessment. 1-6 - Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men:

Selective Cross-Correlation Consistency Loss for Out-of-Distribution Generalization. 1-6 - Guangxing Wu, Junxi Chen

, Qiu Li, Wentao Zhang, Wei-Shi Zheng, Ruixuan Wang:
Region Attention Fine-tuning with CLIP for Few-shot Classification. 1-6 - Yang Chen

, Yueqi Duan, Runzhong Zhang, Yap-Peng Tan:
Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation. 1-6 - Mingrui Xiao, Zijian Zeng, Yue Zheng, Shu Yang, Yali Li, Shengjin Wang:

A Dataset with Multi-Modal Information and Multi-Granularity Descriptions for Video Captioning. 1-6 - Haotian Hu, Bin Jiang, Chao Yang, Xinjiao Zhou, Xiaofei Huo:

ScribbleEditor: Guided Photo-realistic and Identity-preserving Image Editing with Interactive Scribble. 1-6 - Ying Liu, Ge Bai, Chenji Lu, Shilong Li, Zhang Zhang, Ruifang Liu, Wenbin Guo:

Eliminating the Language Bias for Visual Question Answering with fine-grained Causal Intervention. 1-6 - Tian Feng, Jiaheng Wang, Junao Shen, Qiangguo Jin, Zhiyuan Zhu, Xinyu Wang:

Retinal Vessel Segmentation via Cross-attention Feature Fusion. 1-6 - Juncheng Yang, Zuchao Li, Shuai Xie, Weiping Zhu, Wei Yu, Shijun Li:

Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models. 1-6 - Ting Liu, Xuyang Liu

, Siteng Huang, Honggang Chen, Quanjun Yin, Long Qin, Donglin Wang, Yue Hu:
DARA: Domain- and Relation-Aware Adapters Make Parameter-Efficient Tuning for Visual Grounding. 1-6 - Mengxi Zhang, Heqing Lian, Yiming Liu

, Jie Chen:
HARIS: Human-Like Attention for Reference Image Segmentation. 1-6 - Jing Zhao

, KokSheik Wong, Vishnu Monn Baskaran, Kiki Adhinugraha
, David Taniar:
Music Form Analysis: A Case Study of The Theme and Variations Form. 1-6 - Zhigang Wang, Yunpeng Gao, Xun Li, Peipei Gu, Bin Zhao, Xuelong Li

:
A Coarse-to-Fine Reconstruction Framework for Non-Lambertian Photometric Stereo. 1-6 - Xiaoxi Lu, Xingyue Wang, Jiansheng Fang, Na Zeng, Jingqi Huang, Chuangguang Huang, Jingfeng Zhang, Jianjun Zheng, Heng Meng, Jiang Liu:

3D Nodule Content-Based Metric Learning for Evidence-Based Lung Cancer Screening. 1-7 - Junjie Kang, Jinsong Wu, Shiqi Jiang:

Photorealistic image style transfer based on explicit affine transformation. 1-8 - Wenjing Wang, Si Li:

Consensus Co-teaching for Dynamically Learning with Noisy Labels. 1-6 - Bingheng Pang, Zhuoxuan Liang

, Wei Li, Xiangxu Meng, Chenhao Wang, Yilin Ren:
Brain Waves Unleashed: Illuminating Neonatal Seizure Detection via Multi-scale Hierarchical Modeling. 1-6 - Xiao Fu, Wei Xi, Zhao Yang, Rui Jiang, Dianwen Ng, Jie Yang, Jizhong Zhao:

MRFER: Multi-Channel Robust Feature Enhanced Fusion for Multi-Modal Emotion Recognition. 1-6 - Jianbo Ma

, Chuanming Tang, Fei Wu
, Can Zhao, Jianlin Zhang, Zhiyong Xu:
STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking. 1-6 - Zheng Wang, Junkun Zhao, Bifan Lai, XingHuai Zheng:

Structural Highlight Network for Camouflaged Object Detection. 1-6 - Qiong Chen, Yaochi Zhao, Yujia Chen, He Zhang, Zhuhua Hu:

Combining Soft and Hard Attentions for high-quality single-stage instance segmentation. 1-5 - Wajahat Khalid

, Bin Liu, Muhammad Waqas
:
Clothmix: A Cloth Augmentation Strategy for Cloth-Changing Person Re-Identification. 1-6 - Yujie Liu, Mingyue Li, Jiansen Jing, Yante Li

, Guoying Zhao:
Clothing Sampling Based on Active Learning For Cloth-Changing Person Re-identification. 1-6 - Depei Liu, Hongjie Fan, Junfei Liu:

PGDM: Multimodal Panoramic Image Generation with Diffusion Models. 1-6 - Sijing Xie, Chengxin Zhao, Nan Sun, Wei Li, Hefei Ling:

Picking watermarks from noise (PWFN): an improved robust watermarking model against intensive distortions. 1-6 - Zhuo Xie, Haoran Mo, Chengying Gao:

Video-Driven Sketch Animation Via Cyclic Reconstruction Mechanism. 1-6 - Huanting Zhang, Mengting Ma, Xinyu Wang, Jiawei Yang, Xiangdong Li, Wei Zhang:

SSETPAN: Spatial-Spectral Enhanced Transformer based network for pansharpening. 1-6 - Xin Zhou

, Tianyang Dong, Jing Fan, Wenyuan Ying
, Hubin Kong:
ODNet: Orthogonal-Perception and Dense-dilation Enhanced Network for Segmenting Complex Tree Branch Structures. 1-6 - Ruizhou Liu, Zongsheng Cao, Zhe Wu, Qianqian Xu, Qingming Huang:

Multimodal Knowledge Graph Embeddings via Lorentz-based Contrastive Learning. 1-6 - Haitao Yao, Zhenwei Wang, Mingli Zhang, Wen Zhu, Lizhi Zhang, Lijun He, Jianxin Zhang:

Second-Order Self-Supervised Learning for Breast Cancer Classification. 1-6 - Daowu Yang, Ying Liu, Qiyun Yang, Ruihui Li:

Talking Portrait with Discrete Motion Priors in Neural Radiation Field. 1-6 - Junjie Yang, Hao Wu, Ji Zhang, Lianli Gao, Jingkuan Song:

Effective and Efficient Few-shot Fine-tuning for Vision Transformers. 1-6 - Yulun Wu, Yaolong Ju, Simon Lui, Jing Yang, Fan Fan, Xuhao Du:

Cycle Frequency-Harmonic-Time Transformer for Note-Level Singing Voice Transcription. 1-6 - Jie Luo, Xin Jin, Mingyu Liu, Yihui Fan:

TrafficScene: A Multi-modal Dataset including Light Field for Semantic Segmentation of Traffic Scenes. 1-6 - Yuxuan Mu, Shihao Zou, Kangning Yin, Zheng Tian, Li Cheng, Weinan Zhang, Jun Wang:

RACon: Retrieval-Augmented Simulated Character Locomotion Control. 1-6 - Yang Li, Songlin Yang, Wei Wang

, Ziwen He, Bo Peng, Jing Dong:
Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts. 1-6 - Haiyan Jin, Yifan Shuai, Fengyuan Zuo, Haonan Su, Zhaolin Xiao, Bin Wang, Yuanlin Zhang:

A Channel-Wise Guidance Sparse Transformer for Effective Dark Image Enhancement. 1-6 - Zongyao He, Zhi Jin:

Dynamic Implicit Image Function for Efficient Arbitrary-Scale Super-Resolution. 1-6 - Yuebin Xie, Xiaochen He, Baoyao Yang, Fei Lyu, Siqi Liu:

CAM-Guided Translation for Unpaired Weakly-Supervised Medical Image Segmentation. 1-6 - Zihan Niu, Zheyong Xie, Tong Xu, Xiangfeng Wang, Yao Hu, Ying Yu, Enhong Chen:

Knowledge-Enhanced Multi-perspective Incongruity Perception Network for Multimodal Sarcasm Detection. 1-6 - Haoxuan Wang, Ping Wei, Shuaijia Chen, Zhimin Liao, Jialu Qin:

Local-to-Global Perception Network for Point Cloud Segmentation. 1-6 - Jiacheng Su, Kunhong Liu, Liyan Chen, Junfeng Yao, Qingsong Liu, Dongdong Lv:

Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN. 1-6 - Meng Wang, Xiaojie Guo, Jiawan Zhang:

FNFORMER: A Transformer-Based Face Normal Estimator. 1-6 - Hanting Li, Hongjing Niu, Zhaoqing Zhu, Feng Zhao:

CLIPER: A Unified Vision-Language Framework for In-the-Wild Facial Expression Recognition. 1-6 - Chuanfei Hu, Hang Shao, Bo Dong, Zhe Wang, Yongxiong Wang:

ASD: Towards Attribute Spatial Decomposition for Prior-Free Facial Attribute Recognition. 1-9 - Dawei Dai, Yingge Liu, Shiyu Fu, Guoyin Wang:

Multimodal Image-Text Representation Learning for Sketch-Less Facial Image Retrieval. 1-6 - Junkun Hong, Yitian Long, Yueyi Luo, Qianqian Qi, Jun Long:

Multi-feature and Multi-branch Action Segmentation Framework for Modeling Long-Short-Term Dependencies. 1-6 - Chuqiao Wu, Haitao Huang, Wenming Yang:

Diffusion based Coarse-to-Fine Network for 3D Human Pose and Shape Estimation from monocular video. 1-6 - Jingru Wang, Xinguang Xiang:

Multi-scale Transformer with Prompt Learning for Remote Sensing Image Dehazing. 1-6 - Liman Jiang, Canlong Zhang, Lei Wu, Zhixin Li, Zhiwen Wang, Chunrong Wei:

Mask-guided Salient Feature Mining for Cloth-Changing Person Re-identification. 1-6 - Haoran Mo, Xusheng Lin, Chengying Gao, Ruomei Wang:

Text-Based Vector Sketch Editing with Image Editing Diffusion Prior. 1-6 - Zhibo Zhang, Ximing Yang, Weizhong Zhang, Cheng Jin:

ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation. 1-6 - Jianhao Fu

, Xiang Ling, Yaguan Qian, Changjiang Li, Tianyue Luo, Jingzheng Wu:
Towards Query-Efficient Decision-Based Adversarial Attacks Through Frequency Domain. 1-6 - Zemin Tang, Min Shi, Zhibang Yang, Xu Zhou, Cen Chen, Joey Tianyi Zhou:

Sentiment Confidence Separation: A Trust-Optimized Framework for Multimodal Sentiment Classification. 1-6 - Yiming Tang, Yi Yu, Yan Qiu Chen:

Prototype-Guided Prior Enhancement and Rectification in Few-shot Semantic Segmentation. 1-6 - Qingfeng Zheng, Peijia Zheng, Weiqi Luo

, Wei Lu:
A Fast and Tunable Privacy-Preserving Action Recognition Framework over Compressed Video. 1-6 - Jaakko Laitinen, Tero Partanen

, Alexandre Mercat
, Jarno Vanne
, Miska M. Hannuksela, Honglei Zhang
, Alireza Aminlou, Francesco Cricri:
Feasibility Study of Multi-Layer VVC Coding Scheme for Hybrid Machine-Human Consumption. 1-6 - Longjie Qi, Yue Ding, Hongtao Lu:

CGCUT: Unpaired Image-to-Image Translation via Cluster-Guided Contrastive Learning. 1-6 - Jiayi Lyu, Xing Lan, Guohong Hu, Hanyu Jiang, Wei Gan, Jian Xue:

ETAU: Towards Emotional Talking Head Generation Via Facial Action Unit. 1-6 - Yuhao Gao, Gensheng Pei, Mengmeng Sheng, Zeren Sun, Tao Chen, Yazhou Yao:

Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection. 1-6 - Qianrui Teng, Rui Wang, Xing Cui, Peipei Li, Zhaofeng He:

Exploring 3D-aware Lifespan Face Aging via Disentangled Shape-Texture Representations. 1-6 - Xinlong Ding

, Hongwei Yu, Jiansheng Chen, Jinlong Wang, Jintai Du, Huimin Ma:
Invisible Pedestrians: Synthesizing Adversarial Clothing Textures To Evade Industrial Camera-Based 3D Detection. 1-6 - Zhiwei Dong

, Xi Zhu, Xiya Cao, Ran Ding, Caifa Zhou, Wei Li, Yongliang Wang, Qiangbo Liu:
BézierFormer: A Unified Architecture for 2D and 3D Lane Detection. 1-6 - Yicheng Pan, Zhenrong Zhang, Jiefeng Ma, Pengfei Hu, Jun Du, Qing Wang, Jianshu Zhang, Dan Liu, Si Wei:

Maths: Multimodal Transformer-Based Human-Readable Solver. 1-6 - Qi Li:

Parameter Efficient Fine-Tuning on Selective Parameters for Transformer-Based Pre-Trained Models. 1-6 - Jiancheng Huang, Mingfu Yan, Yifan Liu, Shifeng Chen:

Color-SD: Stable Diffusion Model Already has a Color Style Noisy Latent Space. 1-6 - Siyu Xing, Chen Gong, Hewei Guo, Xiao-Yu Zhang, Xinwen Hou, Yu Liu:

GAN Inversion for Image Editing via Unsupervised Domain Adaptation. 1-6 - Yongkang Ding, Rui Mao, Hanyue Zhu, Anqi Wang, Liyan Zhang:

Discriminative Pedestrian Features and Gated Channel Attention for Clothes-Changing Person Re-Identification. 1-6 - Zheng Wang, Bowen Tang, Yi Bin, Lei Zhu, Guoqing Wang, Yang Yang:

Shapley Ensemble Adversarial Attack. 1-6 - Haitao Cao, Baoping Cheng, Qiran Pu, Haocheng Zhang, Bin Luo, Yixiang Zhuang, Juncong Lin, Liyan Chen, Xuan Cheng:

DNPM: A Neural Parametric Model for the Synthesis of Facial Geometric Details. 1-6 - Yule Liu, Zhuben Dong, Shenglan Liu, Wujun Wen, Lin Feng:

Two-Step Temporal Divisive Clustering for Unsupervised Action Segmentation. 1-6 - Songlin Li, Xiuhong Li, Zhe Li, Hongbing Ma, Jiabao Sheng, Boyuan Li:

Dual Guidance Enhancing Camouflaged Object Detection via Focusing Boundary and Localization Representation. 1-6 - Xuanxi Chen, Ziqian Shao, Tong Lu:

SVT: Spectral Video Transformer for Video Restoration in Under-Display Camera. 1-6 - Pengfei Hu, Xiuzhe Wu, Yang Wu, Wenming Yang:

PortraitNeRF: A Single Neural Radiance Field for Complete and Coordinated Talking Portrait Generation. 1-6 - Shizhuo Deng, Da Teng, Zhubao Guo, Jiaqi Chen, Dongyue Chen, Tong Jia, Hao Wang:

Self-Supervised Federated Learning for Personalized Human Activity Recognition. 1-6 - Ke Cao, Xuanhua He, Keyu Yan, Tao Hu, Rui Li, Chengjun Xie, Jie Zhang:

Frequency Decomposition-Driven Network for JPEG Artifacts Removal. 1-6 - Xing Wei, Zhaoxin Ji, Bin Wen, Fan Yang, Chong Zhao, Yang Lu:

Unsupervised Multi-Target Domain Adaptation Incremental Method Based on Contrastive Learning. 1-6 - Zihang Huang

, Yukun Yang, Tianyu Zhao, Xin Yang:
A Noise Robust Framework via Uncertainty Guidance for Medical Image Segmentation with Noisy Label. 1-6 - Seunghwan Lee, Gwanmo Park, Hyewon Son, Jiwon Ryu, Han Joo Chae:

InFusionSurf: Refining Neural RGB-D Surface Reconstruction Using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning. 1-6 - Wuyang Chen, Kele Xu, Yong Dou, Tian Gao:

Voice-to-Face Generation: Couple of Self-Supervised Representation Learning with Diffusion Model. 1-6 - Xuan Wu

, Liang Chen, Ming Tan, Yi Wu:
Convolutional Modulation Feature Distillation Network for Image Super-resolution. 1-6 - Yuyang Ji, Lianlei Shan:

LDNET: Semantic Segmentation Of High-Resolution Images Via Learnable Patch Proposal And Dynamic Refinement. 1-6 - Xinyu Li, Xing Wang, Xiaoxiao Yang, Suping Wu, Xiangzheng Li, Xitie Zhang, Zhiyuan Zhou, Xiang Zhang:

Towards Accurate 3D Face Alignment Under Extreme Scenarios Via Multi-Granularity Perturbation Relearning. 1-6 - Xinyu Zhang, Hefei Huang, Xu Jia, Wenyue Chen, Dong Wang, Shengming Li, Huchuan Lu:

Multi-Stage Fusion for Event-based Multimodal Tracker. 1-6 - Xiaotong Chen, Shikui Wei, Gangjian Zhang, Yao Zhao:

Multi-granular Semantic Mining for Composed Image Retrieval. 1-6 - Hangjie Yi, Yuhang Ming, Dongjun Liu, Wanzeng Kong:

Time-Frequency Jointed Imperceptible Adversarial Attack to Brainprint Recognition with Deep Learning Models. 1-6 - Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang

, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guo:
HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection. 1-6 - Zhangbin Qian, Jiawei Tan, Zhilong Ou, Hongxing Wang

:
CLIP-Driven Multi-Scale Instance Learning for Weakly Supervised Video Anomaly Detection. 1-6 - Haoquan Wang, Shengbo Chen, Xijun Wang

, Hong Rao, Yong Chen:
Defending Against Backdoor Attacks via Region Growing and Diffusion Model. 1-6 - Xiaorong Ma, Jiahe Tian, Yu Cai, Yesheng Chai, Zhaoxing Li, Jiao Dai, Liangjun Zang, Jizhong Han

:
HIDD: Human-perception-centric Incremental Deepfake Detection. 1-6 - Le Zhang, Tong Li, Yao Lu, Mixiao Hou, Guangming Lu:

Efficient U-Shape Invertible Neural Network for Image Steganography. 1-7 - Xin Liu, Yali Li, Shengjin Wang:

Representation Distillation for Efficient Self-Supervised Learning. 1-6 - Haoran Zhang, Xi Lin, Suxian Xiang, Chenxi Huang, Lvqing Yang, Yan Wang:

Boundary Contrast Domain Adaptation for Cross-modality Medical Image Segmentation. 1-6 - Chenyi Zhu, Dengshi Li, Aolei Chen, Yu Gao, Wei Li, Xi Wang:

Noise Adaptive Fine-grained Speech Intelligibility Enhancement With Soft-label Guided Diffusion. 1-6 - Jinyang An, Wanqian Zhang, Dayan Wu, Zheng Lin, Jingzi Gu, Weiping Wang

:
SD4Privacy: Exploiting Stable Diffusion for Protecting Facial Privacy. 1-6 - Yu Lu

, Kevin Bui, Roummel F. Marcia:
Alternating Direction Method of Multipliers for Negative Binomial Model with the Weighted Difference of Anisotropic and Isotropic Total Variation. 1-6 - Wenjing Wang, Si Li:

Focusing on All Refined Attention Regions for Noisy Label Facial Expression Recognition. 1-6 - Ziang Li

, Chengxiang Si, Zhenyu Cheng, Shuyuan Zhao, Yong Ding:
MTDM-MS: A Malicious Traffic Detection Model Based on Multi-Category Signals. 1-6 - Peilin Xiao, Yueyi Zhang, Dachun Kai, Yansong Peng, Zheyu Zhang, Xiaoyan Sun:

ESTME: Event-driven Spatio-temporal Motion Enhancement for Micro-Expression Recognition. 1-6 - Xiaolin Huang, Biqing Zeng, Jiahui Pan, Yujiang Yao, Zheng Zhou, Bingzhi Chen:

Ambiguity Consistency and Uncertainty Minimization for Semi-Supervised Medical Image Segmentation. 1-6 - Xiaoyu Qiu, Yuechen Wang, Jiaxin Shi, Wengang Zhou, Houqiang Li:

Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator. 1-6 - Ben Chen, Xuechao Zou, Kai Li, Yu Zhang, Junliang Xing, Pin Tao:

High-Fidelity Lake Extraction Via Two-Stage Prompt Enhancement: Establishing A Novel Baseline and Benchmark. 1-6 - Yugan Chen, Lin Zhao, Yalong Xu, Honglei Zu, Xiaoqi An, Guangyu Li:

Domain Adaptive Pose Estimation Via Multi-level Alignment. 1-6 - Mengjiao Zhao, Mengting Ma, Xiangdong Li, Xiaowen Ma, Xinyu Wang, Ao Gao, Wei Zhang:

DuCoFPan: Dual-Condition Flow-based Network for Pan-sharpening. 1-6 - Laurie Van Bogaert, Armand Losfeld, Gauthier Lafruit, Mehrdad Teratani:

Single RGBD to Multilayer 3D Display Pipeline. 1-6 - Yanping Li, Zhaoshuai Qi, Xiuwei Zhang, Tao Zhuo, Yue Liang, Yanning Zhang:

Edge-Guided Detector-Free Network for Robust and Accurate Visible-Thermal Image Matching. 1-6 - Yulan Gao, Zhaoxiang Hou, Chengyi Yang, Zengxiang Li, Han Yu, Xiaoxiao Li:

The Prospect of Enhancing Large-Scale Heterogeneous Federated Learning with Foundation Models. 1-6 - Qingmao Wei, Bi Zeng, Guotian Zeng

:
Learning Motion Priors with DETR for Visual Tracking. 1-6 - Yang Zhang, Yue Zhou, Zonghao Yang, Ao Chen:

Cross-modal Prominent Fragments Enhancement Aligning Network for Image-text Retrieval. 1-6 - Qingzhi He

, Rong Quan, Weifeng Yang, Jie Qin:
Visual Feature Disentanglement for Zero-Shot Learning. 1-6 - Zenghao Guan, Yucan Zhou, Xiaoyan Gu, Bo Li:

GIE : Gradient Inversion with Embeddings. 1-6 - Haonan Lin, Wenbin An, Yan Chen, Feng Tian, Yuzhe Yao, Wei Ding, Qianying Wang, Ping Chen:

A Tri-Branch Network with Prototype-aware Matching for Universal Category Discovery. 1-6 - Xin Liu, Yue Xu, Kun He:

Improving the Sar Image Adversarial Transferability Through Dual-Loop Ensemble Gradient Attack. 1-6 - Yuan Liu

, Shu Wang, Zhe Qu, Xingyu Li, Shichao Kan, Jianxin Wang:
FedGCA: Global Consistent Augmentation Based Single-Source Federated Domain Generalization. 1-6 - Chengjie Wang, Chengming Xu, Zhenye Gan, Yuxi Li, Jianlong Hu, Wenbing Zhu, Lizhuang Ma:

PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision. 1-6 - Fengqi Li, Mengchao Guo, Fengqiang Xu, Renxuan Xiong, Xiaohong Yan, Qian Sun, Deguang Wang:

STformer: Advancing Video Deraining Network Integrating with Spatial Transformers and Multiscale Feature Extraction. 1-6 - Chenqu Ren, Yeheng Shao, Haolei Qiu:

PVRF: Single-Plane and Single-Vector for Memory-Efficient Radiance Fields. 1-6 - Leqi Shen, Tao He, Sicheng Zhao, Zhelun Shen, Yuchen Guo, Tianshi Xu, Guiguang Ding:

X-ReID: Cross-Instance Transformer for Identity-Level Person Re-Identification. 1-6 - Liang Shi, Fuyong Xu, Ru Wang, Yongqing Wei, Guangjin Wang, Bao Wang, Peiyu Liu:

Information Aggregate and Sentiment Enhance Network to Handle Missing Modalities for Multimodal Sentiment Analysis. 1-6 - Sumei Li, Hangwei Liang, Mingxuan Xie, Xiaofei He:

Multi-Scale and Multi-Patch Aggregation Network Based on Dual-Column Vision Fusion for Image Aesthetics Assessment. 1-6 - Shuai Zhao, Tuo Li, Boyuan Zhang

, Yang Zhai, Ziyi Liu, Yahong Han:
Improving Transferability of Adversarial Examples with Adversaries Competition. 1-6 - Zhen Wang, Dianxi Shi, Chunping Qiu, Songchang Jin, Tongyue Li, Yanyan Shi:

ICF-Loc: An Infrared-Based Coarse-to-Fine Approach for UAV Visual Geolocation under GPS-Denied Environments. 1-6 - Yu Wu, Haiguang Wang, Mengxia Wu, Min Cao, Min Zhang:

LAIP: Learning Local Alignment from Image-Phrase Modeling for Text-based Person Search. 1-10 - Yuanwen Chen, Xinyao Zhang, Yaran Chen

, Dongbin Zhao
, Yunzhen Zhao, Zhe Zhao, Pengfei Hu:
Common Sense Language-Guided Exploration and Hierarchical Dense Perception for Instruction Following Embodied Agents. 1-6 - Qian Li, Cheng Wen, Rao Fu:

Improving Few-Shot Neural Radiance Field with Image Based Rendering. 1-6 - Liang He

, Zhida Song
, Shuanghong Liu, Mengqi Niu, Ying Hu, Hao Huang:
Speaker Recognition Based on Pre-Trained Model and Deep Clustering. 1-6 - Yige Wang, Risheng Huang, Haozhi Huang, Zongqing Lu:

FusionDreamer: Consistent Images Generation from Sparse-view Images. 1-6 - Shuoqian Wang, Mengbai Xiao, Yao Liu:

RoIRTC: Toward Region-of-Interest Reinforced Real-Time Video Communication. 1-6 - Yuejian Fang, Xiaodong Wang:

Enhancing Zero-shot 3D Photography via Mesh-represented Image Inpainting. 1-6 - Zezeng Li

, Weimin Wang, Ziliang Wang, Na Lei:
Point Cloud Compression via Constrained Optimal Transport. 1-6 - Jingmou Xian, Jian Zhu, Haolin Liao, Si Li:

Frequency-regularized Neural Representation Method for Sparse-view Tomographic Reconstruction. 1-6 - Ziliang Gan, Lei Jin, Lei Nie, Zheng Wang, Li Zhou, Liang Li, Zhecan Wang, Jianshu Li, Junliang Xing, Jian Zhao:

ASQuery: A Query-based Model for Action Segmentation. i-vi - Yufeng Wang, Wensen Feng, Haoqian Wang:

EyebrowNet: High-Precision Eyebrow Reconstruction and Matting. 1-6 - Bingzhi Chen, Haoming Zhou, Yishu Liu, Biqing Zeng, Jiahui Pan, Guangming Lu:

Enhancing Few-Shot Classification without Forgetting Through Multi-level Contrastive Constraints. 1-6 - Xiangwen Deng, Yufeng Wang, Yuanhao Cai, Jingxiang Sun, Yebin Liu, Haoqian Wang:

ITportrait: Image-Text Coupled 3D Portrait Domain Adaptation. 1-6 - Naifu Xue

, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Ma:
Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer. 1-6 - Zhenqiang Zhang, Chuantao Li, Jian Song, Jialiang Lv, Chunxiao Wang, Zhigang Zhao, Jidong Huo:

STUI-NET: Semi-Supervised Transformer for Underwater Information Enhancement. 1-6 - Xiao Kang, Xingbo Liu, Xuening Zhang, Wen Xue, Xiushan Nie, Shaohua Wang, Yilong Yin:

Unsupervised Online Cross-modal Hashing With Multiple Association Exploitation. 1-6 - Yongkang Cheng, Mingjiang Liang, Shaoli Huang, Jifeng Ning, Wei Liu:

ExpGest: Expressive Speaker Generation Using Diffusion Model and Hybrid Audio-Text Guidance. 1-6 - Wudi Chen, Chao Zhang, Cheng Han, Yanjie Ma, Yongqing Cai:

Sttcnerf: Style Transfer of Neural Radiance Fields for 3d Scene Based on Texture Consistency Constraint. 1-6 - Anustup Choudhury

, Praneet Singh, Guan-Ming Su:
NeRVA: Joint Implicit Neural Representations for Videos and Audios. 1-6 - Shengjia Zhang, Suping Wu:

GFAvatar: A High-Quality Facial Avatar Reconstruction Method. 1-6 - Zejun He, Fei Chen, Fan Jiang, Wanling Liu, Zhangyan Ye:

A Dual-Branch Network Based on Connectivity Mask for Retinal Vessel Segmentation. 1-6 - Tianyang Dong, Huanbo Zhang, Hubin Kong, Shuqian Lv, Fenghao Li:

Align-RDW: Alignment-based Redirected Walking for Multi-User VR scenarios. 1-6 - Boyuan Li, Xiuhong Li, Songlin Li, Yuye Zhang, Kangwei Liu:

Adaptive Feature Fusion Network for Infrared Small Target Detection. 1-6 - Fan Dai, Yun Zhu, Yaqi Shen, Jin Xie, Jianjun Qian:

Dense Voxel Representation Network for Implicit Scene Completion. 1-6 - Jilin Tang, Lincheng Li, Xingqun Qi, Yingfeng Chen, Changjie Fan, Xin Yu

:
AS-NeRF: Learning Auxiliary Sampling for Generalizable Novel View Synthesis from Sparse Views. 1-6 - Huizhen Ji, Yaohua Zha, Qingmin Liao:

LR-MAE: Locate while Reconstructing with Masked Autoencoders for Point Cloud Self-supervised Learning. 1-6 - Tian Zhang, Kongming Liang, Ke Zhang, Zhanyu Ma:

Learning Conditional Prompt for Compositional Zero-Shot Learning. 1-6 - Zeqi Wu, Yuefeng Ma:

I2CL-ANE: A Novel Attribute Network Embedding based on Intra-Inter View Contrastive Learning. 1-6 - Njuod Alsudays, Jing Wu, Yu-Kun Lai, Ze Ji:

GRPSNET: Multi-Class Part Parsing Based on Graph Reasoning. 1-10 - Yung-Wei Fan, Sheng-Chun Huang, Shao-Yi Chien:

Graph Attention Convolutional Network for 3D Human Pose and Shape Estimation from Point Clouds. 1-6 - Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu:

Multi-modal Learnable Queries for Image Aesthetics Assessment. 1-6 - Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Daize Dong, Suncheng Xiang, Ting Liu, Yuzhuo Fu:

iDAT: inverse Distillation Adapter-Tuning. 1-6 - Yihang Zhang, Yun Liang, Shitong Weng, Hai Lin, Liping Chen, Shenlong Zheng:

Hierarchical Temporal Attention and Competent Teacher Network for Sound Event Detection. 1-6 - Dongze Hao, Qunbo Wang, Jing Liu:

Semantic-Visual Graph Reasoning for Visual Dialog. 1-6 - Haoran Jiang, Xiangjie Wang, Junjie Zhang, Jian Zhang

, Dan Zeng:
DSENet: An Object-Wise Density-Informed Coarse-to-Fine Object Detector for Aerial Image. 1-6 - Xicheng Chen, Haibo Ye, Fangyu Zhou:

Class-Aware Feature Perturbation for Long-Tailed Visual Recognition. 1-6 - Yuhang Cheng

, Ziyang Fan, Hongyu Wu, Xiaogang Wang:
High-Order Differential Regularizing Implicit Surface Representation of Point Cloud. 1-6 - Yaxin Liu, Yan Zhou, Ziming Li, Jinchuan Zhang, Yu Shang, Chenyang Zhang, Songlin Hu:

RNG: Reducing Multi-level Noise and Multi-grained Semantic Gap for Joint Multimodal Aspect-Sentiment Analysis. 1-6 - Haoyu Deng, Yanmei Fang, Fangjun Huang:

Enhancing Adversarial Transferability on Vision Transformer by Permutation-Invariant Attacks. 1-6 - Pengyu Wang, Jianmin Li, Wenbo Ding, Jiachen Zhong, Jianyong Ai:

Correcting Pseudo Labels in Semi Supervised Object Detection with SAM. 1-6 - Min Zhang, Zifeng Zhuang, Zhitao Wang, Donglin Wang:

RotoGBML: Towards Out-of-distribution Generalization for Gradient-based Meta-learning. 1-6 - Qi Jia, Zikun Zhao, Xiaomei Feng, Jinyuan Liu, Yu Liu, Xinwei Xue:

Joint edge detection learning for recurrent homography estimation. 1-6 - Xuewei Li, Yujie Diao, Mei Yu, Chenhan Wang, Jie Gao, Ruiguo Yu:

Area Intervention for Enhancing Class Activation Maps in Weakly Supervised Semantic Segmentation. 1-6 - Kang Zhu, Cunhang Fan, Jianhua Tao, Jun Xue, Heng Xie, Xuefei Liu, Yongwei Li, Zhengqi Wen, Zhao Lv:

Dual-View Multimodal Interaction in Multimodal Sentiment Analysis. 1-6 - Wang Yang, Lingchen Zhao, Dengpan Ye:

Reputation Defender: Local Black-Box Adversarial Attack against Image-Translation-Based DeepFake. 1-6 - Haifei Duan, Shenglan Liu, Chenwei Tan, Yuning Ding, Jirui Tian

, Feilong Wang:
Decoupling Spatio-Temporal Network for Fine-Grained Temporal Action Segmentation. 1-6 - Wenjing Zhu, Sining Sun, Changhao Shan, Peng Fan

, Qing Yang:
Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition. 1-6 - Hao Li, Jinlong Wang, Hanxiang Yang, Xiongxin Tang, Fanjiang Xu:

Learning Semantic-aware Retinex Network with Spatial-Frequency Interaction for Low-light Image Enhancement. 1-6 - Yuxin Huang, Yiwei Yuan, Xiangyu Zeng, Ling Xie, Yiyu Fu, Guanghui Yue, Baoquan Zhao:

Full-Reference Motion Quality Assessment Based on Efficient Monocular Parametric 3D Human Body Reconstruction. 1-6 - Liang Zhao, Yukun Yuan, Qiongjie Xie, Ziyue Wang:

Anchor Based Multi-view Clustering for Partially View-Aligned Data. 1-5 - Li Fang, Kaijun Zou, Zhiye Chen, Long Ye:

HMDST: A Hybrid Model-Data Driven Approach for Spatio-Temporally Consistent Video Inpainting. 1-6 - Ke Chen, Zhihua Huang, Kexin Lu, Yonghong Yan:

CosDiff: Code-Switching TTS Model Based on A Multi-Task DDIM. 1-6 - Changsheng Chen, Yongyi Deng, Liangwei Lin, Zitong Yu, Zhimao Lai:

Multi-Modal Document Presentation Attack Detection with Forensics Trace Disentanglement. 1-8 - Hongjing Su, Fuxiang Lu:

HctMAE: Hybrid Convolution-Transformer Meets Masked Autoencoder for Plant Recognition. 1-6 - Chengxiang Fan

, Aohong Shen, Zhen Han, Cai Tong, Zhongyuan Wang, Dekang Yi:
Dual-Domain Multi-Model GAN Fingerprint Restoration for Compressed Fake Face Attribution. 1-6 - Zijian Zhang, Ruiguo Yu, Xi Wei, Jie Gao, Mei Yu, Xuewei Li, Zhiqiang Liu:

Unsupervised Domain Adaptation Semantic Segmentation on Thyroid Ultrasound Images Based on Task-Oriented Feature Disentanglement. 1-6 - Chenglin Liu, Binquan Wang, Ming Zhu:

ReCo-CXR: A Self-Supervised Pre-Training Framework for Pulmonary Nodule Detection in X-Ray Images. 1-6 - Yang Yu, Chen Xu, Kai Wang:

TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks. 1-6 - Yaoxin Wu, Hongwei Ding, Yunqi Liu, Zerui Wen, Xiaohui Cui:

Synthetic Data Augmentation for Infrared Small Target Detection via Exploring Frequency Components and Targets Prior. 1-6 - Xuewan He, Jielei Wang

, Qianxin Xia, Guoming Lu, Yuan Tang, Hongxia Lu:
Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval. 1-6 - Dan Yang, Xiuhong Li, Zhe Li, Chenyu Zhou, Xiaofan Wang, Fan Chen:

Prompt Fusion Interaction Transformer For Aspect-Based Multimodal Sentiment Analysis. 1-6 - Kaihao Lin, Guoqing Wang, Yuhui Wu, Shuhang Gu, Xing Xu, Yang Yang:

Domain Prompt Learning Framework for Real Image Dehazing. 1-6 - Jintai Du, Jinlong Wang, Jiansheng Chen, Xinlong Ding

, Jiehui Wu, Tianyu Hu, Huimin Ma:
Analyzing Behavior and Intention in Multi-Agent Systems Using Graph Neural Networks. 1-6 - Haichuan Song, Zhihong Zheng, Zhizhong Zhang, Yuan Xie, Guchu Zou

, Zhenyi Qi, Xin Tan:
Mutual Positive and Negative Learning for Weakly-supervised Point Cloud Semantic Segmentation. 1-6 - Zhao Wu, Dunbo Ning, Wenjing Chen, Hao Sun, Wei Xie, Ming Dong:

Spatial Dual Context Learning for Weakly-supervised Group Activity Recognition in Still-images. 1-6 - Taizhang Hu, Fan Yang, Xing Wei, Chong Zhao, Li Meng, Bin Wen, Yang Lu:

BTC: Bilateral-Branch Vision Transformer via Hilbert Patch Embedding for Image Clustering. 1-6 - Chenbin Zhang, Zhiqiang Hu, Shuyu Dai, Qingyuan He, Defeng Liu, Kun Yan, Ping Wang:

Boundary-Aware Contrastive Learning for Single-Source Domain Generalization in Medical Image Segmentation. 1-6 - Honghui Xu, Yueqian Quan, Chuangjie Fang, Jianwei Zheng:

Robust Principal Component Analysis via High-Order Self-Learning Transform Tensor Nuclear Norm. 1-6 - Yuming Yang, Dongsheng Zou:

AdaStyleSpeech: A Fast Stylized Speech Synthesis Model Based on Adaptive Instance Normalization. 1-6 - Xinfa Zhu, Yuke Li, Yi Lei, Ning Jiang, Guoqing Zhao, Lei Xie:

Boosting Multi-Speaker Expressive Speech Synthesis with Semi-Supervised Contrastive Learning. 1-6 - Bo Kong

, Shengquan Liu, Liang He
, Liruizhi Jia
, Yi Liang
:
CSMA-CNER: Multi-modal Chinese NER task with Cross- and Self-Modality Attention. 1-6 - Rao Fu, Qian Li, Cheng Wen, Ning An, Fulin Tang:

A Region-Growing Supervised Geometry-Weighted Transformer for Normal Estimation. 1-6 - Haozheng Zhang, Yanhong Yang, Zhixuan Jing, Shengyong Chen:

DA-LGNet: Enhancing Spatial-Spectral feature representation with Dual-Attention Local-General Network for Hyperspectral images and Multispectral images Fusion. 1-6 - Gongxin Yao, Xinyang Li, Yixin Xuan, Yu Pan:

MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval. 1-6 - Luojun Lin, Qipeng Liu, Xiangwei Zheng, Zheng Lin:

Slow-Fast Adaptation for Source-Free Object Detection. 1-6 - Shaoqi Yu, Lili Chen, Xiaolin Zhang, Jiamao Li:

VTR: Bidirectional Video-Textual Transmission Rail for CLIP-based Video Recognition. 1-6 - Yulin He, Wei Chen, Zhengfa Liang, Ke Liang, Yusong Tan, Tianrui Liu, Yulan Guo:

Don't Turn a Blind Eye to Localization Noise: Localization Pseudo-label Correction and Learning for Semi-Supervised Object Detection. 1-6 - Qin Lei, Rui Yang, Jiang Zhong, Rongzhen Li, Muyang He, Mianxiong Dong, Kaoru Ota:

Expanding Crack Segmentation Dataset with Crack Growth Simulation and Feature Space Diversity. 1-6 - Yuan-Yuan Liu, Song-Lu Chen, Qi Liu, Feng Chen, Xu-Cheng Yin:

Towards Low-resource License Plate Recognition via Feature Shuffling. 1-6 - Yadang Chen, Wentao Zhu, Zhi-Xin Yang, Enhua Wu:

Space-time Reinforcement Network for Video Object Segmentation. 1-6 - Ying Tang, Wei Yang, Junqing Yu, Zikai Song:

Agnostic Feature Compression with Semantic Guided Channel Importance Analysis. 1-6 - Yuwei Feng, Gang Zhou, Sen Yang, Jiang Zhang, Jing Ma, Zhenhong Jia:

Intermediate Domain Meets Natural Hazy Tracking. 1-6 - Fanxu Min, Shaoxiang Guo

, Hao Fan, Junyu Dong:
GaitMA: Pose-guided Multi-modal Feature Fusion for Gait Recognition. 1-6 - Lintao Zhang, Xiangcheng Du, LeoWu TomyEnrique, Yiqun Wang, Yingbin Zheng, Cheng Jin:

Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling. 1-6 - Ru Zhen, Xingtao Zhang, Chao Min, Biao Li:

Winner Takes It All: An Efficient Overlap-Aware Hybrid Online Diarization with Partial Backtracking Mechanism. 1-6 - Yucheng Shu, Jiaxin Xie, Lihong Qiao, Bin Xiao, Weisheng Li, Xinbo Gao:

C3T: Contrastive Consistency Cross-Network Learning for Semi-Supervised Semantic Segmentation. 1-6 - Mingyu Wu, Zhiyi Tan, Bing-Kun Bao:

Inferring the effectiveness of epidemic prevention measures based on spatial heterogeneity modeling. 1-6 - Jiawei Feng, Ruomei Wang, Mingyang Liu, Yuanmao Luo, Fuwei Zhang:

Frequency-Domain Enhanced Cross-modal Interaction Mechanism for Joint Video Moment Retrieval and Highlight Detection. 1-8 - Xueqiang Sun, Jin Wang, Jiade Chen, Yunhui Shi, Nam Ling, Baocai Yin:

MC-PCGC: A Space-Channel Mixed Contextual Coding for Point Cloud Geometry Compression. 1-6 - Siqi Deng, Liu Yang:

Enhancing Consistent Federated Learning Objectives Through Uniform Feature Distributions. 1-6 - Ziheng Xu, Jianwei Niu

, Qingfeng Li, Tao Ren, Chen Chen:
NID-SLAM: Neural Implicit Representation-based RGB-D SLAM In Dynamic Environments. 1-6 - Ruiting Wang, Enguang Zuo, Chen Chen, Cheng Chen, Junyi Yan

, Jie Zhong, Ziwei Yan, Xiaoyi Lv:
SMAE: A Split Masked Graph Autoencoder. 1-6 - Xiaolong Wang, Ping Hu, Rongyao Hu, Xiaofeng Zhu:

GATrack: Group-Aware features for multiple object tracking. 1-6 - Tao He, Leqi Shen, Guiguang Ding, Zhiheng Zhou, Tianshi Xu, Xiaofeng Jin, Yuheng Huang:

Balanced Active Sampling for Person Re-identification. 1-6 - Xin Yan, Chi-Man Pun, Haolun Li, Mengqi Liu, Hao Gao:

Hierarchical Local Temporal Feature Enhancing for Transformer-Based 3D Human Pose Estimation. 1-6 - Bosheng Qin, Juncheng Li, Siliang Tang

, Tat-Seng Chua, Yueting Zhuang:
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions. 1-6 - Hao-Yuan Ma, Li Zhang:

Multi-head multi-scale pixel localization network for crowd counting with highly dense and small-scale samples. 1-5 - Penghui Wen, Kun Hu, Dong Yuan, Zhiyuan Ning, Changyang Li, Zhiyong Wang:

Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach. 1-6 - Dongmei Zhang, Ray Zhang, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie, Shanghang Zhang:

VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification. 1-6 - Xiaoli Tang, Han Yu, Xiaoxiao Li:

Agent-Oriented Joint Decision Support for Data Owners in Auction-Based Federated Learning. 1-6 - Xuewei Liu, Shaofei Huang, Ruipu Wu, Hengyuan Zhao, Duo Xu, Xiaoming Wei, Jizhong Han

, Si Liu:
Reference Prompted Model Adaptation for Referring Camouflaged Object Detection. 1-6 - Ching-Chia Kao, Cheng-Yi Lee

, Chun-Shien Lu, Chia-Mu Yu, Chu-Song Chen:
On the Higher Moment Disparity of Backdoor Attacks. 1-6 - Liyan Guo, Kaiyu Song, Mengying Xu, Hanjiang Lai

:
DNAF: Diffusion with Noise-Aware Feature for Pose-Guided Person Image Synthesis. 1-6 - Cai Yu, Shan Jia, Xiaomeng Fu, Jin Liu, Jiahe Tian, Jiao Dai, Xi Wang, Siwei Lyu, Jizhong Han

:
Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection. 1-6 - Wang Xia, Yao Lu, Shunzhou Wang, Wenjing Wang, Ziqi Wang, Peiqi Xia:

Omni Spatial-Angular Correlations Exploration for Light Field Image Super-Resolution. 1-6 - Zhiyi Pan, Guoqing Liu, Wei Gao, Thomas H. Li:

EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding. 1-6 - Yuxiang Yang

, Lu Wen, Yuanyuan Xu, Jiliu Zhou, Yan Wang:
Adaptive Prompt Learning with Negative Textual Semantics and Uncertainty Modeling for Universal Multi-Source Domain Adaptation. 1-6 - Donghui Zhang, Xiaobing Li, Di Lu, Yun Tie, Yan Gao, Lin Qi:

Multitrack Emotion-Based Music Generation Network Using Continuous Symbolic Features. 1-6 - Ying Zhong

, Ke-Ao Zhao, Leping Zhang, Fangming Zhao, Wentao Wei, Feilin Han:
The Correlation Analysis Between Cybersickness and Postural Behavior in Immersive VR Experience. 1-6 - Chengxin Zhao, Hefei Ling, Sijing Xie, Han Fang, Yaokun Fang, Nan Sun:

SSyncOA: Self-synchronizing Object-aligned Watermarking to Resist Crop-paste Attacks. 1-6 - Ning Pang, Wansen Wu, Yue Hu, Kai Xu, Quanjun Yin, Long Qin:

Enhancing Multimodal Sentiment Analysis via Learning from Large Language Model. 1-6 - Yifang Xu

, Yunzhuo Sun, Benxiang Zhai, Zien Xie, Youyao Jia, Sidan Du:
Multi-Modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection. 1-6 - Shunkai Zhou, Canlong Zhang, Zhixin Li, Zhiwen Wang, Chunrong Wei:

Person Re-identification utilizing Text to Search Video. 1-6 - Shuo Zhang

, Xiongpeng Hu, Jing Liu:
TranBF: Deep Transformer Networks and Bayesian Filtering for Time Series Anomalous Signal Detection in Cyber-physical Systems. 1-6 - Jin Wang, Yahong Han:

Symmetrical Two-Stream with Selective Sampling for Diversifying Video Captions. 1-6 - Hao Wu, Ke Lu, Yuqiu Li, Junhao Huang, Jian Xue:

MISTA: A Large-Scale Dataset for Multi-Modal Instruction Tuning on Aerial Images. 1-6 - Zhaochen Li, Kedian Mu:

Disentangling and Aggregating: A Data-Centric Training Framework for Cross-Domain Few-Shot Classification. 1-6 - Peng Yan, Guodong Long:

Client-Supervised Federated Learning: Towards One-Model-for-All Personalization. 1-6 - Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song:

Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization. 1-6 - Qi Li, Yucan Zhou, Jiang Zhou, Xiaoyan Gu, Bo Li:

Tackling Feature Skew in Heterogeneous Federated Learning with Semantic Enhancement. 1-6 - Shuwen Yang, Tianyu Huai, Anran Wu

, Xingjiao Wu, Wenxin Hu, Liang He:
Enhancing Out-of-Distribution Generalization in VQA through Gini Impurity-guided Adaptive Margin Loss. 1-6 - Yiwei Lou, Jiayu Zhang, Dexuan Xu, Yongzhi Cao

, Hanpin Wang, Yu Huang:
No-Reference MRI Quality Assessment via Contrastive Representation: Spatial and Frequency Domain Perspectives. 1-6 - Ren Nie, Jin Ding, Lingxiao He, Xue Zhou:

Latent Distribution Alignment for Domain Generalizable Person Re-identification. 1-6 - Yan Li, Qiong Wang:

Leveraging Hybrid Referring Expressions for Referring Video Object Segmentation. 1-6 - Bingyu Duan, Wanqian Zhang, Dayan Wu, Zheng Lin, Jingzi Gu, Weiping Wang

:
Exploiting Vision-Language Model for Visible-Infrared Person Re-identification via Textual Modality Alignment. 1-6 - Liman Wang, Hanyang Zhong:

FENet: Focusing Enhanced Network for Lane Detection. 1-6 - Bo Qian, Yang Wen, Bin Sheng:

Self-Paced Co-Training and Foundation Model for Semi-Supervised Medical Image Segmentation. 1-6 - Jingjing Lu, Yunchuan Qin, Fan Wu, Zhizhong Liu, Kenli Li, Ruihui Li:

DeformingNet: Deforming Multiple Uniform 3D Priors for 3D Point Cloud Completion. 1-6 - Zhaozhi Xie, Bochen Guan, Weihao Jiang, Muyang Yi, Yue Ding, Hongtao Lu, Lei Zhang:

PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation. 1-6 - Xianpeng Cao, Weixing Xie, Xianxing Cao, Qiqin Lin, Rongzhou Zhou, Junfeng Yao, Qingqi Hong:

ICR-Net: Semi-Supervised Medical Image Segmentation Guided By Intra-Sample Cross Reconstruction. 1-6 - Yu Wang, Bingchen Zhao, Yongchun Lu, Guoqiang Xiao, Quan Lu:

Debiased Prototypical Learning Improves Generalized Category Discovery. 1-6 - Mohan Chen, Yiren Zhang, Jueqi Wei, Yuejie Zhang, Rui Feng, Tao Zhang, Shang Gao:

Temporal Feature Aggregation for Efficient 2D Video Grounding. 1-6 - Na Jiang, Yuxuan Qiu, Wei Song, Jiawei Liu, Zhiping Shi, Liyang Wang:

Joint Visual-Textual Reasoning and Visible-Infrared Modality Alignment for Person Re-Identification. 1-6 - Yongsheng Yu, Jiebo Luo

:
Chain-of-Thought Prompting for Demographic Inference with Large Multimodal Models. 1-7 - You Wu, Zhixin Li:

Mining Similarity Relationships for Unsupervised Cross-Modal Hashing. 1-6 - Yeheng Zhu, Zhijian Wu, Jun Li, Jianhua Xu:

HURDNet: Heterogeneous UNet Structure With Range-Null Space Decomposition for Hyperspectral Image Reconstruction. 1-6 - Hui Wang

, Jie Sun, Tianyu Wo, Xudong Liu:
FedFRR: Federated Forgetting-Resistant Representation Learning. 1-6 - Peiming Lin, Sumei Li, Zilin Zhao, Huilin Zhang

:
I2GSRnet: Iterative Interaction Guidance Network for Stereo Image Super-Resolution. 1-6 - Ruitao Xie, Limai Jiang, Xiaoxi He, Yi Pan, Yunpeng Cai:

A Weakly Supervised and Globally Explainable Learning Framework for Brain Tumor Segmentation. 1-6 - Yaoxin Li, Deepak Sridhar, Hanwen Liang, Alexander Wong:

Spot the Difference! Temporal Coarse to Fine to Finer Difference Spotting for Action Recognition in Videos. 1-6 - Yong Tang, Qiang Huang, Yingying Zhu:

C2F-CCPE: Coarse-to-Fine Cross-View Camera Pose Estimation. 1-6 - Zicheng Zhang, Yu Fan, Wei Sun, Xiongkuo Min, Xiaohong Liu

, Chunyi Li, Haoning Wu, Weisi Lin, Ning Liu, Guangtao Zhai:
Optimizing Projection-Based Point Cloud Quality Assessment with Human Preferred Viewpoints Selection. 1-6 - Kang Xiao, Xu Wang, Yulin He, Baoliang Chen, Xuelin Shen:

Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement. 1-6 - Haoran Zhang, Xiangdong Su, Xingxiang Zhou, Guanglai Gao:

MEMix: Improving HMER with Diverse Formula Structure Augmentation. 1-6 - Yichi Zhang

, Zhihao Duan, Yuning Huang, Fengqing Zhu:
Theoretical Bound-Guided Hierarchical Vae For Neural Image Codecs. 1-6 - Yanyu Li, Jiangbo Xu, Ruoyu Zou:

Research on Image Aesthetic Assessment based on Graph Convolutional Network. 1-6 - Tiancheng Zhang, Xinyi Zhang:

Multi-contrast MRI Reconstruction with Deformable Attention and Invertible Network. 1-6 - Pei Wang

, Yun Yang, Zhenyu Yu
:
Multi-batch Nuclear-norm Adversarial Network for Unsupervised Domain Adaptation. 1-6 - Helin Zhao, Wei Chen, Peng Zhou

:
Deep Self-paced Active Learning for Image Clustering. 1-6 - Rui Ma, Mengxi Guo, Peidong Jia, Chenxuan Li, Yi Hou, Yuan Li, Xiaodong Xie, Shanghang Zhang:

Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework. 1-6 - Xiaolin Chen, Daoguang Zan, Wei Li, Bei Guan, Yongji Wang:

FIA-TE: Feature Inference Attack on Decision Tree Ensembles in Vertical Federated Learning. 1-6 - Yuzhou Zhao, Xinyu Zhou, Haijing Guo, Qianyu Guo, Yan Zuo, Shaoli Song, Shuyong Gao, Wenqiang Zhang:

Attention in Attention for PET-CT Modality Consensus Lung Tumor Segmentation. 1-7 - Bingzhi Chen, Shuobin Lin, Yishu Liu, Zheng Zhang, Guangming Lu, Lewei He:

Rethinking Adversarial Robustness Distillation VIA Strength-Dependent Adaptive Regularization. 1-6 - Meng Wang, Yue Qi:

Efficient Sampling and Volume Rendering Strategy for Neural Field SLAM. 1-6 - Md. Ershadul Haque

, Manoranjan Paul
:
Block-Wise Compression Of The Quantum Gray-Scale Image Using Lossy Preparation Approach. 1-6 - Jorge Kessler-Martín

, Pablo Fernández-Lagos
, David García-Lucas, Gabriel Cebrián-Márquez
, Belén Ríos-Sánchez, Guillermo Vigueras, Antonio Jesús Díaz-Honrubia:
Saliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Coding. 1-6 - Qiancheng Yang, Yong Luo, Bo Du:

Training-Free Robust Neural Network Search Via Pruning. 1-6 - Xianzhou Zeng, Hao Qin, Ming Kong, Luyuan Chen

, Qiang Zhu:
Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation. 1-6 - Yi Pan, Jun-Jie Huang, Zihan Chen, Wentao Zhao, Ziyue Wang:

SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks. 1-6 - Jiazhe Miao, Tao Peng, Fei Fang, Xinrong Hu, Ping Zhu, Feng Yu, Minghua Jiang:

SmPhy: Generating smooth and physically plausible 3D garment animations. 1-6 - Rui Zhang

, Junxiao Xue, Feng Lin, Qing Zhang, Pavel Smirnov, Xiao Ma
, Xiaoran Yan:
Enhancing Human Action Recognition with Fine-grained Body Movement Attention. 1-6 - Yuting Hu, Yue Ming, Panzi Zhao, Boyang Lyu, Kai Hong:

LMGSNet: A Lightweight Multi-scale Group Shift Fusion Network for Low-quality 3D Face Recognition. 1-6 - Hao Niu, Yun Xiong, Xiaosu Wang, Biao Yang, Yao Zhang:

How Does Textual Information Selection Influence Time Series Forecasting? A Cross-modal Perspective on Financial Volatility Prediction. 1-6 - Sanhita Pathak, Vinay Kaushik, Brejesh Lall:

Single Stage Warped Cloth Learning and Semantic-Contextual Attention Feature Fusion for Virtual Tryon. 1-6 - Beibei Li, Beihong Jin, Yisong Yu, Yiyuan Zheng, Jiageng Song, Wei Zhuo, Tao Xiang:

Orthogonal Hyper-category Guided Multi-interest Elicitation for Micro-video Matching. 1-6 - Wenwen Zhang, Jie Lian, Bingying Dong:

Multi-Scale Position-Aware Cell Nucleus Mask Attention for Tumor Budding Detection. 1-6 - Ming Guo

, Wenrui Li, Chao Wang, Yuxin Ge, Chongjun Wang:
Smile: Spiking Multi-Modal Interactive Label-Guided Enhancement Network for Emotion Recognition. 1-6 - Yucheng Shu, Longjin Cheng, Bin Xiao, Lihong Qiao, Weisheng Li, Xinbo Gao:

Focal-Guided Multi-Consistency for Unsupervised Partial-to-Partial Point Cloud Registration. 1-6 - Si Li, Jiaxing Liu, Peilin Li, Dichucheng Li, Xinlu Liu, Yongwei Gao, Wei Li:

Improving Drum Source Separation with Temporal-Frequency Statistical Descriptors. 1-6 - Zexian Yang, Dayan Wu, Wanqian Zhang, Jingzi Gu, Zheng Lin, Weiping Wang

:
Privacy-Preserving Replay and Adaptive Relation Distillation for Camera Incremental Person Re-Identification. 1-6 - Dingbang Li, Wenzhou Chen, Xin Lin:

Tina: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation. 1-6 - Bo Gao

, Junchi Ren, Fei Shen, Mengwan Wei, Zijun Huang:
Exploring Warping-Guided Features via Adaptive Latent Diffusion Model for Virtual try-on. 1-6 - Lichao Cui, Shanliang Yang:

Enhancing Multimodal Sentiment Recognition Based on Cross-Modal Contrastive Learning. 1-6 - Wen Xue, Xingbo Liu, Xiao Kang, Xuening Zhang, Xiushan Nie, Shaohua Wang, Yilong Yin:

Fast Multi-view Clustering With Binary Anchor Graph. 1-6 - Xuening Zhang

, Xingbo Liu, Xiao Kang, Wen Xue, Xiushan Nie, Shaohua Wang, Yilong Yin:
Completely Unpaired Cross-Modal Hashing Based on Coupled Subspace. 1-6 - Shuang Liang, Long Zhang, Chi Xie

, Lili Chen:
Causal Intervention for Panoptic Scene Graph Generation. 1-6 - Xu Wang, Kairui Zhang:

Adaptive Style Transfer Learning for Generalizable Person Re-identification. 1-6 - Fengyuan Zhang, Zhaopei Huang, Xinjie Zhang, Qin Jin:

Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition. 1-6 - Yongjie Guo, Siya Chen, Hongjian You:

Continual Semantic Segmentation via Mask-Based Class Rebalancing. 1-6 - Menglong Yang, Hanyong Wang, Yang Ren:

A Self-Attention Network for Stereo Matching. 1-10 - Ke Jia, Yonghong Song, Xiaomeng Wu, You Su:

Video Anomaly Detection Via Self-Supervised Learning With Frame Interval and Rotation Prediction. 1-6 - Weilong Peng, Yi Luo, Keke Tang, Kongyang Chen, Yangtao Wang

, Ping Li, Meie Fang:
IE-aware Consistency Losses for Detailed 3D Face Reconstruction from Multiple Images in the Wild. 1-6 - Zhenggang Yang, Faming Fang, Qiaosi Yi, Guixu Zhang, Fang Li:

HFF-Net: A High-Frequency Fidelity Model for Accelerated Parallel MRI Reconstruction. 1-6 - Zhihang Wei, Jinxin Shi, Jing Yang, Jiabao Zhao:

VIP-FSCIL: A More Robust Approach for FSCIL. 1-6 - Meng Pang, Binghui Wang, Nanrun Zhou, Yintao Zhou, Wei Huang:

Reconstructing Prototype From Contaminated Face With Variations Across Heterogeneous Domains. 1-6 - Xiaoke Yang, Haixu Song, Xiangyu Lu, Shao-Lun Huang, Yueqi Duan:

AdaForensics: Learning A Characteristic-aware Adaptive Deepfake Detector. 1-6 - Kangnan Bai, Pengyi Gao, Kai Chen, Xin Nie, Shenghui Li, Bingqian Li:

Mutual Compromised Multi-feature Fusion Method for Cross-modal Hashing Retrieval. 1-7 - Qingfeng Wang, Lingyu Liang, Shuangping Huang:

Document Image Dewarping Guided by 3D Geometry and Layout Priors. 1-6 - Xuechun Wang, Wentao Chao, Fuqing Duan:

Point Cloud Reconstruction Optimization of Light Field Image based on Intra-class Distance. 1-6 - Jiawei Zhu, Meirong Ding, Yishu Liu, Biqing Zeng, Guangming Lu, Bingzhi Chen:

Robust Visual Question Answering With Contrastive-Adversarial Consistency Constraints. 1-6 - Zhuoxin Chen, Zhenyu Wu, Yang Ji:

Decoupled Federated Learning on Long-Tailed and Non-IID data with Feature Statistics. 1-6 - Kangwei Liu, Xiuhong Li, Boyuan Li, Yuye Zhang, Chao Che:

Lightweight Camouflaged Object Detection Network Based on Feature Complementation and Enhancement. 1-6 - Lei Wang, Tianfu Cai, Pinyi Huang, Xiyao Liu, Wangyang Cai:

Two-Stage Facial Expression Spotting with Spectrum-Based Post-Processing. 1-6 - Yanjie Sun, Kele Xu, Yong Dou, Tian Gao:

Self-Supervised Learning-Based General Fine-tuning Framework For Audio Classification and Event Detection. 1-6 - Wenxin Liang, Bingkai Liu, Han Liu, Hong Yu:

Boosting Node Injection Attack with Graph Local Sparsity. 1-6 - Xu Wang

, Yanxia Wu, Ye Yuan
, Yan Fu, Xue Zhang:
Unpaired image despeckling based on adversarial speckle generation. 1-6 - Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:

Towards Omni-supervised Referring Expression Segmentation. 1-6 - Youqian Zhang

, Chunxi Yang, Eugene Yujun Fu, Qinhong Jiang, Chen Yan, Sze-Yiu Chau, Grace Ngai, Hong Va Leong, Xiapu Luo, Wenyuan Xu:
Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection. 1-6 - Baotong Su, Siyan Li, Wenguang Zheng, Yao Chen:

SFDE-net: A Spatial-Frequency Domain Feature Enhancement Network for Cloud Detection. 1-6 - Yuxuan Sun, Chenglu Zhu, Sunyi Zheng

, Yunlong Zhang, Honglin Li, Lin Yang:
Context-Aware Text-Assisted Multimodal Framework for Cervical Cytology Cell Diagnosis and Chatting. 1-6 - Zhongzhan Huang, Senwei Liang, Mingfu Liang, Wei He, Haizhao Yang

, Liang Lin:
Lottery Ticket Hypothesis for Attention Mechanism in Residual Convolutional Neural Network*. 1-6 - Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le Phuoc:

Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object Detection. 1-6 - Nan Chen, Yonghe Wang, Xiangdong Su, Feilong Bao:

Efficient Speech-to-Text Translation: Progressive Pruning for Accelerated Speech Pre-trained Model. 1-6 - Lei Wang, Quan Zhang, Junyang Qiu, Jianhuang Lai:

Rotation Exploration Transformer for Aerial Person Re-identification. 1-6 - Xuan Dang, Guolong Wang, Xun Wu, Zheng Qin:

Improving Image Reconstruction and Synthesis by Balancing the Optimization from Frequency Perspective. 1-6 - Hengda Li, Yinglin Zheng, Qifeng Dai, Jintai Wang, Liang Song, Ming Zeng:

Multi-Modal Gait Recognition with Unidirectional Cross-modal Alignment. 1-6 - Yingxuan Li

, Kiyoharu Aizawa, Yusuke Matsui:
Manga109Dialog: A Large-Scale Dialogue Dataset for Comics Speaker Detection. 1-6 - Huan Zhao, Yi Ju, Yingxue Gao:

Bilevel Relational Graph Representation Learning-based Multimodal Emotion Recognition in Conversation. 1-6 - Kangwei Liu, Xiaowei Yi, Xianfeng Zhao:

ProDub: Progressive Growing of Facial Dubbing Networks for Enhanced Lip Sync and Fidelity. 1-6 - Yulun Wu, Weixing Wei, Dichucheng Li, Mengbo Li, Yi Yu, Yongwei Gao, Wei Li:

Harmonic Frequency-Separable Transformer for Instrument-Agnostic Music Transcription. 1-6 - Xinxin Zhang, Xiankai Lu, Jizhou Li

, Yongshun Gong, Qiangchang Wang, Yilong Yin:
Two-phase Parametric Registration for Retinal Images. 1-6 - Pengcheng Lei, Zaoming Yan, Tingting Wang, Faming Fang, Guixu Zhang:

Three-Stage Temporal Deformable Network for Blurry Video Frame Interpolation. 1-6 - Kaifen Cai, Kaiyu Song, Yan Pan, Hanjiang Lai

:
MALIP: Improving Few-Shot Image Classification with Multimodal Fusion Enhancement. 1-6 - Zhenghao Ke, Sheng Liu, Chengyuan Ke, Yuan Feng, Shengyong Chen:

Cross-Modality Consistency Mining For Continuous Sign Language Recognition with Text-Domain Equivalents. 1-6 - Zhixuan Shen, Haonan Luo, Sijia Li, Tianrui Li:

Adversarial Training with OCR modality Perturbation for Scene-Text Visual Question Answering. 1-6 - Jinkang Ji, Junao Shen, Xinyu Wang, Tian Feng, Sensen Wu:

WirePAuS: Auxiliary-free Single-shot Wireframe Parsing. 1-6 - Hao Wu, Ruochong Li, Hao Wang, Hui Xiong:

COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval. 1-6 - Xiao Liu, Guan Yuan, Rui Bing, Zhuo Cai, Shengshen Fu, Yonghao Yu:

When Skeleton Meets Motion: Adaptive Multimodal Graph Representation Fusion for Action Recognition. 1-6 - Guodong Li, Letu Qingge, Qingyi Pan, Pei Yang:

Edge-Guided Mural Image Inpainting by Integrating Local and Global Information and Multiple Color Spaces. 1-6 - Jialing Zou, Jiahao Mei, Xudong Nan, Jinghua Li, Daoguo Dong, Liang He:

TEAdapter: Supply Vivid Guidance for Controllable Text-to-Music Generation. 1-6 - Yongheng Zhang, Yuanqiang Cai, Danfeng Yan:

Restoring Real-World Images Affected by Varied Degradations Using a Semi-Supervised Domain Adaptation Network. 1-6 - Yizhu Wen

, Yiwei Wang, Kai Yi, Jing Ke, Yiqing Shen:
Diffimpute: Tabular Data Imputation with Denoising Diffusion Probabilistic Model. 1-6 - Dahe Peng, Rongrong Shen, Zhixin Li:

Robust VQA via Internal and External Interaction of Modal Information and Question Transformation. 1-6 - Zhangfeng Hu, Wenming Zheng, Yuan Zong, Mengting Wei, Xingxun Jiang, Mengxin Shi:

A Novel Decoupled Prototype Completion Network for Incomplete Multimodal Emotion Recognition. 1-6 - Ping Xu

, Jiangqun Ni, Jian Zhang
, Yulin Zhang, Shiyuan Tang:
Diff-IFL: Towards General Image Forgery Localization using Diffusion Probabilistic Model. 1-6 - Han Fang, Xianghao Zang, Chao Ban, Zerun Feng, Lanxiang Zhou, Zhongjiang He, Yongxiang Li, Hao Sun:

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval. 1-6 - Mingchen Xu, Jing Wu, Yu-Kun Lai, Ze Ji:

Fusion of Short-term and Long-term Attention for Video Mirror Detection. 1-9 - Xiao Liang, Zijian Zhao, Weichao Zeng, Yutong He, Fupeng He, Yiyi Wang, Chengying Gao:

PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training. 1-6 - Ye-Wen Wang, Chen-Chen Zong, Ming-Kun Xie, Sheng-Jun Huang:

Dirichlet-Based Coarse-to-Fine Example Selection For Open-Set Annotation. 1-6 - Long Huang

, Zhiwei Dong, Song-Lu Chen, Ruiyao Zhang, Shutong Ti, Feng Chen, Xu-Cheng Yin:
HQOD: Harmonious Quantization for Object Detection. 1-6 - Fan Tian, Peichi Zhou, Chen Li, Changbo Wang:

Shadow Constrained DEM Refinement Based on Differentiable Rendering. 1-6 - Cheng Shang, Jidong Tian, Jiannan Ye, Xubo Yang:

Free-view Rendering of Dynamic Human from Monocular Video Via Modeling Temporal Information Globally and Locally among Adjacent Frames. 1-6 - Simiao Lai, Dong Wang, Huchuan Lu:

DepthRefiner: Adapting RGB Trackers to RGBD Scenes via Depth-Fused Refinement. 1-6 - Shuo Zhang

, Xiongpeng Hu, Jing Liu
:
Causal Fusion of Convolutional Neural Network and Vision Transformer for Image Anomaly Detection and Localization. 1-6 - Yueming Zhu, Qing Xu, Kai Zhen, Runlin Zhang, Shunbo Wang:

Quantitative Analysis of Eye-Tracking Data Based on Information-Theoretic Tools for Measuring Driver Drowsiness. 1-6 - Chen He, Shenshen Li, Zheng Wang, Fumin Shen, Yang Yang, Xing Xu:

Diverse Embedding Modeling with Adaptive Noise Filter for Text-based Person Retrieval. 1-6 - Zhengyang Li, Shanshan Huang, Jiawei Liu

, Laiming Jiang, Shen Chen, Yi Zhang, Jun Liao, Shu Wang, Li Liu:
Recognizing Cognitive Load by a Multi-instance Causal Learning Model from Multi-channel Physiological Data. 1-6 - Kun Hu, Zizhuo Wang, Zixuan Hu, Heng Gao, Xingjun Wang:

Stega-Matting: Irregular Matting Protection via Steganography. 1-6 - Zhenrong Cheng, Jiayan Guo, Hao Sun, Yan Zhang:

Boosting Disfluency Detection with Large Language Model as Disfluency Generator. 1-6 - Chen Liang, Zhiqian Dong, Sheng Yang, Peng Zhou

:
Jointly Learn the Base Clustering and Ensemble for Deep Image Clustering. 1-6 - Jinyi Wang, Fei Ben, Huangjie Zheng, Jiangchao Yao, Ya Zhang, Yanfeng Wang:

MVTexGen: Synthesising 3D Textures Using Multi-View Diffusion. 1-6 - Xiyao Liu, Fengkai Dong, Xin Liao, Yuhan Guo

, Jianbiao He, Jian Zhang, Gerald Schaefer, Hui Fang:
Multi-Strategy Adversarial Learning for Robust Face Forgery Detection Under Heterogeneous and Composite Attacks. 1-6 - Shaoyao Huang, Luozheng Qin, Ziqiang Cao

, Qian Qiao:
STRA: A Simple Token Replacement Strategy Alleviating Exposure Bias in Text Generation. 1-6 - Pan Mu, Binjia Zhou, Qirui Wang, Zhiying Du, Xiaoyan Wang:

BFMEF: Brightness-Free Multi-exposure Image Fusion via Adaptive Correction. 1-6 - Hanglin Li, Peng Yin, Xiaosu Zhu, Lianli Gao, Jingkuan Song:

BFD: Binarized Frequency-enhanced Distillation for Vision Transformer. 1-6 - Jianshe Duan, Yachao Zhang, Yanyun Qu:

Source-Free Domain Adaptation for Point Cloud Semantic Segmentation. 1-6 - Quoc-Huy Trinh, Minh-Van Nguyen, Phuoc-Thao Vo Thi:

KDAS: Knowledge Distillation via Attention Supervision Framework for Polyp Segmentation. 1-6 - Yunfei Yang

, Xiaojun Chen, Yuexin Xuan, Zhendong Zhao:
DualCOS: Query-Efficient Data-Free Model Stealing with Dual Clone Networks and Optimal Samples. 1-6 - Junkai Li, Huicheng Lai, Jun Ma, Tongguan Wang, Hutuo Quan, Dongji Chen:

Efficient Guided Query Network for Human-Object Interaction Detection. 1-6 - Dongyang Gao

, Chen Chen, Yichao Zhou, Haotian Zhang, Xiyuan Hu
:
TS-SAM: Two Small Steps for SAM, One Giant Leap for Abnormal detections. 1-6 - Zhenping Li, Si Wu, Xindian Wei

, Qianfen Jiao, Cheng Liu, Rui Li
:
Reference-conditional Makeup-aware Discrimination for Face Image Beautification. 1-6 - Zhenyu Yu

, Pei Wang
:
CaPAN: Class-aware Prototypical Adversarial Networks for Unsupervised Domain Adaptation. 1-6 - Yiran Liu

, Zhanjie Wu
, Mengjingcheng Mo, Ji Gan, Jiaxu Leng, Xinbo Gao:
Dual Space Embedding Learning For Weakly Supervised Audio-Visual Violence Detection. 1-6 - Yifei Pu

, Chi Wang, Xiaofeng Hou, Cheng Xu, Jiacheng Liu, Jing Wang, Minyi Guo, Chao Li:
M2SN: Adaptive and Dynamic Multi-modal Shortcut Network Architecture for Latency-Aware Applications. 1-6 - Yuanwu Xu, Mohan Chen, Yuejie Zhang, Rui Feng, Tao Zhang, Shang Gao:

Memory-Augmented Transformer for Efficient End-to-End Video Grounding. 1-6 - Yan Feng, Tian Jiang, Yunqi Liu, Zijian Huang, Xiaohui Cui:

Multimodal Semantic Fusion for Zero-Shot Learning. 1-6 - Xin Zhang, Teodor Boyadzhiev, Jinglei Shi, Jufeng Yang:

ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation. 1-6 - Haifan Gong, Wenhao Huang, Huan Zhang, Yu Wang, Xiang Wan, Hong Shen, Guanbin Li, Haofeng Li:

Intensity Confusion Matters: An Intensity-Distance Guided Loss For Bronchus Segmentation. 1-6 - Soumyya Kanti Datta

, Shan Jia, Siwei Lyu:
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies. 1-6 - Ila Gokarn, Yigong Hu, Tarek F. Abdelzaher, Archan Misra:

JIGSAW: Edge-based Streaming Perception over Spatially Overlapped Multi-Camera Deployments. 1-6 - Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. 1-6 - Dongyue Li, Songlin Du:

Beyond Global Cues: Unveiling the Power of Fine Details in Image Matching. 1-6 - Yin Tang, Guang Yang, Xili Wan:

SDViT: Towards Efficient Visual Foundation Model via Unifying Sparse and Dense Representation Learning. 1-6 - Jinhe Long, Zekai Chen, Fuyi Wang, Jianping Cai

, Ximeng Liu:
FedCL: Detecting Backdoor Attacks in Federated Learning with Confidence Levels. 1-6 - Bowen Qu, Haohui Li, Wei Gao:

Bringing Textual Prompt to AI-Generated Image Quality Assessment. 1-6 - Zhaofei Wang, Weijia Zhang, Min-Ling Zhang:

Proposal Feature Learning Using Proposal Relations for Weakly Supervised Object Detection. 1-6 - Fengshuo Zhang:

Multi-Hop Distillation for Efficient Cross-Layer Knowledge Transfer. 1-7 - Shengyang Sun, Xiaojin Gong:

Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection. 1-6 - Qianyun Gong, Kunheng Jiang, Jingjing Wen, Xinjing Yuan, Jianxin Shi, Lingjun Pu:

TailClip: Mitigating Tail Latency in Cloud Gaming via Smart Video Frame Generation. 1-6 - Mufan Liu, Le Yang

, Yiling Xu, Ye-Kui Wang, Jenq-Neng Hwang:
EVAN: Evolutional Video Streaming Adaptation via Neural Representation. 1-6 - Duo Liu, Linglan Zhao, Zhongqiang Zhang, Fuhan Cai, Xiangzhong Fang:

Distillation Excluding Positives for Few-Shot Class-Incremental Learning. 1-6 - Tienyi Hsieh, Qijun Zhao, Fan Pan, Pubu Danzeng, Dingguo Gao, Dorji Gesang:

Text and Edge Guided Thangka Image Inpainting with Diffusion Model. 1-10 - Xinyu Liu, Yong Yi, Ye Luo

:
A Cascade Multimodal Fine-Grained MRI Image Grading Network For Preoperative Microvascular Invasion In Hepatocellular Carcinoma. 1-6 - Enqi Liu, Liyuan Pan:

A Lightweight Multi-Level Relation Network for Few-shot Action Recognition. 1-6 - Jiangbin Zheng, Stan Z. Li:

Progressive Multi-Modality Learning for Inverse Protein Folding. 1-6 - Zheng Cui, Yongli Hu, Jiapu Wang, Junbin Gao, Yanfeng Sun, Baocai Yin:

Common-Memory Bridged Cross-Modal Adaptive Graph Embedding for Image-Text Retrieval. 1-6 - Hengyu Zhang

, Hang Lv, Yanchao Tan, Guofang Ma
, Fan Wang, Carl Yang:
ExpertODE: Continuous Diagnosis Prediction with Expert Enhanced Neural Ordinary Differential Equations. 1-6 - Zhuangzi Li, Shan Liu, Ge Li:

PointELM: Fast Point Cloud Classification Using Deep Random Mapping Based Extreme Learning Machines. 1-6 - Lan Yan, Kenli Li:

Unknown Instance Learning for Person Search. 1-6 - Zhijian Wu, Dingjiang Huang:

Ultralight-weight Binary Neural Network with 1K Parameters for Image Super-Resolution. 1-6 - Diwen Wan

, Jiaxiang Tang, Jingbo Wang, Xiaokang Chen, Lingyun Gan, Gang Zeng:
Open-set Hierarchical Semantic Segmentation for 3D Scene. 1-6 - Ziyi Huang, Binbin Yan, Shuo Chen, Dongliang Wang, Lu Yang:

Focal Stack Alignment Enhancement Network For Light Field Salient Object Detection. 1-6 - Zihan Ma, Huan Liu, Zhi Zeng

, Hao Guo, Xiang Zhao, Minnan Luo:
Learning Multimodal Attention Mixed with Frequency Domain Information as Detector for Fake News Detection. 1-6 - Ryandhimas E. Zezario

, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao, Hsin-Min Wang, Chiou-Shann Fuh:
A Study On Incorporating Whisper For Robust Speech Assessment. 1-6 - Xuan Long, Meiqin Liu, Qi Tang

, Chao Yao, Jian Jin, Yao Zhao:
Noisy-Residual Continuous Diffusion Models for Real Image Denoising. 1-6 - Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu

, Xiongkuo Min, Weisi Lin, Guangtao Zhai:
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image. 1-6 - Yuxuan Jiang, Guobin Zhu, Yi Ding, Zhen Qin, Minghui Pang:

Gradient Saliency-aware CutMix for Semi-Supervised Medical Image Segmentation. 1-6 - Thanh Hai Phung, Hung-Jen Chen, Hong-Han Shuai:

Hierarchically Aggregated Identification Transformer Network for Camouflaged Object Detection. 1-6 - Peiqi Xia, Yao Lu, Sijia Zhang, Shunzhou Wang, Ziqi Wang, Wang Xia:

Revisiting Large Kernel Convolution for Light Field Image Angular Super-Resolution. 1-6 - Lifeng Zhou, Yuke Li:

Coarse-to-fine Alignment Makes Better Speech-image Retrieval. 1-6 - Yuan Yao, Yuanhan Zhang, Zhenfei Yin, Jiebo Luo

, Wanli Ouyang, Xiaoshui Huang
:
3D Point Cloud Pre-Training with Knowledge Distilled from 2D Images. 1-6 - Da Ai, Kai Jia, Yunqiao Wang, Ying Liu:

NIR-VIS Image Translation for the Cross-Spectral and Cross-Distance Face Recognition. 1-6 - Ruixue Qi, Chen Pang, Mengyang Zhang, Lei Lyu:

EGLA-Net: Edge Guided with Lesion Aware Network for Medical image segmentation. 1-6 - Li Jin, Xibin Song, Jia Li, Changhe Tu, Xueying Qin:

CSS-Net: Domain Generalization in Category-level Pose Estimation via Corresponding Structural Superpoints. 1-6 - Ke Ning, Rongrong Shen, Zhixin Li:

Robust Knowledge Distillation and Self-Contrast Reasoning for Debiased Visual Question Answering. 1-6 - Lirong Xue, Kang-Yang Huang, Rong Chao, Jhih-Ciang Wu

, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Learning Efficient Interaction Anchor for HOI Detection. 1-6 - Chenyang Li, Xing Wei, Huazheng Zhao:

MultiQ: Multi-model Joint Learning via Synthetic Data for Data-Free Quantization. 1-6 - Jiaye Zhang, Zili Meng, Mingwei Xu:

Beimin: Serverless-based Adaptive Real-Time Video Processing. 1-6 - Yu Lu, Yizhou Jin, Yuyu Chen, Gang Zhou, Zhenghui Hu, Qingjie Liu, Di Huang, Yunhong Wang:

Fast Textile Pilling Classification Based on a Lightweight Network and 3D Point Clouds. 1-6 - Haoyu Wang

, Zilong Yin, Hangling Sun, Xin Guo:
Enhancing Vital Sign Monitoring with Reinforcement Learning and Wavelet Analysis in Sleep Disorders. 1-6 - Xiufeng Liu, Zhongqiu Zhao, Chen Ding:

Style-ACAE: Adversarial Capsule Autoencoder with Styles. 1-6 - Weitian Zhang, Sijing Wu, Yichao Yan, Ben Xue, Wenhan Zhu, Xiaokang Yang:

HQ-Avatar: Towards High-Quality 3D Avatar Generation via Point-based Representation. 1-6 - Lihong Qiao, Rui Wang, Yucheng Shu, Ximing Xu, Baobin Li, Weisheng Li, Xinbo Gao:

Re3adapter: Efficient Parameter Fing-Tuning with Triple Reparameterization for Adapter without Inference Latency. 1-6 - Pengxiang Ouyang, Jianan Chen, Qing Ma, Zheng Wang, Cong Bai:

Distinguishing Visually Similar Images: Triplet Contrastive Learning Framework for Image-text Retrieval. 1-6 - Yifan Zhang, Meiqin Liu, Chenming Xu, Qi Tang

, Chao Yao, Yao Zhao:
TLVC: Temporal Bit-rate Allocation for Learned Video Compression. 1-6 - Yang Xu

, Yifan Feng
, Yu Jiang:
Structure-aware Residual-center Representation for Self-Supervised Open-set 3D Cross-modal Retrieval. 1-6 - Jiachen Luo, Huy Phan, Lin Wang, Joshua D. Reiss:

Enhanced Speech Emotion Recognition Incorporating Speaker-Sensitive Interactions in Conversations. 1-6 - Songpei Xu, Xuri Ge, Chaitanya Kaul, Roderick Murray-Smith:

HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems. 1-6 - Xin Chen, Bin Wang, Yongsheng Gao:

Multiscale Binary-Pattern Dependency: A Novel Co-Occurrence Texture Descriptor for Fine-Grained Leaf Image Retrieval. 1-6 - Will Kerr, Crescent Jicol

, Tom S. F. Haines, Wenbin Li:
Camera Chameleon - The Creative Impact of Tracked Tangible Interfaces for Virtual Film Pre-Production. 1-6 - Yixiao Li, Xiaoyuan Yang, Jun Fu, Guanghui Yue, Wei Zhou:

Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment. 1-6 - Hongzhang Mu, Shuili Zhang, Quangang Li, Tingwen Liu

, Hongbo Xu:
Dynamic Multi-Modal Representation Learning For Topic Modeling. 1-6 - Zihao He, Shengchuan Zhang:

ESR-DDLN : Enhanced Single Image Super-Resolution Via Dual-Domain Learning Network. 1-6 - Junyuan Guo, Teng Wang, Chao Wang:

Mixed 3D Gaussian for Dynamic Scenes Representation and Rendering. 1-6 - Jiacheng Wang

, Ping Liu, Wei Xu:
Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance. 1-6 - Davide Berghi, Craig Cieciura

, Farshad Einabadi, Maxine Glancy, Oliver C. Camilleri
, Philip Foster, Asmar Nadeem
, Faegheh Sardari, Jinzheng Zhao, Marco Volino, Armin Mustafa
, Philip J. B. Jackson, Adrian Hilton
:
ForecasterFlexOBM: A Multi-View Audio-Visual Dataset for Flexible Object-Based Media Production. 1-6 - Yingying Zhu, Dafeng Li, Zhihang Liu, Hong Zhou:

ClipComb: Global-Local Composition Network based on CLIP for Composed Image Retrieval. 1-6 - Jiaxin Qiu, Guoyu Yang, Jie Lei, Zunlei Feng, Ronghua Liang:

Visual-guided Query with Temporal Interaction for Video Object Segementation. 1-6 - Yiheng Duan, Yunjie Ge, Zixuan Wang, Jiayi Yu, Shenyi Zhang, Libing Wu:

Enhancing the Transferability of Adversarial Examples with Noise Injection Augmentation. 1-6 - Yang Yao, Xin Wang, Yijian Qin, Ziwei Zhang, Wenwu Zhu, Hong Mei:

Customized Cross-device Neural Architecture Search with Images. 1-6 - Yaori Zhang, Shujin Lin, Fan Zhou, Ruomei Wang:

Hierarchical Attention Feature Fusion and Refinement Network for Point Cloud Upsampling. 1-8 - Chen Cai, Runzhong Zhang, Jianjun Gao, Kejun Wu, Kim-Hui Yap, Yi Wang:

Temporal Sentence Grounding with Temporally Global Textual Knowledge. 1-6 - Jiayu Li

, Xuechao Zou, Shiying Wang, Ben Chen, Junliang Xing, Pin Tao:
A Parallel Attention Network For Cattle Face Recognition. 1-6 - Minghao Han, Xukun Zhang, Dingkang Yang, Tao Liu, Haopeng Kuang, Jinghui Feng, Lihua Zhang

:
Multi-Scale Heterogeneity-Aware Hypergraph Representation for Histopathology Whole Slide Images. 1-6 - Yabin Zhang, Xu Chen:

Enhancing Sequential Recommendation Modeling Via Adversarial Training. 1-6 - Yuteng Wang, Xing Wu, Zhongshi He, Peng Wang, Haidong Wang, Hongqian Wang:

US-SAM: An Automatic Prompt Sam For Ultrasound Image. 1-6 - Jiabo Ye, Junfeng Tian, Xiaoshan Yang, Zhenru Zhang, Anwen Hu, Ming Yan, Ji Zhang, Liang He, Xin Lin:

VG-Annotator: Vision-Language Models as Query Annotators for Unsupervised Visual Grounding. 1-6 - Kailai Feng, Minheng Ni, Jiaxiu Jiang, Zhilu Zhang, Wangmeng Zuo:

Multi-Attentional Distance for Zero-Shot Classification with Text-to-Image Diffusion Model. 1-6 - Kaiyue Tian, Chen Chen, Yichao Zhou, Xiyuan Hu

:
Illumination Enlightened Spatial-temporal Inconsistency for Deepfake Video Detection. 1-6 - Hantao Zhou, Runze Hu, Xiu Li:

Video Object Segmentation with Dynamic Query Modulation. 1-6 - Ziyu Gong

, Chengcheng Mai, Yihua Huang:
AsCL: An Asymmetry-sensitive Contrastive Learning Method for Image-Text Retrieval with Cross-Modal Fusion. 1-6 - Zixuan Hu, Kun Hu, Zizhuo Wang, Ranran Pan, Xingjun Wang:

OWR: Optimizing Watermark Robustness for Screen Recording. 1-6 - Zhiyuan Zhu, Zhiyuan Ning, Hui Cui

, Junao Shen, Jiaheng Wang, Xinyu Wang, Tian Feng:
MuMoSNet: 3D MRI-based Brain Tumor Segmentation via Multi-modal and Multi-scale Feature Fusion. 1-6 - Kaiyu Jin, Chenwang Wu, Defu Lian:

Out-of-Distribution Generalization via Style and Spuriousness Eliminating. 1-6 - Mingdong Yu, Xiaofeng Jin, Guirong Wang, Bo Wang

, Jiaqi Chen:
SPformer: Hybrid Sequential-Parallel Architectures for Automatic Speech Recognition. 1-5 - Jiahao Nie, Shan Lin, Alex C. Kot:

Color Space Learning for Cross-Color Person Re-Identification. 1-6 - Xuan Hai, Xin Liu, Zhaorun Chen, Yuan Tan, Song Li, Weina Niu, Gang Liu, Rui Zhou, Qingguo Zhou:

Ghost-in-Wave: How Speaker-Irrelative Features Interfere DeepFake Voice Detectors. 1-6 - Changjuan Ran, Yeting Guo, Fang Liu, Shenglan Cui, Yunfan Ye:

FedStyle: Style-Based Federated Learning Crowdsourcing Framework for Art Commissions. 1-6 - Yanchao Tan, Zhenghong Lin, Sujie Pan, Siying Xu, Weiming Liu, Guofang Ma

, Shiping Wang:
Heterogeneous Hypergraph Structure Learning for Multimedia Recommendation. 1-6 - Liqing Zhu, Xun Jiang, Fumin Shen, Guoqing Wang, Yang Yang, Xing Xu:

Temporal Self-Paced Proposal Learning for Weakly-Supervised Video Moment Retrieval and Highlight Detection. 1-6 - Wei Han, Zhili Qin, Junming Shao:

Interpretable Function Embedding and Module in Convolutional Neural Networks. 1-6 - Zhicheng Cai, Qiu Shen:

Encoding Semantic Priors into the Weights of Implicit Neural Representation. 1-6 - Yiru Wang, Qianqian Li, Xinyue Wang, Qiao Yang, Shunli Zhang:

Unveiling the Significance of Width Dimension in Bird's-Eye View Segmentation. 1-6 - R. Gnana Praveen

, Jahangir Alam:
Cross-Attention is not always needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition. 1-6 - Songshi Dou, Xianhao Chen, Kwan L. Yeung:

Enabling Practical and Pervasive Content Delivery from Emerging LEO Mega-Constellations. 1-6 - Bingxin Li, Ying Li, Shihui Ying:

Cross-Evaluation and Re-weighting for Multi-Source-Free Domain Adaptation. 1-6 - Yujiao Jiang, Qingmin Liao, Zhaolong Wang, Xiangru Lin, Zongqing Lu, Yuxi Zhao, Hanqing Wei, Jingrui Ye, Yu Zhang, Zhijing Shao:

SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations. 1-6 - Zipeng Guo, Yuchen Zhou, Chao Gou:

DrivingGen: Efficient Safety-Critical Driving Video Generation with Latent Diffusion Models. 1-6 - Md Adnan Faisal Hossain, Zhihao Duan, Fengqing Zhu:

Flexible Mixed Precision Quantization for Learne Image Compression. 1-8 - Chao Long, Mengning Yang, Kai Li, Zhifu Deng, Kunyuan Jian, Simin Wang:

LPTCGAN: Laplace Pyramid three-layer cyclic high definition image enhancement network. 1-6 - Yiqun Wang, Zhao Zhou, Xiangcheng Du, Xingjiao Wu, Yingbin Zheng, Cheng Jin:

Fine-Grained Scene Image Classification with Modality-Agnostic Adapter. 1-6 - Xinxin Jiao, Liejun Wang, Yinfeng Yu:

MFHCA: Enhancing Speech Emotion Recognition Via Multi-Spatial Fusion and Hierarchical Cooperative Attention. 1-5 - Yunqi Zhao, Yuchen Guo, Zheng Cao, Kai Ni, Ruqi Huang, Lu Fang:

DynamicTrack: Advancing Gigapixel Tracking in Crowded Scenes. 1-6 - Xiaoyuan Guan, Zhiyong Gan, Ling Deng, Wei Shi, Jiankang Chen, Shenshen Bu, Chunliang Zhao, Jianfang Hu, Yuren Zhou, Wei-Shi Zheng, Ruixuan Wang:

Out-of-Distribution Detection by Principal Component Correspondence. 1-6 - Tianjiao Du, Jun Chen, Jiasheng Lu, Qinmei Xu, Huan Liao, Yupeng Chen, Zhiyong Wu:

Controllable Text-to-Audio Generation with Training-Free Temporal Guidance Diffusion. 1-6 - Zichuan Liu, Ke Wang, Mingyuan Wu, Lantao Yu, Klara Nahrstedt, Xin Lu:

I-Matting: Improved Trimap-Free Image Matting. 1-6 - Chenyang Bu, Yunpeng Hong, Shiji Zang, Guojie Chang, Xindong Wu:

Automatic Fusion for Multimodal Entity Alignment: A New Perspective from Automatic Architecture Search. 1-6 - Fei Wang, Jianqiang Sheng, Kai Jiang, Zhineng Zhang, Juepeng Zheng, Baoquan Zhao:

Single Free-Hand Sketch Guided Free-Form Deformation For 3D Shape Generation. 1-6 - Ugochukwu Ejike Akpudo

, Yongsheng Gao, Jun Zhou, Andrew Lewis:
Coherentice: Invertible Concept-Based Explainability Framework for CNNs beyond Fidelity. 1-6 - Haoyu Huang

, Linxuan He
, Faqiang Liu, Rong Zhao, Luping Shi:
Neural Dynamics Pruning for Energy-Efficient Spiking Neural Networks. 1-6 - Jianjun Sun

, Yan Zhao, Xinbo Li, Shigang Wang, Jian Wei, Shibo Wang:
Fractional Order Spectrum in SAR Image Registration. 1-6 - Yiwen Tu, Wen Tan, Youneng Bao, Genhong Wang, Fanyang Meng, Yongsheng Liang:

Enhanced Interpretability in Learned Image Compression via Convolutional Sparse Coding. 1-6 - Zhimin Weng, Jinpu Zhang, Yuehuan Wang:

Joint Language Prompt and Object Tracking. 1-6 - Xingzhe Su, Daixi Jia, Fengge Wu, Junsuo Zhao, Changwen Zheng, Wenwen Qiang:

Unbiased Image Synthesis via Manifold Guidance in Diffusion Models. 1-6 - Wen-Li Wei, Jen-Chun Lin

:
Multi-Candidate Motion Modeling for 3D Human Pose and Shape Estimation from Monocular Video. 1-6 - Clement Bled

, François Pitié
:
Lightweight Video Denoising Using a Classic Bayesian Backbone. 1-6 - Sumei Li, Xiaofei He, Hangwei Liang:

Top-Down Guidance Based ViT-CNN Network Considering Theme Information for Image Aesthetic Assessment. 1-6 - Xudong Zhou, Tianxiang Chen:

FREQFORMER: Efficient Polyp Segmentation via Wavelet Transform. 1-6 - Yi Fan, Yu-Bin Yang:

Training-free Neural Architecture Search on Hybrid Convolution-attention Networks. 1-6 - Bowen Zhao, Licheng Zhang, Lei Zhang, Zhendong Mao:

Neighborhood-Adaptive Context Enhancement Learning For Scene Graph Generation. 1-6 - Shilv Cai, Xiaoguo Liang

, Shuning Cao, Luxin Yan, Sheng Zhong, Liqun Chen, Xu Zou:
Powerful Lossy Compression for Noisy Images. 1-6 - Pochun Chen, Nan Zhang, Guoqing Liu, Ge Li:

MFITrack: Multi-Frame Integration Strategy for Enhanced Motion-Centric Single Object Tracking. 1-6 - Yijia Guo, Yuanxi Bai, Liwen Hu, Mianzhi Liu, Ziyi Guo, Lei Ma, Tiejun Huang:

Spike-NeRF: Neural Radiance Field Based On Spike Camera. 1-6 - Hongzhao Li

, Hongyu Wang, Xia Sun, Hua He, Jun Feng:
Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM. 1-6 - Jicheng Yang, Qing Zhang, Yilin Zhao, Yuetong Li, Zeming Liu:

Bi-directional Boundary-object interaction and refinement network for Camouflaged Object Detection. 1-6 - Shenghao Chen, Zhe Liu, Jun Chen, Yuqing Song, Yi Liu, Qiaoying Teng:

Tutor Assisted Feature Distillation. 1-6 - Fengqiang Wan, Xiangyu Wu, Zhihao Guan, Yang Yang:

CoVLR: Coordinating Cross-Modal Consistency and Intra-Modal Relations for Vision-Language Retrieval. 1-6 - Xiaolong Xiong, Jinhan Cui, Jiaxiong Liu, Shuzhan Guo, Jun Zhou:

Inverse Optimization for Multi-View Multiple Clustering. 1-6 - Qipeng Zhu, Jie Chen, Junping Zhang, Jian Pu:

G-MIMO: Empowering GNNs with Diverse Sub-Networks for Graph Classification. 1-6 - Fuyang Yu, Runze Tian, Zhen Wang, Xiaochuan Wang, Xiaohui Liang

:
CUS3D: Clip-Based Unsupervised 3D Segmentation via Object-Level Denoise. 1-6 - Yaxiong Chen, Xueping Zhang, Yunfei Zi, Shengwu Xiong:

Adaptive Learning via a Negative Selection Strategy for Few-Shot Bioacoustic Event Detection. 1-6 - Jiefeng Lin, Chenlin Fu, Qiang Huang, Yingying Zhu:

Contextual Interaction Enhancement Network for Smoke Detection. 1-6 - Jie Zhang, Hao Xiong, Hecang Zang, Meng Zhou, Dong Liu, Zhonghua Liu, Hualei Shen:

AuxSegCount: Auxiliary Seg-Attention Based Network for Wheat Ears Counting in Field Conditions. 1-6 - Liang Wen, Lizhong Wang, Yuxing Zheng, Weijing Shi, Kwang Pyo Choi

:
FT-CSR: Cascaded Frequency-Time Method for Coded Speech Restoration. 1-6 - Chuang Ding, Yang Wu, Huihui Song, Kaihua Zhang, Xu Zhang, Zhenhua Guo:

Language-Guided Semantic Alignment for Co-saliency Detection. 1-6 - Yewei Gu, Xianfeng Zhao, Xiaowei Yi:

RLVC: Robust and Lightweight Voice Conversion Using Cross-Adaptive Instance Normalization. 1-6 - Chenhao Shuai, Rizhao Cai, Bandara Dissanayake, Amanda Newman, Dayan Guan, Dennis Sng, Ling Li, Alex C. Kot:

Controllable and Gradual Facial Blemishes Retouching Via Physics-Based Modelling. 1-6 - Weimin Wang, Yingxu Deng, Zezeng Li

, Yu Liu, Na Lei:
MergeNet: Explicit Mesh Reconstruction from Sparse Point Clouds via Edge Prediction. 1-6 - Wei Wang, Zhi Jin:

CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement. 1-6 - Kanghui Wu, Dongyan Guo:

Semantic Bridging and Feature Anchoring for Class Incremental Learning. 1-6 - Haixu Song, Fangfu Liu, Chenyu Zhang, Yueqi Duan:

ToW3D: Consistency-aware Interactive Point-based Mesh Editing on GANs. 1-6 - Sheng Chen, Fei Yang, Aimin Pan, Zhewei Mei:

Wi-Fi based Gait Recognition using Spectrogram and Phase. 1-6 - Jihao Dong, Hua Yang, Renjie Pan:

Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model. 1-6 - Suwei Zhang, Tai Ma, Ying Wen:

RC-Block: Refinement Coefficient for Rectifying Deformation Field. 1-6 - Haixiang Zhu, Jing Ye, Jianbing Tang, Yiping Song:

DiffuStra: A Diffusion Model for Dialog Strategy in Non-Collaborative Dialog Systems. 1-6 - Nesryne Mejri, Pavel Chernakov, Polina Kuleshova, Enjie Ghorbel, Djamila Aouada:

Facial Region-Based Ensembling for Unsupervised Temporal Deepfake Localization. 1-6 - Rukai Wei, Heng Cui, Yu Liu, Yanzhao Xie, Yufeng Hou, Ke Zhou:

Contrastive masked auto-encoders based self-supervised hashing for 2D image and 3D point cloud cross-modal retrieval. 1-6 - Jianing Han, Jiangrong Shen, Qi Xu, Jian K. Liu

, Huajin Tang:
The Balanced Multi-Modal Spiking Neural Networks with Online Loss Adjustment and Time Alignment. 1-6 - Yuwen Yang, Yuxiang Lu, Suizhi Huang, Shalayiding Sirejiding, Chang Liu, Muyang Yi, Zhaozhi Xie, Yue Ding, Hongtao Lu:

BARTENDER: A simple baseline model for task-level heterogeneous federated learning. 1-6 - Huan Li, Xinpeng Huang, Ping An:

Low Bitrate Light Field Video Compression with Two-step Refinement Reconstruction. 1-6 - Fanxiao Li, Ping Wei, Tingchao Fu, Yu Lin, Wei Zhou:

Imperceptible Text Steganography based on Group Chat. 1-6 - Xiaoke Zhu, Danyang Li, Xiaopan Chen, Fumin Qi, Fan Zhang, Xiao-Yuan Jing:

Similarity Mining via Implicit Matching Pattern Learning for Kinship Verification. 1-6 - Xiao Liang, Siyuan Duan, Lijie Zheng, Yuqian Zeng:

Unsupervised Monte Carlo Denoising via Learning Contrastive Disentanglement Representation. 1-6 - Ruihang Li, Shanding Ye, Zhe Yin, Tao Li, Zehua Zhang, Kaikai Xiao, Zhijie Pan:

M2Depth: A Novel Self-Supervised Multi-Camera Depth Estimation with Multi-Level Supervision. 1-6 - Daowan Peng, Wei Wei:

Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation. 1-6 - Jiaxu Leng, Zhanjie Wu

, Mengjingcheng Mo, Mingpi Tan, Shuang Li, Xinbo Gao:
Modality-Free Violence Detection via Cross-Modal Causal Attention and Feature Distillation. 1-6 - Laiming Jiang, Jiawei Liu

, Shu Wang, Jun Liao, Qingsong Li, Zhengyang Li, Shen Chen, Li Liu:
Multi-channel Spatio-Temporal Causal Representation Model for Cognitive Load Assessment in Physiological Signals. 1-6 - Bin Kang, Bin Chen, Junjie Wang, Weizhi Xian

, Huifeng Chang:
Multi-Attribute Consistency Driven Visual Language Framework for Surface Defect Detection. 1-5 - Rui Deng, Yuke Li:

DeCMG: Denoise with Cross-modality Guidance Makes Better Text-Video Retrieval. 1-6 - Qianyu Li, Xiaoli Tang, Siyao Zhou, Han Yu, Hengjie Song, Lizhen Cui, Xiaoxiao Li:

FedRMS: Privacy-Preserving Federated Knowledge Graph Embedding Through Randomization. 1-6 - Yuan Gao, Zilei Wang, Yixin Zhang:

Delve into Source and Target Collaboration in Semi-supervised Domain Adaptation for Semantic Segmentation. 1-6 - Kaiyue Zhou

, Ming Dong, Peiyuan Zhi, Shengjin Wang:
Cascaded Network with Hierarchical Self-Distillation for Sparse Point Cloud Classification. 1-6 - Keli Wen, Nan Zhang, Ge Li, Wei Gao:

MPVNN: Multi-resolution Point-Voxel Non-parametric Network for 3D Point Cloud Processing. 1-6 - Tianlong Zhang, Zhe Xue, Yuchen Dong, Junping Du, Meiyu Liang:

A Multi-View Double Alignment Hashing Network with Weighted Contrastive Learning. 1-6 - Li Keyao, Kai Liu, Min Peng, Bo Zhao, Li Jiangyuanhong, Jiahui Zhu:

MACFAN: A multi-channel fusion network for subjective aesthetic attributes with automated comments labeling pipeline. 1-6 - Chengji Wang, Zhiming Luo, Shaozi Li:

Omni-Granularity Embedding Network for Text-to-Image Person Retrieval. 1-6 - Zeyun Zhao, Rong Wang, Jianzhe Gao, Zhiming Luo, Shaozi Li:

Mask Matching Network for Self-supervised Few-shot Medical Image Segmentation. 1-6 - Chaoxiang He

, Yimiao Zeng, Xiaojing Ma, Bin Benjamin Zhu, Zewei Li, Shixin Li, Hai Jin:
MysticMask: Adversarial Mask for Impersonation Attack Against Face Recognition Systems. 1-6 - Han Cao

, Lingwei Wei, Wei Zhou, Songlin Hu:
Multi-source Knowledge Enhanced Graph Attention Networks for Multimodal Fact Verification. 1-6 - Tong Zhang, Wenxue Cui, Shaohui Liu, Feng Jiang:

SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer. 1-6 - Chuanming Tang, Kai Wang, Joost van de Weijer:

IterInv: Iterative Inversion for Pixel-Level T2I Models. 1-6 - Weijie Li, Luwei Xiao, Xingjiao Wu, Tianlong Ma, Jiabao Zhao, Liang He:

Artistry in Pixels: FVS - A Framework for Evaluating Visual Elegance and Sentiment Resonance in Generated Images. 1-6 - Guoxuan Mao, Ting Cao, Ziyang Li, Yuan Dong:

Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection. 1-6 - Shu Wang, Zhe Qu, Yuan Liu

, Shichao Kan, Yixiong Liang, Jianxin Wang:
FedMMR: Multi-Modal Federated Learning via Missing Modality Reconstruction. 1-6 - Xinyu Feng, Cong Li, Qingni Shen, Jisheng Dong, Wenjun Qian, Yuejian Fang, Zhonghai Wu:

HyPRE: Hybrid Proxy Re-Encryption for Secure Multimedia Data Sharing on Mobile Devices. 1-6 - Xiangru Lin, Shenghua Zhong, Yan Liu, Gong Chen:

Sal-Guide Diffusion: Saliency Maps Guide Emotional Image Generation through Adapter. 1-6 - Yuwu Lu, Chunzhi Liu:

Pseudolabel Distillation with Adversarial Contrastive Learning for Semisupervised Domain Adaptation. 1-6 - Zhenzhe Gao

, Zhenjun Tang, Zhaoxia Yin, Baoyuan Wu, Yue Lu:
Fragile Model Watermark for integrity protection: leveraging boundary volatility and sensitive sample-pairing. 1-6 - Bingfei Fu, Xiangyang Xue:

Unsupervised Object Discovery Via Object-Centric Representation. 1-6 - Sisi You, Bing-Kun Bao:

Dynamic Scene Graph Generation with Unified Temporal Modeling. 1-6 - Shuang Cheng, Zhanyu Ma, Jian Ye:

A Benchmark of Zero-Shot Cross-Lingual Task-Oriented Dialogue Based on Adversarial Contrastive Representation Learning. 1-6 - Zhongzhu Yang, Liang Luo, Yu Gu, Fuji Ren:

K-Face Net: A Two-Stage Framework for Balanced Feature Space in Facial Expression Recognition. 1-6 - Jiaqi Guo, Sitong Su, Junchen Zhu, Lianli Gao, Jingkuan Song:

Training-Free Semantic Video Composition via Pre-trained Diffusion Model. 1-6 - Qiqin Lin, Weixing Xie, Rongzhou Zhou, Xianpeng Cao, Jingze Chen, Junfeng Yao, Qingqi Hong:

DPP-Net: Difficulty Perception-Processing Heterogeneous Network for Semi-supervised Medical Image Segmentation. 1-6 - Zheng Zhou, Zongxin Liu, Yongyong Chen, Bingzhi Chen, Biqing Zeng, Yicong Zhou:

Deep Unfolding 3D Non-Local Transformer Network for Hyperspectral Snapshot Compressive Imaging. 1-6 - Zhenhu Zhang, Li Jin, Dan Song, Jiahua Dong, Ruofeng Tong:

FedDGP: Disentangling Global and Personal Models for Federated Learning. 1-6 - Ling Li, Junliang Xing, Xinchun Yu, Xiao-Ping Zhang:

Deviation Wing Loss for High-Performance 2D Pose Estimation. 1-6 - Jintao Tan, Xize Cheng, Lingyu Xiong, Lei Zhu, Xiandong Li, Xianjia Wu, Kai Gong, Minglei Li, Yi Cai:

Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation. 1-6 - Ning Xu, Jingqiu Li, Lanjun Wang, Anan Liu:

Rumor Detection Framework Based on Multi-source Knowledge Adaptation. 1-6 - Mingzhe Yu, Lei Wu, Changshuo Wang, Lei Meng, Xiangxu Meng:

LayoutDM: Precision Multi-Scale Diffusion for Layout-to-Image. 1-6 - Yuxin Tian

, Mouxing Yang, Yunfan Li, Dayiheng Liu, Xingzhang Ren, Xi Peng, Jiancheng Lv:
An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language Pre-train Model. 1-6 - Yan Jiang, Guisheng Yin, Ye Yuan

, Jingjing Chen, Zhipeng Wei:
Cross-Point Adversarial Attack Based on Feature Neighborhood Disruption Against Segment Anything Model. 1-6 - Zhenyu Li, Congju Du, Huijuan Zhao, Li Yu:

Offset-based Disentangled Representation for Efficient Human Pose Estimation. 1-6 - Zhihang Zhu, Yunfeng Yan, Yi Chen, Haoyuan Jin, Xuesong Nie, Donglian Qi, Xi Chen:

SAMP: Adapting Segment Anything Model for Pose Estimation. 1-7 - Zhongqiang Zhang, Fuhan Cai, Duo Liu, Ge Liu, Xiangzhong Fang:

Mix background and foreground separately: Transformer-based Augmentation Strategies for Domain Generalization. 1-6 - Yanchao Liang

, Xiangqian Wu:
Do Keypoints Contain Crucial Information? Mining Keypoint Information to Enhance Cross-View Geo-Localization. 1-6 - Jiayang Gu, Xovee Xu

, Yulu Tian, Yurun Hu, Jiadong Huang, Wenliang Zhong, Fan Zhou, Lianli Gao:
RRE: A Relevance Relation Extraction Framework for Cross-domain Recommender System at Alipay. 1-6 - Hongfei Xue

, Qijie Shao, Kaixun Huang, Peikun Chen, Jie Liu, Lei Xie:
SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition. 1-6 - Zhenrong Huang, Bin Chen:

Unsupervised Multi-Modal Medical Image Registration via query-selected attention and decoupled Contrastive Learning. 1-6 - Shuai Yu

, Xiaoliang He, Yanting Zhang:
RevNet: A Review Network with Group Aggregation Fusion for Singing Melody Extraction. 1-6 - Zhen Liang, Enyu Che, Guoqiang Xiao, Jingwei Qu:

Multi-granularity Correlation Refinement for Semantic Correspondence. 1-6 - Xiao Liang, Tao Shi, Yaoyuan Liang, Te Tao, Shao-Luo Huang:

Exploring Iterative Refinement with Diffusion Models for Video Grounding. 1-6 - Junqing Huang

, Xiaochen Yuan, Chan-Tong Lam, Wei Ke
:
MSFGNet: Multi-Scale Features Gathering Network for Change Detection of Remote Sensing Images. 1-6 - Gang Wu, Junjun Jiang, Kui Jiang, Xianming Liu:

Exploiting Self-Supervised Constraints in image Super-Resolution. 1-6 - Gang Liu, Jing Jia, Rui Mao, Yan Ji:

FedCA: Federated learning based on classification layer alignment. 1-6 - Hongyan Xu, Xiu Su, Arcot Sowmya, Ian Katz, Dadong Wang:

SCD-NAS: Towards Zero-Cost Training in Melanoma Diagnosis. 1-6 - Chuanfeng Yang, Kaiheng Li, Jiahui Chen

, Qingqi Hong:
FFnsr: Fast and Fine Neural Surface Reconstruction. 1-6 - Yuxuan Chen, Chengbo Wang, Xiuying Wang:

CMSCL: Cross-Modal Spatial Contrastive Learning for 3D Medical Image Classification. 1-6 - Ya Jiang, Qing Wang, Jun Du, Maocheng Hu, Pengfei Hu, Zeyan Liu, Shi Cheng, Zhaoxu Nian, Yuxuan Dong, Mingqi Cai, Xin Fang, Chin-Hui Lee:

Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios. 1-6 - Chun Wang

:
Data Standardization for Robust Lip Sync. 1-6 - Yi Fan, Yu-Bin Yang:

Training-free Neural Architectural Search on Transformer via Evaluating Expressivity and Trainability. 1-6 - Chuang Liu, Haogang Zhu, Xiu Su:

DomainVoyager: Embracing The Unknown Domain by Prompting for Automatic Augmentation. 1-7 - Dongming Zhou, Zhengbin Pang:

Heuristic Action-aware and Priority Communication for Multi-agent Path Finding. 1-6 - Songping Wang, Hanqing Liu, Haochen Zhao:

Public-Domain Locator for Boosting Attack Transferability on Videos. 1-6 - Tao He, Leqi Shen, Guiguang Ding, Zhiheng Zhou, Tianshi Xu, Xiaofeng Jin, Yuheng Huang:

Camera Bias Regularization for Person Re-identification. 1-6 - Chenyue Liang, Jiabei Zeng, Mingjie He, Dongmei Jiang, Shiguang Shan:

Facial Action Unit Detection with the Semantic Prompt. 1-6 - Tingyu Li, Junpeng Bao, Jiaqi Qin, Yuping Liang, Ruijiang Zhang, Jason Wang:

Multi-modal Intent Detection with LVAMoE: the Language-Visual-Audio Mixture of Experts. 1-6 - Ziqi Wang, Yao Lu, Shunzhou Wang, Wang Xia, Peiqi Xia, Wenjing Wang:

Trident Transformer for Light Field Image Super-Resolution. 1-6 - Jisheng Bai, Han Yin, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen, Susanto Rahardja

:
Audiolog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning. 1-6 - Jiehang Xie, Xuanbai Chen, Shao-Ping Lu:

An Aesthetic-Guided Multimodal Framework for Video Summarization. 1-6 - Junyang Qiu, Zhanxiang Feng, Lei Wang, Jianhuang Lai:

Salient Part-Aligned and Keypoint Disentangling Transformer for Person Re-Identification in Aerial Imagery. 1-6 - Yuchen Li, Fan Wan, Yang Long:

SID-NERF: Few-Shot Nerf Based on Scene Information Distribution. 1-6

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














