


default search action
IEEE Transactions on Multimedia, Volume 27
Volume 27, 2025
- Yuan Yuan
, Hongjie He
, Yaolin Yang
, Hadi Amirpour
, Christian Timmerer
, Fan Chen
:
JPEG Image Encryption With DC Rotation and Undivided RSV-Based AC Group Permutation. 1-15 - Dizhan Xue
, Shengsheng Qian
, Quan Fang
, Changsheng Xu
:
LININ: Logic Integrated Neural Inference Network for Explanatory Visual Question Answering. 16-27 - Pingping Zhang, Shiqi Wang
, Meng Wang
, Peilin Chen
, Wenhui Wu
, Xu Wang
, Sam Kwong
:
HNR-ISC: Hybrid Neural Representation for Image Set Compression. 28-40 - Qingxin Sheng, Chong Fu
, Zhaonan Lin, Junxin Chen
, Xingwei Wang
, Chiu-Wing Sham
:
Content-Aware Tunable Selective Encryption for HEVC Using Sine-Modular Chaotification Model. 41-55 - Qiguang Miao, Wentian Xin
, Ruyi Liu, Yi Liu, Mengyao Wu, Cheng Shi
, Chi-Man Pun
:
Adaptive Pitfall: Exploring the Effectiveness of Adaptation in Skeleton-Based Action Recognition. 56-71 - Shizhou Zhang
, Dexuan Kong
, Yinghui Xing
, Yue Lu
, Lingyan Ran
, Guoqiang Liang
, Hexu Wang, Yanning Zhang
:
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection. 72-83 - Yu Wang
, Shengjie Zhao
, Shiwei Chen
:
SQL-Net: Semantic Query Learning for Point-Supervised Temporal Action Localization. 84-94 - Kefan Tang
, Lihuo He
, Nannan Wang
, Xinbo Gao
:
Dual Semantic Reconstruction Network for Weakly Supervised Temporal Sentence Grounding. 95-107 - Yiting Liu
, Liang Li
, Yunbin Tu
, Beichen Zhang
, Zheng-Jun Zha
, Qingming Huang
:
Dynamic Strategy Prompt Reasoning for Emotional Support Conversation. 108-119 - Yunlong Tang, Yuxuan Wan, Lei Qi
, Xin Geng
:
DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization. 120-132 - Zhenyu Shu
, Shiyang Li
, Shiqing Xin
, Ligang Liu
:
3D Shape Segmentation With Potential Consistency Mining and Enhancement. 133-144 - Min Dang
, Gang Liu
, Hao Li
, Di Wang
, Rong Pan
, Quan Wang
:
PRA-Det: Anchor-Free Oriented Object Detection With Polar Radius Representation. 145-157 - Yizhen Jia
, Rong Quan
, Haiyan Chen
, Jiamei Liu, Yichao Yan
, Song Bai
, Jie Qin
:
Disaggregation Distillation for Person Search. 158-170 - Shiqi Gao
, Huiyu Duan
, Xinyue Li
, Kang Fu
, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang
, Xiongkuo Min
, Guangtao Zhai
:
Quality-Guided Skin Tone Enhancement for Portrait Photography. 171-185 - Yue Dai
, Shihui Ying
, Yue Gao
:
Exploring Local and Global Consistent Correlation on Hypergraph for Rotation Invariant Point Cloud Analysis. 186-197 - Hao Tan
, Zichang Tan
, Dunfang Weng
, Ajian Liu
, Jun Wan
, Zhen Lei
, Stan Z. Li
:
Vision Transformer With Relation Exploration for Pedestrian Attribute Recognition. 198-208 - Zhaofeng Shi
, Qingbo Wu
, Fanman Meng
, Linfeng Xu
, Hongliang Li
:
Cross-Modal Cognitive Consensus Guided Audio-Visual Segmentation. 209-223 - Ge Li
, Jiale Cao
, Hanqing Sun
, Rao Muhammad Anwer
, Jin Xie
, Fahad Khan
, Yanwei Pang
:
Video Instance Segmentation Without Using Mask and Identity Supervision. 224-235 - Guanghui Yue
, Shangjie Wu
, Tianwei Zhou
, Gang Li
, Jie Du
, Yu Luo
, Qiuping Jiang
:
Progressive Region-to-Boundary Exploration Network for Camouflaged Object Detection. 236-248 - Yumo Zhang
, Zhanchuan Cai
:
DNP-AUT: Image Compression Using Double-Layer Non-Uniform Partition and Adaptive U Transform. 249-262 - Sijia Wen
, Yinqiang Zheng
, Feng Lu
:
Polarization State Attention Dehazing Network With a Simulated Polar-Haze Dataset. 263-274 - Jiapeng Li
, Ruonan Zhang
, Ge Li
, Thomas H. Li:
SDE2D: Semantic-Guided Discriminability Enhancement Feature Detector and Descriptor. 275-286 - Xu Han
, Junyu Gao
, Chuang Yang
, Yuan Yuan, Qi Wang
:
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection. 287-299 - Kai Hu
, Xiaobo Chen
, Zhineng Chen
, Yuan Zhang
, Xieping Gao
:
Multi-Perspective Pseudo-Label Generation and Confidence-Weighted Training for Semi-Supervised Semantic Segmentation. 300-311 - Xinru Guo
, Huaxiang Zhang
, Li Liu
, Dongmei Liu
, Xu Lu
, Hui Meng
:
Primary Code Guided Targeted Attack against Cross-modal Hashing Retrieval. 312-326 - Shichao Zhang
, Yibo Ding
, Tianxiang Huo
, Shukai Duan
, Lidan Wang
:
PointAttention: Rethinking Feature Representation and Propagation in Point Cloud. 327-339 - Mengzan Qi
, Sixian Chan
, Chen Hang
, Guixu Zhang
, Tieyong Zeng
, Zhi Li
:
Auxiliary Representation Guided Network for Visible-Infrared Person Re-Identification. 340-355 - Li Huang
, Yaping Huang
, Qingji Guan
:
Improving Image Inpainting via Adversarial Collaborative Training. 356-370 - Lin Jiang
, Jigang Wu
, Shuping Zhao
, Jiaxing Li
:
Cross-Scatter Sparse Dictionary Pair Learning for Cross-Domain Classification. 371-384 - Yusra Alkendi
, Rana Azzam
, Sajid Javed
, Lakmal D. Seneviratne
, Yahya H. Zweiri
:
Neuromorphic Vision-Based Motion Segmentation With Graph Transformer Neural Network. 385-400 - Guangzhao Dai
, Xiangbo Shu
, Wenhao Wu, Rui Yan
, Jiachao Zhang
:
GPT4Ego: Unleashing the Potential of Pre-Trained Models for Zero-Shot Egocentric Action Recognition. 401-413 - Nan Wang
, Shaohui Mei
, Yi Wang
, Yifan Zhang
, Duo Zhan
:
WHANet:Wavelet-Based Hybrid Asymmetric Network for Spectral Super-Resolution From RGB Inputs. 414-428 - Haojin Deng
, Yimin Yang
:
Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework. 429-441 - Jingyi Xu, Xin Deng
, Yibing Fu, Mai Xu
, Shengxi Li
:
MDSC-Net: Multi-Modal Discriminative Sparse Coding Driven RGB-D Classification Network. 442-454 - Chen Guo
, Weiling Chen
, Aiping Huang
, Tiesong Zhao
:
Prototype Alignment With Dedicated Experts for Test-Agnostic Long-Tailed Recognition. 455-465 - Hefeng Wang
, Jiale Cao
, Jin Xie
, Aiping Yang, Yanwei Pang
:
Implicit and Explicit Language Guidance for Diffusion-Based Visual Perception. 466-476 - Meijing Zhang, Mengxue Chen, Qi Li
, Yanchen Chen, Rui Lin, Xiaolian Li, Shengfeng He
, Wenxi Liu
:
Category-Contrastive Fine-Grained Crowd Counting and Beyond. 477-488 - Kaiwei Zhang
, Dandan Zhu
, Xiongkuo Min
, Huiyu Duan
, Guangtao Zhai
:
Explain Vision Focus: Blending Human Saliency Into Synthetic Face Images. 489-502 - Shaowei Weng
, Jianhao Zhang, Tanguo Zhu, Lifang Yu
, Chunyu Zhang
:
DCM-Net: A Diffusion Model-Based Detection Network Integrating the Characteristics of Copy-Move Forgery. 503-514 - Meng Yang
, Jun Chen
, Xin Tian
, Longsheng Wei
, Jiayi Ma
:
VRTNet: Vector Rectifier Transformer for Two-View Correspondence Learning. 515-530 - Kai Ye, Zepeng Huang
, Yilei Xiong, Yu Gao, Jinheng Xie, Linlin Shen
:
Progressive Pseudo Labeling for Multi-Dataset Detection Over Unified Label Space. 531-543 - Yuxiu Lin
, Hui Liu
, Ren Wang, Qiang Guo
, Caiming Zhang
:
Multiview Feature Decoupling for Deep Subspace Clustering. 544-556 - Lili Huang
, Yiming Cao, Pengcheng Jia, Chenglong Li
, Jin Tang
, Chuanfu Li:
Knowledge-Guided Cross-Modal Alignment and Progressive Fusion for Chest X-Ray Report Generation. 557-567 - Min Liu
, Zhu Zhang
, Yuan Bian
, Xueping Wang
, Yeqing Sun
, Baida Zhang, Yaonan Wang
:
Cross-Modality Semantic Consistency Learning for Visible-Infrared Person Re-Identification. 568-580 - Ben Fei
, Liwen Liu
, Tianyue Luo
, Weidong Yang
, Lipeng Ma
, Zhijun Li
, Wenming Chen
:
Point Patches Contrastive Learning for Enhanced Point Cloud Completion. 581-596 - Shunjie Yuan
, Xinghua Li
, Yinbin Miao
, Haiyan Zhang
, Ximeng Liu
, Robert H. Deng
:
Combating Noisy Labels by Alleviating the Memorization of DNNs to Noisy Labels. 597-609 - Jiaping Yu, Muli Yang
, Aming Wu
, Cheng Deng
:
Memory-Enhanced Confidence Calibration for Class-Incremental Unsupervised Domain Adaptation. 610-621 - Yi Jin
, Xiaoxiao Ma
, Rui Zhang
, Huaian Chen
, Yuxuan Gu
, Pengyang Ling
, Enhong Chen
:
Masked Video Pretraining Advances Real-World Video Denoising. 622-636 - Kun Dai
, Zhiqiang Jiang
, Tao Xie
, Ke Wang
, Dedong Liu
, Zhendong Fan
, Ruifeng Li
, Lijun Zhao
, Mohamed Omar
:
SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection. 637-651 - Abdullah Aman Khan
, Jie Shao
, Yunbo Rao
, Lei She, Heng Tao Shen
:
LRDNet: Lightweight LiDAR Aided Cascaded Feature Pools for Free Road Space Detection. 652-664 - Shuhua Wang
, Ke Lv
, Jian Xue
, Yang Zhao
:
DA-Net: Density-Aware 3D Object Detection Network for Point Clouds. 665-678 - Congcong Wen
, Xiang Li
, Hao Huang, Yu-Shen Liu
, Yi Fang
:
3D Shape Contrastive Representation Learning With Adversarial Examples. 679-692 - Dong Liang, Dong Zhang, Qiong Wang, Zongqi Wei, Liyan Zhang:
CrossNet: Cross-Scene Background Subtraction Network via 3D Optical Flow. 693-706 - Zhanwen Liu
, Juanru Cheng
, Jin Fan
, Shan Lin
, Yang Wang
, Xiangmo Zhao
:
Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection. 707-717 - Hui Tian
, Zheng Qin, Renjiao Yi, Chenyang Zhu, Kai Xu
:
Tensorformer: Normalized Matrix Attention Transformer for High-Quality Point Cloud Reconstruction. 718-730 - Mingtao Feng
, Haoran Hou
, Liang Zhang
, Yulan Guo
, Hongshan Yu
, Yaonan Wang
, Ajmal Mian
:
Exploring Hierarchical Spatial Layout Cues for 3D Point Cloud Based Scene Graph Prediction. 731-743 - Qiaoyun Wu
, Jun Wang
, Yi Zhang, Hua Dong, Cheng Yi
:
Accelerating Point Cloud Registration With Low Overlap Using Graphs and Sparse Convolutions. 744-753 - Qijian Zhang
, Junhui Hou
, Yue Qian:
PointMCD: Boosting Deep Point Cloud Encoders via Multi-View Cross-Modal Distillation for 3D Shape Recognition. 754-767 - Shuaihang Yuan
, Congcong Wen
, Yu-Shen Liu
, Yi Fang
:
Retrieval-Specific View Learning for Sketch-to-Shape Retrieval. 768-779 - Jing-Yu Yang
, Wenqiang Xu
, Yusen Hou, Xinchen Ye
, Pascal Frossard
, Kun Li
:
High-Quality Reconstruction of Depth Maps From Graph-Based Non-Uniform Sampling. 780-791 - Shaojie Zhuang
, Guangshun Wei
, Zhiming Cui, Yuanfeng Zhou
:
Robust Hybrid Learning for Automatic Teeth Segmentation and Labeling on 3D Dental Models. 792-803 - Jiawen Zhao
, Qing Zhu
, Yaonan Wang
, Weixing Peng
, Hui Zhang, Jianxu Mao
:
Registration of Multiview Point Clouds With Unknown Overlap. 804-819 - Jincen Jiang
, Xuequan Lu
, Lizhi Zhao
, Richard Dazeley
, Meili Wang
:
Masked Autoencoders in 3D Point Cloud Representation Learning. 820-831 - Xu Wang
, Yi Jin
, Yigang Cen
, Tao Wang
, Bowen Tang
, Yidong Li
:
LighTN: Light-Weight Transformer Network for Performance-Overhead Tradeoff in Point Cloud Downsampling. 832-847 - Shuangzhi Li
, Zhijie Wang
, Felix Juefei-Xu
, Qing Guo
, Xingyu Li
, Lei Ma
:
Common Corruption Robustness of Point Cloud Detectors: Benchmark and Enhancement. 848-859 - Shanshan Li
, Pan Gao
, Xiaoyang Tan
, Wei Xiang
:
RLGrid: Reinforcement Learning Controlled Grid Deformation for Coarse-to-Fine Point Cloud Completion. 860-874 - Xianglin Guo
, Yifan Wang
, Heng Liu
, Haoran Xie
, Gary Cheng
, Fu Lee Wang
:
Steerable Graph Neural Network on Point Clouds via Second-Order Random Walks. 875-888 - Junteng Zhang
, Jianqiang Wang
, Dandan Ding
, Zhan Ma
:
Scalable Point Cloud Attribute Compression. 889-899 - Wenting Cui
, Shaoyi Du
, Runzhao Yao
, Canhui Tang
, Aixue Ye
, Feng Wen
, Zhiqiang Tian
:
RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient. 900-913 - André F. R. Guarda
, Manuel Ruivo
, Luís Coelho
, Abdelrahman Seleem
, Nuno M. M. Rodrigues
, Fernando Pereira
:
Deep Learning-Based Point Cloud Coding and Super-Resolution: A Joint Geometry and Color Approach. 914-926 - Zicheng Zhang
, Wei Sun
, Yucheng Zhu
, Xiongkuo Min
, Wei Wu
, Ying Chen
, Guangtao Zhai
:
Evaluating Point Cloud From Moving Camera Videos: A No-Reference Metric. 927-939 - Lintai Wu
, Qijian Zhang
, Junhui Hou
, Yong Xu
:
Leveraging Single-View Images for Unsupervised 3D Point Cloud Completion. 940-953 - Xin Kang
, Chaoqun Wang
, Xuejin Chen
:
Region-Enhanced Feature Learning for Scene Semantic Segmentation. 954-964 - Weiquan Liu
, Minghao Liu, Shijun Zheng
, Siqi Shen
, Xuesheng Bian
, Yu Zang, Ping Zhong
, Cheng Wang
:
Interpreting Hidden Semantics in the Intermediate Layers of 3D Point Cloud Classification Neural Network. 965-977 - Elena Camuffo
, Umberto Michieli
, Simone Milani
:
Learning From Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation. 978-989 - Xiantong Zhao
, Yinan Han
, Shengjing Tian
, Jian Liu
, Xiuping Liu
:
OST: Efficient One-Stream Network for 3D Single Object Tracking in Point Clouds. 990-1002 - Shuai Guo
, Lei Shi
, Xiaoheng Jiang
, Pei Lv
, Qidong Liu
, Yazhou Hu
, Rongrong Ji
, Mingliang Xu
:
An Efficient Ungrouped Mask Method With two Learnable Parameters for 3D Object Detection. 1003-1017 - Yuan Liang
, Zitian Zhang
, Chuhua Xian
, Shengfeng He
:
Delving Into Multi-Illumination Monocular Depth Estimation: A New Dataset and Method. 1018-1032 - Yanyang Xiao
, Tieyi Zhang
, Juan Cao
, Zhonggui Chen
:
Accelerated Lloyd's Method for Resampling 3D Point Clouds. 1033-1046 - Qing Guo
, Zhijie Wang
, Lubo Wang, Haotian Dong, Felix Juefei-Xu
, Di Lin
, Lei Ma
, Wei Feng
, Yang Liu
:
CarveNet: Carving Point-Block for Complex 3D Shape Completion. 1047-1058 - Jingtao Sun
, Yaonan Wang
, Mingtao Feng
, Xiaofeng Guo
, Huimin Lu
, Xieyuanli Chen
:
Category-Level Multi-Object 9D State Tracking Using Object-Centric Multi-Scale Transformer in Point Cloud Stream. 1072-1085 - Xingyu Gao
, Zhenyu Chen
, Jianze Wei
, Rubo Wang, Zhijun Zhao:
Deep Mutual Distillation for Unsupervised Domain Adaptation Person Re-Identification. 1059-1071 - Yuanpeng Zeng, Ru Zhang, Hao Zhang
, Shaojie Qiao
, Faliang Huang
, Qing Tian
, Yuzhong Peng
:
GCCNet: A Novel Network Leveraging Gated Cross-Correlation for Multi-View Classification. 1086-1099 - Liangchen Liu
, Nannan Wang
, Dawei Zhou
, Decheng Liu
, Xi Yang
, Xinbo Gao
, Tongliang Liu
:
Generalizable Prompt Learning via Gradient Constrained Sharpness-Aware Minimization. 1100-1113 - Liangwei Chen
, Xiren Zhou
, Qiuju Chen, Fang Xiong
, Huanhuan Chen
:
Investigating the Effective Dynamic Information of Spectral Shapes for Audio Classification. 1114-1126 - Abdullah Aman Khan
, Jie Shao
, Sidra Shafiq, Shuyuan Zhu
, Heng Tao Shen
:
Enhancing Few-Shot 3D Point Cloud Classification With Soft Interaction and Self-Attention. 1127-1141 - Guanglin Zhou
, Zhongyi Han
, Shiming Chen
, Biwei Huang, Liming Zhu
, Tongliang Liu
, Lina Yao
, Kun Zhang
:
HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization. 1142-1152 - Cairong Zhao
, Rui Shu
, Shuyang Feng
, Liang Zhu
, Xuekuan Wang:
Scene Text Image Super-Resolution Via Semantic Distillation and Text Perceptual Loss. 1153-1164 - Yuhui Quan
, Xi Wan, Tianxiang Zheng, Yan Huang
, Hui Ji
:
Dual-Path Deep Unsupervised Learning for Multi-Focus Image Fusion. 1165-1176 - Zihan Gao
, Lingling Li
, Xu Liu
, Licheng Jiao
, Fang Liu
, Shuyuan Yang
:
Uncertainty Guided Progressive Few-Shot Learning Perception for Aerial View Synthesis. 1177-1192 - Lingtong Min
, Ziman Fan, Shunzhou Wang
, Feiyang Dou, Xin Li
, Binglu Wang
:
Adaptive Fusion Learning for Compositional Zero-Shot Recognition. 1193-1204 - Jian Yang
, Jun Li
, Yunong Cai, Guoming Wu
, Zhi-Ping Shi
, Chaodong Tan, Xianglong Liu
:
Hard-Sample Style Guided Patch Attack With RL-Enhanced Motion Pattern for Video Recognition. 1205-1215 - Gaosheng Liu
, Huanjing Yue
, Bihan Wen
, Jing-Yu Yang
:
Learned Focused Plenoptic Image Compression With Local-Global Correlation Learning. 1216-1227 - Jingyun Tian
, Jinjing Gu
, Yuanyuan Pu
, Zhengpeng Zhao
:
Leveraging Enriched Skeleton Representation With Multi-Relational Metrics for Few-Shot Action Recognition. 1228-1241 - Shaocan Liu
, Xingtao Wang
, Ruiqin Xiong
, Xiaopeng Fan
:
GCN-Based Multi-Modality Fusion Network for Action Recognition. 1242-1253 - Deng Xu
, Chao Zhang
, Zechao Li
, Chunlin Chen
, Huaxiong Li
:
Fast Disentangled Slim Tensor Learning for Multi-View Clustering. 1254-1265 - Tae-Young Kim, Jufeng Yang
, Eunil Park
:
MSDLF-K: A Multimodal Feature Learning Approach for Sentiment Analysis in Korean Incorporating Text and Speech. 1266-1276 - Lei Zhao
, Bo Li
, Jixiang Jiang, Xingxing Wei
:
Classification Committee for Active Deep Object Detection. 1277-1288 - Lingzhi Zhao
, Ying Cui
, Yuhang Jia, Yunfei Zhang
, Klara Nahrstedt
:
Enhancing Neural Adaptive Wireless Video Streaming via Cross-Layer Information Exposure and Online Tuning. 1289-1304 - Wenyang Liu
, Kejun Wu
, Tianyi Liu
, Yi Wang
, Kim-Hui Yap
, Lap-Pui Chau
:
ByteNet: Rethinking Multimedia File Fragment Classification Through Visual Perspectives. 1305-1319 - Weikang Wang
, Yuting Su, Jing Liu
, Wei Sun
, Guangtao Zhai
:
Weakly Supervised Referring Video Object Segmentation With Object-Centric Pseudo-Guidance. 1320-1333 - Zeke Zexi Hu
, Xiaoming Chen
, Vera Yuk Ying Chung
, Yiran Shen
:
Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-Resolution. 1334-1348 - Yu Jiang
, Yuehang Wang
, Siqi Li
, Yongji Zhang
, Qianren Guo
, Qi Chu
, Yue Gao
:
EvCSLR: Event-Guided Continuous Sign Language Recognition and Benchmark. 1349-1361 - Bingzheng Liu
, Jianjun Lei
, Bo Peng
, Zhe Zhang
, Jie Zhu
, Qingming Huang
:
Advancing Generalizable Occlusion Modeling for Neural Human Radiance Field. 1362-1373 - Rui Tian
, Zuxuan Wu
, Qi Dai
, Micah Goldblum, Han Hu
, Yu-Gang Jiang
:
The Role of ViT Design and Training in Robustness to Common Corruptions. 1374-1385 - Yalan Qin
, Nan Pu
, Hanzhou Wu
, Nicu Sebe
:
Discriminative Anchor Learning for Efficient Multi-View Clustering. 1386-1396 - Jingyao Wang
, Luntian Mou
, Changwen Zheng
, Wen Gao:
Image-Based Freeform Handwriting Authentication With Energy-Oriented Self-Supervised Learning. 1397-1409 - Dong Chen
, Kaihang Pan, Guangyu Dai, Guoming Wang
, Yueting Zhuang
, Siliang Tang
, Mingliang Xu
:
Improving Vision Anomaly Detection With the Guidance of Language Modality. 1410-1419 - Li Wang
, Yunzhou Zhang
, Fawei Ge
, Wenjing Bai
, Yifan Wang
:
Learning Local Features by Reinforcing Spatial Structure Information. 1420-1431 - Feiwei Qin
, Gaoyang Zhan, Meie Fang
, C. L. Philip Chen
, Ping Li
:
VGNet: Multimodal Feature Extraction and Fusion Network for 3D CAD Model Retrieval. 1432-1447 - Chuanming Wang
, Huiyuan Fu
, Peiye Liu, Huadong Ma
:
Part-Level Relationship Learning for Fine-Grained Few-Shot Image Classification. 1448-1460 - Jinpu Zhang
, Ziwen Li
, Ruonan Wei
, Yuehuan Wang
:
Augment One With Others: Generalizing to Unforeseen Variations for Visual Tracking. 1461-1474 - Chunlei Peng
, Boyu Wang, Decheng Liu
, Nannan Wang
, Ruimin Hu, Xinbo Gao
:
Masked Attribute Description Embedding for Cloth-Changing Person Re-Identification. 1475-1485 - Xingfeng Li
, Yuangang Pan
, Yuan Sun
, Quansen Sun
, Yinghui Sun
, Ivor W. Tsang
, Zhenwen Ren
:
Incomplete Multi-View Clustering With Paired and Balanced Dynamic Anchor Learning. 1486-1497 - Guanghui Wu
, Lili Chen, Zengping Chen
:
Uni-DPM: Unifying Self-Supervised Monocular Depth, Pose, and Object Motion Estimation With a Shared Representation. 1498-1511 - Yi Liu
, Qiuping Jiang
, Xinyi Wang, Ting Luo
, Jingchun Zhou
:
Underwater Image Enhancement With Cascaded Contrastive Learning. 1512-1525 - Md. Moniruzzaman
, Zhaozheng Yin
:
Progressive Knowledge Distillation From Different Levels of Teachers for Online Action Detection. 1526-1537 - Mingze Yao
, Huibing Wang
, Yawei Chen
, Xianping Fu
:
Between/Within View Information Completing for Tensorial Incomplete Multi-View Clustering. 1538-1550 - Dongqing Wu
, Huihui Li
, Cang Gu
, Lei Guo, Hang Liu
:
Dual Stream Relation Learning Network for Image-Text Retrieval. 1551-1565 - Hailong Ma
, Sibo Feng
, Xi Xiao
, Chenyu Dong, Xingyue Cheng:
Image Shooting Parameter-Guided Cascade Image Retouching Network: Think Like an Artist. 1566-1573 - Song Chang
, Youfang Lin
, Shuo Zhang
:
Structure-Aware Pre-Selected Neural Rendering for Light Field Reconstruction. 1574-1587 - Jianxin Shi
, Miao Zhang
, Linfeng Shen
, Jiangchuan Liu
, Lingjun Pu
, Jingdong Xu
:
Towards Neural Codec-Empowered 360$^\circ$ Video Streaming: A Saliency-Aided Synergistic Approach. 1588-1600 - Xiating Jin
, Jiajun Bu
, Zhi Yu
, Hui Zhang
, Yaonan Wang
:
Federated Hallucination Translation and Source-Free Regularization Adaptation in Decentralized Domain Adaptation for Foggy Scene Understanding. 1601-1616 - Mehwish Ghafoor, Arif Mahmood
, Muhammad Bilal
:
Enhancing 3D Human Pose Estimation Amidst Severe Occlusion With Dual Transformer Fusion. 1617-1624 - Wei Gao
, Jintian Feng
, Mengqi Wei
, Rui Zou
, Jianwen Sun
:
Towards a Multi-Granulated Statistical Framework for Human-Machine Collaboration in Image Classification. 1625-1636 - Shishun Tian
, Tiantian Zeng
, Zhengyu Zhang
, Wenbin Zou
, Xia Li
:
Dual Residual-Guided Interactive Learning for the Quality Assessment of Enhanced Images. 1637-1651 - Weida Chen
, Jie Jiang
, Linfei Wang
, Huafeng Li
, Yibing Zhan
, Dapeng Tao
:
Cps-STS: Bridging the Gap Between Content and Position for Coarse-Point-Supervised Scene Text Spotter. 1652-1664 - Zhongwei Shen
, Xiaojun Wu
, Hui Li
, Tianyang Xu
, Cong Wu
:
I Know How You Move: Explicit Motion Estimation for Human Action Recognition. 1665-1676 - Hai Liu
, Cheng Zhang
, Yongjian Deng
, Bochen Xie
, Tingting Liu
, Youfu Li
:
TransIFC: Invariant Cues-Aware Feature Concentration Learning for Efficient Fine-Grained Bird Image Classification. 1677-1690 - Quanquan Xiao
, Haiyan Jin
, Haonan Su
, Yuanlin Zhang, Zhaolin Xiao
, Bin Wang
:
SPDFusion:A Semantic Prior Knowledge-Driven Method for Infrared and Visible Image Fusion. 1691-1705 - Renjie Zhang
, Di Lin
, Xin Wang
, George Baciu
, C. L. Philip Chen
, Ping Li
:
Accurate-PGNet: Learning to Assemble Perceptual Body Parts for Accurate Human Skeleton Establishment. 1706-1721 - Ke Liang
, Lingyuan Meng
, Hao Li, Meng Liu
, Siwei Wang
, Sihang Zhou
, Xinwang Liu
, Kunlun He
:
MGKsite: Multi-Modal Knowledge-Driven Site Selection via Intra and Inter-Modal Graph Fusion. 1722-1735 - Haoran Li
, Yulan Guo
, Jiali You
, Xiaojian You, Zhenwen Ren
:
Graph Proxy Fusion: Consensus Graph Intermediated Multi-View Local Information Fusion Clustering. 1736-1747 - Mina Han
, Kailong Yu
, Weiran Li
, Qiannan Guo
, Zhenbo Li
:
Colliding Depths and Fusion: Leveraging Adaptive Feature Maps and Restorable Depth Recharge for Infrared and Visible Scene Fusion. 1748-1759 - Yijun Chen
, Xianwei Zheng
, Zhulun Yang
, Xutao Li
, Jiantao Zhou
, Yuanman Li
:
DuPMAM: An Efficient Dual Perception Framework Equipped With a Sharp Testing Strategy for Point Cloud Analysis. 1760-1771 - Guozhang Li
, Xinpeng Ding, De Cheng
, Jie Li
, Nannan Wang
, Xinbo Gao
:
ETC: Temporal Boundary Expand Then Clarify for Weakly Supervised Video Grounding With Multimodal Large Language Model. 1772-1782 - Yi Xiao
, Qiangqiang Yuan
, Kui Jiang
, Yuzeng Chen
, Qiang Zhang
, Chia-Wen Lin
:
Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution. 1783-1796 - Shuo Wang
, Xinyu Zhang, Meng Wang
, Xiangnan He
:
Symmetric Hallucination With Knowledge Transfer for Few-Shot Learning. 1797-1807 - Yu Luo
, Xuanrong Chen, Jie Ling
, Chao Huang
, Wei Zhou
, Guanghui Yue
:
Unsupervised Low-Light Image Enhancement With Self-Paced Learning. 1808-1820 - Xiaoyang Hao
, Han Li
, Jing Sun
, Lei Wang
, Jianping Fan
:
A Twist Representation and Shape Refinement Method for Human Mesh Recovery. 1821-1834 - Yidi Li
, Hong Liu
, Bing Yang
:
STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking. 1835-1847 - Hua Yu
, Yaqing Hou
, Wenbin Pei
, Yew-Soon Ong
, Qiang Zhang
:
DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction. 1848-1859 - Leiyu Xie
, Yuxing Yang, Zeyu Fu
, Syed Mohsen Naqvi
:
Position and Orientation Aware One-Shot Learning for Medical Action Recognition From Signal Data. 1860-1873 - Yanan Zhu
, Jiaqiu Ai
, Le Wu
, Dan Guo
, Wei Jia
, Richang Hong
:
An Active Multi-Target Domain Adaptation Strategy: Progressive Class Prototype Rectification. 1874-1886 - Jie Zhang
, Kangneng Zhou
, Yan Luximon
, Tong-Yee Lee
, Ping Li
:
3DCMM: 3D Comprehensive Morphable Models With UV-UNet for Accurate Head Creation. 1887-1900 - Sheng Zheng
, Chaoning Zhang
, Xinhong Hao
:
Black-Box Targeted Adversarial Attack on Segment Anything (SAM). 1901-1913 - Hao Feng
, Wendi Wang
, Shaokai Liu
, Jiajun Deng
, Wengang Zhou
, Houqiang Li
:
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser. 1914-1925 - Bo Ding
, Libao Zhang
, Hongbo Sun, Yongjun He
, Jian Qin
:
Semantic-Enhanced ULIP for Zero-Shot 3D Shape Recognition. 1926-1936 - Xu Han
, Junyu Gao
, Chuang Yang
, Yuan Yuan, Qi Wang
:
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera. 1937-1949 - Jingcheng Ke
, Dele Wang
, Jun-Cheng Chen
, I-Hong Jhuo
, Chia-Wen Lin
, Yen-Yu Lin
:
Make Graph-Based Referring Expression Comprehension Great Again Through Expression-Guided Dynamic Gating and Regression. 1950-1961 - Zhiqiang Fu
, Yao Zhao
, Dongxia Chang
, Yiming Wang
, Jie Wen
:
Reordered $k$-Means: A New Baseline for View-Unaligned Multi-View Clustering. 1962-1972 - Huafeng Li
, Shedan Yang
, Yafei Zhang
, Dapeng Tao
, Zhengtao Yu
:
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval. 1973-1987 - Zhengyi Liu
, Sheng Deng
, Xinrui Wang, Linbo Wang
, Xianyong Fang
, Bin Tang
:
SSFam: Scribble Supervised Salient Object Detection Family. 1988-2000 - Hao Luo
, Baoliang Chen
, Lingyu Zhu
, Peilin Chen
, Shiqi Wang
:
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement. 2001-2014 - Xu Cheng
, Hao Yu
, Kevin Ho Man Cheng, Zitong Yu
, Guoying Zhao
:
MDANet: Modality-Aware Domain Alignment Network for Visible-Infrared Person Re-Identification. 2015-2027 - Ying Fu
, Xinyu Zhu
, Xiaojie Li
, Xin Wang
, Xi Wu, Shu Hu
, Yi Wu
, Siwei Lyu
, Wei Liu
:
VB-KGN: Variational Bayesian Kernel Generation Networks for Motion Image Deblurring. 2028-2042 - Yiting Lu
, Xin Li
, Jianzhao Liu, Zhibo Chen
:
StyleAM: Perception-Oriented Unsupervised Domain Adaption for No-Reference Image Quality Assessment. 2043-2058 - Wenhao Xu
, Changwei Wang
, Rongtao Xu
, Shibiao Xu
, Weiliang Meng
, Man Zhang
, Xiaopeng Zhang
:
Token Masking Transformer for Weakly Supervised Object Localization. 2059-2069 - Rongqun Lin, Wenhan Yang
, Baoliang Chen
, Pingping Zhang, Yue Liu
, Shiqi Wang
, Sam Kwong
:
HFGlobalFormer: When High-Frequency Recovery Meets Global Context Modeling for Compressed Image Deraindrop. 2070-2082 - Zhen Lan
, Zixing Li, Chao Yan
, Xiaojia Xiang
, Dengqing Tang
, Han Zhou, Jun Lai
:
Adaptive Knowledge Distillation With Attention-Based Multi-Modal Fusion for Robust Dim Object Detection. 2083-2096 - Kai Zhang
, Ludan Sun
, Jun Yan
, Wenbo Wan
, Jiande Sun
, Shuyuan Yang
, Huaxiang Zhang
:
Texture-Content Dual Guided Network for Visible and Infrared Image Fusion. 2097-2111 - Gang Hu
, Yafei Lv
, Jianting Zhang, Qian Wu
, Zaidao Wen
:
CLIP-Based Modality Compensation for Visible-Infrared Image Re-Identification. 2112-2126 - Bowen Shi
, Xiaopeng Zhang
, Yaoming Wang
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
:
MENSA: Multi-Dataset Harmonized Pretraining for Semantic Segmentation. 2127-2140 - Shaowei Wang
, Lingling Zhang
, Wenjun Wu
, Tao Qin
, Xinyu Zhang
, Jun Liu
:
Alignment-Guided Self-Supervised Learning for Diagram Question Answering. 2141-2154 - Fan Nie
, Jiangqun Ni
, Jian Zhang, Bin Zhang, Weizhe Zhang
:
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection. 2155-2167 - Xingjian He
, Sihan Chen, Fan Ma
, Zhicheng Huang, Xiaojie Jin
, Zikang Liu, Dongmei Fu
, Yi Yang, Jing Liu
, Jiashi Feng
:
VLAB: Enhancing Video Language Pretraining by Feature Adapting and Blending. 2168-2180

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.