


default search action
13th CVM 2025: Hong Kong SAR, China - Part III
- Piotr Didyk
, Junhui Hou
:
Computational Visual Media - 13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025, Proceedings, Part III. Lecture Notes in Computer Science 15665, Springer 2025, ISBN 978-981-96-5814-5
Image and Video Analysis
- Wenbin Wu, Zhiwei Zhang, Xin Tan, Zhizhong Zhang, Lizhuang Ma:
DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras. 3-18 - Shu Liu, Melikamu Liyih Sinishaw, Luo Zheng:
DIMATrack: Dimension Aware Data Association for Multi-Object Tracking. 19-36 - Qinghua Song, Xiaolei Wang:
Efficient Transformer Network for Visible and Ultraviolet Object Tracking. 37-51 - Mingming Li, Fei Wu
, Yinjie Wang
:
LightGR-Transformer: Light Grouped Residual Transformer for Multispectral Object Detection. 52-71 - Ruizhong Du
, Luman Zhao
, Mingyue Li
, Yidan Li, Shenyu Li, Caixia Ma
:
ADMMOA: Attribute-Driven Multimodal Optimization for Face Recognition Adversarial Attacks. 72-88 - Wei Ge, Yongwei Nie, Fei Ma, Keke Tang, Fei Richard Yu, Hongmin Cai, Ping Li:
Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring. 89-104
Multimodal Learning
- Yongbiao Gao
, Xiangcheng Sun
, Guohua Lv
, Deng Yu
, Sijiu Niu
:
Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video Parsing. 107-124 - Jiangnan Xia, Zhiyuan Zhang
, Yanyin Guo, Qilong Wu, Yi Li, Jianghan Cheng, Junwei Li:
Bridging the Modality Gap: Advancing Multimodal Human Pose Estimation with Modality-Adaptive Pose Estimator and Novel Benchmark Datasets. 125-153 - Xiaole Zhu
, Zongtao Duan
, Junchen Huang, Xing Sheng
:
Momentum-Based Uni-modal Soft-Label Alignment and Multi-modal Latent Projection Networks for Optimizing Image-Text Retrieval. 154-176 - Hao Tong, Jiawei Liu, Yong Wu, Guozhi Zhao, Fanrui Zhang, Zheng-Jun Zha:
Multi-granularity and Multi-modal Prompt Learning for Person Re-Identification. 177-200 - Lu Xu, Shuaixin Li, Xin Zhou, Xiaozhou Zhu, Wen Yao:
Local and Global Feature Cross-Attention Multimodal Place Recognition. 201-220 - Zheng Zhang
, RuiQing Yang
, Chuanlei Zhang:
IML-CMM - A Multimodal Sentiment Analysis Framework Integrating Intra-modal Learning and Cross-Modal Mixup Enhancement. 221-243
Geometrical Processing
- Benchao Li
, Yun Zou
, Ruisheng Ran
:
MCFG with GUMAP: A Simple and Effective Clustering Framework on Grassmann Manifold. 247-265 - Yun Zou
, Benchao Li
, Ruisheng Ran
:
Joint UMAP for Visualization of Time-Dependent Data. 266-288 - Hongchao Zhong
, Li Yu
, Longkun Zou
, Ke Chen
:
Unsupervised Domain Adaptation on Point Cloud Classification via Imposing Structural Manifolds into Representation Space. 289-307
Applications
- Keyang Lin, Zhijun Fang
, Sicong Zang
, Hang Wu:
Learning Adaptive Basis Fonts to Fuse Content Features for Few-Shot Font Generation. 311-332 - Xiaoyu Guan, Yihao Li
, Tianyu Huang
:
TaiCrowd: A High-Performance Simulation Framework for Massive Crowd. 333-350 - Chengrong Yang, Qiwen Jin, Xiaoguo Zhang, Yujue Zhou:
Feature Disentanglement and Fusion Model for Multi-source Domain Adaptation with Domain-Specific Features. 351-372 - Kailang Hu, Yixiao Lu
, Huibing Li, Xuan Song
:
A Trademark Retrieval Method Based on Self-supervised Learning. 373-398 - Junjiang Liu
, Dandan Sun
, Hailun Xia
, Jiangtao Bai, Xinyue Fan:
Weaken Noisy Feature: Boosting Semi-supervised Learning by Noise Estimation. 399-418 - Weiye Peng, Shenghua Zhong
:
Multi-dimension Full Scene Integrated Visual Emotion Analysis Network. 419-434 - Shan Huang, Wenhua Qian:
Gap-KD: Bridging the Significant Capacity Gap Between Teacher and Student Model. 435-453

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.