


default search action
13th CVM 2025: Hong Kong SAR, China - Part III
- Piotr Didyk

, Junhui Hou
:
Computational Visual Media - 13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025, Proceedings, Part III. Lecture Notes in Computer Science 15665, Springer 2025, ISBN 978-981-96-5814-5
Image and Video Analysis
- Wenbin Wu, Zhiwei Zhang, Xin Tan, Zhizhong Zhang, Lizhuang Ma:

DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras. 3-18 - Shu Liu, Melikamu Liyih Sinishaw, Luo Zheng:

DIMATrack: Dimension Aware Data Association for Multi-Object Tracking. 19-36 - Qinghua Song, Xiaolei Wang:

Efficient Transformer Network for Visible and Ultraviolet Object Tracking. 37-51 - Mingming Li, Fei Wu

, Yinjie Wang
:
LightGR-Transformer: Light Grouped Residual Transformer for Multispectral Object Detection. 52-71 - Ruizhong Du

, Luman Zhao
, Mingyue Li
, Yidan Li, Shenyu Li, Caixia Ma
:
ADMMOA: Attribute-Driven Multimodal Optimization for Face Recognition Adversarial Attacks. 72-88 - Wei Ge, Yongwei Nie, Fei Ma, Keke Tang, Fei Richard Yu, Hongmin Cai, Ping Li:

Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring. 89-104
Multimodal Learning
- Yongbiao Gao

, Xiangcheng Sun
, Guohua Lv
, Deng Yu
, Sijiu Niu
:
Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video Parsing. 107-124 - Jiangnan Xia, Zhiyuan Zhang

, Yanyin Guo, Qilong Wu, Yi Li, Jianghan Cheng, Junwei Li:
Bridging the Modality Gap: Advancing Multimodal Human Pose Estimation with Modality-Adaptive Pose Estimator and Novel Benchmark Datasets. 125-153 - Xiaole Zhu

, Zongtao Duan
, Junchen Huang, Xing Sheng
:
Momentum-Based Uni-modal Soft-Label Alignment and Multi-modal Latent Projection Networks for Optimizing Image-Text Retrieval. 154-176 - Hao Tong, Jiawei Liu, Yong Wu, Guozhi Zhao, Fanrui Zhang, Zheng-Jun Zha:

Multi-granularity and Multi-modal Prompt Learning for Person Re-Identification. 177-200 - Lu Xu, Shuaixin Li, Xin Zhou, Xiaozhou Zhu, Wen Yao:

Local and Global Feature Cross-Attention Multimodal Place Recognition. 201-220 - Zheng Zhang

, RuiQing Yang
, Chuanlei Zhang:
IML-CMM - A Multimodal Sentiment Analysis Framework Integrating Intra-modal Learning and Cross-Modal Mixup Enhancement. 221-243
Geometrical Processing
- Benchao Li

, Yun Zou
, Ruisheng Ran
:
MCFG with GUMAP: A Simple and Effective Clustering Framework on Grassmann Manifold. 247-265 - Yun Zou

, Benchao Li
, Ruisheng Ran
:
Joint UMAP for Visualization of Time-Dependent Data. 266-288 - Hongchao Zhong

, Li Yu
, Longkun Zou
, Ke Chen
:
Unsupervised Domain Adaptation on Point Cloud Classification via Imposing Structural Manifolds into Representation Space. 289-307
Applications
- Keyang Lin, Zhijun Fang

, Sicong Zang
, Hang Wu:
Learning Adaptive Basis Fonts to Fuse Content Features for Few-Shot Font Generation. 311-332 - Xiaoyu Guan, Yihao Li

, Tianyu Huang
:
TaiCrowd: A High-Performance Simulation Framework for Massive Crowd. 333-350 - Chengrong Yang, Qiwen Jin, Xiaoguo Zhang, Yujue Zhou:

Feature Disentanglement and Fusion Model for Multi-source Domain Adaptation with Domain-Specific Features. 351-372 - Kailang Hu, Yixiao Lu

, Huibing Li, Xuan Song
:
A Trademark Retrieval Method Based on Self-supervised Learning. 373-398 - Junjiang Liu

, Dandan Sun
, Hailun Xia
, Jiangtao Bai, Xinyue Fan:
Weaken Noisy Feature: Boosting Semi-supervised Learning by Noise Estimation. 399-418 - Weiye Peng, Shenghua Zhong

:
Multi-dimension Full Scene Integrated Visual Emotion Analysis Network. 419-434 - Shan Huang, Wenhua Qian:

Gap-KD: Bridging the Significant Capacity Gap Between Teacher and Student Model. 435-453

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














