


default search action
21st ICIC 2025: Ningbo, China - Part VI
- De-Shuang Huang, Qinhu Zhang, Chuanlei Zhang, Wei Chen:

Advanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Ningbo, China, July 26-29, 2025, Proceedings, Part VI. Lecture Notes in Computer Science 15847, Springer 2025, ISBN 978-981-96-9855-4
Intelligent Computing in Computer Vision
- Zhuoxu Jiang, Abdusalam Dawut, Askar Hamdulla:

SiamMFT: Siamese MultiFrame Network in Infrared Small Target Tracking. 3-12 - Haoyu Liu, Xiaoqing Jiang, Chenyang Liang, Peizhi Sun, Jianwei Gu, Bang Li, Haoran Sun, Tuchuan Chen, Zhenxiang Chen:

DSF-SL: A Dual-Stream Fusion Network with Soft Labels for Robust Facial Expression Recognition. 13-26 - Jiashu Han, Yitong Ding, Minghao Liu, Yuyang Li:

CDS-YOLO: An Underwater Object Detection Algorithm Based on Improved YOLOv10s. 27-38 - Ziqi Shu, Qingfeng Wu:

Adaptive Drone-Based Tilt Photogrammetry and Real-Time 3D Reconstruction for Interactive Digital Scenography. 39-49 - Ruijie Chen, Xiongxin Tang, Fanjiang Xu, Chao Yin:

ASFST:Adaptive Spectral Filters Sparse Transformer for Hyperspectral Image Denoising. 50-62 - Zhenlin Cao, Jie Luo, Bingrong Xu:

STD-DETR: A Multi-scale Feature Fusion Network Based on RT-DETR for Small Object Detection. 63-72 - Yang Liu, Tengyu Fan, Mengfan Qu, Xinhui Zhou:

VI-YOLO: A Vehicle Detection Method Based on Bimodal Fusion for Drones. 73-83 - Hao Sun

, Ji Zhang
, Zhou Xuchuan, Jingzhong Xiao
, Jiamin Tang, Huimin Yang
:
DSCformer: Dynamic Spectrum Coordination Transformer for Multi-scale Image Deraining. 84-95 - Jin Wang, Yahong Han:

LBA: Multi-Scale Video Segment Sampling for Open-Ended Video Question Answering. 96-107 - Tingting Zhang, Zhen Xiao, Jinlin Guo, Xueliang Liu:

EmoGaussian High-Fidelity Emotional Talking Head Generation with 3D Gaussian Splatting. 108-119 - Benli Zou, Kun Liu, Meishu Li, Qinghao Peng:

YOLO-GH: An Enhanced Oriented Object Detection for Greenhouses in Remote Sensing Images. 120-130 - Kaihua Chen, Tingting Zhu, Shaofeng Li, Yinxue Shi:

Facial Keypoint-Based Segment-Level Driver Yawning Detection by Graph-Temporal Convolutional Neural Network Modeling. 131-142 - Yonggan Wu, Yueyi Bai, Hongqiang Wang, Xingpin Xie, Shumi Zhao:

DRIM-Net: Diversity-Enhanced Robust Information Mining Network for Visible-Infrared Person Re-identification. 143-156 - Shuwei Yan, Shuang Liang, Kenan Ye, Baihua Liu, Chi Xie, Shengjie Zhao:

Classifier Recalibration for Human-Object Interaction Detection. 157-168 - Zhiqun Yang, Eryong Wu, Chong Zhao, Shoufeng Huang:

HyDiffVeg: A Hybrid Diffusion-Based Framework for Uncertainty-Aware Vegetation Forecasting. 169-180 - Ming Xin

, Bo Wang, Xin Wang, Kaiting Gong, Caili Fang:
Enhancing Small Object Detection in UAV Aerial Imagery with DCI-DETR: A Deep Learning Approach. 181-192 - Shaohui Jin, Zhenjie Yu, Yanxin Zhang, Tianyu Liu, Hao Liu

:
Enhancing Non-line-of-Sight Imaging Through Contrastive Multiscale Context Aggregation. 193-204 - Xinnan Zhu

, Yicheng Zhu, Tixin Chen
, Wentao Wu, Yuanjie Dang
:
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection. 205-217 - Yan Jiang, Zhitao Dai, Yu Chen, Zhiqiang Chu:

An Improved Multi-task Model for Instance Segmentation and Pose Estimation Based on YOLOv11. 218-229 - Zuyin Wu, Zixuan Guo, Yifan Xie, Ying Tiffany He, Fei Ma, Fei Richard Yu:

DictAvatar: Expressive Facial Avatar Reconstruction with Facial Feature Dictionary. 230-241 - Shijie Zhang, Xuefeng Yan, Jiamei Xiong, Li Dai, Xiangping Zhai:

UniMamba: A Unified CNN-Mamba Model for Infrared Small Target Detection. 242-254 - Chengzhang Wei, Bo Yin:

Unified Identity and Attribute Learning for Visible-Infrared Person Re-identification. 255-266 - Z. Ye, Prashan Premaratne, Peter James Vial:

Hybrid Positional Encoding for Spatiotemporal Feature Separation in Sign Language Recognition. 267-279 - Feifei Xu, QiYe Cai, Fumiaoyue Jia, HaoRan Bi, Kang Han, BinEr Zuo:

Enhancing CLIP for Pedestrian Image-Text Retrieval via Bi-level Alignment and Weighted Similarity Distribution Matching Loss. 280-291 - Xueyang Sun, Weimin Wei, Qiang Wu, Wuyao Shi, Mei Xue:

Image Manipulation Localization via Enhanced Cross-Modal Fusion with Edge Supervision. 292-304 - Xinyue Li, Yinsai Guo, Liyan Ma, Shaorong Xie:

Zero-Shot Scene Graph Generation with Bias Correction and Unseen Space Optimization. 305-317 - Jiayong Zhong, Jinjun Wang, Kaifan Hou, Yongqiang Ma:

HomoMamba for Self-supervised Homography Estimation. 318-329 - Haoyang Long:

Hierarchical Local-Latent Diffusion Model for Efficient Video Deblurring. 330-342 - Mengyao Zhou

, Shuo Wang, Yanmin Chen, Jun Luo
:
CropAug: Controllable Region Cropping for Fine-Grained Data Augmentation. 343-354 - Bing Liu, Hu Liang, Jiacheng Qu, Yuchen Liu

, Shengrong Zhao:
WGMVSNet: An Efficient Dual-branch Self-supervised Multi-view Stereo Network for 3D Reconstruction. 355-367 - Yuhan Cai, Yang Hua, Wenjie Zhang, Xiaoning Song, Zhenhua Feng:

Catching Inter-Modal Artifacts: A Cross-Modal Framework for Temporal Forgery Localization. 368-380 - Jian Tang, Ranfeng Shi, Leilei Gu:

An Immersive Virtual Reality Training System for Enhancing Orientation and Mobility in Individuals with Visual Impairments. 381-393 - Kai Cheng

, MingZhe Yu, Lei Wu
, Manyi Li
, YaJie Xu
:
StoryWeaver: Consistent Multi-character Open-Ended Story Generation. 394-405 - Xianming Huang

, Hongxiang Yu, Ainan Liang, Yawen Gao, Pengju Si:
Efficient Surface Defect Detection via Multi-scale Features and Lightweight Attention. 406-417 - Dongsheng Yang, Meiling Zhu, Yinfeng Yu:

Landmark-Guided Knowledge for Vision-and-Language Navigation. 418-428 - Xinpan Yuan, Guorong Liang, Liujie Hua, Shaomin Xie, Wenguang Gan:

MAS-ZSAS: A Zero-Shot Anomaly Segmentation Framework with Multi-attribute Guided Text Prompts. 429-440 - Yuao Cao, Jundian Song, Yu Pan, Ming Chen:

HLD-DETR: A Lightweight Transformer-Based Model for Underwater Seafood Detection. 441-452 - Zhike Qiu, Yuhao Qin, Luping Zeng

, Liangming Wen:
MBFSNet: Multi-scale Binocular Fusion Semantic Co-occurrence Network for Multi-label Fundus Disease Diagnosis. 453-464 - Yanyan Su, Zhiliang Qiu

, Min Lu, Jun Xiang, Shenglian Lu:
Visibility-Guided GCN-Transformer: Enhancing 2D Pose Estimation Under Occlusion. 465-477 - Zhongteng Zhang

, Qing Peng, Liu Zhang, Zihao Zhang, Jing Chong, Weihong Huang
:
STPEFormer: Spatio-Temporal Pose Embedding-Enhanced Transformer for Energy Expenditure Estimation. 478-489 - Tao Zhang, Zebing Wei, Hongjun Xie, Panfeng An:

Detail Aware CompletionNet for Point Cloud Completion. 490-501 - Zihong Guo, Chen Wan, Yayin Zheng, Hailing Kuang, Xiaohai Lu:

Boosting Adversarial Transferability Against Defenses via Multi-scale Transformation. 502-513 - Qi Cao, Wenzheng Liu, Chen Sun, Jianhao Wei, Jin Zhang:

Information Bottleneck Driven Masked Autoencoders for Data-Efficient Auxiliary Learning. 514-525

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














