


default search action
Image and Vision Computing, Volume 162
Volume 162, 2025
- Junxiong Zhang, Jin Peng, Kaiyun Wang:

Athlete posture estimation and analysis based on embodied artificial intelligence. 105598 - Xing Liu

, Hao Tang:
BCDPose: Diffusion-based 3D Human Pose Estimation with bone-chain prior knowledge. 105636 - Zhilong Ou

, Hongxing Wang
, Jiawei Tan, Jiaxin Li, Ziyi Zhao, Zhangbin Qian:
Label refinement for change detection in remote sensing. 105639 - Chunsing Lo, Hao Zhang, Andy J. Ma

:
Invariant prompting with classifier rectification for continual learning. 105641 - Qiyue Sun, Yang Yang, Haoxuan Xu, Zezhou Li, Yunxia Liu, Hongjun Wang:

MG-KG: Unsupervised video anomaly detection based on motion guidance and knowledge graph. 105644 - Yingcheng Lin, Yuxiao Wang, Rui Ding, Haijun Liu

, Xichuan Zhou:
Distribution-modulated binary neural network for image classification. 105646 - Yujie Wu, Hengliang Tan

, Jiao Du, Shuo Yang, Guofeng Yan:
Deep Hybrid Manifold Network with joint metric learning for image set classification. 105647 - Dahuin Jung

:
Harnessing consistency for improved test-time adaptation. 105650 - Minchen Yang, Ziyi Yang

, Nur Intan Raihana Ruhaiyem:
MAFUNet: Mamba with adaptive fusion UNet for medical image segmentation. 105655 - Tichao Wang, Fusheng Hao, Qieshi Zhang

, Jun Cheng:
Progressive background-foreground difference enhancement for few-shot 3D point cloud semantic segmentation. 105656 - Haotian Lei, Xiangyu Liu, Yan Zhou, Guo Niu, Chang'an Yi, Yuexia Zhou, Xiaofeng Liang, Fuhe Liu:

MMFEIR: Multi-attention Mutual Feature Enhance and Instance Reconstruction for category-level 6D object pose estimation. 105657 - Serena Lembo

, Paola Barra
, Luigi Di Biasi
, Thierry Bouwmans
, Genoveffa Tortora:
AI4RDD: Artificial Intelligence and Rare Disease Diagnosis: A proposal to improve the anamnesis process. 105658 - Qiqi Kou, Jiapeng Chen, Hailong Zhang, Tianshu Song, He Jiang, Deqiang Cheng, Liangliang Chen

:
DMNet: Image dehazing via Dual-Domain Modulation. 105659 - Davide Caffagni

, Marcella Cornia
, Lorenzo Baraldi
, Rita Cucchiara
:
Augmenting and mixing Transformers with synthetic data for image captioning. 105661 - Wei Gao

, Li Jin
, Youssef Akoudad
, Yang Yang
:
APS-NeuS: Adaptive planar and skip-sampling for 3D object surface reconstruction in high-specular scenes. 105665 - Zeynep Hilal Kilimci

, Mustafa Yalcin, Ayhan Kucukmanisa, Amit Kumar Mishra:
Advancing heart disease diagnosis with vision-based transformer architectures applied to ECG imagery. 105666 - Marcello Di Giammarco

, Antonella Santone, Mario Cesarelli, Fabio Martinelli, Francesco Mercaldo:
Explainable retinal disease classification and localization through Convolutional Neural Networks. 105667 - Alessandro Sebastian Podda

, Riccardo Balia, Marco Manolo Manca, Jacopo Martellucci
, Livio Pompianu:
A deep learning strategy for the 3D segmentation of colorectal tumors from ultrasound imaging. 105668 - Liangyu Zhou, Xiaoyan Luo, Rui Xue:

Modal-aware contrastive learning for hyperspectral and LiDAR classification. 105669 - Rahim Khan, Nada Alzaben, Yousef Ibrahim Daradkeh, Xianxun Zhu

, Inam Ullah:
Pyramidal attention with progressive multi-stage iterative feature refinement for salient object segmentation. 105670 - Yue Huang, Pan Wang, Yumei Zheng, Bochuan Zheng

:
Lightweight multi-scale global attention enhancement network for image super-resolution. 105671 - Amirreza Fateh

, Mohammad Reza Mohammadi
, Mohammad Reza Jahed-Motlagh
:
MSDNet: Multi-scale decoder for few-shot semantic segmentation via transformer-guided prototyping. 105672 - Hussein Hasan

, Miguel Ángel García, Hatem A. Rashwan
, Domenec Puig
:
CoHAtNet: An integrated convolutional-transformer architecture with hybrid self-attention for end-to-end camera localization. 105674 - Zhongxuan Zhang

, Bi Zeng
, Xinyu Ni, Yimin Du:
BTMTrack: Robust RGB-T tracking via dual-template bridging and temporal-modal candidate elimination. 105676 - Huiyu Luo:

Combining spatio-temporal attention and multi-level feature fusion for video saliency prediction. 105678 - Zunair Safdar, Jinfang Sheng, Muhammad Usman Saeed, Muhammad Ramzan, A. Al-Zubaidi:

Empowering cardiovascular diagnostics with SET-MobileNet: A lightweight and accurate deep learning based classification approach. 105684 - Agniva Sengupta

, Stefan Zachow:
Shape-from-template with generalised camera. 105579 - Mukhtiar Khan

, Inam Ullah
, Nadeem Khan
, Sumaira Hussain
, Muhammad Ilyas Khattak
:
ADPNet: Attention-Driven Dual-Path Network for automated polyp segmentation in colonoscopy. 105648 - Yinzhe Cui, Jing Liu, Ze Teng, Shuangfeng Yang, Hongfeng Li, Pingkang Li, Jiabin Lu, Yajuan Gao, Yun Peng, Hongbin Han, Wanyi Fu:

Multi-scale feature fusion with task-specific data synthesis for pneumonia pathogen classification. 105662 - Weicheng Song, Siyou Guo, Mingliang Gao, Qilei Li, Xianxun Zhu, Imad Rida:

Deepfake detection via Feature Refinement and Enhancement Network. 105663 - Lorenzo Putzu

, Simone Porcu, Andrea Loddo
:
Distributed collaborative machine learning in real-world application scenario: A white blood cell subtypes classification case study. 105673 - Marcello Di Giammarco

, Antonella Santone, Mario Cesarelli, Fabio Martinelli, Francesco Mercaldo:
A method for skin lesion detection and localization by means of Deep Learning and reliable prediction explainability. 105675 - Xudong Zhou

, Jun Tang, Ke Wang
, Nian Wang, Han Chen:
Graph hashing network for image retrieval. 105677 - Diego Gragnaniello

, Antonio Greco, Carlo Sansone, Bruno Vento:
FOCUS: Improving fire detection on videos by scenario adaptation. 105679 - Xin Hu, Fen Chen, Zongju Peng, Lian Huang, Jiawei Xu:

MMCFNet: Multi-scale and multi-modal complementary fusion network for light field salient object detection. 105680 - Mengran Hou, Junmin Liu, Zengjie Song

, Yongjun Wang:
Semantic-consistency multi-view deep subspace clustering network with frequency branches. 105681 - Zhenhua Bai, Qiangchang Wang

, Lu Yang, Xinxin Zhang, Yanbo Gao, Yilong Yin:
Diverse Information Aggregation with Adaptive Graph Construction and prompts for deepfake detection. 105682 - Gianluca Zaza

, Gabriella Casalino
, Sergio Caputo, Giovanna Castellano
:
Estimating blood pressure using video-based PPG and deep learning. 105683 - Ricardo Pizarro, Roberto Valle, José Miguel Buenaposada

, Luis Miguel Bergasa, Luis Baumela
:
Pose-guided token selection for the recognition of activities of daily living. 105686 - Zhiyu Zheng, Dake Zhou

, Yiming Shao, Xin Yang:
EGU-GS: Efficient Gaussian utilization for real-time 3D Gaussian splatting. 105687 - Shihui Zhang, Haonan Yang, Jiawei Zhang, Xinyu Wang:

DBNet: A depth-guided and boundary-aware network for amodal instance segmentation. 105688 - Marco Caruso, Lucia Cimmino

, Fabio Narducci, Chiara Pero, Gianluca Ronga:
Advancements in basketball action recognition: Datasets, methods, explainability, and synthetic data applications. 105689 - Wei Zhang, Chenglin Zhou, Xuekang Peng, Zhichao Lian:

InpaintingPose: Enhancing human pose transfer by image inpainting. 105690 - Yakup Abrek Er, Arda Güler, Mehmet Cagri Demir, Hande Uysal, Gamze Babur Guler, Ilkay Öksüz

:
Spatiotemporal XAI: Explaining video regression models in echocardiography videos for ejection fraction prediction. 105691 - Manar N. Amin, Muhammad Ali Rushdi, Rasha Kamal, Amr Farouk, Mohamed Gomaa, Noha M. Fouad, Ahmed M. Mahmoud:

A deep learning approach for contrast-agent-free breast lesion detection and classification using adversarial synthesis of contrast-enhanced mammograms. 105692 - Song Guo

:
Enhancing cross-domain generalization in retinal image segmentation via style randomization and style normalization. 105694 - Hui Li, Su Qin, Saiyu Li, Ying Gao, Yanli Wu:

Synergistic-aware cascaded association and trajectory refinement for multi-object tracking. 105695 - Asma Aldrees, Nihal Abuzinadah, Muhammad Umer

, Dina Abdulaziz Alhammadi, Shtwai Alsubai, Raed Alharthi
:
Deepfake detection using optimized VGG16-based framework enhanced with LIME for secure digital content. 105696 - Shiyuan Li, Hongbo Bi, Disen Mo, Cong Zhang

, Yue Li
:
ECNet: An edge-guided and cross-image perception network for collaborative camouflaged object detection. 105697 - Yuanqing Wang

, Tao Wang, Xiangbo Shu, Yuhui Zheng, Jin Ding, Xianghui Fu, Zhaohui Zheng:
Structure-aware contrastive learning for glomerulus segmentation in renal pathology. 105698 - Yaohui Guo, Luanyuan Dai, Xinwei Gan, Yuting Huang, Miaohua Ruan, Detian Huang

:
One-step diffusion for real-world image super-resolution via degradation removal and text prompts. 105699 - Liqin Huang, Hanyu Zheng

, Lin Pan
, Zhipeng Su
, Qiang Wu:
Codebook prior-guided hybrid attention dehazing network. 105700 - Bohan Yang, Yong Luo

, Bo Du, Dongjing Shan, Chuan Cheng, Gang Liu, Jun Zhang, Jingnan Liu:
DoseNet: Dose-adaptive prediction of the parotid glands deformation for radiotherapy planning. 105701 - Chuyue Zhao

, Xin Huang, Xue Wang
, Guoqing Zhou, Qing Wang
:
Phase shift guided dynamic view synthesis from monocular video. 105702 - Liping Zhu, Xuan Li, Bohui Li, Chengyang Li, Bingyao Wang, Xianxiang Chang:

SAMUNet: Enhancing pillar-based 3D object detection in autonomous driving with Shape-aware Mini-Unet. 105703 - Yufeng Yin, Xiaoyan Liu, Qing Fan, Zichao Zhang:

A non-local adaptive hypothesis propagation for multi-view stereo. 105704 - Fan Yang, Kunchi Li

, Nanfeng Jiang
, Yun Wu, Ziyu Li, Da-Han Wang:
LDH-Net: Luminance-based Deep Hybrid Network for Document Image De-shadowing. 105705 - Liang Zhao, Zehan Bao, Yi Xie, Hong Chen, Yaohui Chen, Weifu Li:

TSGaussian: Semantic and depth-guided Target-Specific Gaussian Splatting from sparse views. 105706 - Shengkai Liu, Jun Miao, Yuanhua Qiao, Hainan Wang:

TPWGAN: Wavelet-aware text prior guided super-resolution for scene text images. 105707 - Inayatul Haq, Zheng Gong, Haomin Liang

, Wei Zhang, Rashid Khan, Lei Gu, Roland Eils, Yan Kang, Bingding Huang:
A review of breast cancer histopathology image analysis with deep learning: Challenges, innovations, and clinical integration. 105708 - Zenghui Wang, Wenhao Song, Xuening Xing, Lina Liu, Xianxun Zhu, Mingliang Gao:

A Dual-branch Progressive Network with spatial-frequency constraint for image fusion. 105709 - Hongbo Bi, Jianing Yu, Disen Mo, Shiyuan Li, Cong Zhang

:
Edge-guided semantic-aware network for camouflaged object detection with PVTv2. 105720 - Jiaxin Chen, Dong Xing, Mohammad Shabaz, Yongpei Zhu, Yong Wang, Xianxun Zhu

:
DNLN: Image super-resolution with Deformable Non-Local attention and Multi-Branch Weighted Feature Fusion. 105721 - Xunzhan Yao, Ming Yin

, Yonghua Wang, Yi Guo:
Flexible disentangled representation learning with soft-splitting for multi-view data. 105722 - Tao Wang, Ping Li, Zeyu Pan, Hao Wang:

Few-sample video captioning using pre-trained language model with gated bidirectional fusion. 105723 - Liangduan Wu, Yan Zhuang, Guoliang Liao, Lin Han, Zhan Hua, Rui Wang, Ke Chen, Jiangli Lin:

Breast tumor detection in ultrasound images with anatomical prior knowledge. 105724 - Shihao Liu, Cheng Xu, Songyin Dai, Nuoya Li, Weiguo Pan, Bingxin Xu, Hongzhe Liu

:
Self-attention enhanced dynamic semantic multi-scale graph convolutional network for skeleton-based action recognition. 105725 - Hamad Aldawsari, Saad Alammar:

Integrating explainable AI with synthetic biometric data for enhanced image synthesis and privacy in computer vision systems. 105726 - Hongkuan Wang, Qingxi Yu, Zhenguang Di, Gang Yang

:
Explicit Semantic Alignment Network for RGB-T salient object detection with Hierarchical Cross-Modal Fusion. 105730 - Emrah Simsek

, Baris Ozyer
:
Deep learning enhanced monocular visual odometry: Advancements in fusion mechanisms and training strategies. 105732

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














