


default search action
Image and Vision Computing, Volume 163
Volume 163, 2025
- Zenan Xu

, Zhengyao Bai
, Han Ma, Mingqiang Xu, Qiqin Huang, Tao Lin:
InceptionWTMNet: A hybrid network for Alzheimer's Disease detection using wavelet transform convolution and Mixed Local Channel Attention on finely fused multimodal images. 105693 - Syamantak Sarkar

, Revoti Prasad Bora
, Bhupender Kaushal
, Sudhish N. George
, Kiran B. Raja
:
Assessing the noise robustness of Class Activation Maps: A framework for reliable model interpretability. 105717 - Nicolò Francesco Resmini, Eugenio Lomurno

, Cristian Sbrolli, Matteo Matteucci:
Your image generator is your new private dataset. 105727 - Nishant Kumar, Amit Kumar Singh

:
Artificial intelligence content detection techniques using watermarking: A survey. 105728 - Shuang Zeng, Chee Hong Lee, Micky C. Nnamdi, Wenqi Shi, J. Ben Tamo, Hangzhou He, Xinliang Zhang, Qian Chen, May D. Wang, Lei Zhu, Yanye Lu

, Qiushi Ren:
Novel extraction of discriminative fine-grained feature to improve retinal vessel segmentation. 105729 - Neeraj Baghel, Shiv Ram Dubey

, Satish Kumar Singh:
UpAttTrans: Upscaled attention based transformer for facial image super-resolution. 105731 - Haishun Du

, Wenzhe Zhang, Sen Wang, Zhengyang Zhang, Linbing Cao:
FFENet: A frequency fusion and enhancement network for camouflaged object detection. 105733 - Ezequiel Perez-Zarate, Chunxiao Liu, Oscar Ramos-Soto, Diego Oliva

, Marco Pérez-Cisneros:
UNIR-Net: A novel approach for restoring underwater images with non-uniform illumination using synthetic data. 105734 - Kaikai Zhao

, Zhaoxiang Liu
, Peng Wang, Xin Wang, Zhicheng Ma, Yajun Xu, Wenjing Zhang, Yibing Nan, Kai Wang, Shiguo Lian:
MITS: A large-scale multimodal benchmark dataset for Intelligent Traffic Surveillance. 105736 - Shuren Zhou

, Qihang Zhou, Jiao Liu:
Dynamic sparse and weight allocation-based text-driven person retrieval. 105737 - Md Sarfaraz Momin, Abu Sufian

, Debaditya Barman, Marco Leo, Cosimo Distante, Naser Damer:
Explainable deepfake detection across different modalities: An overview of methods and challenges. 105738 - Juan Zhao, Lili Kong, Deshang Sun, Deng Xiong, Jiancheng Lv:

Mining fine-grained attributes for vision-semantics integration in few-shot learning. 105739 - Siqi Zhang

, Lu Zhang, Zhiyong Liu:
Test-time adaptation for object detection via Dynamic Dual Teaching. 105740 - Emrah Simsek

, Baris Ozyer
:
DeepDCT-VO: 3D directional coordinate transformation for low-complexity monocular visual odometry using deep learning. 105742 - Shuaishuai Deng

, Tianhua Chen, Qinghua Qiao:
DECF-FGVC: A discriminative enhancement and complementary fusion approach for fine-grained bird visual classification. 105744 - Yukang Huo, Mingyuan Yao, Tonghao Wang, Qingbin Tian, Jiayin Zhao, Xiao Liu, Haihua Wang:

PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detection. 105745 - Dooho Choi, Yunsick Sung:

PixTention: Dynamic pixel-level adapter using attention maps. 105746 - Tao Wang, Weijie Wang, Fausto Giunchiglia, Fengzhi Zhao, Ye Zhang, Duo Yu, Guixia Liu

:
MBT-Polyp: A new Multi-Branch Memory-augmented Transformer for polyp segmentation. 105747 - Zahra Solatidehkordi, Tamer Shanableh:

Fall detection using deep learning with features computed from recursive quadratic splits of video frames. 105749 - Yunlei Sun, Pengxiao Shi, Tiancheng Chen, Danning Qi, Ke Xu:

MFET: Multi-frequency enhancement transformer for single-image super-resolution. 105751 - Ming Lu, Jian Li, Duo Han Zhao, Qin Wang:

CDAF: Cross-Modal and Dual-channel Upsample Adaptive Fusion network for Point Cloud Completion. 105735 - Pasquale Coscia, Angelo Genovese, Vincenzo Piuri, Fabio Scotti:

OneN: Guided attention for natively-explainable anomaly detection. 105741 - Soyoun Won, Hyeon Bae Kim, Yong Hyun Ahn, Hong Joo Lee, Seong Tae Kim:

Understanding adversarial robustness of deep neural networks via decision reliance. 105743 - Kai Lu, Long Liu, Xin Wang, Siying Ren:

PHMG: Prompt-based Human Motion Generation for action recognition. 105748 - Lifang Zhou, Zhen Hu:

Enhanced crowd counting with weighted attention network and multi-scale feature integration. 105750 - Zeyad Q. Habeeb, Branislav Vuksanovic, Imad Q. Alzaydi:

Modified ResNet model for medical image-based lung cancer detection. 105752 - Rita Delussu, Lorenzo Putzu, Fadi Boutros, Carmen Bisogni, Naser Damer, Giorgio Fumera:

Synthetic data sets for person Re-Identification: A critical analysis. 105753 - Daniele Venturini, Marco Raoul Marini, Luigi Cinque, Gian Luca Foresti:

Leveraging spatial-channel attention in U-Net for enhanced segmentation of martian dust storms. 105754 - Fuqin Deng, Caiyun Tang, Lanhui Fu, Wei Jin, Jiaming Zhong, Hongming Wang, Nannan Li:

GNN-based primitive recombination for compositional zero-shot learning. 105762 - Hao Zhai, Zhendong Xu, Zhi Zeng, Lei Yu, Bo Lin:

EDFusion: Edge-guided attention and dynamic receptive field with dense residual for multi-focus image fusion. 105763 - Chhaya Gupta, Nasib Singh Gill, Preeti Gulia, Giovanni Pau:

CODNet: Context-based object detection network for multimodal image captioning and virtual question answering. 105768 - Jinlong Liu, Yaping Zhang:

DPDNet : The lightweight stereo matching network based on disparity probability distribution consistency. 105771 - Peichao Jiang, Mayire Ibrayim, Sitong Shen:

TCaEx:Targeted Caption as External Knowledge for knowledge-based visual question answering. 105772 - Tengda Huang, Yu Zhang, Tianren Li, Yufu Qu, Fulin Liu, Zhenzhong Wei:

Burst image super-resolution via multi-cross attention encoding and multi-scan state-space decoding. 105773 - Zhengyu Zhu, Xinaoxue Zhang, Xiaobo Zhang, Zixuan Zhao, Feng Chen:

GLMambaNet: Mamba-based decoder with local detail enhancement for semantic segmentation of remote sensing imagery. 105774 - Kim Nhat Minh Nguyen, Hung Viet Vuong, Ngoc-Quan Ha-Phan, Myungsik Yoo:

FuPaSCo: Long-range and local context fusion for 3D panoptic scene completion. 105776 - Van Quang Nguyen, Thi-Thao Tran, Gia-Bao Truong, Nhu-Linh Than, Ngoc-Khai Hoang, Dinh-Hieu Nguyen, Van-Truong Pham:

A dense attention Mamba-based network with Adaptive Sigmoid Fowlkes-Mallows Loss for enhanced medical image segmentation. 105778 - Firos V. M., Alphonse P. J. A., Ugo Fiore, G. R. Gangadharan:

RLTNT: An explainable residual learning-based transformer model for kidney disease classification. 105781 - Jifeng Guo, Yutong Liu, Zhiqi Pang, Jing Liu, Yan Chen:

Merge-split collaborative learning for unsupervised visible-infrared person re-identification. 105783 - Shaqing Song, Kuo Tang, Ying Zhang:

PST-Mamba: Spatio-temporal selective state fusion for effective point cloud video understanding with state space models. 105785

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














