


default search action
The Visual Computer, Volume 41
Volume 41, Number 1, January 2025
- Nadia Magnenat-Thalmann:

Welcome to the Year 2025. 1-2 - Acknowledgement to reviewers 2024. 3-10

- Wenji Yang

, Liping Xie, Wenbin Qian, Canghai Wu, Hongyun Yang:
Coarse-to-fine cascaded 3D hand reconstruction based on SSGC and MHSA. 11-24 - Gusu Song, Shaoyan Gai

, Feipeng Da:
Memory-based gradient-guided progressive propagation network for video deblurring. 25-40 - Rohit Pratap Singh

, Dolendro Singh Laiphrakpam:
Dyhand: dynamic hand gesture recognition using BiLSTM and soft attention methods. 41-51 - Zhe Li

, Hui Lv, Libo Cheng, Xiaoning Jia:
Image deblocking algorithm based on GC and SSR. 53-66 - I-Chao Shen

, Li-Wen Su, Yu-Ting Wu, Bing-Yu Chen:
StylePart: image-based shape part manipulation. 67-78 - Youssef Ait Khouya

, Mohammed Ait Oussous, Abdeslam Jakimi
, Faouzi Ghorbel
:
Stable and invertible invariants description for gray-level images based on Radon transform. 79-97 - Mahmoud A. Eldosoky

, Jianping Li, Amin Ul Haq, Fanyu Zeng, Mao Xu, Shakir Khan
, Inayat Khan:
WallNet: Hierarchical Visual Attention-Based Model for Putty Bulge Terminal Points Detection. 99-114 - Rajendra Nagar

:
Robust extrinsic symmetry estimation in 3D point clouds. 115-128 - Chen Zhao, Weiling Cai

, Zheng Yuan:
Spectral normalization and dual contrastive regularization for image-to-image translation. 129-140 - Ziliang Feng, Ju Zhang, Xusong Ran, Donglu Li, Chengfang Zhang:

Ghost-Unet: multi-stage network for image deblurring via lightweight subnet learning. 141-155 - Chunlu Li

, Feipeng Da:
Refined dense face alignment through image matching. 157-171 - Xiongbo Lu, Feng Liu, Yi Rong, Yaxiong Chen, Shengwu Xiong:

MakeupDiffuse: a double image-controlled diffusion model for exquisite makeup transfer. 173-189 - Junjie Liu, Junlong Liu, Rongxin Jiang, Boxuan Gu, Yaowu Chen, Chen Shen:

Boosted verification using siamese neural network with DiffBlock. 191-208 - Xujia Qin

, Xinyu Li, Mengjia Li, Hongbo Zheng, Xiaogang Xu:
Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement. 209-227 - Xiaochun Lei

, Zeyu Chen
, Zhaoxin Yu
, Zetao Jiang
:
BENet: boundary-enhanced network for real-time semantic segmentation. 229-241 - Feihu Bian, Suya Xiong, Ran Yi, Lizhuang Ma:

Multi-view stereo-regulated NeRF for urban scene novel view synthesis. 243-255 - Hengrui Zhang

, Yongfeng Qi, Huili Chen, Panpan Cao, Anye Liang, Shengcong Wen:
LSDNet: lightweight stochastic depth network for human pose estimation. 257-270 - Zubair Ahmad Lone

, Alwyn Roshan Pais:
Salient object detection in HSI using MEV-SFS and saliency optimization. 271-280 - Clement Mailhe

, Amine Ammar, Francisco Chinesta, Dominique Baillargeat:
Towards improving synthetic-to-real image correlation for instance recognition in structure monitoring. 281-301 - Yue Yu, Yue Yang, Jingshuo Xing:

PMGAN: pretrained model-based generative adversarial network for text-to-image generation. 303-314 - Haoyu Xiong, Yu Xiang:

Robust gradient aware and reliable entropy minimization for stable test-time adaptation in dynamic scenarios. 315-330 - Zhixuan Tang, Haiyun Shen, Peng Yu, Kaisong Zhang, Jianyu Chen:

Infrared tracking for accurate localization by capturing global context information. 331-343 - Yixiu Liu, Long Zhan

, Yu Feng, Pengju Si, Shaowei Jiang, Qiang Zhao, Chenggang Yan:
Loose-tight cluster regularization for unsupervised person re-identification. 345-358 - Le-Anh Tran

, Dong-Chul Park:
Encoder-decoder networks with guided transmission map for effective image dehazing. 359-382 - Yixiu Liu, Tao Jiang, Pengju Si, Shangdong Zhu

, Chenggang Yan, Shuai Wang, Haibing Yin:
Unpaired semantic neural person image synthesis. 383-397 - Yan Huang, Xinchang Lu, Jia Fu:

Single image reflection removal via self-attention and local discrimination. 399-408 - Ziyang Chen

, Yang Zhao
, Junling He, Yujie Lu, Zhongwei Cui, Wenting Li
, Yongjun Zhang
:
Feature distribution normalization network for multi-view stereo. 409-421 - Dayu Jia, Yanwei Pang, Jiale Cao, Jing Pan:

SSNet: a joint learning network for semantic segmentation and disparity estimation. 423-435 - Ye Li, Wu Zhang, Meiling Wu, Di Zhang, Zhiguo Wang, Changjiang You:

Multi-keypoints matching network for clothing detection. 437-449 - Zhentao Zhang

, Wenhao Li, Yuxi Cheng, Qingnan Huang, Taorong Qiu:
An improved residual learning model and its application to hardware image classification. 451-464 - Ping Ma

, Xinyi He, Yiyang Chen, Yuan Liu:
ISOD: improved small object detection based on extended scale feature pyramid network. 465-479 - Jian Xiong, Jie Wu, Ming Tang, Pengwen Xiong, Yushui Huang, Hang Guo:

Combining YOLO and background subtraction for small dynamic target detection. 481-490 - Henry Senior, Gregory G. Slabaugh, Shanxin Yuan, Luca Rossi:

Graph neural networks in vision-language image understanding: a survey. 491-516 - Yuanhao Chai, Jingyu Gong, Xin Tan, Jiachen Xu, Yuan Xie, Lizhuang Ma:

Learnable scene prior for point cloud semantic segmentation. 517-534 - Kunhong Xiong, Linbo Qing

, Lindong Li, Li Guo, Yonghong Peng:
Facial expression recognition based on local-global information reasoning and spatial distribution of landmark features. 535-548 - Lixia Xue, Wenhao Wang, Ronggui Wang, Juan Yang:

Modular dual-stream visual fusion network for visual question answering. 549-562 - Jinguang Chen

, Xin Zhang, Lili Ma
, Bo Yang, Kaibing Zhang:
CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM. 563-577 - Huihui Li, Junhao Zhu, Guihua Wen, Haoyang Zhong:

Structural self-contrast learning based on adaptive weighted negative samples for facial expression recognition. 579-590 - Lihuan Zheng

, Wanru Xu, Zhenjiang Miao, Xinxiu Qiu, Shanshan Gong:
RESTHT: relation-enhanced spatial-temporal hierarchical transformer for video captioning. 591-604 - Yanxiang Hu

, Panpan Wu
, Bo Zhang
, Wenhao Sun
, Yaru Gao
, Caixia Hao
, Xinran Chen
:
A new multi-focus image fusion quality assessment method with convolutional sparse representation. 605-624 - Shuyu Xiao, Yongfang Wang, Yihan Wang:

SISIM: statistical information similarity-based point cloud quality assessment. 625-638 - Jing Wu, Hao Wu

, Guowu Yuan
:
Detail-aware image denoising via structure preserved network and residual diffusion model. 639-658 - Luhan Wang

, Jun Li, Shangwei Guo, Shaokun Han:
A cascaded graph convolutional network for point cloud completion. 659-674 - Zhongxu Li, Qihan He, Wenyuan Yang

:
E-FPN: an enhanced feature pyramid network for UAV scenarios detection. 675-693 - Jiakun Zhao, Yige Cai

:
SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion. 695-708 - Hao Zhou, Junjie Yin, Yilun Yang, Meie Fang

, Ping Li:
Topology-guided accelerated vector field streamline visualization. 709-722 - Kun Wu, Lei Zhu

, Weihang Shi, Wenwu Wang:
Automated fabric defect detection using multi-scale fusion MemAE. 723-737 - A. Lubna

, Saidalavi Kalady, A. Lijiya:
Visual question answering on blood smear images using convolutional block attention module powered object detection. 739-757 - Xiyu Wei, Yanmei Dong, Qin Liu, Lei Wang, Liantang Lou:

Robust corner detection in continuous space. 759-772 - Jing Zhao, Yongjun He, Zheng Shi, Jian Qin, Yining Xie

:
A style-aware network based on multi-task learning for multi-domain image normalization. 773-783
Volume 41, Number 2, January 2025
- Jianliang Li, Jinming Zhang

, Xiaohai Zhang, Ming Chen:
Edge-guided generative network with attention for point cloud completion. 785-798 - Haowei Zhu

, Suqin Bai, Jinlong Shi, Chenggen Wang, Yunhan Sun, Jiawen Lu, Xin Shu, Shucheng Huang:
IOFusion: instance segmentation and optical-flow guided 3D reconstruction in dynamic scenes. 799-813 - Chao Yang, Meng Yang

, Hongyu Li
, Linlu Jiang, Xiang Suo
, Lijuan Mao, Weiliang Meng, Zhen Li:
A survey on soccer player detection and tracking with videos. 815-829 - Sameer Bhimrao Patil

, Suresh Shirgave:
Instructor emotion recognition system using manta ray foraging algorithm for improving the content delivery in video lecture. 831-851 - Ting Yu, Weiliang Meng, Zhongqi Wu, Jianwei Guo, Xiaopeng Zhang:

Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow. 853-867 - Yasmeen Cheema

, Muhammad Nadeem Cheema, Anam Nazir, Fahad Ahmed KhoKhar, Ping Li, Ayaz Ahmed:
A novel approach for improving open scene text translation with modified GAN. 869-881 - Pengbin Fu, Ganyun Xiao, Huirong Yang:

SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder. 883-900 - Roberto Alcover-Couso

, Juan C. SanMiguel, Marcos Escudero-Viñolo
, Pablo Carballeira:
Per-class curriculum for Unsupervised Domain Adaptation in semantic segmentation. 901-919 - Supriya Agrawal

, Prachi Natu:
OBB detector: occluded object detection based on geometric modeling of video frames. 921-943 - Xin Wang, Jin Feng, Jiajia Ding, Jun Gao:

Light field salient object detection based on discrete viewpoint selection and multi-feature fusion. 945-960 - Zhizhen Zhou, Yejing Huo, Guoheng Huang, An Zeng, Xuhang Chen

, Lian Huang, Zinuo Li:
QEAN: quaternion-enhanced attention network for visual dance generation. 961-973 - Shunsuke Takao

:
Underwater image sharpening and color correction via dataset based on revised underwater image formation model. 975-990 - Junqing Yuan, Mengting Fan, Zhenyang Liu, Tongxuan Han, Zhenzhong Kuang, Chihao Pan, Jiajun Ding:

Collaborative neural radiance fields for novel view synthesis. 991-1006 - Can Zhang, Feipeng Da, Shaoyan Gai:

Point clouds feature frequency domain analysis based on multilayer perceptron. 1007-1020 - Lei Wang, Xue-Song Tang

, Kuangrong Hao:
GFPE-ViT: vision transformer with geometric-fractal-based position encoding. 1021-1036 - Fahad Ahmed KhoKhar, Jamal Hussain Shah, Rabia Saleem, Anum Masood:

Harnessing deep learning for faster water quality assessment: identifying bacterial contaminants in real time. 1037-1048 - Yixiao Jin, Fu Gui, Minghao Chen, Xiang Chen, Haoxuan Li, Jingfa Zhang

:
Deep learning-driven automated quality assessment of ultra-widefield optical coherence tomography angiography images for diabetic retinopathy. 1049-1059 - Bo Qian, Xiangning Wang, Zhouyu Guan

, Dawei Yang, An-ran Ran, Tingyao Li, Zheyuan Wang, Yang Wen, Xinming Shu, Jinyang Xie, Shichang Liu, Guanyu Xing, Julio Silva-Rodríguez, Riadh Kobbi, Ping Li, Tingli Chen, Lei Bi, Jinman Kim, Weiping Jia
, Huating Li, Jing Qin, Ping Zhang, Ching-Yu Cheng, Pheng-Ann Heng, Tien Yin Wong, Carol Y. Cheung, Yih-Chung Tham, Nadia Magnenat-Thalmann, Bin Sheng:
HRDC challenge: a public benchmark for hypertension and hypertensive retinopathy classification from fundus images. 1061-1077 - Dapeng Yan, Gangyi Ding, Kexiang Huang

, Tianyu Huang:
Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN. 1079-1096 - Yan Zhou, Xiang Chen, Tingyao Li, Shiqun Lin, Bin Sheng, Ruhan Liu, Rongping Dai:

GAMNet: a gated attention mechanism network for grading myopic traction maculopathy in OCT images. 1097-1108 - Gang Liu

, Jiebang Wang, Yao Qian, Yonghua Li:
Infrared and visible image fusion method based on visual saliency objects and fuzzy region attributes. 1109-1125 - Shweta Saboo

, Joyeeta Singha:
Semantic hand gesture integration system using self-co-articulation and movement epenthesis detection. 1127-1140 - Lars Zawallich

:
Unfolding polyhedra via tabu search. 1141-1154 - Bo Qian, Hao Chen, Yupeng Xu, Yang Wen, Huating Li, Yuan Xie, David Dagan Feng, Jinman Kim, Lei Bi, Xun Xu, Xiangui He, Bin Sheng

:
Deep contour attention learning for scleral deformation from OCT images. 1155-1170 - Lan Wei, Nikolaos M. Freris:

Multi-scale graph neural network for physics-informed fluid simulation. 1171-1181 - Mengsi Guo, Mingfu Xiong

, Jin Huang, Xinrong Hu, Tao Peng:
Face photo-sketch portraits transformation via generation pipeline. 1183-1196 - Mengsi Wang

, Yuan Mei, Lichun Yang, Bin Tian, Kaijun Wu:
SDR: stepwise deep rectangling model for stitched images. 1197-1211 - Qingkuo Meng

, Yongjian Huai, Fei Ma, Wentao Ye, Haifeng Xu, Siyu Yang:
Visualization of the occurrence and spread of wildfires in three-dimensional natural scenes. 1213-1226 - Xuan Miao, Shijie Li, Zheng Li, Wenzheng Xu, Ning Yang:

Multi-scale gated network for efficient image super-resolution. 1227-1239 - Václav Skala:

A new fully projective O(lg N) line convex polygon intersection algorithm. 1241-1249 - Gaoming Yang, Yifeng Ding

, Xianjin Fang, Ji Zhang, Yan Chu:
Fast face swapping with high-fidelity lightweight generator assisted by online knowledge distillation. 1251-1271 - Wensheng Li, Jing Zhang

, Jiafeng Li, Li Zhuo:
Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map. 1273-1291 - Xiangning Wang, Zhouyu Guan

, Bo Qian, Tingli Chen, Qiang Wu:
A deep learning system for the detection of optic disc neovascularization in diabetic retinopathy using optical coherence tomography angiography images. 1293-1302 - Mei Zhang, Lingling Liu, Yongtao Pei, Guojing Xie, Jinghua Wen:

Semantic segmentation of multi-scale remote sensing images with contextual feature enhancement. 1303-1317 - Ya'nan Guan, Shujiao Liao, Wenyuan Yang:

AParC-DETR: Accelerate DETR training by introducing Adaptive Position-aware Circular Convolution. 1319-1333 - Yong Liu, Xingyuan Li, Yong Liu, Wei Zhong:

SimpliFusion: a simplified infrared and visible image fusion network. 1335-1350 - Liping Zhu, Silin Wu, Xianxiang Chang, Yixuan Yang, Xuan Li:

Rethinking group activity recognition under the open set condition. 1351-1366 - Yuanqi Hu, Jianqi Zhang, Ling Bai, Jing Li, Bing Li, Ying Zang, Wenjun Hu:

From sketch to reality: precision-friendly 3D generation technology. 1367-1378 - Wenxuan Liu, Xuemei Jia, Yihao Ju, Yakun Ju, Kui Jiang, Shifeng Wu, Luo Zhong, Xian Zhong:

Fragrant: frequency-auxiliary guided relational attention network for low-light action recognition. 1379-1394 - Wuzhen Shi, Fei Tao, Yang Wen:

Joint super-resolution-based fast face image coding for human and machine vision. 1395-1408 - Shengzhou Luo

, Jingxing Xu, John Dingliana, Mingqiang Wei, Lu Han, Lewei He, Jiahui Pan
:
Publisher Correction: Twinenet: coupling features for synthesizing volume rendered images via convolutional encoder-decoders and multilayer perceptrons. 1409-1411 - Liwen Huang, Shujiao Liao, Wenyuan Yang

:
Correction: DC-PSENet: a novel scene text detection method integrating double ResNet-based and changed channels recursive feature pyramid. 1413-1414
Volume 41, Number 3, February 2025
- Yanfeng Zhao, Zhenjian Yang, Yunjie Zhang, Yadong Chen:

BGFNet: boundary information-aided graph structure fusion network for semantic segmentation of remote sensing images. 1415-1433 - Shuo Tong, Han Liu, Runyuan Guo

, Wenqing Wang, Ding Liu:
Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration. 1435-1451 - Pengshu Du, Xiao Wang, Qi Zheng, Xi Wang, WeiGang Li, Xin Xu:

Glare countering and exploiting via dual stream network for nighttime vehicle detection. 1453-1466 - Yongli Liu, Degang Yang

, Tingting Song, Yichen Ye, Xin Zhang
:
YOLO-SSP: an object detection model based on pyramid spatial attention and improved downsampling strategy for remote sensing images. 1467-1484 - Robin G. C. Maack, Felix Raith

, Juan F. Pérez
, Gerik Scheuermann, Christina Gillmann:
A workflow to systematically design uncertainty-aware visual analytics applications. 1485-1498 - QiGuang Zhu, Qiang Cen, YuXin Wang, Weidong Chen, Shuo Liu

:
An underwater target recognition algorithm incorporating improved attention mechanism and downsampling. 1499-1509 - Wenyue Sun, Jindong Zhang, Yitong Liu:

Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects. 1511-1525 - Jun Yang, Zilu Wu, Renbiao Wu:

Micro-expression recognition based on contextual transformer networks. 1527-1541 - Ya Li, Ziming Li, Huiwang Liu, Qing Wang:

ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation. 1543-1554 - Jindrich Adolf

, Peter Kán
, Tiare Feuchtner
, Barbora Adolfová
, Jaromír Dolezal
, Lenka Lhotská
:
Offistretch: camera-based real-time feedback for daily stretching exercises. 1555-1571 - Qunpo Liu

, Zhiwei Lu, Ruxin Gao, Xuhui Bu, Naohiko Hanajima:
SimpleMask: parameter link and efficient instance segmentation. 1573-1589 - Xiao Fang, Xin Gao, Baofeng Li, Feng Zhai, Yu Qin, Zhihang Meng, Jiansheng Lu, Chun Xiao:

A non-uniform low-light image enhancement method with multi-scale attention transformer and luminance consistency loss. 1591-1608 - Haibin Li, Aodi Guo, Yaqian Li

:
CCMA: CapsNet for audio-video sentiment analysis using cross-modal attention. 1609-1620 - Xun Zhao, Feiyun Xu

, Zheng Liu:
TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze. 1621-1635 - Qi Zhao, Congxuan Zhang

, Zhibo Rao, Zhen Chen, Zige Wang, Ke Lu:
GPDF-Net: geometric prior-guided stereo matching with disparity fusion refinement. 1637-1654 - Haihua Ding, Chuan Lin, Fuzhang Li, Yongcai Pan:

A feature aggregation network for contour detection inspired by complex cells properties. 1655-1671 - Zhengwu Yuan, Peixian Tang, Xinguang Sang, Fan Zhang, Zheqi Zhang:

Visionary: vision-aware enhancement with reminding scenes generated by captions via multimodal transformer for embodied referring expression. 1673-1688 - Munish Bhardwaj, Nafis Uddin Khan

, Vikas Baghel
:
Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering. 1689-1704 - Houfu Peng, Xing Lu, Daoxun Xia

, Xiaoyao Xie:
A novel image restoration solution for cross-resolution person re-identification. 1705-1717 - Caifeng Liu, Fangjie Gu:

Differential motion attention network for efficient action recognition. 1719-1731 - Gang Zhang, Yang Geng, Zhao G. Gong:

A comprehensive review of deep learning approaches for group activity analysis. 1733-1755 - Huijuan Wang, Xinyue Chen, Quanbo Yuan, Peng Liu:

A review of 3D object detection based on autonomous driving. 1757-1775 - Libo Sun, Yifan Li, Wenhu Qin:

PEPillar: a point-enhanced pillar network for efficient 3D object detection in autonomous driving. 1777-1788 - Mohamed Charfeddine Mzoughi, Najib Ben Aoun

, Sami Naouali:
A review on kinship verification from facial information. 1789-1809 - Jiawei Chen, Wen Su

, Mengjiao Ge, Ye He, Jun Yu:
To-Former: semantic segmentation of transparent object with edge-enhanced transformer. 1811-1825 - Ying Ma, Meng Wang, Guangyun Lu, Yajun Sun:

Multi-label semantic sharing based on graph convolutional network for image-to-text retrieval. 1827-1840 - Xiafan Li, Hongyan Quan:

MVPCL: multi-view prototype consistency learning for semi-supervised medical image segmentation. 1841-1854 - Yihe Nie, Xingbo Zhao, Yongxiang Li

, Qianwen Lu, Qingchuan Tao, Yanmei Yu:
DEAR: a novel deep-level semantics feature reinforce framework for Infrared Small Object Segmentation. 1855-1872 - Aokun Mei, Hua Huo, Jiaxin Xu, Ningya Xu:

Multistage attention region supplement transformer for fine-grained visual categorization. 1873-1889 - Tong Li, Zhaoxuan Zhang, Yuxin Wang, Yan Cui, Yuqi Li, Dongsheng Zhou, Baocai Yin, Xin Yang:

Self-supervised indoor scene point cloud completion from a single panorama. 1891-1905 - Xuyuan Zhang

, Chen Xu
, Yu Han
, George Baciu
:
Fabric image recolorization by fuzzy pretrained neural network. 1907-1920 - Shilong Wang

, Qianwen Hou, Jiaang Li, Jianlei Liu:
TSID-Net: a two-stage single image dehazing framework with style transfer and contrastive knowledge transfer. 1921-1938 - Xiaohong Zhang, Shengwu Xiong, Zhaoyang Sun, Jianwen Xiang:

Semi-hard constraint augmentation of triplet learning to improve image corruption classification. 1939-1956 - Huijuan Wang

, Boyan Cui, Quanbo Yuan, Gangqiang Pu, Xueli Liu, Jie Zhu:
Mini-3DCvT: a lightweight lip-reading method based on 3D convolution visual transformer. 1957-1969 - Zhigang Huang, Wanli Xue

, Yuxi Zhou
, Jinlu Sun, Yazhou Wu, Tiantian Yuan, Shengyong Chen:
Dual-stage temporal perception network for continuous sign language recognition. 1971-1986 - Zixuan Yu, Zhenjun Tang

, Xiaoping Liang, Hanyun Zhang, Ronghai Sun, Xianquan Zhang:
A novel image hashing with low-rank sparse matrix decomposition and feature distance. 1987-1998 - Shiyu Li, Zehao Liu, Meijing Gao, Yang Bai, Haozheng Yin:

MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration. 1999-2010 - Suyi Liu, Fang Xu, Chengdong Wu

, Jianning Chi, Xiaosheng Yu, Longxing Wei, Chuanjiang Leng:
CMT-6D: a lightweight iterative 6DoF pose estimation network based on cross-modal Transformer. 2011-2027 - Jun Wu

, Wanyu Nie
, Yu Zheng, Gan Zuo, Jiaming Dong, Siwei Wei:
Malleable pruning meets more scaled wide-area of attention model for real-time crack detection. 2029-2046 - Qiwang Li, Mingwen Shao, Fukang Liu, Yuanjian Qiao, Zhiyong Hu:

Contrastive local constraint for irregular image reconstruction and editability. 2047-2060 - Xiang Suo, Weidi Tang, Lijuan Mao, Zhen Li:

Correction: Digital human and embodied intelligence for sports science: advancements, opportunities and prospects. 2061 - Dhruv Meduri, Mohit Sharma, Vijay Natarajan:

Correction to: Jacobi set simplification for tracking topological features in time-varying scalar fields. 2063
Volume 41, Number 4, March 2025
- Hanqin Wang, Alexei Sourin:

Visual signatures for music mood and timbre. 2065-2077 - Khawla Ben Salah, Mohamed Othmani, Jihen Fourati, Monji Kherallah:

Advancing spatial mapping for satellite image road segmentation with multi-head attention. 2079-2089 - Mikolaj Maik

, Jakub Flotynski
, Krzysztof Walczak
:
Knowledge-based approach to adaptive XR interface design for non-programmers. 2091-2105 - Max Reimann

, Martin Büßemeyer, Benito Buchheim, Amir Semmo, Jürgen Döllner, Matthias Trapp:
Artistic style decomposition for texture and shape editing. 2107-2122 - Hiba Mzoughi

, Ines Njeh, Mohamed Ben Slima, Nouha Farhat, Chokri Mhiri:
Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI). 2123-2142 - Dingning Long

, Rongrong Chen
:
Cognitive capacity and aesthetics: the influence of visual working memory on landscape ink painting preference. 2143-2156 - Liangwei Wang

, Zhan Wang
, Xi Zhao
, Fugee Tsung
, Wei Zeng
:
Antarctica storytelling: creating interactive story maps for polar regions with graphic-based approach. 2157-2169 - Chuang Wu

, Tingqin He:
Efficient minor defects detection on steel surface via res-attention and position encoding. 2171-2185 - Junjie Zhang, Yi Lin, Xin Zhou, Pangrong Shi, Xiaoqiang Zhu, Dan Zeng:

Precision in pursuit: a multi-consistency joint approach for infrared anti-UAV tracking. 2187-2202 - Jiayi Xu, Xuan Tan, Yixuan Ju, Xiaoyang Mao, Shanqing Zhang:

High similarity controllable face anonymization based on dynamic identity perception. 2203-2217 - Mohamed Elsayed, Mohamed Reda, Ahmed S. Mashaly, Ahmed Saleh:

LERFNet: an enlarged effective receptive field backbone network for enhancing visual drone detection. 2219-2232 - Jialin Zhu

, He Wang, David Hogg, Tom Kelly:
Learning to sculpt neural cityscapes. 2233-2249 - Suresh Cheekaty, G. Muneeswari

:
Advancing autism prediction through visual-based AI approaches: integrating advanced eye movement analysis and shape recognition with Kalman filtering. 2251-2270 - Huaping Zhou, Bin Deng, Kelei Sun, Shunxiang Zhang, Yongqi Zhang:

UTE-CrackNet: transformer-guided and edge feature extraction U-shaped road crack image segmentation. 2271-2283 - Xiaoyang Zhao, Zhuo Wang, Zhongchao Deng, Hongde Qin, Zhongben Zhu:

Transmission-guided multi-feature fusion Dehaze network. 2285-2297 - Randa I. Elanwar, Margrit Betke

:
Generative adversarial networks for handwriting image generation: a review. 2299-2322 - Yixi Li, Yanzhe Liu, Rong Chen

, Hui Li, Na Zhao:
Point cloud upsampling via a coarse-to-fine network with transformer-encoder. 2323-2337 - Neil Patrick Del Gallego

, Joel Ilao
, Macario O. Cordel II, Conrado R. Ruiz Jr.
:
Training a shadow removal network using only 3D primitive occluders. 2339-2376 - Qunpo Liu

, Qi Tang, Bo Su
, Xuhui Bu, Naohiko Hanajima, Manli Wang:
Wire rope damage detection based on a uniform-complementary binary pattern with exponentially weighted guide image filtering. 2377-2390 - Jianjian Jiang, Ziwei Chen, Fangyuan Lei, Long Xu, Jiahao Huang

, Xiaochen Yuan:
Multi-granularity hypergraph-guided transformer learning framework for visual classification. 2391-2408 - Yueqian Pan, Qiaohong Chen, Xian Fang:

DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation. 2409-2424 - Wei Li

, Bowen Li, Jingqi Wang
, Weiliang Meng, Jiguang Zhang, Xiaopeng Zhang:
ROMOT: Referring-expression-comprehension open-set multi-object tracking. 2425-2437 - Longfeng Shen

, Bin Hou, Yulei Jian, Xisong Tu, Yingjie Zhang, Lingying Shuai
, Fangzhen Ge, Debao Chen:
TransFGVC: transformer-based fine-grained visual classification. 2439-2459 - Avantika Saklani, Shailendra Tiwari

, H. S. Pannu:
Deep attentive multimodal learning for food information enhancement via early-stage heterogeneous fusion. 2461-2476 - Xiang Suo

, Weidi Tang, Lijuan Mao, Zhen Li:
Digital human and embodied intelligence for sports science: advancements, opportunities and prospects. 2477-2493 - Jiaxuan Zhu, Ming Shao, Libo Sun, Siyu Xia:

ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition. 2495-2510 - JiaYan Wen, YuanSheng Zhuang, JunYi Deng:

EDM: a enhanced diffusion models for image restoration in complex scenes. 2511-2527 - Canlin Li, Xinyue Wang, Ran Yi, Wenjiao Zhang, Lihua Bi, Lizhuang Ma:

MCLGAN: a multi-style cartoonization method based on style condition information. 2529-2544 - Haobo Dong, Tianyu Song, Xuanyu Qi, Jiyu Jin, Guiyue Jin, Lei Fan:

Exploring high-quality image deraining Transformer via effective large kernel attention. 2545-2561 - Surendrabikram Thapa, Abhijit Sarkar

:
A deep dive into enhancing sharing of naturalistic driving data through face deidentification. 2563-2594 - Runtao Xi, Jiahao Lyu

, Kang Sun, Tian Ma:
Learning kernel parameter lookup tables to implement adaptive bilateral filtering. 2595-2605 - Yi-lun Wang, Yi-zheng Lang, Yunsheng Qian

:
Effective multi-scale enhancement fusion method for low-light images based on interest-area perception OCTM and "pixel healthiness" evaluation. 2607-2627 - Alireza Dehghanpour, Zahra Sharifi, Masoud Dehyadegari

:
Point cloud downsampling based on the transformer features. 2629-2638 - Yabo Wu

, Wenting Li
, Ziyang Chen
, Hui Wen, Zhongwei Cui, Yongjun Zhang:
Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling. 2639-2654 - Yumei Tan, Haiying Xia, Shuxiang Song:

Robust consistency learning for facial expression recognition under label noise. 2655-2667 - Wen-Kai Tsai

, Hsin-Chih Wang:
Real-time salient object detection based on accuracy background and salient path source selection. 2669-2690 - Nauman Ullah Gilal, Marwa K. Qaraqe, Jens Schneider, Marco Agus:

Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing. 2691-2708 - Ying Ni, Xiaoli Wang, Hanghang Peng, Yonzhi Li, Jinyang Wang

, Haoxuan Li, Jin Huang:
Dual-branch dilated context convolutional for table detection transformer in the document images. 2709-2720 - Yubo Zhang, Lei Xu, Haibin Xiang, Haihua Kong, Junhao Bi, Chao Han:

LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution. 2721-2736 - Xiaoyu Song, Dezhi Han, Chongqing Chen, Xiang Shen

, Huafeng Wu:
Vman: visual-modified attention network for multimodal paradigms. 2737-2754 - Zekang Liu, Wei Feng, Liqing Gao, Lianyu Hu

:
DBL-SC: background-independent sign language recognition based on spatial channel separation computation. 2755-2766 - Ze Ouyang, Huihuang Zhao, Yudong Zhang, Long Chen:

STVDNet: spatio-temporal interactive video de-raining network. 2767-2782 - R. Raja Sekar

, T. Dhiliphan Rajkumar, Koteswara Rao Anne:
Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme. 2783-2800 - Lirong Li, Jiang Ding, Hao Cui, Zhiqiang Chen, Guisheng Liao:

LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes. 2801-2815 - Saba Ghazanfar Ali, Xiaoxia Wang, Ping Li, Huating Li, Po Yang, Younhyun Jung, Jing Qin, Jinman Kim, Bin Sheng:

EGDNet: an efficient glomerular detection network for multiple anomalous pathological feature in glomerulonephritis. 2817-2834 - Pan Wu, Jin Tang:

FHFN: content and context feature hierarchical fusion networks for multi-focus image fusion. 2835-2856 - Ling-Xiao Qin, Hong-Mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-Sheng Jia:

Adaptive learning-enhanced lightweight network for real-time vehicle density estimation. 2857-2873 - Jit Chatterjee

, Maria Torres Vega
:
3D-Scene-Former: 3D scene generation from a single RGB image using Transformers. 2875-2889 - Xinyi Liu, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Guo Zhong, Xuhang Chen

, Chi-Man Pun:
Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression. 2891-2906 - Jiazhe Miao, Tao Peng, Fei Fang, Xinrong Hu, Li Li:

TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details. 2907-2921 - Wei Song, Kaili Yang:

Dual adaptive local semantic alignment for few-shot fine-grained classification. 2923-2937 - Changhong Shi, Weirong Liu, Jiahao Meng, Xiongfei Jia, Jie Liu:

Self-prior guided generative adversarial network for image inpainting. 2939-2951 - Chunyu Liu, Yixiao Jin, Zhouyu Guan

, Tingyao Li, Yiming Qin
, Bo Qian, Zehua Jiang, Yilan Wu, Xiangning Wang, Ying Feng Zheng, Dian Zeng:
Visual-language foundation models in medicine. 2953-2972 - Xin Zhao, Yinhuang Chen, Chengzhuan Yang, Lincong Fang:

FuseNet: a multi-modal feature fusion network for 3D shape classification. 2973-2985 - Hao Li, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Xuhang Chen

, Guo Zhong, Chi-Man Pun:
Psanet: prototype-guided salient attention for few-shot segmentation. 2987-3001
Volume 41, Number 5, March 2025
- Liang Zhang, Shifeng Li, Xi Luo, Xiaoru Liu, Ruixuan Zhang:

Video anomaly detection with both normal and anomaly memory modules. 3003-3015 - Hong Zhao, Wengai Li

, Dailin Huang, Jinhai Huang, Lijun Zhang:
M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis. 3017-3035 - Xunan Tan, Xiang Suo, Wenjun Li, Lei Bi, Fangshu Yao:

Data visualization in healthcare and medicine: a survey. 3037-3058 - Junding Sun, Chenxu Wang, Haifeng Sima, Xiaosheng Wu, Shuihua Wang, Yudong Zhang:

Mfpenet: multistage foreground-perception enhancement network for remote-sensing scene classification. 3059-3076 - R. Varun Prakash, V. Karthikeyan, S. Vishali, M. Karthika

:
Multi-level LSTM framework with hybrid sonic features for human-animal conflict evasion. 3077-3093 - Xintao Liu, Yan Gao, Changqing Zhan, Qiao Wang, Yu Zhang, Yi He, Hongyan Quan:

Directional latent space representation for medical image segmentation. 3095-3107 - Yan Zhou, Haibin Zhou, Yin Yang, Jianxun Li, Richard Irampaye, Dongli Wang, Zhengpeng Zhang:

Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation. 3109-3128 - Fengling Li, Zheng Yang, Yan Gui:

SES-yolov5: small object graphics detection and visualization applications. 3129-3142 - Xiaoying Chen, Weijie Ye:

Dual representations network for few-shot learning based on local descriptor importance: integrating global and local features. 3143-3154 - Zezheng Tang, Yihua Wu, Xinming Xu:

The study of recognizing ripe strawberries based on the improved YOLOv7-Tiny model. 3155-3171 - Daipeng Yang, Bo Peng, Xi Wu:

A bio-inspired edge and segment detection method by modeling multiple visual regions. 3173-3188 - Jianjun Zhu, Huihuang Zhao, Yudong Zhang:

Filter-deform attention GAN: constructing human motion videos from few images. 3189-3204 - Mingjian Li

, Younhyun Jung, Shaoli Song, Jinman Kim:
Attention-driven visual emphasis for medical volumetric image visualization. 3205-3219 - Jun Wang, Honghui Cao

, Chenhao Sun, Ziqing Huang, Yonghua Zhang:
Motion perception-driven multimodal self-supervised video object segmentation. 3221-3238 - Gang Chen, Wenju Wang, Haoran Zhou

, Xiaolin Wang:
EGCT: enhanced graph convolutional transformer for 3D point cloud representation learning. 3239-3261 - Haojie Gao, Peishun Liu, Xiaolong Ma, Zikang Yan, Ningning Ma, Wenqiang Liu, Xuefang Wang, Ruichun Tang:

TP-LSM: visual temporal pyramidal time modeling network to multi-label action detection in image-based AI. 3263-3281 - Guowei Zhang

, Wuzhi Li, Yutong Tang, Shuixuan Chen, Li Wang:
Lightweight CNN-ViT with cross-module representational constraint for express parcel detection. 3283-3295 - Jianglei Ye, Yigang Wang, Fengmao Xie, Qin Wang, Xiaoling Gu, Zizhao Wu

:
Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention. 3297-3308 - Xingquan Cai, Haoyu Zhang, LiZhe Chen, YiJie Wu, Haiyan Sun:

3D human pose estimation using spatiotemporal hypergraphs and its public benchmark on opera videos. 3309-3327 - Zhiyuan Li, Xin Jin, Qian Jiang, Puming Wang, Shin-Jye Lee, Shaowen Yao, Wei Zhou:

Crafting imperceptible and transferable adversarial examples: leveraging conditional residual generator and wavelet transforms to deceive deepfake detection. 3329-3344 - Wan-He Kai, Kai-Xin Xing:

Video-driven musical composition using large language model with memory-augmented state space. 3345-3357 - Wenzhe Shi, Ziqi Hu, Hao Chen, Hengjia Zhang, Jiale Yang, Li Li:

Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal. 3359-3370 - Xu Liu, Tong Zhou, Chong Wang, Yuping Wang, Yuanxin Wang, Qinjingwen Cao, Weizhi Du, Yonghuan Yang, Junjun He, Yu Qiao, Yiqing Shen:

Toward the unification of generative and discriminative visual foundation model: a survey. 3371-3412 - Yaping Deng, Yingjiang Li, Zibo Wei, Keying Li:

GLDC: combining global and local consistency of multibranch depth completion. 3413-3422 - Weifeng Cao, Xiaoyan Lei

, Jun Shi, Wanyong Liang, Jie Liu, Zongfei Bai:
HASN: hybrid attention separable network for efficient image super-resolution. 3423-3435 - Sunhan Xu, Jinhua Wang, Ning He, Guangmei Xu, Geng Zhang:

Optimizing underwater image enhancement: integrating semi-supervised learning and multi-scale aggregated attention. 3437-3455 - Yazhuo Fan, Jianhua Song, Lei Yuan, Yunlin Jia:

HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron. 3457-3472 - Muhammad Fahad, Tao Zhang, Yasir Iqbal, Azaz Ikram, Fazeela Siddiqui

, Bin Younas Abdullah, Malik Muhammad Nauman, Xin Zhao, Yanzhang Geng:
Advanced deepfake detection with enhanced Resnet-18 and multilayer CNN max pooling. 3473-3486 - Jiajun Yang, Xuesong Zhang, Cunli Song:

Research on a small target object detection method for aerial photography based on improved YOLOv7. 3487-3501 - Pengbo Bo, Qingxiang Liu, Caiming Zhang:

Topological structure extraction for computing surface-surface intersection curves. 3503-3518 - Wenji Yang, Hang An, Wenchao Hu, Xinxin Ma, Liping Xie:

Text-guided floral image generation based on lightweight deep attention feature fusion GAN. 3519-3535 - Ali Salar, Ali Ahmadi:

Enhancing high-vocabulary image annotation with a novel attention-based pooling. 3537-3551 - Yiting Wu, Pinqi Fang, Xiangning Wang, Jie Shen:

Predicting pancreatic diseases from fundus images using deep learning. 3553-3564 - Shunzhou Wang, Yao Lu, Wang Xia, Peiqi Xia, Ziqi Wang, Wei Gao:

Light field angular super-resolution by view-specific queries. 3565-3580 - Xiaohu Wang, Xin Yang, Hengrui Li, Tao Li:

FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution. 3581-3593 - Minsoo Choi, Christos Mousas

, Nicoletta Adamo, Sanjeevani Patankar, Klay Hauser, Fangzheng Zhao, Richard E. Mayer:
ASAP: animation system for agent-based presentations. 3595-3610 - Dinghao Guo

, Dali Chen, Xin Lin, Zheng Xue, Wei Zheng, Xianling Li:
Semi-supervised image semantic segmentation method with semantic regions patching and uncertainty-guided loss. 3611-3626 - YaTing Liu, ChengDong Lan, Wanjian Feng:

DLKN: enhanced lightweight image super-resolution with dynamic large kernel network. 3627-3644 - Andrea Bodonyi, István Csoba, Roland Kunkli:

Real-time ray transfer for lens flare rendering using sparse polynomials. 3645-3662 - Shijie Li, Shanhua Yao, Zhonggen Wang, Juan Wu:

FFCANet: a frequency channel fusion coordinate attention mechanism network for lane detection. 3663-3678
Volume 41, Number 6, April 2025
- Zhaijuan Ding, Yanyu Liu, Sen Liu, Kangjian He, Dongming Zhou:

$\hbox {KD}^{3}$mt: knowledge distillation-driven dynamic mixer transformer for medical image fusion. 3679-3693 - Lin Wang, Jie Li, Chun Qi, Fengping Wang, Pan Wang:

Progressive Crowd Enhancement De-Background Network for crowd counting. 3695-3717 - Baoan Li, Long Zhang, Shangzhi Teng, Xueqiang Lyu:

Attribute correlation mask fusion network for pedestrian attribute recognition. 3719-3734 - Yasmin M. Alsakar, Nehal A. Sakr, Shaker H. Ali El-Sappagh, Tamer Abuhmed, Mohammed Elmogy:

Underwater image restoration and enhancement: a comprehensive review of recent trends, challenges, and applications. 3735-3783 - Xiaopan Li, Shiqian Wu, Xin Yuan, Shoulie Xie, Sos S. Agaian:

Hierarchical wavelet-guided diffusion model for single image deblurring. 3785-3800 - Yawen Xiang, Heng Zhou, Chengyang Li, Fangwei Sun, Zhongbo Li, Yongqiang Xie:

Deep learning in motion deblurring: current status, benchmarks and future prospects. 3801-3827 - Yunxi Chen, Yuanjie Cao, Fei Fang, Jin Huang, Xinrong Hu, Ruhan He, Junjie Zhang:

SACANet: end-to-end self-attention-based network for 3D clothing animation. 3829-3842 - Yuanjie Dang, Jiangyun Chen, Peng Chen, Nan Gao, Ruohong Huan, Dongdong Zhao:

Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection. 3843-3852 - Qian Wan, Bin Zhou, Yanjiang Wang:

BSCGAN: structured minority class image generation under class-balanced pretraining. 3853-3865 - Shize Wang, Gang Wu

, Jin Wang, Qing Zhu, Yunhui Shi, Baocai Yin:
SBC-Net: semantic-guided brightness curve estimation network for low-light image enhancement. 3867-3882 - Xinzhe Xie, Buyu Guo, Peiliang Li, Shuangyan He, Sangjun Zhou:

SwinMFF: toward high-fidelity end-to-end multi-focus image fusion via swin transformer-based network. 3883-3906 - Zitao Gao, Xiangjian Liu, Anna K. Wang, Liyu Lin:

A simulated two-stream network via multilevel distillation of reviewed features and decoupled logits for video action recognition. 3907-3923 - Ronghui Feng, Yuefei Wang, Jiajing Xue, Yuquan Xu

, Yutong Zhang, Xi Yu:
CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections. 3925-3955 - Guowen Yue

, Ge Jiao
, Chen Li
, Jiahao Xiang
:
When CNN meet with ViT: decision-level feature fusion for camouflaged object detection. 3957-3972 - Shuo Yang, Xiaoling Gu, Zhenzhong Kuang, Feiwei Qin, Zizhao Wu:

Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey. 3973-4000 - Chen Li, Weiqi Yan, Hongwei Zhao, Shihua Zhou, Yueping Wang:

TFFD-Net: an effective two-stage mixed feature fusion and detail recovery dehazing network. 4001-4016 - Kailin Liu, Yonghong Hou, Zihui Guo, Wenjie Yin, Yi Ren:

Visual context learning based on cross-modal knowledge for continuous sign language recognition. 4017-4031 - Qiang Cen, QiGuang Zhu, YuXin Wang, Weidong Chen, Shuo Liu

:
YOLOv9-YX: lightweight algorithm for underwater target detection. 4033-4045 - Le-Anh Tran

, Dong-Chul Park:
Lightweight image dehazing networks based on soft knowledge distillation. 4047-4066 - Haiyuan Cao

, Deng Chen
, Yanduo Zhang, Huabing Zhou, Dawei Wen, Congcong Cao:
MFINet: a multi-scale feature interaction network for point cloud registration. 4067-4079 - Libo Sun, Jiahui Yan, Yongchun Qiu, Wenhu Qin:

The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning. 4081-4095 - Guowei An

, Yaonan Wang, Kai Zeng
, Qing Zhu, Xiaofang Yuan:
Deep spatial and discriminative feature enhancement network for stereo matching. 4097-4110 - Qiyang Liu, Yun Ge, Sijia Wang, Ting Wang, Jinlong Xu:

Dynamic manifold-based sample selection in contrastive learning for remote sensing image retrieval. 4111-4127 - Ziwei Zeng, Lihong Li, Zoufei Zhao, Qingqing Liu:

Improved fine-grained image classification in few-shot learning based on channel-spatial attention and grouped bilinear convolution. 4129-4141 - Yiqian Huang, Shuqi Liu, Fei Dong, Xu Li, Xin Yang, Ya Zhou, Jinxiang Huang

, Yong Song:
PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking. 4143-4156 - Yong Zhang, Qingguo Shan, Wenyun Chen, Wenzhe Liu:

EEG emotion recognition approach using multi-scale convolution and feature fusion. 4157-4169 - Guowei Zhang

, Weidong Zhang, Wuzhi Li, Li Wang, Huankang Cui:
A dynamic attention mechanism for object detection in road or strip environments. 4171-4181 - Youjie Zhou, Runyu Jiao, Zhonghan Tao, Xichang Liang, Yi Wan:

Spatial-frequency attention-based optical and scene flow with cross-modal knowledge distillation. 4183-4198 - Pham Thanh Huu, Nguyen Thai An, Nguyen Ngoc Trung, Huynh Ngoc Thien, Nguyen Sy Duc, Nguyen Thi Ty:

Judicial decision prediction using an integrated attention based bidirectional long-short term memory and dilated skip residual convolution neural network. 4199-4220 - Xinbiao Lu, Gaofan Zhan, Wen Wu, Wentao Zhang, Xiaolong Wu, Changjiang Han:

Van-DETR: enhanced real-time object detection with vanillanet and advanced feature fusion. 4221-4238 - Chenchen Xu

, Kaixin Han, Weiwei Xu
:
Image-aware layout generation with user constraints for poster design. 4239-4252 - Zhen Huang, Yongjian Zhu, Qiao Zhang

, Hongyan Zang, Tengfei Lei:
Exploration, fusion, and refinement: a multivariate features interaction network for visual camouflaged detection. 4253-4267 - Yongbo Yu, Weidong Li, Linyan Bai, Jinlong Duan, Xuehai Zhang:

UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration. 4269-4285 - Liping Zhu, Haibo Zhou, Silin Wu, Tianrong Cheng, Hongjun Sun:

Polynomial for real-time rendering of neural radiance fields. 4287-4300 - Yong Zhang, Da Liu, Li Jiang, Huibing Wang, Wenzhe Liu:

Feature decomposition and structural learning for multi-diverse and multi-view data clustering. 4301-4320 - Pengjie Liu

, Yanzhan Chen, Fan Yu
, Qian Zhang
:
Mastering adverse weather: a two-stage approach for robust semantic segmentation in autonomous driving. 4321-4346 - Yuqi Xiao, Yongjun Wu:

A dual-channel correlation filtering tracker for real-time tracking based on deep features of improved CaffeNet and integrated manual features. 4347-4361 - Dejin Zhao, Yunjie Ma, Xiaolong Yuan, Tong Tong, Dechao Wang, Rui Sun, Lili Cheng, Jianhai Zhang:

SME: Spatial multi-scale enhanced attention for automated detection of micro-defect on automobile complex paint surfaces. 4363-4376 - Yuanhong Zhong, Ting Chen, Daidi Zhong, Xiaoming Liu:

Wavelet-guided network with fine-grained feature extraction for vessel segmentation. 4377-4392 - Ling-Xiao Qin, Hong-Mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-Sheng Jia:

Correction: Adaptive learning-enhanced lightweight network for real-time vehicle density estimation. 4393-4394
Volume 41, Number 7, May 2025
- Long Zhang, QingHua Zhou, Shuai Tang, Yunxiang Chen:

High-definition multi-scale voice-driven facial animation: enhancing lip-sync clarity and image detail. 4395-4403 - Qiaohong Chen, Shufan Xie, Xian Fang

, Qi Sun:
CTHFNet: contrastive translation and hierarchical fusion network for text-video-audio sentiment analysis. 4405-4418 - Xuanpeng Li, Hengshuo Cao, Jinming Li, Guangyu Li, Lin Zhao:

A shoreline extraction method based on dual-loop network framework. 4419-4430 - Viktor Leonhardt

, Alexander Wiebel
, Christoph Garth
:
A framework for visual comparison of scalar fields with uncertainty. 4431-4448 - Ye Liu, Lei Zhu, Liang Wan, Xing Wang:

Masked frequency-color fusion network for video instance-level hazy lane detection. 4449-4461 - Jibing Peng, Yaohua Yi, Ying Zhou:

DPDTRN: a dynamic pixel-level difficulty-aware texture reconstruction network for document super-resolution. 4463-4480 - Huangyuan Wu, Bin Li, Lianfang Tian, Chao Dong:

DDFA: a displacement and diffusion-based feature augmentation method for imbalanced image recognition. 4481-4495 - Yunfei Qiu, Shuai Jiao, Qingtang Su:

Enhancing color image watermarking via fast quaternion Schur decomposition: a high-quality blind approach. 4497-4515 - Rui Sun, Xiaolu Yu, Huidong Feng, Fei Wang, Xudong Zhang:

Motion-robust mask face presentation attack detection via dual-stream texture-rPPG network. 4517-4532 - Zhiwen Shao, Yifan Cheng, Yong Zhou, Xiang Xiang, Jian Li, Bing Liu, Dit-Yan Yeung:

High-level LoRA and hierarchical fusion for enhanced micro-expression recognition. 4533-4546 - Kesai Wang, Xifan Yao, Nanfeng Ma, Guangjun Ran:

PLMOT-SLAM: a point-line features fusion SLAM system with moving object tracking. 4547-4565 - Ping Lu, Youcheng Cai, Jiale Yang, Dong Wang, Tingting Wu:

Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction. 4567-4580 - Zhengyan Liu, Huiwen Wang, Lihong Wang, Shanshan Wang:

Locality-constrained double-layer structure scaled simplex multi-view subspace clustering. 4581-4601 - Tianxiang Huo

, Zhenqi Liu, Shichao Zhang, Jiening Wu, Rui Yuan, Shukai Duan, Lidan Wang:
CDNet: object detection based on cross-level aggregation and deformable attention for UAV aerial images. 4603-4621 - Krishnendu Maity

, Susanta Mukhopadhyay
:
LPSIS: a lossless secret image sharing scheme based on Legendre polynomials with low-cost reconstruction. 4623-4637 - Yuesong Tian

, Li Shen, Xiang Tian, Dacheng Tao
, Zhifeng Li, Wei Liu, Yaowu Chen:
DGL-GAN: discriminator-guided GAN compression. 4639-4660 - Javed Aymat Husen Shaikh

, Shailendrakumar M. Mukane, Santosh Nagnath Randive:
Lightweight progressive recurrent network for video de-hazing in adverse weather conditions. 4661-4672 - Jinchang Zhu, Dayang Sun, Yu Cheng, Hailong Wang, Yujing Chen, Yaowei Chen:

GaitHF: enhancing appearance-based gait recognition through height fused images. 4673-4686 - Wanjun Zhong, Haohao Hu, Yuerong Wang, Li Li, Tianyu Han, Chunyong Li, Peng Zan:

Hierarchical evidence aggregation in two dimensions for active water surface object detection. 4687-4702 - Julien Thomas, Boyu Kuang

, Yizhong Wang
, Stuart Barnes
, Karl Jenkins
:
Advanced semantic segmentation of aircraft main components based on transfer learning and data-driven approach. 4703-4722 - Hongfei Li, Xueyang Li:

Dim and small objects detection in aerial images with stacked attention mechanism and improved loss function. 4723-4739 - Yanliang Ge, Junchao Ren, Cong Zhang, Min He, Hongbo Bi, Qiao Zhang

:
Feature-aware and iterative refinement network for camouflaged object detection. 4741-4758 - Mohamad Haniff Junos, Anis Salwa Mohd Khairuddin

:
YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction. 4759-4778 - Sardor Mamarasulov, Lianggangxu Chen, Changgu Chen, Yang Li, Changbo Wang:

Data augmentation with attention framework for robust deepfake detection. 4779-4798 - Jian Ni, Zheng Wang

, Yixiao Wang, Wenjian Tao, Ao Shen:
DRCL: rethinking jigsaw puzzles for unsupervised medical image segmentation. 4799-4813 - Huanshuo Zhang, Guobiao Ren:

Intelligent leaf disease diagnosis: image algorithms using Swin Transformer and federated learning. 4815-4838 - Václav Skala:

A new fully projective O(log N) point-in-convex polygon algorithm: a new strategy. 4839-4850 - Jianuo Wang, Huawei Li, Yumin Chen

:
Seg-invRender: fusing semantic segmentation based on NeRF for inverse rendering considering shadows. 4851-4864 - Wuzhen Shi, Aixue Yin, Yingxiang Li, Bo Qian:

Cross-view Transformer for enhanced multi-view 3D reconstruction. 4865-4877 - Jiaxing Yu, Zheng Chen, Jingkai Wang

, Linghe Kong
, Jiajie Yan, Wei Gu:
Enhancing Image Super-Resolution with Dual Compression Transformer. 4879-4892 - Saleha Masood, Mousa Ahmad Al Bashrawi, Muhammad Attique Khan, Anam Nazir:

Exploring ChatGPT applications in healthcare: a comprehensive overview. 4893-4914 - Yaqi Sun, Xiaolan Xie, Zhi Li, Huihuang Zhao:

Image style transfer with saliency constrained and SIFT feature fusion. 4915-4930 - Zean Jin, Yulong Bai, Wei Song, Qinghe Yu, Xiaoxin Yue:

EduCodeVR: VR for programming teaching through simulated farm and traffic. 4931-4955 - Zeyu Cai, Ziyu Zhang, Chengqian Jin, Feipeng Da:

DMDC: a cross-attention network for dynamic mask-based dual-camera snapshot hyperspectral Photography. 4957-4974 - Baokai Zu, Tong Cao, Yafang Li, Jianqiang Li, Hongyuan Wang, Quanzeng Wang:

RESwinT: enhanced pollen image classification with parallel window transformer and coordinate attention. 4975-4990 - Yaqian Li, Xin Zhan, Haibin Li, Wenming Zhang:

Selection and guidance: high-dimensional identity consistency preservation for face inpainting. 4991-5003 - Yang Yang, Changming Zhu:

Deep multi-view clustering based on global hybrid alignment with cross-contrastive learning. 5005-5017 - Tiago Madeira

, Miguel Oliveira
, Paulo Dias
:
Reflection-aware 3D mirror segmentation and pose estimation. 5019-5028 - Tao Shi, Yao Ding, Kui-feng Zhu, Yan-jie Su:

DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision. 5029-5041 - Congying An, Jingjing Wu, Huanlong Zhang:

Occlusion-aware segmentation via RCF-Pix2Pix generative network. 5043-5057 - Zidi Cao, Jiayi Han, Sipeng Yang, Xiaogang Jin:

Fast best viewpoint selection with geometry-enhanced multiple views and cross-modal distillation. 5075-5086 - Hongru Wang, Hu Cheng, Jingtao Zhang:

Faster-PGYOLO: an efficient framework for floating debris detection in inland waters. 5087-5104 - Yanchen Liu, Changming Zhu:

DMVMLC-VT: Deep incomplete multi-view multi-label image classification with view translation and pseudo-label enhancement. 5105-5121 - Miao Yang, Meng Yang, Weiliang Meng, Ping Li, Zhen Li:

Msc-Net: multi-stage colorization network for real-world images with specular highlights. 5123-5134 - Kexuan Wang, Chenhua Liu, Rongfu Zhang:

CMA-SOD: cross-modal attention fusion network for RGB-D salient object detection. 5135-5151 - Yanliang Ge, Taichuan Liang, Junchao Ren, Jiaxue Chen, Hongbo Bi:

Enhanced salient object detection in remote sensing images via dual-stream semantic interactive network. 5153-5169 - Jianguo Ning, Lei Zhang, Xiangzhao Xu:

Virtual simulation for the dynamic response of concrete blocks under blast loading. 5171-5187 - Shue Liu, Siwei Zhao, Yiying Wang, Jiaming Xin, Dashe Li:

An enhanced underwater fish segmentation method in complex scenes using Swin transformer with cross-scale feature fusion. 5189-5203 - Zewei Zhao, Xiaotie Ma, Yingjie Shi, Xiaotong Yang:

Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding. 5205-5221
Volume 41, Number 8, June 2025
- Shiyun Zhang, Xing Deng, Haijian Shao, Yingtao Jiang:

ImpRes: implicit residual diffusion models for image super-resolution. 5223-5233 - Imen Labiadh, Larbi Boubchir, Hassene Seddik:

Optimization of 2D and 3D facial recognition through the fusion of CBAM AlexNet and ResNeXt models. 5235-5250 - He Yu

, Kang Yan, Jiexi Chen, Xuan Li, Jinming Guo, Xiaoxue Xing, Tao Huang
:
Study on the methods of hyperspectral image saliency detection based on MBCNN. 5251-5266 - Yanxiang Li, Wenzhe Meng, Dehua Ma, Siping Xu, Xiaoliang Zhu:

MCGFF-Net: a multi-scale context-aware and global feature fusion network for enhanced polyp and skin lesion segmentation. 5267-5282 - Yusong Li, Bin Xie, Yuling Li, Jiahao Zhang:

Multi-scale local regional attention fusion using visual transformers for fine-grained image classification. 5283-5298 - Yongpeng Zhao

, Guangyuan Zhang, Kefeng Li, Zhenfang Zhu, Xiaotong Li, Yongshuo Zhang, Zhiming Fan:
MFADU-Net: an enhanced DoubleU-Net with multi-level feature fusion and atrous decoder for medical image segmentation. 5299-5309 - Meichen Lu, Yi Chai, Kaixiong Xu, Weiqing Chen, Fei Ao, Wen Ji:

Multimodal fusion and knowledge distillation for improved anomaly detection. 5311-5322 - Jihua Peng, Yanghong Zhou, P. Y. Mok:

EHFusion: an efficient heterogeneous fusion model for group-based 3D human pose estimation. 5323-5345 - Xizhuo Yu, Chaojie Fan, Jiandong Pan, Guoliang Xiang, Chunyang Chen, Tianjian Yu, Yong Peng, Hanwen Deng:

X-ray security inspection for real-world rail transit hubs: a wide-ranging dataset and detection model with incremental learning block. 5347-5359 - Junli Shen, Yuman Hai

, Chongyu Lin:
CT-UFormer: an improved hybrid decoder for image segmentation. 5361-5371 - Yufang Yang, Yining Xie

, Jun Cao, Kaihua Yang:
Attention-guided dual feature extraction approach for small target detection in infrared images. 5373-5389 - Honglin Wu, Xinyu Yu, Zhaobin Zeng:

SSBFNet: a spectral-spatial fusion with BiFormer network for hyperspectral image classification. 5391-5404 - Fangfang Liang, Zilong Huang, Wenjian Wang, Zhenxue He, Qing En:

Dynamic text prompt joint multimodal features for accurate plant disease image captioning. 5405-5419 - Wei Cao

, Xin Chen, Jianping Lv, Liang Shao, Weixin Si:
Semi-supervised intracranial aneurysm segmentation via reliable weight selection. 5421-5433 - Wei-Jong Yang, Li-Yang Ho:

CSA-Lanenet: a contiguous spatial attention lane detection network with vision transformer modules. 5435-5445 - Simin Yan, Shuchang Xu, Aiping Lei, Sanyuan Zhang:

Advancing neural aesthetic assessment of artistic images based on bundle features integration. 5447-5459 - Donghui Wang, Jinhua Wang, Ning He, Jingzun Zhang, Sen Zhang, Shuai Liu:

Enhancing unsupervised shadow removal via multi-intensity shadow generation and diffusion modeling. 5461-5476 - Yunfei Lu, Chenxia Chang, Song Gao, Shaowen Yao, Ahmed Zahir:

Boosting adversarial example detection via local histogram equalization and spectral feature analysis. 5477-5494 - Canlin Li, Haowen Su, Xin Tan, Lihua Bi, Xiangfei Zhang, Lizhuang Ma:

Innovative collaborative multi-lookup table for real-time enhancement of low-light images. 5495-5515 - Zhao Liangjun, Yinqing Wang, Yueming Hu, Hui Dai, Xi Yubin, Feng Ning, He Zhongliang, Gang Liang, Yuanyang Zhang:

An image fusion algorithm based on image clustering theory. 5517-5537 - Jie Yin, Tao Sun, Guorong Zhang, Yuhao Wu, Xiao Zhang:

Deformation-aware image restoration from atmospheric turbulence based on quasiconformal geometry and pulse-coupled neural network. 5539-5562 - Hongwei Wei, Qi Li, Jie Pan, Junmei Chen, Yizhuo Zhang, Lizhuang Qi, Ying Zhou:

SPSNet: semantic-guided perspective shift network for robust person re-identification in drone imagery. 5563-5582 - Shuai Su, Chengju Liu, Qijun Chen:

Universally describing keypoints from a semi-global to local perspective, without any specific training. 5583-5596 - Yan Liu, Wenting Qi, Jingwen Wang

, Yanqiu Xiao, Guangzhen Cui, Li Han:
An efficient defogging network for RAW image sequences with high viewpoint. 5597-5608 - Yiyuan Ge

, Mingxin Yu, Zhihao Chen, Wenshuai Lu, Yuxiang Dai, Huiyu Shi:
Attention-enhanced controllable disentanglement for cloth-changing person re-identification. 5609-5624 - Maocheng Bai, Xiaosheng Yu

, Ying Wang, Jubo Chen
, Xiaofeng Zhang, Pengfei Lyu
:
Enhancing pixel-level analysis in medical imaging through visual instruction tuning: introducing PLAMi. 5625-5641 - Wei Liu, Cong Wang, Yongkang Zhang:

Industrial surface defect detection by multi-scale Inpainting-GAN. 5643-5660 - Yanzheng He, Pengjun Wang, Xiaochun Guan, Han Li:

Enhancing 3D Human Moiton Prediction with MSIGCN: A Novel Approach to Addressing Sensor Noise and State Accuracy. 5661-5674 - Saba Ghazanfar Ali, Xiangning Wang, Lei Bi, Younhyun Jung, Tingli Chen, Haifang Zhang:

Deep learning-based binocular system for automated diabetic retinopathy grading with prior clinical knowledge integration. 5675-5688 - Xuefeng Zhang, Bin Yan, Zhaohu Xing, Feng Gao, Yuandong Tao, Zhenyan Han, Weiming Wang

, Lei Zhu:
HADiff: hierarchy aggregated diffusion model for pathology image segmentation. 5689-5700 - Zhaobin Chang, Xiong Gao, Dongyi Kong, Na Li, Yonggang Lu:

Multi-prototype collaborative perception enhancement network for few-shot semantic segmentation. 5701-5718 - Kunyu Yan, Wenbin Zheng, Yujie Yang:

Lightweight weed detection using re-parameterized partial convolution and collection-distribution feature fusion. 5719-5731 - Xin Zhang, Degang Yang

, Tingting Song, Yichen Ye, Yingze Song, Jie Zhou, Jie Chen:
A lightweight object detector based on changeable-size lightweight convolution and context augmentation module for images captured by UAVs. 5733-5749 - Cuiyun Lin

, Chengxue Lao
, Tianrun Jing
, Wenxiao Wang
:
Predicting game ownership dynamics: a novel POAFD-trend analysis approach. 5751-5767 - Jiaze He, Jian Xiao, Yuanjie Cao, Jing He, Siyu Li, Jin Huang, Ruhan He, Jianlin Zhu

:
Region-assisted line drawing colorization through diffusion model. 5769-5780 - Jinsong Zhang, Yu-Kun Lai, Jingyu Yang, Kun Li:

PISE-V: person image and video synthesis with decoupled GAN. 5781-5798 - Zheyuan Wang, Ziyao Meng

, Yiming Qin:
MSPAN: lightweight image super-resolution with multi-semantic guidance. 5799-5814 - Zehao Cao, Zongji Wang, Yuanben Zhang, Cheng Jin, Weinan Cai, Zhihong Zeng, Junyi Liu:

Enhancing 3D Gaussian splatting for low-quality images: semantically guided training and unsupervised quality assessment. 5815-5833 - Liangjun Zhao, Xi Yubin, Yinqing Wang, Feng Ning, He Zhongliang, Gang Liang, Yuanyang Zhang:

MADNet: cropland change detection network for the complex terrain and dense vegetation hilly region in the Southwestern China. 5835-5854 - Qiaohong Chen

, Zhenyang Xu, Xian Fang
:
CaVMamba: convolution-augmented VMamba for medical image segmentation. 5855-5872 - Runlong Cao, Jianqi Zhang, Yun Shen, Huanhuan Zhou, Peiying Zhou, Guowei Shen, Zhengwen Xia, Ying Zang, Qingshan Liu, Wenjun Hu:

Dual-flow feature enhancement network for robust anomaly detection in stainless steel pipe welding. 5873-5889 - Yiming Chen

, Yihang Liu, Gizem Kayar-Ceylan
:
CSG-based ML-supported 3D translation of sketches into game assets for game designers. 5891-5903 - Yuanchuan Duan, Peng Wang, Yan Huang, Yuxin Hang, Qi Sun, Haibo Shao, Jinzhu Yang:

Optimizing semi-supervised medical image segmentation with imbalanced filtering and nnU-Net enhancement. 5905-5917 - Pengfei Zhao, Jianhua Ji, Yang Wen, Wuzhen Shi, Wenming Cao:

Dual prior guided depth image super-resolution with multi-scale transformer fusion network. 5919-5933 - Yaguang Lu, Yong Hu, Huiyan Feng, Pengshuai Duan, Xukun Shen:

Generating reconstructable collaborative virtual environments via graph matching for mixed reality remote collaboration. 5935-5947 - Yingjie Fan, Bin Wen, Hongfei Deng:

MRA-Net: an instance segmentation method based on multi-scale feature fusion for ethnic costumes images. 5949-5960 - Zhangmeng Chen, Ju Dai, Junjun Pan, Feng Zhou:

Diffusion model with temporal constraint for 3D human pose estimation. 5961-5977 - Zhenmin Yao, Qianqian Hu:

Accelerated local progressive-iterative approximation methods for curve and surface fitting. 5979-5993 - Ahmet Agaoglu

, Nezih Topaloglu:
Dynamic region of interest generation for maritime horizon line detection using time series analysis. 5995-6009 - Hu Wang, Hong-Mei Sun, Wen-Long Zhang, Yu-Xiang Chen, Rui-Sheng Jia:

FANN: a novel frame attention neural network for student engagement recognition in facial video. 6011-6025 - Tongtong Liu

, Chen Yang, Guoqiang Chen, Wenhui Li:
Open-vocabulary multi-label classification with visual and textual features fusion. 6027-6039 - Shang Ma, Xiaoying Nie, Gang Yang

, Chunqing Zhou:
A robust and efficient model for the interaction of fluids with deformable solids. 6041-6054 - Guoyou Zhang

, Zhixiang Hao
, Lihu Pan
, Wei Guo, Jiaxin Zuo, Xuenan Zhang:
MeshBLS: mesh-based broad learning 3D object classification network. 6055-6065 - Yajuan Zhang, Yongquan Liang, Junjie Wang, Houying Zhu

, Zhihui Wang:
Enhanced multi-object tracking via embedded graph matching and differentiable Sinkhorn assignment: addressing challenges in occlusion and varying object appearances. 6067-6085 - Xiao Li, Kai Wu, Haoran Chen

, Wenjun Song, Hongwei Tao
, Zuhe Li
, Yanan Du:
Deep residual PLSR model with manifold optimization and Gaussian filter for enhanced image classification. 6087-6102 - Hongzhi Li, Zhanghao Ren, Guoqing Zhu, Yaoju Liang, Han Cui, Chaozeyu Wang, Jiaxi Wang:

Enhancing medical image segmentation with MA-UNet: a multi-scale attention framework. 6103-6120 - Jianbing Xu, Jiangxin Zhou, Dongxu Xu, Yu Chen:

Local dual-branch attention feature learning framework from UAVs for visual defect detection. 6121-6132 - Zhanqiang Huo, Xiyan Zhan, Yingxu Qiao, Shan Zhao:

D3-Dehaze: a divide-and-conquer framework for enhanced single image dehazing. 6133-6148 - Jingya Shi, Dezhi Han, Chongqing Chen, Xiang Shen

:
SAFFNet: self-attention based on Fourier frequency domain filter network for visual question answering. 6149-6167 - Xiaodong Wang, Jiangtao Fan, Fei Yan, Hongmin Hu, Zhiqiang Zeng, Haiyan Huang:

Unsupervised fur anomaly detection with B-spline noise-guided Multi-directional Feature Aggregation. 6169-6185 - Tang Xu, Wenbin Wang

, Alin Zhong:
HOIEdit: Human-object interaction editing with text-to-image diffusion model. 6187-6199 - Xiangyang Wang, Kun Yang, Qiang Ding, Rui Wang, Jinhua Sun:

Tic action recognition for children tic disorder with end-to-end video semi-supervised learning. 6201-6217 - Elmira Bagheri, Amir Hossein Barshooi:

Nighttime driver behavior prediction using taillight signal recognition via CNN-SVM classifier. 6219-6235 - Yanmei Li, Tao Yu, Jian Luo, Xiaoshuang Li, Jingshi Deng, Qibin Yang:

JLEDNet: a nighttime UAV tracking method through joint low-light image enhancement using hybrid attention transformer and denoising. 6237-6249 - V. Karthikeyan

, S. Praveen, S. Sudeep Nandan:
Lightweight deep hybrid CNN with attention mechanism for enhanced underwater image restoration. 6251-6269 - Qian Ye

, Qingwu Li, Guanying Huo, Yan Liu, Yan Zhou:
Boundary-guided multi-scale refinement network for camouflaged object detection. 6271-6297 - Qiuquan Zhao, Jianyuan Li:

SPS-UNet: a super-pixel sampling UNet for extracting buildings from high-resolution satellite images. 6299-6312 - Enze Yang, Yuxin Liu, Shitao Zhao, Yiran Liu, Shuoyan Liu:

Learn from restoration: exploiting task-oriented knowledge distillation in self-supervised person re-identification. 6313-6326 - Satoshi Nishimura

:
Correction: Grid-induced bounding volume hierarchy for ray tracing dynamic scenes. 6329
Volume 41, Number 9, July 2025
- Nadia Magnenat-Thalmann:

Editorial issue July 2025. 6331-6333 - Wonjun Lee

:
Multilevel Monte Carlo for asymptotically efficient path tracing. 6335-6348 - Teng Zhang, Bo Yang, Jianlin Zhu

, Xincheng Hu:
Scene-Enhanced Social Interpretable Movement Behavior for Multimodal Pedestrian Trajectory Prediction. 6349-6361 - Naoki Kita:

StencilQR: connectivity-enhanced fabricable QR codes for stencil. 6363-6374 - Yi Jiang, Yiqian Wu, Hao Xu, Xiwen Shi, Xiaogang Jin:

Geometry guidance diffusion image morphing with large shape difference. 6375-6386 - Yanping Fu, Yuting Zhang, Dengdi Sun, Shaojie Zhang, Haifeng Zhao:

Single image shadow removal using 2D signed distance field. 6387-6399 - Xiaonan Fang, Muhan Chang:

Video sketching using multi-domain guidance and implicit encoding. 6401-6412 - Wenguang Chen, Dong Xiao, Renjie Chen:

Bijective spherical parameterization via stereographic projection. 6413-6424 - Haipeng Wang:

Submodular-based view selection for low-quality points rendering with multi-feature point-based NeRF. 6425-6437 - Shihao Zheng, Huisi Wu, Zhijian Gao, Ping Li:

Few-shot medical image segmentation via query transformation learning. 6439-6452 - Yuan-Hao Jiang, Kezong Tang, Zi-Wei Chen, Yuang Wei, Tian-Yi Liu, Jiayi Wu:

MAS-KCL: knowledge component graph structure learning with large language model-based agentic workflow. 6453-6464 - Xiaojiao Guo, Shenghong Luo, Yihang Dong, Zexiao Liang, Zimeng Li, Xiujun Zhang, Xuhang Chen

:
An asymmetric calibrated transformer network for underwater image restoration. 6465-6477 - Renjie Zhang, Xin Wang, George Baciu, Ping Li:

Distilling complementary information from temporal context for enhancing human appearance in human-specific NeRF. 6479-6491 - Feiwei Qin, Liangzhe Zhu, Zijian Xu, Meie Fang, Ping Li:

CADGCL: unsupervised retrieval of CAD models via boundary representations. 6493-6505 - Jie Zhao, Ju Dai, Feng Zhou, Junjun Pan, Hongwen Xu:

Dual-path spatio-temporal Mamba for skeleton-based action recognition. 6507-6519 - Shu Liu, Yilin Huang, Hongyun Yu, Yan Xu:

AMNet: an attention-enhanced multi-branch network for micro-expression recognition. 6521-6532 - Yun Pei

, Lingbo Liu, Runqing Jiang, Ye Zhang, Pengpeng Yu
, Liang Lin, Yulan Guo:
Energy-guided test-time adaptation for data shifts in multi-modal perception. 6533-6546 - Cheng Fang, Siyan Zhu, Junjun Pan:

Enhanced material point method with affine projection stabilizer for efficient hyperelastic simulations. 6547-6560 - Pengpei Hong, Chuhua Xian, Hongmin Cai, Jiazhou Chen, Guiqing Li:

Batch Specular Manifold Sampling for caustics rendering. 6561-6569 - Yuhang Yi, Yan Gui, Zhuo Liu:

Boosting memory network for video object segmentation in complex scenes. 6571-6585 - Yuval Onn, Haggai Maron, Ayellet Tal:

Attention-guided self-supervised distinctive region detection in point clouds. 6587-6600 - Qingzheng Wang, Ning Li, Jiazhi Xie, Wenhui Liu, Xingqin Wang, Zengwei Mai:

Unified cross-domain refinement network for camouflaged object detection. 6601-6615 - Runqiao Li, Qiujie Dong, Shuangmin Chen:

RevolRecon: Neural Representation for Reconstructing Surface of Revolution. 6617-6629 - Kai Yang, Wenhao Zhang, Ping Li, Jinxing Liang, Tao Peng, Jia Chen, Li Li, Xinrong Hu, Junping Liu:

ViT-BF: vision transformer with border-aware features for visual tracking. 6631-6644 - Xijun Wang, Xin Zhou, Yi Wang, Songto Zeng, Xinyu Liu, Haobo Shen, Xianying Wang, Ping Li, Lei Zhu:

RainRWKV: a deep RWKV model for video deraining. 6645-6656 - Sen Peng, Yihang Fu, Runjie Miu, Tianyi Lv, Baorong Yang, Xiao Dong:

GenericAvatar: generic human modeling from monocular video based on mesh-guided Gaussians. 6657-6670 - Shengjun Liu, Ting Zhang, Ruoxi Deng, Xinru Liu, Hanchao Liu:

Physics-guided deep learning framework with attention for image denoising. 6671-6685 - Qiuyue Zhang, Zhiwang Zhang, Shiting Wen, Chaoyi Pang, Fangyu Wu:

Boosting remote semantic segmentation using vision-and-language foundation model. 6687-6700 - Yixiao Feng, Weihua Tong, Zhangjin Huang:

High-quality neural surface reconstruction from unoriented point clouds via multilevel tensor product B-spline hash encoding and viscosity regularization. 6701-6714 - Jian Lin, Chengze Li, Xueting Liu, Zhongping Ge:

Instance-guided anime editing with a curated large-scale dataset. 6715-6727 - Baofeng Zhou, Xianyong Fang, Linbo Wang, Zhengyi Liu:

SemanticAvatar: human surface reconstruction based on semantically consistent biplane features. 6729-6743 - Muyang Zhang, Weiliang Meng, Mingda Jia, Jiaming Gu, Yihua Shao, Changwei Wang

, Rongtao Xu, Zhihao Ma, Xiaopeng Zhang:
PDFT: parameter-diminish fine-tuning for transformer-based models. 6745-6755 - Taoqi Bao, Jiangnan Ye, Zhankong Bao, Chee Siang Leow, Haoji Hu, Jianfeng Lu, Issei Fujishiro, Jiayi Xu:

L2H-NeRF: low- to high-frequency-guided NeRF for 3D reconstruction with a few input scenes. 6757-6768 - Taishi Ito, Yuki Endo, Yoshihiro Kanamori:

Selfage: personalized facial age transformation using self-reference images. 6769-6781 - Jianning Chi, Mingyang Sun, Zelan Li, Geng Lin, Ying Huang:

Adaptive box-level supervision with superpixel shape guidance for ultrasound image segmentation. 6783-6794 - J. Antony, Minu Reghunath, Safeer Babu Thayyil, M. Ramanathan:

ConDT: A 2D curve reconstruction algorithm based on a constrained neighbor proximity graph. 6795-6807 - Yiyi Wang, Jia Su, Song Zhang, Eisei Nakahara:

RaEUNet: a retentive and efficient UNet for medical image segmentation. 6809-6821 - Zizhao Peng, Zihan Wang, Mengying Sun, Zheng Lv, Yan Wang, Ping Li, Fengwei An:

Graph convolutional networks for 3D skeleton-based scoliosis screening using gait sequences. 6823-6835 - Min Shi, Guo-Liang Zhao, Shi-sheng Guo, Bi-lian Sun, Dengming Zhu, Xiu-juan Chai, Zhao-Xin Li, Xinru Zhuo:

Generating 3D fish motion skeleton via iterative optimization method and FishSkeletonNet. 6837-6849 - Peng Yu

, Zhiyang Ji, Aimin Hao, Yang Gao:
Real-time immersive haptic sculpting with elastoplastic virtual clay. 6851-6864 - Enxu Zhao, Jianchi Sun, Fei Luo, Chunxia Xiao:

EE-Head: emotion estimation for precise facial expression in NeRF head avatars. 6865-6878 - Linling Jiang, Xin Wang, Fan Zhang

, Caiming Zhang:
Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers. 6879-6890 - Huibiao Wen, Lei Wang, Shuang-Min Chen, Shiqing Xin, Chongyang Deng, Ying He, Wenping Wang, Changhe Tu:

ImS: implicit shell for the sandwich-walled space surrounding polygonal meshes. 6891-6904 - Tsukasa Fukusato, Akinobu Maejima, Takeo Igarashi:

Locality-Preserving Free-Form Deformation. 6905-6915 - Jiawei Xu

, Qiangqiang Zhou, Jiacong Yu, Chen Liao, Dandan Zhu:
Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection. 6917-6929 - Yunlong Liao

, Yiting Lin
, Zheng Xing, Xiaochen Yuan:
Privacy Image Secrecy Scheme Based on Chaos-Driven Fractal Sorting Matrix and Fibonacci Q-Matrix. 6931-6941 - Ruiling Li, Ming Gao, Xiaogang Jin:

Recognize Me If You Can: Two-stream Adversarial Transfer for Facial Privacy Protection using Fine-grained Makeup. 6943-6954 - Minjae Seo, Inhyung Jung, Jinhoon Choi, Kyoungju Park:

PhysAvatar: physically plausible avatar generation from sparse tracking. 6955-6967 - Ruhao Wang, Yu Jiang, Huizhi Zhu, Fei Luo, Chunxia Xiao:

HumanIR-MGI: human inverse rendering via jointly optimizing geometry, material, and illumination. 6969-6982 - Bingchen Yang, Haiyong Jiang, Zhengda Lu, Jun Xiao:

Exploring Structural Lines for Interior Floorplan Segmentation. 6983-6997 - Haibo Wang, Qinsong Li, Ling Hu, Haojun Xu

, Jing Meng, Xinru Liu, Yu-Kun Lai, Shengjun Liu:
TriAlign: revisiting deep functional map from map representation alignment perspectives. 6999-7012
Volume 41, Number 10, August 2025
- Mengyao Liu, Ruhan Liu, Jia Shu, Qirong Liu, Yuan Zhang, Lixin Jiang:

AutoDDH: A dual-attention multi-task network for grading developmental dysplasia of the hip in ultrasound images. 7013-7025 - Lakshita Agarwal, Bindu Verma:

Enriching image description generation through multi-modal fusion of VGG16, scene graphs and BiGRU. 7027-7047 - Main Uddin, Zhangjie Fu, Xiang Zhang:

Deepfake face detection via multi-level discrete wavelet transform and vision transformer. 7049-7061 - Mengnan Hu, Qianli Zhou, Rong Wang:

Bridging visible and infrared modalities: a dual-level joint align network for person re-identification. 7063-7078 - Hao Liu, Ye Liu, Shuanglong Yao, Tongshuai Yu, Ke Gao, Pengcheng Hao, Shuqing He, Ji Chen, Xing Wang:

ISTFormer: lightweight transformer for enhanced super-resolution of coal rock images via iterative feature extraction. 7079-7092 - Zhehang Qiu, Huijuan Zhang

, Jie Zhou, Jianming Zhan:
Image restoration for both deblurring and dehazing based on multi-channel frequency information using deep neural network. 7093-7108 - Xi Li, Yulong Feng, Xianguo Yu, Yirui Cong, Lili Chen:

Epipolar constraint-guided differentiable keypoint detection and description. 7109-7121 - Wei Pan

, Zhe Yang
:
A lightweight enhanced YOLOv8 algorithm for detecting small objects in UAV aerial photography. 7123-7139 - Sung-Wook Park

, Se-Hoon Jung
, Chun-Bo Sim
:
NeXtSRGAN: enhancing super-resolution GAN with ConvNeXt discriminator for superior realism. 7141-7167 - Yuyan Liu, Qing Zhang, Yilin Zhao, Yanjiao Shi:

A dual-stream learning framework for weakly supervised salient object detection with multi-strategy integration. 7169-7184 - Guoquan Jiang, Canyu Wang, Zhanqiang Huo, Huan Xu:

Multi-channel correlated diffusion for text-driven artistic style transfer. 7185-7199 - Lihua Yang, Jinxian Zhao, Ziming Wang, Yuheng Liu, Dazhao Chi

:
M-KANUNet: enhanced defect segmentation in X-ray images of copper pipe welds via multi-scale representation and Kolmogorov-Arnold Networks. 7201-7214 - Xingyue Zou, Jiqiang Tang:

Guided fusion of infrared and visible images using gradient-based attentive generative adversarial networks. 7215-7232 - Lei Dai, Wen Gao, Chengyu Tang, Min Wang, Zhihua Chen:

MTMFNet: multi-threshold and multi-scale feature fusion network for text detection. 7233-7248 - Huaiguang Cai, Yang Yang, Yongqiang Tang, Zhengya Sun, Wensheng Zhang:

Shapley value-based class activation mapping for improved explainability in neural networks. 7249-7267 - Wei Song, Yaobin Huang:

Adaptive feature recalibration transformer for enhancing few-shot image classification. 7269-7283 - Jialin Zhang

, Xiao Wang, Hui Wei, Kui Jiang, Nan Mu, Zheng Wang:
Context-aware target texture perturbation attack for concealed object detection. 7285-7302 - Qida Cao, Jiajun Ding, Zhenyang Liu, Zhenzhong Kuang, Yijie Shao, Yilan Shen:

VC-GS: view-consistent deblurring Gaussian splatting via alternating branch optimization. 7303-7317 - Fuqiang Gou, Yonglong Li, Yanpian Mao, Chunyao Hou, Gang Wan, Jialong Li, Haoran Wang, Yongcan Chen:

Planar tunnel point cloud fine registration under multiple constraints. 7319-7340 - Haitian Ren, Quinten Kwok, Meng Sun, Xuyan Huang, Jianlin Zhu

, Haoxuan Li:
Toward artificial general intelligence in health care. 7341-7350 - Chen-Bin Feng

, Qi Lai, Kangdao Liu, Houcheng Su, Hao Chen, Kaixi Luo, Chi-Man Vong:
Learning few-shot semantic segmentation with error-filtered segment anything model. 7351-7365 - Peng Zhang, Yuming Yan, Yuangao Ai, Benhong Wang, Houming Shen, Zhonghan Peng:

Unet-based image segmentation and binarization for water level detection. 7367-7377 - Manuel Silva

, Antonio Seoane
, Omar A. Mures
, Antonio M. López
, José Antonio Iglesias Guitián:
Exploring the effects of synthetic data generation: a case study on autonomous driving for semantic segmentation. 7379-7397 - Ronggui Wang, Hong Chen, Juan Yang, Lixia Xue:

Adaptive sparse triple convolutional attention for enhanced visual question answering. 7399-7415 - Die Yu, Zhaoyan Fang, Yong Jiang:

Alleviating category confusion in fine-grained visual classification. 7417-7432 - Haomiao Liu, Hao Xu

, Chuhuai Yue, Bo Ma:
Adaptive objectness learning for enhanced unknown object detection. 7433-7446 - Xinbiao Lu, Yisen Chen, Yudan Chen, Xing Gao, Tieliu Yang, Guiyun Chen:

STIG-Net: a spatial-temporal interactive graph framework for recognizing violent behaviors in videos. 7447-7458 - Keqi Li, Yaping Wan, Gang Zou, Wangxiu Li, Jian Yang, Changyi Xie:

Enhancing facial action unit recognition through topological feature integration and relational learning. 7459-7475 - Yuenan Wang

, Hua Wang
, Fan Zhang
:
Mask autoencoder for enhanced image reconstruction with position coding offset and combined masking. 7477-7491 - Haowei Zhu

, Suqin Bai, Jinlong Shi, Jiawen Lu, Xin Zuo, Shucheng Huang, Xu Yao
:
Ellipsoid-SLAM: enhancing dynamic scene understanding through ellipsoidal object representation and trajectory tracking. 7493-7508 - Daikun Qu

, Hongwei Zhao, Mingzhu Zhou:
Unsupervised video object segmentation with mask transformer: boosting accuracy and efficiency through feature fusion. 7509-7520 - Cheng Zhong, Xiaomin Yu, Huan Xia, Rongdong Xie, Qingyi Xu:

Restoring intricate Miao embroidery patterns: a GAN-based U-Net with spatial-channel attention. 7521-7533 - Jinyang Wang

, Jihong Wang, Haoxuan Li, Xiaojun Huang, Jun Xia, Zhen Li, Weibing Wu, Bin Sheng:
Temporal goal-aware transformer assisted visual reinforcement learning for virtual table tennis agent. 7535-7549 - Junchi Ma, Yuanqing Wang, Guangmiao Ding, Wei Cao, Xiangyun Liao, Ping Zhang, Jianping Lv:

Mamba-enhanced hierarchical attention network for precise visualization of hippocampus and amygdala. 7551-7565 - Yuhao Zhang, Jiaqi Tong, Honglin Liu:

SCAP: enhancing image captioning through lightweight feature sifting and hierarchical decoding. 7567-7584 - Yan Zhang, Xueting Sang, Yemei Sun, Shudong Liu, Shengpei Zhou:

DMTNet: dual-domain adaptive multi-scale feature fusion network with transformer for small target detection. 7585-7601 - Xiaochun Wu, Ning Guo:

MGSLU-Net: a lightweight network for efficient detection of water leakage in subway tunnel linings. 7603-7616 - Kehao Chen, Zhiping Zhou, Kewei Li, Taoyong Su, Zhaozhong Zhang, Jinhua Liu, Chenghao Ying:

Red green blue-depth salient object detection based on multi-scale refinement and cross-modalities fusion network. 7617-7640 - Fang Zhou, Tingting Yang, Liuyan Tan, Xiaolong Xu, Mengdao Xing:

DAP-Net: enhancing SAR target recognition with dual-channel attention and polarimetric features. 7641-7656 - Cheng Jiang, Pengle Zhang, Ying Ni, Xiaoli Wang, Hanghang Peng, Sen Liu, Mengdi Fei, Yuxin He, Yaxuan Xiao, Jin Huang, Xingyu Ma, Tian Yang:

Multimodal retrieval-augmented generation for financial documents: image-centric analysis of charts and tables with large language models. 7657-7670 - Zhaozhao Yang, Yuhai Yu, Yongdong Huang, Jiana Meng:

Innovative approaches in image processing: enhancing feature extraction and recognition capabilities. 7671-7685 - Yihao Li, Junyu Liu, Xiaoyu Guan, Hanming Hou, Tianyu Huang:

Introducing anisotropic fields for enhanced diversity in crowd simulation. 7687-7702 - Liming Wan, Lin Song, Ying Zhou, Chenrui Kang, Shijian Zheng, Guo Chen:

Dynamic neighbourhood-enhanced UNet with interwoven fusion for medical image segmentation. 7703-7721 - Haomou Bai, Yue Sang:

Ultra-lightweight convolutional network for efficient single-image super-resolution. 7723-7733 - Sathish Mothe, Srinivas Kankanala:

Multi-stage residual network with two fold attention mechanisms for low-light image enhancement. 7735-7750 - Xie Chengjie, Lu Shuhua, Shi Yangyu, Zheng Diwen:

Joint perturbation consistency across image and feature levels for cross-domain adaptive crowd counting. 7751-7766 - Pengyun Chen, Shuang Cui, Ning Cao, Wenhao Zhang, Pengfei Wang, Shaohui Jin, Mingliang Xu:

Lightweight multi-scale feature fusion with attention guidance for passive non-line-of-sight imaging. 7767-7780 - Shili Wu, Yongkun Guo, Chao Qian, Ying Li, Xinyou Zhang:

Global attention and context encoding for enhanced medical image segmentation. 7781-7798 - Xiang Shijie, Zhou Dong, Tian Dan:

Multi-scale feature fusion network for real-time semantic segmentation of urban street scenes: enhancing detail retention and accuracy. 7799-7815 - Hao Li, Shengkun Wu, Lei Deng, Chenhua Liu, Yifan Chen, Hanrui Chen, Heng Yu, Mingli Dong, Lianqing Zhu:

Enhancing infrared and visible image fusion through multiscale Gaussian total variation and adaptive local entropy. 7817-7838 - Duo Liu, Guoyin Zhang, Yiqi Shi, Ye Tian, Liguo Zhang:

Efficient feature difference-based infrared and visible image fusion for low-light environments. 7839-7854 - Weichen Dai, Hexing Wu, Xiaoyang Weng, Wanzeng Kong:

Implicit guidance for enhancing low-light optical flow estimation via channel attention networks. 7855-7865 - Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Jose M. Martínez:

Layer-wise model merging for unsupervised domain adaptation in segmentation tasks. 7867-7882 - Xinzhi Li, Yong Liu, Peng Yan:

Optimizing feature map matching for marine benthic organism detection. 7883-7907 - Zhen Song, Jianhua Chen:

Adaptive rate compression for distributed video sensing in wireless visual sensor networks. 7909-7923 - Jinxing Liang, Kaifang Han, Dongsheng Li, Ruixin Gao, Jiajia Peng, Tao Peng, Xinrong Hu:

Enhancing low-frequency stitch code generation for knitted fabrics: an LFSCG-E-Net approach. 7925-7938 - JiaHao Wang, Yongqiang Wang, Congling Zhou, Jiawei Huang:

LF-RTMDet: an instance segmentation algorithm for real-time detection of water-filled barriers. 7939-7950 - Xijun Wang, Xin Zhou, Yi Wang, Songto Zeng, Xinyu Liu, Haobo Shen, Song Fei, Lei Zhu:

Msu-mamba: multi-scale defocus blur detection using cross-scale fusion and state-space models. 7951-7963 - Xite Wang, Changsheng Qin, Mei Bai, Qian Ma, Guanyu Li:

CAFormer: a connectivity-aware vision transformer for road extraction from remote sensing images. 7965-7981 - Zhenghao Xie, Junfen Chen, Yingying Wang, Bojun Xie:

Enhanced fine-grained relearning for skeleton-based action recognition. 7983-7995 - Doudou Zhang, Junchi Ma, Jie Chen, Linxia Xiao

, Xiangyun Liao, Yong Zhang, Weixin Si:
MF-SAM: enhancing multi-modal fusion with Mamba in SAM-Med3D for GPi segmentation. 7997-8008 - Wubin Shi, Shaoyan Gai, Feipeng Da, Zeyu Cai, Jiaoling Wang:

GRPoseNet: a generalizable and robust 6D object pose estimation network using sparse RGB views. 8009-8023 - Zongyu Ye, Hongjuan Yan, Yewang Sun, Bin Li, Lei Liu, Wenbo Wu:

MSPNet: real-time semantic segmentation with large kernel and atrous convolutions. 8025-8040 - Zhengwei Guo, Bo Wang:

Enhancing sandstorm images via color-guided spatial-frequency fusion network. 8041-8053 - Yu Pang, Yang Huang, Chenyu Weng, Jialin Lyu, Chuanyue Bai, Xiaosheng Yu:

Enhanced RGB-T saliency detection via thermal-guided multi-stage attention network. 8055-8073 - Xiang Chen, Yuanqi Yao, Zhouyu Guan, Chenyang Li

, Jian Guan, Jun Pu, Ruhan Liu, Bin Sheng, Shankai Yin, Yiming Qin
:
DSTS-GF: a dual-stream temporal-spatial transformer with gated fusion for the classification of Obstructive Sleep Apnea. 8075-8087 - Yuanqi Yao, Zehua Jiang, Zhouyu Guan, Yilun Luxue, Seungmin Lee, Xiang Chen, Haodong Yang, Yiming Qin:

A visual-language foundation model for disease diagnosis and doctor-patient co-decision. 8089-8101 - Shigang Hu, Darong Wu, Jianxin Wang, Shijun Huang:

The image super-resolution network based on dual-branch feature interaction attention mechanism. 8103-8116 - Tao Shi, Yao Ding, Kui-feng Zhu, Yan-jie Su:

Correction: DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision. 8117 - Sung-Wook Park

, Se-Hoon Jung
, Chun-Bo Sim
:
Correction: NeXtSRGAN: enhancing super-resolution GAN with ConvNeXt discriminator for superior realism. 8119
Volume 41, Number 12, September 2025
- Yijie Yang, Jianlin Zhou, Wei Hu, Zhigang Tu:

End-to-end pose-action recognition via implicit pose encoding and multi-scale skeleton modeling. 9337-9353 - Mengnan Hu, Wenjing Zhang, Qianli Zhou, Rong Wang:

Fine-grained text-based person re-identification via interlaced cross-attention and LoRA fine-tuning. 9355-9372 - Xueqing Zhang, Shuo Wang, Fengjuan Feng, Jianlei Liu:

Enhancing Fine-Grained Visual Classification via Curriculum Learning and Global-Local Feature Interaction. 9373-9394 - Yaolin Lei, Kai Jin, Zifeng Qiu, Yiming Sun, Huihui Bai, Wenzhi He:

MPA-Det: multi-path aggregation-based object detection framework for aerial visual computing. 9395-9408 - Jiamei Tang, Chao Dong, MengKun Li:

AdaMaskNet: adaptive multi-scale masked kernels for enhanced sensor-based human activity recognition. 9409-9425 - Haochen Li, Sheng Tang, Zhang Wan, Juan Cao, Jintao Li:

Latent inversion for consistent identity preservation in character animation. 9427-9440 - Alessio Barbaro Chisari, Luca Guarnera, Alessandro Ortis, Wladimiro Carlo Patatu, Sebastiano Battiato, Mario Valerio Giuffrida:

Benchmarking computer vision architectures for cloud detection from lidar ceilometer backscatter data. 9441-9458 - Jiacheng Cao, Liyu Ren, Ao Deng, Feng Yu

, Li Liu, Minghua Jiang:
MHC-Segnet: Mamba-Hadamard collaboration segmentation network for multimodal MRI brain tumor. 9459-9470 - Qiyan Zhao, Lanying Liang, Xiaofeng Zhang, Tiange Zhang, Jiuze Li, Yuefeng Liu:

Few-shot cross-modal text detection via CLIP. 9471-9485 - Dabbrata Das

, Argho Deb Das, Farhan Sadaf
:
Enhanced encoder-decoder architecture for accurate monocular depth estimation. 9487-9508 - Pinqi Fang, Yiting Wu, Yufeng He, Haoxuan Li, Zhouyu Guan, Xiangning Wang, Tingli Chen, Jie Shen:

Research progress on AI-assisted screening and prediction of systemic diseases based on retinal images. 9509-9537 - Yang Wen, Bai Chen, Wuzhen Shi, Daquan Feng, Wenming Cao, Song Wu:

MSPFM: Multi-Scale Pyramid Fusion Mamba for Medical Image Classification. 9539-9554 - Liqi Zhu, Dezhi Han, Xiang Shen

, Chongqing Chen, Kuan-Ching Li:
Enhancing image-text matching through multi-level semantic consistency alignment. 9555-9570 - Jin Huang, Li Liu, Yue Lu, Ching Y. Suen:

Enhancing scene text script identification through multi-task self-supervised learning. 9571-9586 - Yizhi Cong, Haoran Zhu, Longyu Guo, Wei Zhang, Zhongqing Zhang:

Deep channel-spatial attention networks for enhancing super-resolution of high-magnification SEM images. 9587-9600 - Tingyao Li, Zheyuan Wang, Zehua Jiang, Huaiqin Zhong, Yiming Qin:

Generative artificial intelligence for ophthalmic images: developments, applications and challenges. 9601-9627 - Zhan Hu, Juan Zhang, Yongbin Gao, Bo Huang, Zhijun Fang:

Depth-guided color correction and multi-scale Retinex network for underwater image enhancement. 9629-9644 - Mohammad Raihanul Bashar

, Mayra Donaji Barrera Machuca, Wolfgang Stuerzlinger, Anil Ufuk Batmaz:
The effect of visual depth on the vergence-accommodation conflict on 3D selection performance within virtual reality headsets. 9645-9661 - Youyou Lu, Lixia Chen, Xuewen Wang, Xin Xu, Xiaoli Sun:

Foreground detection through feature fusion convolutional neural networks: enhancing robustness against complex backgrounds. 9663-9674 - Haiyu Liu, Shuai Zhang, Keyan Ren, Hu Zhao, Xuhong Li, Zhiyu Nie:

PGLRNet: target pose-guided and feature loss-reduced network for oriented object detection in remote sensing images. 9675-9690 - Qiang Lan

, Haifeng Wu:
Dynamic local affine transformation for enhanced text-to-image generation with GANs. 9691-9704 - Cheng Zhang, Zhuoyue Ding, Xiaoying Jing, Lei Huang, Run Ye, Bin Yan, Xiaojia Zhou, Jinhong Guo:

Lightweight segmentation network for real-time wildfire detection: LSNet's parallel feature multiplication and attentional fusion. 9705-9715 - Ji Cui, Litai Pang, Shiju Zhao, Zhengyuan Peng, Xiaojuan Hu, Lingzhi Zeng, Tao Jiang, Mengchen Liang, Jinlian Huang, Wang Yuan, Xin Tan, Lizhuang Ma, Jiatuo Xu:

Learning Pulse Image with Deep Dynamic Frequency Network for Cardiovascular Diseases Diagnosis. 9717-9735 - Xinhai Li, Kuo-Kun Tseng:

CFF: Chunked Fourier Features Mapping Let NeRF Learn Fine 3D Knowledge. 9737-9747 - Kaibo Zhang, Tong Jia, Weihua Chen:

Enhancing Adversarial Transferability through Dual-Domain Gradient Flatness Optimization. 9749-9763 - Hengtao Wang, Peishun Liu, Quanjie Dou, Yibao Song, Mengqi Luo, Rongjia Han, Boning Zhang:

Enhanced edge detection via Dual-branch attention fusion with Canny-assisted supervision. 9765-9780 - Biao Dong

, Lei Zhang:
Enhanced Temporal Representation and Spatial Alignment for High-Fidelity Talking Video Generation. 9781-9792 - Dezhi Wu, Hui Wang, Yueqiong Ni, Yurun Lu, Yong Wang, Huating Li, Luonan Chen:

Decoding the gut-brain axis: toward AI-driven integration of neuroimaging and gut microbiota in human health. 9793-9804 - Ferhat Tas:

Quaternion-based curves and surfaces for enhanced spatial motion generation using geometric algebra. 9805-9824 - Bo Chen, Chenyu Zhou, Xiaoli Sun:

Enhancing green screen matting with group normalization and perceptual loss for color overflow and complex edges. 9825-9837 - Maria Maqbool, Amna Khan, Mehak Rafiq, Shahzad Rasool:

Enhancing Energy Conservation Behaviors Through Audio, Visual, and Social Cues in Virtual Reality. 9839-9855 - Yansen Huang, Hongji Yang, Jiao Liu, Bo Ren:

IDiff-NeRF: single-view 3D human body reconstruction utilizing identity-based diffusion within implicit neural network framework. 9857-9868 - Yaojie Chen, Chengyu Deng:

Dtm: Density embeded transformer mamba hybird network for point cloud analysis. 9869-9884 - YoungWoo Kim, Sungmin Kwon, Duksu Kim:

RTPD: penetration depth calculation using hardware-accelerated ray-tracing. 9885-9899 - Ruiying Wang, Yong Jiang:

Effective enhancement and fusion of multi-perspective features for self-supervised real image denoising. 9901-9917 - Aolei Yang, Yinghong Zhou, Chenchen Lv, Banghua Yang, Zhonghua Miao, Minrui Fei:

TGST: A transformer-graph framework for enhanced spatiotemporal modeling in 3D human pose estimation. 9919-9932 - Jiang Xin, Xiaonan Fang, Xueling Zhu, Ruyi Dai, Ju Ren, Wenzhen Yue, Yaoxue Zhang:

Privacy-aware Real-Time Target Person Matting in Multi-Person Scenes Using Dual Encoder-Decoder Networks. 9933-9950 - Tao Chen, Qiliang Yang, Yin Chen, Qizhen Zhou:

Multi-scale spatial attention and network enhancement for single-image reflection removal. 9951-9962 - Hao Yin, Ran Yi, Bin Sheng:

Dost: a dual optimization method for text-guided face images style transfer. 9963-9975 - Arundhati Bhowal, Ruchira Naskar, Sarmistha Neogy:

Multi-approach survey and in-depth analysis of image forgery detection techniques. 9977-10035 - Wenji Yang, Xingyang Miao:

Enhancing hand-object interaction pose reconstruction through semantic-enhanced and reconstruction modules. 10037-10054 - Wei Xiao, Jie Chen, Chao Pan, Tao Wang, Lei Jiang:

Adaptive dynamic fusion of multi-modality features for enhanced image representation. 10055-10067 - Huimin Lu, Bingwang Dong, Bingxue Zhu, Songzhe Ma, Zexing Zhang

, Jianzhong Peng, Kaishan Song:
A survey on deep learning-based object detection for crop monitoring: pest, yield, weed, and growth applications. 10069-10094 - Sen Zhang, Jinhua Wang, Ning He, Sunhan Xu, Shuai Liu, Pengcheng Yu, XiaoYue Ma:

Adaptive multi-modal prompting for universal image restoration amidst diverse degradations. 10095-10107 - Saba Ghazanfar Ali, Xiaolong Yang, Saleha Masood, Zainab Ghazanfar, Younhyun Jung, Tingli Chen, Xiangning Wang:

Revolutionizing diabetic retinopathy and macular edema management: a systematic review on the transformative potential of artificial intelligence. 10109-10134 - Ling Li, Wei Wang, Aizeng Wang:

Efficient and adaptive T-spline surface fairing using bilateral filter. 10135-10151 - Fuzheng Zhang

, Xiaoyu Bi, Guina Wang, Guirong Weng, Yiyang Chen:
Anisotropic edge-enhanced active contour model with Gaussian difference for robust multi-category image segmentation. 10153-10170 - Tao Xiang, Jinfu Yang, Shu Cai, Jinglei Bai:

Edge-awareness and feature decoupling enhancement network for camouflaged object detection. 10171-10187 - Yiran Peng, Qingqing Hu

, Jing Xu, Cuiyun Lin, Chenheng Deng, Yiyao Huang
, Wenxiao Wang
, Kintak U:
A robust zero-watermark method based on deep learning and chaotic permutation. 10189-10204 - Guodong Li, Shiren Li, Yaoxue Lin, Sihua Tang, Wenguang Xu, Kangxian Chen, Guangguang Yang:

Cfseg-Net: context feature extraction network for medical image segmentation. 10205-10215 - Murtaza Hanif, Taj Muhammad, Muhammad Junaid Arshad:

An explainable AI-enhanced smart door security system using transfer learning. 10217-10226 - Gaffari Çelik

:
Multi-layer feature fusion for high-accuracy solid waste classification using a hybrid deep learning model. 10227-10249 - Wei Zhu, Hengyi Huang, Longxi Zhu, ChunYang Shao, Ningzhong Liu, Yu Wang:

Self-knowledge distillation through ensemble model averaging: a novel approach for image classification. 10251-10272 - Yuetao Yuan, Shuchang Xu, Junjie Cheng, Shudong Lin:

Hybrid annotation alignment-based multi-region crop model for high-resolution image. 10273-10288 - Cheng Ding, Zhongqiu Zhao, Hao Shen

, Xiufeng Liu:
Adaptive branch selection for accelerate image super-resolution. 10289-10302 - Zhifeng Wang, Renjiao Yi, Xin Wen, Chenyang Zhu, Kai Xu, Kunlun He:

Angio-Diff: learning a self-supervised adversarial diffusion model for angiographic geometry generation. 10303-10315 - Hong Qu, K. P. Chau, Pik-Yin Mok:

Recycling/upcycling graphic design: automatic design elements extraction and vectorization. 10317-10331 - Sanghyeon Lee, Jong Taek Lee:

Occlusion-aware heatmap generation for enhancing 3D human pose estimation in multi-person environments. 10333-10345 - Mansour Tchenegnon, Sylvie Gibet, Thibaut Le Naour:

MoCoSys: human motion correction based on deep learning coupled with 3D+t Laplacian motion representation. 10347-10362 - Huixian Lin, Haidong Deng, Hong Du, Yaohong Liu, Junhua Xu:

Low visibility underwater biological target detection based on the improved YOLOV5s. 10363-10376 - Zhengquan Piao, Fuyong Feng, Ruina Dang, Wenzheng Wang, Shichao Zhou, Yuqi Han:

Enhancing few-shot object detection through mixing and separating tuning strategies. 10377-10394 - Noor Ahmed, Xin Tan, Lizhuang Ma:

D2U-Net: a dual-path hybrid UNet architecture for precise medical image segmentation. 10395-10415 - Jacqueline Zhou, Meng Sun, Zhouyu Guan, Jie Shen, Tingli Chen, Dian Zeng, Jianlin Zhu

, Haoxuan Li:
Artificial intelligence in the management of hypertension: a narrative review. 10417-10431 - Song Zhang, Rui Zhang, Jian Li, Xuefeng Li:

Procedural generation of lens soiling data via physics-based simulation. 10433-10449 - Wenjing Gao, Xi Li, Chang Liu, JiaoJiao Wang, Dingguo Yu:

Disentangled text-driven stylization of 3D faces via directional CLIP losses. 10451-10466 - Hanrui Jiang, Tuo Zang, Wenwen Zhang, Siyu Li, Wenqiang Wang, Lingfeng Liu:

Dynamic multi-stream graph neural networks for efficient interactive action recognition. 10467-10480 - Ickbum Kim, Sandeep K. Singh:

3D point cloud denoising via Gaussian processes regression. 10481-10496 - Teng Zhang, Bo Yang, Jianlin Zhu

, Xincheng Hu:
PTMP: predefined trajectories for multimodal pedestrian trajectory prediction. 10497-10510 - Dapeng Lang

, Zheng Zhou, Hongyi Hao, Zechao Liu, Jinjie Huang, Deyun Chen:
AdvShadow: camouflaged adversarial attacks via conditional diffusion model-generated shadows. 10511-10528 - Jichun Wang, Guodong Yi, Shuyou Zhang, Yang Wang, Zili Wang, Xuewei Zhang, Zheyuan Zhou, Jinghua Xu:

Dense point-wise line voting for robust 6D Pose estimation in industrial bin-picking. 10529-10540
Volume 41, Number 13, October 2025
- Yulin Wang, Zheng Liang, Zetian Mi, Jiqing Zhang, Xianping Fu, Yujia Wang:

Dark Channel Low-Rank Prior for Enhanced Single Underwater Image Restoration. 10541-10557 - Ning Zhou, Jiawei Cao:

Logical reasoning-enhanced interactive clustering: an efficient algorithm for large-scale datasets. 10559-10576 - Shiyao Zhou, Mengyao Wang, Zhiyong Huang, Yuqin He, Xiao Han, Yunlan Zhao, Zhiyu Zhao:

Attention-guided fusion of transformers and CNNs for enhanced medical image segmentation. 10577-10597 - Jinxia Yu, Jie Wu, Yongli Tang:

A lightweight adaptive feature selection network for enhanced small object detection in UAV imagery. 10599-10615 - Jinyang Wang, Xuequan Lu, Haoxuan Li, Xiaojun Huang, Jun Xia:

Enhanced ping pong training assessment via VR: integrating time-spatial alignment and multi-modal fusion. 10617-10633 - Yan Wang, Mingwen Shao, Chao Wang, Kai Xu, Xiaolin Lu:

Interacthand: Robust 3D Hand Mesh Reconstruction via Interaction-Aware Segmentation and Refinement. 10635-10647 - Kai Chen, Zhihua Chen, Lei Dai, Zhe Wang, Xin Chen:

Hybrid Mamba-Transformer Multi-Agent Reinforcement Learning for scalable coordination in complex environments. 10649-10661 - Muhammad Nadeem Cheema, Lei Zhang, Anam Nazir, Yiran Li, John A. Detre, Ze Wang:

Transformer-based arterial spin labeling perfusion MRI denoising. 10663-10673 - Jinyang Wang, Xuequan Lu, Haoxuan Li, Xiaojun Huang, Jun Xia:

$\hbox {D}^3$M-GS: Dynamic endoscopy reconstruction via dual-domain deformation model. 10675-10689 - Yuanhong Wei

, Yuefei Wang
:
TlED-Net: optimizing semantic segmentation via triple-loop encoder-decoder architecture with dense skip connections. 10691-10722 - Xin Li, Kai Zhang, Qing Yuan, Xiafeng Shen, Rui Zhu, Yue Kai:

High-accuracy real-time mouth recognition and 3D positioning for autonomous feeding robots using YOLO and binocular vision. 10723-10737 - Yu Cheng, Bo Li, Penghao Jia:

MDF-Net: multilayer feature dynamic interactive fusion network for few-shot fine-grained image classification. 10739-10750 - Huilin Liu, Xinyue Wen, Xinwei Ye, Wenkang Zhang:

Progressive dual-branch transformer-based diffusion model: a novel approach for robust 2D human pose estimation. 10751-10766 - Xiaolong He, Feipeng Da:

Expression-driven monocular 3D face reconstruction based on cross-modal guidance. 10767-10787 - Shaohui Jin, Peng Zheng, Guangpeng Li, Huimin Wang, Manman Zhang, Wenhao Zhang, Hao Liu

:
Enhanced passive non-line-of-sight imaging via multi-scale polarization-guided diffusion model. 10789-10804 - Bing Wang, Zhihong Wei, Mengyi Ju, Zutong Zhao, Shiyin Zhang:

Efficient hierarchical multiscale convolutional attention for accurate medical image segmentation. 10805-10826 - Xue-bo Jin, Jiaxi Li, Hui-Jun Ma, Tingli Su, Jianlei Kong, Yuting Bai:

Cross-modal feature fusion via mutual assistance: a novel network for enhanced object detection. 10827-10840 - Bui Thanh Hung

, Huy Vo Quoc:
MTFIC: enhanced fashion image captioning via multi-transformer architecture with contrastive and bidirectional encodings. 10841-10855 - Shen-Bin Li, Rui-Sheng Jia:

Lightweight self-supervised anomaly detection via feature space synthesis for industrial applications. 10857-10872 - Yunhao Chi, Ling Li, Jinxiu Wang,



Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID