


default search action
Computer Vision and Image Understanding, Volume 262
Volume 262, 2025
- Ximing Li, Xiaoguang Di

, Maozhen Liu, Shaoxun Ye
:
Feature-aligned distillation for dense object detection via refined semantic guidance and distribution consistency. 104519 - Yongting Hu, Yuanhong Zhong, Jinkai Li, Xin Wang:

Learning multiscale residual prototypes and global-local correspondence for video anomaly detection. 104524 - Biyun Xu, Yan Zheng

, Suleman Mazhar
, Zhenghua Huang:
D2PCFN: Dual domain progressive cross-fusion network for remote sensing image pansharpening. 104525 - Chaoqun Ma, Rongsheng Cui, Feng Liu

, Chunli Cai:
SDC-Net: A novel selective dilated convolution network for medical images segmentation. 104526 - Agnieszka Anna Tomaka, Leszek Luchowski, Michal Tarnawski, Dariusz Pojda

:
Computer-aided design of personalized occlusal positioning splints using multimodal 3D data. 104527 - Tim J. Schoonbeek

, Shao-Hsuan Hung, Dan Lehman, Hans Onvlee, Jacek Kustra, Peter H. N. de With, Fons van der Sommen:
Learning to recognize correctly completed procedure steps in egocentric assembly videos through spatio-temporal modeling. 104528 - Alexandre Lopes

, Roberto Souza
, Hélio Pedrini:
CCNeXt: An effective self-supervised stereo depth estimation approach. 104529 - Bin Yu, Wei Li, Chen Zhang, Wenjie Mao

, Yu Xie
:
Dynamic deep multi-label image data augmentation based on self-paced learning. 104530 - Tiantian Wang

, Xinxin Zuo
, Fangzhou Mu
, Jian Wang
, Ming-Hsuan Yang
:
Towards 4D human video stylization. 104532 - Xiaolong Zhang, Haonan Miao

, Peizheng Zhao, Yuqi Sun, Fang Nan, Saberi Morteza, Yaqiang Wu, Feng Tian
:
Student gaze target estimation based on depth transformation on dual-view classroom images. 104533 - Hania Ghouse

, Muzammil Behzad:
MOSAIC: A multi-view 2.5D organ slice selector with cross-attentional reasoning for anatomically-aware CT localization in medical organ segmentation. 104522 - Diogo Lavado

, Alessandra Micheletti, Giovanni Bocchi
, Patrizio Frosini, Cláudia Soares:
SCENE-Net: Geometric induction for interpretable and low-resource 3D pole detection with Group-Equivariant Non-Expansive Operators. 104531 - Chongchong Mao, Yongsheng Dong, Lintao Zheng

, Ziang Jiao:
Gated-enhanced attention addition network for indoor RGB-D semantic segmentation. 104534 - Eric L. Wisotzky, Jost Triller, Michael Knoke, Brigitta Globke, Anna Hilsmann, Peter Eisert:

Real-time fusion of stereo vision and hyperspectral imaging for objective decision support during surgery. 104541 - Gaby Maroun, Salah Eddine Bekhouche, Jinan Charafeddine, Fadi Dornaika:

Integrating ConvNeXt and vision transformers for enhancing facial age estimation. 104542 - Cezar Mbiethieu

, Norbert Tsopzé, Engelbert Mephu Nguifo:
XLITE-Unet: Extremely Light and Efficient Deep learning architecture with selective atrous and axial depthwise convolution for image segmentation. 104543 - Jing Huo, Zheng Gu, Jiulin Zhang, Xiangde Liu, Shiyin Jin, Pinzhuo Tian, Wenbin Li, Jing Wu, Yu-Kun Lai, Yang Gao:

REST: A resolution preserving network for photorealistic style transfer via semantic distillation. 104544 - Jiajun Xu, Zixiang Lu

, Ping Gao, Qiguang Miao, Kun Xie:
VMM: Video-Music Mamba for generating background music from videos. 104545 - Wenrun Wang, Jianwu Dang, Yangping Wang, Rui Pan:

RefineHOS: A high-performance hand-object segmentation with fine-grained spatial features. 104548 - Fuming Wang, Wenlong Wang

, Dahua Gao, Xunliang Huang, Xiaodan Song
, Haoyuan Sun, Cheng Peng:
A LLM-guided hybrid Mamba-Transformer architecture for part-to-whole motion synthesis. 104549 - Min Mao, Ge Jiao, Wanhui Gao, Jixun Ye:

MSFENet: Multi-Scale Filter-Enhanced Network architecture for digital image forgery trace localization. 104550 - Jichen Gao

, Suiping Zhou, Hang Yu, Chenyang Li, Xiaoxi Hu
:
SCESS-Net: Semantic consistency enhancement and segment selection network for audio-visual event localization. 104551 - Pengxia Li, Zhonghao Du, Linhui Zhang, Yanyi Lv, Yujie Liu:

Channel-aware feature mining network for Visible-Infrared Person Re-identification. 104552 - Yifan Jiao, Xinran Liu, Xiaoqiong Liu, Xiaohui Yuan, Heng Fan, Libo Zhang:

PlanarTrack: A high-quality and challenging benchmark for large-scale planar object tracking. 104553 - Wenfei Xiong, Huabing Zhou, Yanduo Zhang, Tao Lu, Jiayi Ma:

DiffuseDoc: Document geometric rectification via diffusion model. 104554 - Omar Ikne

, Benjamin Allaert, Ioan Marius Bilasco, Hazem Wannous:
eMotion-GAN: A motion-based GAN for photorealistic and facial expression preserving frontal view synthesis. 104555 - Haoke Yin, Changdong Yu, Chengshang Wu, Kexin Dai, Junfeng Shi

, Yifan Xu, Yuan Zhu
:
Swin Transformer-based maritime objects instance segmentation with dual attention and multi-scale fusion. 104556 - Qian Li, Shen Yang, Peixuan Wu, Jin Wu:

HADF: A hybrid attention and dual-branch feature fusion method for infrared and visible image fusion. 104557 - Simona Correra, Francesco Mercaldo, Vittoria Nardone, Giulia Varriano

, Dalila De Lucia, Maria Chiara Brunese, Antonella Santone, Corrado Caiazzo
:
A method for automatic breast density classification in magnetic resonance imaging. 104558 - Matteo Pennisi

, Giovanni Bellitto, Simone Palazzo
, Isaak Kavasidis
, Mubarak Shah, Concetto Spampinato:
DiffExplainer: Towards cross-modal global explanations with diffusion models. 104559 - Douglas J. Townsell, Lingwei Chen, Mimi Xie, Chen Pan, Wen Zhang

:
STARS: Semantics-Aware Text-guided Aerial Image Refinement and Synthesis. 104561 - Shengwen Chen

, Haixing Song, Huiling Feng, Jieyuan Hu, Qiuyi Bai, Fuyun He:
SPSC-Net: Shared parallel space-channel attention mechanism transformer network for cell sequence image segmentation. 104562 - Mohsen Ahmadkhani, Eric Shook:

TopoSegNet: Scalable topology preservation in image segmentation via critical points. 104564 - Zhiping Wang, Peng Yu

, Xuchong Zhang, Hongbin Sun:
BAP-DETR: Efficient drone object detection network based on bipartite attentive processing and dual fusion encoder. 104565 - Simin Chen

, Qinxia Hu, Mingjin Zhu, Qiming Wu
, Xiao Hu:
TOODIB: Task-aligned one-stage object detection with interactions between branches. 104567 - Qida Yu, Rongrong Jiang, Xiaoyan Zhou, Yiru Wang, Guili Xu, Wu Quan:

An efficient direct solution of the perspective-three-point problem. 104568

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














