


default search action
Multimedia Systems, Volume 31
Volume 31, Number 1, February 2025
- Yanyan Jiang, Yongping Huang, Haipeng Chen, Yingda Lyu:
Multi-modality boundary-guided network for generalizable image manipulation localization. 1 - Kanglin Wang, Qingxuan Shi, Xiaoyang Li, Enyi Wu, Zifan Li:
Optimizing codebook training through control chart analysis. 2 - Liang Zhang, Shifeng Li, Yan Cheng, Xi Luo, Xiaoru Liu:
Learning dual updatable memory modules for video anomaly detection. 3 - Quy Hoang Nguyen, Minh-Van Truong Nguyen, Kiet Van Nguyen:
New benchmark dataset and fine-grained cross-modal fusion framework for Vietnamese multimodal aspect-category sentiment analysis. 4 - Jinghua Li, Zhuowei Bai, Dehui Kong, Dongpan Chen, Qianxing Li, Baocai Yin:
3d human pose estimation based on conditional dual-branch diffusion. 5 - Hongyan Li, Ziyang Zhang, Zhaoming Hao, Baoqing Xu, Weifeng Wang, Jing Sun:
PAR-mono: monocular video depth estimation network based on channel separation and dynamic attention. 6 - Liyan Xiong, Zhida Li, Xiaohui Huang, Heng Wang:
CSFNet: A novel counting network based on context features and multi-scale information. 7 - Sifeng Zhu, Duan Haowei, Yao Yaxing, Chen Hao, Hai Zhu:
Improved NSGA-II algorithm-based task offloading decision in the internet of vehicles edge computing scenario. 8 - Weina Zhou, Xianglin Gao:
Sat-DehazeGAN: an efficient dehazing model in water-sky background for river-sea transport. 9 - Jiachang Sun
, Fuxian Zhu
:
Multilayer interactive attention bottleneck transformer for aspect-based multimodal sentiment analysis. 10 - Keyang Cheng, Liutao Wei, Jingfeng Tang, Yongzhao Zhan:
Constraint embedding for prompt tuning in vision-language pre-trained model. 11 - Qian Zhang, Shasha Li, Mingwen Shao:
DS-Diff: a dual-stage network with degradation-aware and semantic-aware for adverse weather removal based on diffusion models. 12 - Junxian Wu, Yujia Zhang, Michael Kampffmeyer, Yi Pan, Chenyu Zhang, Shiying Sun, Hui Chang, Xiaoguang Zhao:
HierGAT: hierarchical spatial-temporal network with graph and transformer for video HOI detection. 13 - Fuqun Zhao, He Huang, Wenxiang Hu:
An optimized hierarchical point cloud registration algorithm. 14 - Wei Liu, Yurong Zheng, Zhihui Xiang, Yingmeng Wang, Zhao Tian
, Wei She:
An efficient federated learning method based on enhanced classification-GAN for medical image classification. 15 - Junyi Wang, Zexin Guo, Dewei Yi
, Yining Hua, Qinggang Meng:
Enhanced multi-branch learning for long-tailed image recognition. 16 - Wenhui Lian, Xinwu Liu
, Yue Chen:
Non-convex fractional-order TV model for image inpainting. 17 - Lingyao Jia, Bingbing Zhang, Peihua Li:
Stochastic stylization transformer with self-supervision for iris recognition. 18 - Yunbo Gu, Qianyu Wu, Junting Zou, Baosheng Li, Xiaoli Mai, Yudong Zhang, Yang Chen:
Multi-modal clear cell renal cell carcinoma grading with the segment anything model. 19 - Haibo Zhang, Xizhi Wang, Haoran Sun, Yiwei Sun, Yanan Jin, Ruoxue Li, Guohua Geng:
CR-DM: A novel craniofacial reconstruction framework based on diffusion model. 20 - Yu Cao, Ran Ma, KaiFan Zhao, Ping An:
WFIL-NET: image inpainting based on wavelet downsampling and frequency integrated learning module. 21 - Anqi Shi, Xin Shu, Dan Xu, Fang Wang:
GCMR-Net: A Global Context-Enhanced Multi-scale Residual Network for medical image segmentation. 22 - Yuantao Wang, Yuanyao Lu, Yongsheng Qiu:
Gated image-adaptive network for driving-scene object detection under nighttime conditions. 23 - Huilin Wang, Huaming Qian:
SR-DAYOLOv8: cross-domain adaptive object detection based on super-resolution domain classifier. 24 - Zhongxu Li, Qihan He, Lingfei Ren, Wenyong Yao, Wenyuan Yang:
PCAF: UAV scenarios detector via pyramid converge-and-assign fusion network. 25 - Zhouwang Zheng, Weiwei Yu:
RG-YOLO: multi-scale feature learning for underwater target detection. 26 - Qin Guo, Xiangchao Feng, Peng Xue, Shoujun Sun, Xiangrong Li:
Dual-domain multi-scale feature extraction for image dehazing. 27 - Xiaonuo Dongye, Dongdong Weng, Haiyan Jiang, Zeyu Tian, Yihua Bao, Pukun Chen:
Personalized decision-making for agents in face-to-face interaction in virtual reality. 28 - Weijian Hu, Yinyin Xu, Ke Han, Lingfang Li, Jiang Wang:
TSPLNet: a three-stage progressive lightweight network for shadow removal. 29 - Yumeng Zhang, Kaixing Fan, Ying Yu:
Research on passengers behavior recognition method in public transport vehicles based on efficient 3D CNN. 30 - Chen He, Shenshen Li, Zheng Wang, Hua Chen, Fumin Shen, Xing Xu:
Chatting with interactive memory for text-based person retrieval. 31 - Guanxiao Li, Ke Zhang, Yu Su, Jingyu Wang:
Aggregating multi-scale flow-enhanced information in transformer for video inpainting. 32 - Md Shamim Hossain
, Shamima Aktar, Weiyong Liu, Naijie Gu, Zhangjin Huang:
IGINet: integrating geometric information to enhance inter-modal interaction for fine-grained image captioning. 33 - Jixin Liu, Sufang Yao, Haigen Yang, Ning Sun:
Detection of typical abnormal behavior in home-based elderly care based on ViT-iECGAN significant information migration compensation. 34 - Rui Liu, Sicong Zhang, Yang Xu, Weida Xu, Xinlong He:
High-resolution network-based multi-feature fusion for generalized forgery detection. 35 - Penghao Li, Huanjie Tao, Hui Zhou, Ping Zhou, Yishi Deng:
Enhanced Multiview attention network with random interpolation resize for few-shot surface defect detection. 36 - Jindong Ma, Haitao Zhang:
HDR-DANet: single HDR image reconstruction via dual attention. 37 - Wu Zeng, Zhengying Xiao:
Enhancing long-tailed classification via multi-strategy weighted experts with hybrid distillation. 38 - Jian Zheng, Shumiao Ren, Jingyue Zhang, Shiyan Wang, Lin Li:
Binary classification for imbalanced data using data conformity mechanism. 39 - Feng Hou, Yao Zhang, Yang Liu, Jin Yuan, Cheng Zhong, Yang Zhang, Zhongchao Shi, Jianping Fan, Zhiqiang He:
Gradient-aware domain-invariant learning for domain generalization. 40 - Xiangchun Yu, Huofa Liu, Dingwen Zhang
, Miaomiao Liang, Lingjuan Yu, Jian Zheng:
Ground truth is the best teacher: supervised semantic segmentation inspired by knowledge transfer mechanisms. 41 - Jinlong Qu, Qi Li, Jie Pan, Mingzheng Sun, Xingzheng Lu, Ying Zhou, Hongliang Zhu:
SS-YOLOv8: small-size object detection algorithm based on improved YOLOv8 for UAV imagery. 42 - Xue Li, Chunhua Zhu, Fei Zhou, Huawei Tao:
Facial expression recognition via joint loss constraining attention-modulated contextual spatial information network. 43 - Jiawei Ding, Zhiyi Tan, Guanming Lu, Jinsheng Wei:
Adaptive discriminant feature learning for GNN-based session recommendation. 44 - Modafar Al-Shouha, Gábor Szücs:
ReDiT: re-evaluating large visual question answering model confidence by defining input scenario difficulty and applying temperature mapping. 45 - Cunjuan Zhu, Yanyi Zhang, Qi Jia, Weimin Wang, Yu Liu:
Temporal refinement and multi-grained matching for moment retrieval and highlight detection. 46 - Haisheng Li, Rongrong Yuan, Qiuyi Li, Cong Hu:
Research on image captioning using dilated convolution ResNet and attention mechanism. 47 - Yufan Hu, Yi Zhang, Lixin Zhang:
Long-tailed video recognition via majority-guided diffusion model. 48 - Zongmin Li, Yachuan Li, Xavier Soria P., Chaozhi Yang, Qian Xiao, Yun Bai, Hua Li, Xiangdong Wang:
Compact twice fusion network for edge detection. 49 - Jun Tang, Enxue Ma, Yang Qu, Wenbo Gao, Yuchen Zhang, Lin Gan:
UAPT: an underwater acoustic target recognition method based on pre-trained Transformer. 50 - Chenglong Shao, Tongzhen Si, Xiaohui Yang:
Exploring granularity-associated invariance features for text-to-image person re-identification. 51 - Maojin Sun:
A method for solving the multiple degradation video quality enhancement problem: a processing framework for AI-based coding damage repair in concert with video super-resolution. 52 - Dahong Xu, Siyu Jiang, Yihan Zhang, Xi Li:
Psychological analysis of house-tree-person drawings based on multimodal large models. 53 - Tingyu Wang, Rui Zhai, Longge Wang, Junyang Yu, Han Li, Zhicheng Wang, Jinhu Wu:
Multi-scale attention and loss penalty mechanism for multi-view clustering. 54 - Ronqi Wang, Ronguo Zhang, Jing Hu, Rui Zhang, Lifang Wang, Xiaojun Liu:
Position-aware feature matching algorithm based non-rigid point cloud registration. 55 - Miaohui Zhang, Chenxing Shen, Yangyang Deng, Li Wang:
Camouflaged object detection via boundary refinement. 56 - Ziyi Miao, Lan Yao, Feng Zeng, Yi Wang, ZhiGuo Hong:
An effective retrieval model for home textile images based on deep feature extraction. 57 - Weihua Ou, Yingjie Chen, Linqing Liang, Jianping Gou, Jiahao Xiong, Jiacheng Zhang, Lingge Lai, Lei Zhang:
Cross-modal retrieval of chest X-ray images and diagnostic reports based on report entity graph and dual attention. 58 - Dingyu Lu, Zihou Liu, Dongming Zhang, Jing Zhang, Guoqing Jin:
Spatial-temporal transformer network for protecting person-of-interest from deepfaking. 59 - Zhenguang Wang, Huanjie Tao, Hui Zhou, Yishi Deng, Ping Zhou:
A content-style control network with style contrastive learning for underwater image enhancement. 60 - Danyang Cao, Hongbo Zhou, Yongfu Wang:
Improve the image caption generation on out-domain dataset by external knowledge augmented. 61 - Xiaowen Shi, Chao Zhou, Yuan-Gen Wang:
Generative adversarial defense via conditional diffusion model. 62 - Jieyu An, Binfen Ding, Wan Mohd Nazmee Wan Zainon:
Improving multimodal sentiment prediction through vision-language feature interaction. 63 - Kangkang Xu, Wen Han, Yixiang Fang, Yi Zhao, Jun Li, Junxiang Wang:
A robust image watermarking framework based on U2-net encoder and loss function weight assignment. 64 - Zhixue Liang, Wenyong Dong, Bo Zhang:
CLIP-TSA: CLIP-guided open-vocabulary semantic segmentation with two-level semantic awareness. 65 - Liangtai Zhou, Weiwei Zhang, Banghui Zhang, Xiaobin Li, Jianqing Zhu:
A strong benchmark for yoga action recognition based on lightweight pose estimation model. 66 - Xinmin Cheng, Maoke Ran, Benyao Chen, Hongwei Yin
:
Image channel and spatial information integrated method for fall detection. 67 - Mingqi Liu, Zhixin Li:
A dissimilarity feature-driven decomposition network for multimodal sentiment analysis. 68 - Xianghua Kong, Ning Xu, Zefang Sun, Zhewen Shen, Bolun Zheng, Chenggang Yan, Jinbo Cao, Rongbao Kang, An-An Liu:
Counterfactual GAN for debiased text-to-image synthesis. 69 - Carlos Marín-Lora, Miguel Chover:
GameScript: a simplified scripting language for video game development. 70 - Pengqi Yin:
Visual-textual adversarial learning for person re-identification. 71 - Guangsheng Luo, ZhiJun Fang, JianLing Liu, YiFanBai Bai:
CLIP guided image caption decoding based on monte carlo tree search. 72 - Wei Zhang, Hongjie Li, Wei Ke:
LF-GIANet: cascaded global-view information adaptation-guided network for light field image super-resolution. 73 - Jiwu Sun, Cheng Xu
, Cheng Zhang, Yujia Zheng, Pengfei Wang, Hongzhe Liu:
Flood scenarios vehicle detection algorithm based on improved YOLOv9. 74 - Lingtao Wang, Yong Hu:
Topic-guided multi-domain fake news detection. 75 - Tongchi Zhou, Hongyu He, Yanzhao Wang, Yuan Liao:
Improved gated recurrent units together with fusion for semantic segmentation of remote sensing images based on parallel hybrid network. 76 - Sujuan Li, Gengsheng Xie:
Relation-aware non-local attention network for person re-identification. 77 - Haomou Bai:
Inception-like Large Kernel network for lightweight image super-resolution. 78 - Zerui Xu, Dechao Chen, Wenyan Gong:
UMSSNet: a unified multi-scale segmentation network for heterogeneous medical images. 79 - Zhanyang Liang, Yan Wo:
From coarse to fine: a two-stage common semantic space construction for unpaired cross modal retrieval. 80 - Junyin Peng, Hong Tang, Wenbin Zheng:
Hierarchical heterogeneous graph network based multimodal emotion recognition in conversation. 81 - Jiliang Wang, Cancan Jin, Siwang Zhou:
Segmentation-aware image super-resolution with generative adversarial networks. 82 - Xiangchun Yu, Huofa Liu, Dingwen Zhang
, Jianqing Wu, Jian Zheng:
Hierarchical Region-level Decoupling Knowledge Distillation for semantic segmentation. 83 - Feng Xue, Peng Li, Yu Li, Shujie Li:
WPELip: enhance lip reading with word-prior information. 84 - Zhiyong Xiao, Yang Li, Zhaohong Deng:
Food image segmentation based on deep and shallow dual-branch network. 85 - Jue Tian, Lele Guan, Yang Liu, Le Zhang, Yanping Chen:
Deepphysio: detecting deepFake with non-personalized feature of physiological signal. 86 - Andrea Morales-Garzón, Karel Gutiérrez-Batista, María J. Martín-Bautista:
Adaptafood: an intelligent system to adapt recipes to specialised diets and healthy lifestyles. 87 - Saif Ur Rehman Khan
, Sohaib Asif
, Ming Zhao, Wei Zou, Yangfan Li:
Optimize brain tumor multiclass classification with manta ray foraging and improved residual block techniques. 88 - Yubo Zhang, Liying Zheng, Qingming Huang:
Multi-object tracking based on graph neural networks. 89 - Libo Cheng
, Wenlin Du
, Zhe Li
, Xiaoning Jia
:
AFEV-INet: adaptive feature extraction variational interactive network for remote sensing image denoising. 90 - Juan Yang, Yuhang Wei, Ronggui Wang, Lixia Xue:
VTIENet: visual-text information enhancement network for image captioning. 91 - Xiangwei Chen, Chenghai Yu:
SCG-DETR: a high-precision railway turnout defect detection method based on attention feature fusion and SMP-CGLU approach. 92 - Penglei Wang, Xin Fan, Qimeng Yang, Shengwei Tian, Long Yu:
Object detection of mural images based on improved YOLOv8. 93 - Gaili Li, Yongna Yuan, Ruisheng Zhang:
A spatial-temporal graph attention network for protein-ligand binding affinity prediction based on molecular geometry. 94 - Dehua Ma, Xiaoliang Zhu, Yanxiang Li, Wenzhe Meng, Siping Xu:
RANet: A receptive aggregation network for polyp segmentation. 95 - Guangyong Gao, Xiaoan Chen, Li Li:
SSRH: screen-shooting robust hyperlink based on deep learning. 96 - Jihua Ye, Youcai Zou, Zhixiong Wang, Tiantian Wang, Chao Wang, Wentao Wan:
ADMF-ER: a novel approach for wild expression recognition integrating adaptive dropout and multi-level features. 97
Volume 31, Number 2, April 2025
- Liang Yang, Qi Yang, Jingjie Zeng, Tao Peng, Zhihao Yang, Hongfei Lin:
Dialogue sentiment analysis based on dialogue structure pre-training. 98 - Haozhe Tang, Lei Yu, Yu Shao:
MARCFusion: adaptive residual cross-domain fusion network for medical image fusion. 99 - Aizhong Mi, Xianru Huang, Zhanqiang Huo, Luyao Liu:
Context-aware learning and background activation suppression for weakly supervised semantic segmentation. 100 - Ang Li, Xinghao Yang, Baodi Liu, Honglong Chen, Dapeng Tao, Weifeng Liu:
Parentheses insertion based sentence-level text adversarial attack. 101 - Ruinian Shi, Qiang He, Hengyou Wang, Changlun Zhang:
FDC-Net: foreground dynamic capture with deep feature enhancement for video anomaly detection. 102 - Liman Jiang, Canlong Zhang, Lei Wu, Zhixin Li, Zhiwen Wang, Chunrong Wei:
Joint feature augmentation and posture label for cloth-changing person re-identification. 103 - Di Wu, Yuying Zheng, Peng Cheng:
Co-interaction for intent recognition and slot filling with global-local-spatial-temporal feature fusion. 104 - Zijie Song
, Zhenzhen Hu
, Richang Hong
:
Grid Jigsaw Representation with CLIP: a new perspective on image clustering. 105 - Huy Quang Pham, Thang Kien-Bao Nguyen, Quan Van Nguyen, Dan Quang Tran, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen:
ViOCRVQA: novel benchmark dataset and VisionReader for visual question answering by understanding Vietnamese text in images. 106 - Huan Pan, Ruiya Ji, Wenming Cao, Zhao Huang
, Jianqi Zhong:
Optimizing human motion prediction through decoupled motion spatio-temporal trends. 107 - Xiaowen Ruan, Zhaobo Qi, Yuanrong Xu, Weigang Zhang:
Dual-guided multi-modal bias removal strategy for temporal sentence grounding in video. 108 - Jian Gao, Yuhe Zhang, Jinghao Hu, Tong Yang, Pengbo Zhou, Wen Tang, Wuyang Shui, Guohua Geng:
IOPCNet: inner and outer point classification based low overlap rate local-to-global point cloud registration. 109 - Yang Xuan, Xiao-Yu Zhang
, Chen Li, Hui Wang, Chaoxu Mu:
LAM-YOLOv11 for UAV transmission line inspection: overcoming environmental challenges with enhanced detection efficiency. 110 - Zhengyang Lu, Ying Chen:
Self-supervised monocular depth estimation via multiple bilateral consistency. 111 - Jinxia Yu, Fabao Xue, Zhanqiang Huo, Yingxu Qiao:
Combining implicit and explicit priors for zero-reference low-light image enhancement and denoising. 112 - Fatemeh Shafizadegan, Ahmad Reza Naghsh-Nilchi, Elham Shabaninia:
Hybrid embedding for multimodal few-frame action recognition. 113 - Jingcheng Zhang, Yu Zhu, Shengjun Peng, Axi Niu, Qingsen Yan, Jinqiu Sun, Yanning Zhang:
A multi-scale feature cross-dimensional interaction network for stereo image super-resolution. 114 - Tianyu Hong, Guowei Teng, Ping An, Liquan Shen:
Spherical rotation for high efficiency ERP 360-degree video coding. 115 - Zixu Hu, Zhengtao Yu, Junjun Guo:
Multi-level sentiment-aware clustering for denoising in multimodal sentiment analysis with ASR errors. 116 - Shihui Zhang, Xueqiang Han, Zhiguo Cui, Sheng Zhan, Qing Tian:
Fast-colorfool: faster and more transferable semantic adversarial attack with complementary colors and cumulative perturbation. 117 - Wei Song, Dong Li:
Region attention and label embedding for facial action unit detection. 118 - ShaoDong Cui, Kaibo Duan, Wen Ma, Hiroyuki Shinnou:
CCGN: consistency contrastive-learning graph network for multi-modal fake news detection. 119 - Yifei Yang, Zhengyong Feng, Wei Jin, Pengcheng Miao:
ADD-YOLO: a new model for object detection in aerial images. 120 - Muhammad Anwar, Zhiyue Yan, Wenming Cao, Naeem Hussain:
STHRA: selective transformer hierarchical reciprocal attention-based deformable medical image registration. 121 - Xuezhi Xiang, Xiankun Zhou, Xinyao Wang, Mingliang Zhai, Abdulmotaleb El-Saddik:
Multi-object tracking with scale-aware transformer and enhanced association strategy. 122 - Qiong Hu, Masrah Azrifah Azmi Murad
, Qi Li:
Advancing music emotion recognition: large-scale dataset construction and evaluator impact analysis. 123 - Fudong Nian, Yanhong Gu, Wentao Wang, Aoyu Liu, Dong Zhang, Fanding Li:
Rwkv-vg: visual grounding with RWKV-driven encoder-decoder framework. 124 - Zhengwei Jin, Yun Wei:
UMPA: Unified multi-modal prompt with adapter for vision-language models. 125 - Chun Zhang, Jin Wang, Yunhui Shi, Baocai Yin, Nam Ling:
A CNN-transformer hybrid network with selective fusion and dual attention for image super-resolution. 126 - Jing Lv, Zhi Liu, Gongyang Li:
Few-shot fine-tuning with auxiliary tasks for video anomaly detection. 127 - Xu Liu, Chenhua Liu, Xianye Zhou, Guodong Fan:
EATNet: edge-aware and transformer-based network for RGB-D salient object detection. 128 - Xin Chao, Xiaosha Qi, Ruiqi Ding, Genlin Ji:
Vehicle lane change behavior recognition based on multi-scale three-stream 3D ResNets. 129 - Ang Li, Xinghao Yang, Baodi Liu, Honglong Chen, Dapeng Tao, Weifeng Liu:
Correction: Parentheses insertion based sentence-level text adversarial attack. 130 - Zhiwei Tang, ShuWei Xu, Haozhe Jin, Shichong Liu, Rui Zhai, Ke Lu:
Personalized federated learning via decoupling self-knowledge distillation and global adaptive aggregation. 131 - Mingyang Lei, Hong Song, Tianyu Fu, Deqiang Xiao, Danni Ai, Jingfan Fan, Yifei Yang, Ying Gu, Jian Yang:
SEMNet: a simple and efficient MLP-based network for 3D Face point clouds landmarks localization. 132 - Lehao Rong, Liqing Huang:
Image deblurring algorithm based on unsupervised network and alternating optimization iterations. 133 - Caifeng Liu, Lianyu Hu:
Rethinking the temporal downsampling paradigm for continuous sign language recognition. 134 - Zhiheng Gong, Huan Rong, Zhongfeng Chen, Yixiang Tang, Victor S. Sheng:
EDCM-EA: event prediction based on event development context mining considering event arguments. 135 - Dongliang Cao, Wang Ren, Changhong Yu, Bin Wu:
IFMOT: interactive perception and feature optimization network for multi-object tracking. 136 - Hongfei Liu, Ning He, Xunrui Huang, Runjie Li:
A video anomaly detection framework based on feature-strengthened and memory feature-ernhanced reconstruction. 137 - Yuxin Li, Hu Lu, Tingting Qin, Juanjuan Tu, Shengli Wu
:
CM-DASN: visible-infrared cross-modality person re-identification via dynamic attention selection network. 138 - Hui Chen, Rong Chen, Yushi Li, Haoran Li, Nannan Li:
Unsupervised single-image dehazing via self-guided inverse-retinex GAN. 139 - Boyuan Ma, Donglin Zhang, Xiao-Jun Wu:
Food nutrition estimation with RGB-D fusion module and bidirectional feature pyramid network. 140 - Kang Tong, Yiquan Wu:
Small object detection using hybrid evaluation metric with context decoupling. 141 - Qingsong Tang, Yalei Ren, Zhanghui Shan, Chenyang Bao, Yang Liu:
Dual-branch aggregation and edge refinement network for few shot semantic segmentation. 142 - Younghoon Lee:
Enhancing plant health classification via diffusion model-based data augmentation. 143 - Hanqi Jiang, Jinlong Shi, Yongjie Gao, Xin Shu, Suqin Bai, Qiang Qian, Dan Xu:
Psg-6d: prior-free implicit category-level 6D pose estimation with SO(3)-equivariant network and point cloud global enhancement. 144 - Yuzhen Niu, Siling Chen, Shanshan Chen, Fusheng Li
:
Progressive fusion of local and global image features for cross-modal image aesthetic assessment. 145 - Kuo Tan, Zhaobo Qi, Jianping Zhong, Yuanrong Xu, Weigang Zhang:
KN-VLM: KNowledge-guided Vision-and-Language Model for visual abductive reasoning. 146 - Jiacheng Zhao, Haojie Che, Yongxi Li
:
Spatial enhanced multi-level alignment learning for text-image person re-identification with coupled noisy labels. 147 - Haojie Che, Jiacheng Zhao, Yongxi Li
:
Multi-level fine-grained center calibration network for unsupervised person re-identification. 148 - Jiacheng Lu
, Kaiwen Wang, Hui Ding, Zhuhong Shao, Rongyin Qin, Guoping Huo:
MSCA-Sp R-CNN: a segmentation algorithm for pneumonia small lesions integrating multi-scale channel attention and sub-pixel upsampling. 149 - Yang Liu
, Wenyi Zhu, Linyu Dong, Yuzhong Zhang, Xiang Guo:
Enhancing interpretability in video-based personality trait recognition using SHAP analysis. 150 - Yalin Song, Peng Qian, Kexin Zhang, Shichong Liu, Rui Zhai, Ran Song:
An improved Multi-Scale Fusion and Small Object Enhancement method for efficient pedestrian detection in dense scenes. 151 - Jing Wang, Xiaohong Li, Xuesong Dai, Shuo Zhuang, Meibin Qi:
Contrastive learning-based joint pre-training for unsupervised domain adaptive person re-identification. 152 - Teerath Kumar, Alessandra Mileo, Malika Bendechache
:
Saliency-based metric and FaceKeepOriginalAugment: a novel approach for enhancing fairness and Diversity. 153 - Xinwang Chen, Fengrui Ji, Renxin Chu, Baolin Liu:
Data-free pruning of CNN using kernel similarity. 154 - Xuanrui Xiong, Haihong Huang, Tianyu Li, Xiaolin Fan, Yuan Zhang:
DSFAT: a dual-stream framework assisted by textual information for person re-identification in real scenes. 155 - Tianming Zhan, Chenyang Lu, Huapeng Wu, Chenyun Wang:
A novel gradient and semantic-aware transformer network for low-light image enhancement. 156 - Jian Shi, Rui Xu, Baoli Sun, Tiantian Yan, Zhi-Hui Wang, Haojie Li:
Structure-preserving dental plaque segmentation via dynamically complementary information interaction. 157 - Abeer Ayoub, Walid El-Shafai, Fathi E. Abd El-Samie, Ehab K. I. Hamad, S. El-Rabaie:
Video and image quality enhancement using an enhanced lower bound on transmission map dehazing technique. 158 - Haochen Zhang, Shuai Zhang:
Federated semi-supervised polyp image detection based on client feature alignment. 159 - Xuehua Song, Junxing Zhou, Hua Jin, Xin Yuan, Chang-da Wang:
Enhancing cross-modality person re-identification through attention-guided asymmetric feature learning. 160 - Yiming Xing, Jindong Zhang:
Residual channel prior-guided multi-scale progressive dehazing network with hybrid attention. 161 - Jianming Zhang
, Zhijian Feng, Jia Jiang, Xiangnan Shi, Jin Zhang:
RGB-Net: transformer-based lightweight low-light image enhancement network via RGB channel separation. 162 - Xianhui Nie, Yong Fang, Xin Liu, Hao Li, Zi Wang, Longzhen Qiu:
Incorporating human attention shifting features for enhanced local dimming performance. 163 - Min Dai, Wenshan Zhang, Wenguang Zheng:
Digital stabilization method for old movies based on mobilesam and optical flow. 164 - Zhixiong Liu, Fang Liu, Mohan Zhang, Shenglan Cui:
ACIH-VQT: aesthetic constraints incorporated hierarchical VQ-transformer for text logo synthesis. 165 - Lanhui Liu
, Menglin Kong, Cong Cao
, Zhanjie Shu, Kecheng Liu, Xingquan Li, Muzhou Hou:
Personalized music recommendation algorithm based on machine learning. 166 - Yang Liu, Xinyu Liu, Ling Zhao, Bo Mi:
Automatic extraction method for humming-to-Guzheng melody based on improved YIN algorithm. 167 - Dalang Liu, Yunbo Rao, Jialong Zhu, Yanjin Ma, Jie Li:
FSformer: fusing frequency and spatial domain transformer network for underwater image enhancement. 168 - Chiqin Li, Lun Xie, Xinheng Wang, Hang Pan, Zhiliang Wang:
A disentanglement mamba network with a temporally slack reconstruction mechanism for multimodal continuous emotion recognition. 169 - Hichem Metmer
, Xiaoshan Yang:
FedMRG: federated medical report generation via text-aware learning rate adjustment and multi-level prototype collaboration. 170 - Hong Liang, Yu Li, Qian Zhang, Mingwen Shao:
Do-DETR: enhancing DETR training convergence with integrated denoising and RoI mechanism. 171 - Jianxin Wang, Haijian Shao, Xing Deng, Shuheng Lian:
Robust novel view synthesis from multi-view feature stereo matching priors. 172 - Wencong Zhang, Zhiyang Guo, Wengang Zhou, Houqiang Li:
AAGS: Appearance-Aware 3D Gaussian Splatting with Unconstrained Photo Collections. 173 - Linyu Huang
, Zijie Xue, Qian Ning, Yong Guo, Yongsheng Li:
A guidance and alignment transformer model for visible-infrared person re-identification. 174 - Wen Li, Xiaoning Song, Wenjie Zhang, Yang Hua, Xiaojun Wu:
Link prediction via adversarial knowledge distillation and feature aggregation. 175 - Ming Fang, Qi Liu, Jianping Ren, Jie Li, Xinning Du, Shuhua Liu:
A three-stream fusion network for 3D skeleton-based action recognition. 176 - Ouafa Talha, Wenju Zhou, Naitong Yuan, Yuan Xu:
Improved YOLOv8-C2fCA for embryonic cell detection and counting. 177 - Chunman Yan, Huiling Li:
CAPNet: tomato leaf disease detection network based on adaptive feature fusion and convolutional enhancement. 178 - Eunsam Kim, Jinsung Kim, Choonhwa Lee:
Efficient time-extended TV viewing through hybrid data redundancy in networked appliances. 179 - Memoona Aziz, Umair Rehman, Muhammad Umair Danish
, Syed Ali, Amir Zaib Abbasi:
Towards a unified evaluation framework: integrating human perception and metrics for AI-generated images. 180 - Zhenping Mou, Tianqi Song, Hong Luo:
Dual-visual collaborative enhanced transformer for image captioning. 181 - Dangguo Shao, Rui Xu, Lei Ma, Sanli Yi:
Tubular-aware mamba for accurate retinal vessel segmentation: preserving fine details and topological connectivity. 182 - Yongsheng Ye, Guoguang Tan, Qiang Liu, Liu Liu, Jiawei Chu, Bin Wen, Lili Li:
TSSSKD-YOLO: an intelligent classification and defect detection method of insulators on transmission lines by fusing knowledge distillation in multiple scenarios. 183

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.