


default search action
International Journal of Multimedia Information Retrieval, Volume 14
Volume 14, Number 1, March 2025
- Qiuhong Tian, Weilun Miao, Lizao Zhang, Ziyu Yang, Yang Yu, Yanying Zhao, Lan Yao:
STCA: an action recognition network with spatio-temporal convolution and attention. 1 - Fan Yang, Nor Azman Ismail, Pang Yee Yong, Alhuseen Omar Alsayed
:
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features. 2 - Hao Wen, Ziqian Lu, Fengli Shen, Zheming Lu, Jia-Lin Cui:
Improving skeleton-based action recognition with interactive object information. 3 - Ziyong Lin, Xiaolong Jiang, Jie Zhang, Mingyong Li:
Dual-matrix guided reconstruction hashing for unsupervised cross-modal retrieval. 4 - Hao Chen, Wu Huang, Tao Zhang
:
Optimized RT-DETR for accurate and efficient video object detection via decoupled feature aggregation. 5 - Zhong Ji, Yuanheng Liu, Xuan Wang, Jingren Liu, Jiale Cao, YunLong Yu:
Multi-task classification network for few-shot learning. 6 - Changqin Huang, Zhenheng Lin, Zhongmei Han, Qionghao Huang, Fan Jiang, Xiaodi Huang
:
PAMoE-MSA: polarity-aware mixture of experts network for multimodal sentiment analysis. 7 - Digambar Pawar, Raghavendra Gowda, Krishna Chandra:
Image forgery classification and localization through vision transformers. 8 - Lixia Xue, Jiang Dong, Ronggui Wang, Juan Yang:
MFAFD: a few-shot learning method for cascading models with parameter free attention and finite discrete space. 9 - Qiang Zhang, Qin Shi, Teng Cheng, Junning Zhang, Jiong Chen:
VPC-VoxelNet: multi-modal fusion 3D object detection networks based on virtual point clouds. 10
Volume 14, Number 2, June 2025
- Weichen Zhao, Yuxing Lu
, Zhiyuan Liu, Yuan Yang, Ge Jiao:
Cross-modal alignment with synthetic caption for text-based person search. 11 - Hemraj Singh, Mridula Verma, Ramalingaswamy Cheruku:
DMFNet: geometric multi-scale pixel-level contrastive learning for video salient object detection. 12 - Manh-Duy Nguyen, Binh T. Nguyen, Cathal Gurrin:
Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance. 13 - Pu Yan, Kang Ruan, Lili Wang, Yang Zhao, Xu Wang:
Multi-view learning for camouflaged object detection with PVTv2. 14 - Chao Yang, Yakun Chen, Zihao Li, Xianzhi Wang
, Kaize Shi, Lina Yao, Guandong Xu, Zhongwen Guo:
Deep multimodal learning for time series analysis in social computing: a survey. 15 - Xinxin Hao, Haishun Du, Jiangtao Guo
, Jieru Li:
A CNN-transformer hybrid model and a multi-modal multi-stage training strategy for visible-infrared person re-identification. 16 - Minh-Tam Nguyen, Quynh T. Nguyen, Minh-Son Dao, Binh T. Nguyen:
Multimodal scene-graph matching for cheapfakes detection. 17 - Qiaoyun Zhang, Chih-Yung Chang, Shih-Jung Wu, Hsiang-Chuan Chang, Diptendu Sinha Roy:
MMDL: a multi-modal deep learning for video highlight detection in sports. 18 - Lingling Kan, Ruixuan Liu, Hongwei Liang, Fengcai Huo, Wenfeng Wang:
Human behavior recognition based on DualBiNet model. 19 - Mikel Williams-Lekuona, Georgina Cosma:
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis. 20 - Muhammad Irzam Liaqat
, Shah Nawaz, Muhammad Zaigham Zaheer, Muhammad Saad Saeed, Hassan Sajjad, Tom De Schepper, Karthik Nandakumar, Muhammad Haris Khan, Ignazio Gallo, Markus Schedl:
Chameleon: A Multimodal Learning Framework Robust to Missing Modalities. 21
Volume 14, Number 3, September 2025
- Jian Wang, Jia Su, Zonghui Wen, Yongqing Sun:
Enhanced YOLOv10 for small object detection with context-aware and adaptive modules. 22 - Guan Yang, Weihao Sun, Xiaoming Liu, Yang Liu, Chen Wang:
Semantic Fusion and Contrastive Generation for Generalized Zero-Shot Learning. 23 - Xiaofei Zhang, Xiaoguang Di, Runwen Zhu:
TPE-YOLO: improved low-light object detection using a two-way pyramid enhancement network. 24 - Yunxue Shao, Zhiyang Wang, Lingfeng Wang:
MCDINO: Self-supervised learning of masks based on combination of multi-path channel attention and local feature weighting. 25 - Shiwei Zou
, Yingmei Wei, Yuxiang Xie, Mingrui Lao, Xidao Luan:
Remote Sensing Image Change Captioning: A Comprehensive Review. 26

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.