


default search action
International Journal of Multimedia Information Retrieval, Volume 11
Volume 11, Number 1, March 2022
- Silvan Heller

, Viktor Gsteiger
, Werner Bailer
, Cathal Gurrin
, Björn Þór Jónsson
, Jakub Lokoc
, Andreas Leibetseder
, Frantisek Mejzlík
, Ladislav Peska
, Luca Rossetto
, Konstantin Schall
, Klaus Schoeffmann
, Heiko Schuldt
, Florian Spiess
, Ly-Duyen Tran
, Lucia Vadicamo
, Patrik Veselý, Stefanos Vrochidis
, Jiaxin Wu
:
Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown. 1-18 - S. Suganyadevi

, V. Seethalakshmi
, K. Balasamy
:
A review on deep learning in medical image analysis. 19-38 - Sinda Elghoul, Faouzi Ghorbel

:
A fast and robust affine-invariant method for shape registration under partial occlusion. 39-59 - Mohammad Farhad Bulbul, Saiful Islam

, Zannatul Azme, Preksha Pareek
, Md. Humaun Kabir, Hazrat Ali
:
Enhancing the performance of 3D auto-correlation gradient features in depth action classification. 61-76 - Carlos de la Fuente, Jose J. Valero-Mas

, Francisco J. Castellanos, Jorge Calvo-Zaragoza
:
Multimodal image and audio music transcription. 77-84
Volume 11, Number 2, June 2022
- Devashree R. Patrikar

, Mayur Rajaram Parate
:
Anomaly detection using edge computing in video surveillance system: review. 85-110 - Jie Yan

, Yuxiang Xie, Xidao Luan, Yanming Guo, Quanzhi Gong, Suru Feng:
Caption TLSTMs: combining transformer with LSTMs for image captioning. 111-121 - Md. Meraz

, Md Afzal Ansari, Mohammed Javed, Pavan Chakraborty:
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud. 123-133 - Ohoud Nafea

, Wadood Abdul, Ghulam Muhammad:
Multi-sensor human activity recognition using CNN and GRU. 135-147 - Xiaoyi Wang, Jun Huang

:
A local representation-enhanced recurrent convolutional network for image captioning. 149-157 - Marco Fisichella

:
Siamese coding network and pair similarity prediction for near-duplicate image detection. 159-170 - Masum Shah Junayed

, Md Baharul Islam
, Hassan Imani
, Tarkan Aydin
:
PDS-Net: A novel point and depth-wise separable convolution for real-time object detection. 171-188 - Jian Li, Yanming Guo

, Songyang Lao, Xiang Zhao, Liang Bai, Haoran Wang:
Few2Decide: towards a robust model via using few neuron connections to decide. 189-198
Volume 11, Number 3, September 2022
- Xiaoping Zhou, Xiangyu Han, Haoran Li, Jia Wang, Xun Liang:

Cross-domain image retrieval: methods and applications. 199-218 - Deepak Dagar, Dinesh Kumar Vishwakarma

:
A literature review and perspectives in deepfakes: generation, detection, and applications. 219-289 - Veronica Naosekpam

, Nilkanta Sahu:
Text detection, recognition, and script identification in natural scene images: a Review. 291-314 - Ademola Enitan Ilesanmi, Taiwo Ilesanmi, Oluwagbenga Paul Idowu, Drew A. Torigian, Jayaram K. Udupa:

Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review. 315-331 - Ahmed Iqbal

, Muhammad Sharif
, Mussarat Yasmin
, Mudassar Raza
, Shabib Aftab
:
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey. 333-368 - Hao Pan, Jun Huang

:
Semantic-enhanced discriminative embedding learning for cross-modal retrieval. 369-382 - Na He

, Sam Ferguson:
Music emotion recognition based on segment-level two-stage learning. 383-394 - Ihssane Houhou

, Athmane Zitouni, Yassine Ruichek
, Salah Eddine Bekhouche
, Mohamed Kas
, Abdelmalik Taleb-Ahmed:
RGBD deep multi-scale network for background subtraction. 395-407 - Sweta Panigrahi

, U. S. N. Raju
:
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection. 409-430 - Mehdi Ellouze:

How can users' comments posted on social media videos be a source of effective tags? 431-443 - Deepika Varshney, Dinesh Kumar Vishwakarma

:
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content. 445-459
Volume 11, Number 4, December 2022
- Pranjal Kumar

, Piyush Rawat, Siddhartha Chauhan
:
Contrastive self-supervised learning: review, progress, challenges and future research directions. 461-488 - Pranjal Kumar

, Siddhartha Chauhan
, Lalit Kumar Awasthi
:
Human pose estimation using deep learning: review, methodologies, progress and future research directions. 489-521 - Jianlong Wu, Richang Hong, Qi Tian:

Special issue on cross-modal retrieval and analysis. 523-524 - Lingtao Meng, Feifei Zhang

, Xi Zhang, Changsheng Xu:
Prototype local-global alignment network for image-text retrieval. 525-538 - Zhengjie Huang, Zhenguang Liu, Jianhai Chen, Qinming He, Shuang Wu, Lei Zhu, Meng Wang:

Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods. 539-551 - Ren Zhang

, Ning He
, Shengjie Liu, Ying Wu, Kang Yan, Yuzhe He, Ke Lu:
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition. 553-566 - Zefan Zhang

, Tianling Jiang, Chunping Liu
, Yi Ji:
Multi-aware coreference relation network for visual dialog. 567-576 - Keyang Cheng

, Xuesen Zhu, Yongzhao Zhan, Yunshen Pei:
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos. 577-588 - Xiaowei Zhang, Quan Fang

, Jun Hu, Shengsheng Qian, Changsheng Xu:
TCKGE: Transformers with contrastive learning for knowledge graph embedding. 589-597 - Silin Cai, Changping Wang, Jiajun Ding, Jun Yu, Jianping Fan:

FDAM: full-dimension attention module for deep convolutional neural networks. 599-610 - Yuxiang Xie, Jie Yan, Lai Kang, Yanming Guo

, Jiahui Zhang, Xidao Luan:
FCT: fusing CNN and transformer for scene classification. 611-618 - Mohammad Javad Parseh

, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar:
Semantic-aware visual scene representation. 619-638 - Mohamed Kas

, Youssef El Merabet, Yassine Ruichek
, Rochdi Messoussi:
Generative adversarial networks for 2D-based CNN pose-invariant face recognition. 639-651 - Benoughidene Abdel Halim

, Titouna Faiza:
A novel method for video shot boundary detection using CNN-LSTM approach. 653-667 - Zhiguang Liu, Liangwei Wang, Jian Qiao:

Visual and semantic ensemble for scene text recognition with gated dual mutual attention. 669-680 - Junyan Yang, Jie Jiang

, Yanming Guo:
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning. 681-694 - Mohammadreza Sheikh Fathollahi, Rezvan Heidari:

Gender classification from face images using central difference convolutional networks. 695-703 - You Yang, Yongzhi An

, Juntao Hu
, Longyue Pan:
Tri-RAT: optimizing the attention scores for image captioning. 705-715 - Stefanos-Iordanis Papadopoulos

, Christos Koutlis
, Symeon Papadopoulos, Ioannis Kompatsiaris:
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products. 717-729 - Ren Togo

, Yuki Honma, Maiku Abe, Takahiro Ogawa, Miki Haseyama:
Similar interior coordination image retrieval with multi-view features. 731-740

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














