


default search action
28th MMM 2022: Phu Quoc, Vietnam - Part I
- Björn Þór Jónsson

, Cathal Gurrin
, Minh-Triet Tran
, Duc-Tien Dang-Nguyen
, Anita Min-Chun Hu
, Huynh Thi Thanh Binh
, Benoit Huet
:
MultiMedia Modeling - 28th International Conference, MMM 2022, Phu Quoc, Vietnam, June 6-10, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13141, Springer 2022, ISBN 978-3-030-98357-4
Best Paper Session
- Yaxuan Hu

, Yuehong Dai, Zhongxiang Wang:
Real-time Detection of Tiny Objects Based on a Weighted Bi-directional FPN. 3-14 - Boqun Li, Zhong Qian

, Peifeng Li, Qiaoming Zhu:
Multi-modal Fusion Network for Rumor Detection with Texts and Images. 15-27 - Yuan Chang

, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang:
PF-VTON: Toward High-Quality Parser-Free Virtual Try-On Network. 28-40 - Yuyan Yang, Xin Ni, Yanbin Hao, Chenyu Liu, Wenshan Wang, Yifeng Liu, Haiyong Xie:

MF-GAN: Multi-conditional Fusion Generative Adversarial Network for Text-to-Image Synthesis. 41-53
Applications 1
- Kezhen Xie, Lei Huang, Wenfeng Zhang, Qibing Qin, Zhiqiang Wei:

Learning to Classify Weather Conditions from Single Images Without Labels. 57-68 - Yongquan Wan, Cairong Yan, Bofeng Zhang, Guobing Zou:

Learning Image Representation via Attribute-Aware Attention Networks for Fashion Classification. 69-81 - Yuan Chang

, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang:
Toward Detail-Oriented Image-Based Virtual Try-On with Arbitrary Poses. 82-94 - Ilias Gialampoukidis

, Stelios Andreadis
, Nick Pantelidis, Sameed Hayat
, Li Zhong, Marios Bakratsas, Dennis Hoppe, Stefanos Vrochidis
, Ioannis Kompatsiaris
:
Parallel DBSCAN-Martingale Estimation of the Number of Concepts for Automatic Satellite Image Clustering. 95-106
Multimedia Applications - Perspectives, Tools and Applications (Special Session) and Brave New Ideas
- Werner Bailer

, Georg Thallinger
, Verena Krawarik
, Katharina Schell
, Victoria Ertelthalner
:
AI for the Media Industry: Application Potential and Automation Levels. 109-118 - Ladislav Peska

, Jakub Lokoc:
Rating-Aware Self-Organizing Maps. 119-130 - Yana van de Sande

, Martha A. Larson
:
Color the Word: Leveraging Web Images for Machine Translation of Untranslatable Words. 131-138
Activities and Events
- Jiankai Li

, Yunhong Wang, Weixin Li:
MGMP: Multimodal Graph Message Propagation Network for Event Detection. 141-153 - Jiewen Wang, Shuang Liang:

Pose-Enhanced Relation Feature for Action Recognition in Still Images. 154-165 - Tao Peng, Caiyin Tang, Jing Wang:

Prostate Segmentation of Ultrasound Images Based on Interpretable-Guided Mathematical Model. 166-177 - Lin Wang, Yan Song, Rui Yan, Xiangbo Shu:

Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection. 178-190
Multimedia Datasets for Repeatable Experimentation (Special Session)
- Jakub Lokoc

, Werner Bailer
, Kai Uwe Barthel
, Cathal Gurrin
, Silvan Heller
, Björn Þór Jónsson
, Ladislav Peska
, Luca Rossetto
, Klaus Schoeffmann
, Lucia Vadicamo
, Stefanos Vrochidis
, Jiaxin Wu
:
A Task Category Space for User-Centric Comparative Multimedia Search Evaluations. 193-204 - Konstantin Schall, Kai Uwe Barthel, Nico Hezel, Klaus Jung:

GPR1200: A Benchmark for General-Purpose Content-Based Image Retrieval. 205-216 - Ly-Duyen Tran

, Thanh Cong Ho, Lan Anh Pham, Binh T. Nguyen, Cathal Gurrin
, Liting Zhou
:
LLQA - Lifelog Question Answering Dataset. 217-228
Learning
- Yijie Zhong

, Zhengxing Sun, Shoutong Luo, Yunhan Sun, Wei Zhang:
Category-Sensitive Incremental Learning for Image-Based 3D Shape Reconstruction. 231-244 - Zhaoliang He, Yuan Wang, Chen Tang, Zhi Wang, Wenwu Zhu, Chenyang Guo, Zhibo Chen:

AdaConfigure: Reinforcement Learning-Based Adaptive Configuration for Video Analytics Services. 245-257 - Gursimran Singh, Lingyang Chu, Lanjun Wang

, Jian Pei
, Qi Tian, Yong Zhang
:
Mining Minority-Class Examples with Uncertainty Estimates. 258-271 - Siyuan Chen:

Conditional Context-Aware Feature Alignment for Domain Adaptive Detection Transformer. 272-283
Multimedia for Medical Applications (Special Session)
- Vasileios-Rafail Xefteris

, Athina Tsanousa, Thanassis Mavropoulos, Georgios Meditskos, Stefanos Vrochidis, Ioannis Kompatsiaris:
Human Activity Recognition with IMU and Vital Signs Feature Fusion. 287-298 - Zhaohui Zhu

, Marc A. Kastner
, Shin'ichi Satoh
:
On Assisting Diagnoses of Pareidolia by Emulating Patient Behavior. 299-310 - Pooja Prajod

, Tobias Huber
, Elisabeth André:
Using Explainable AI to Identify Differences Between Clinical and Experimental Pain Detection Models Based on Facial Expressions. 311-322
Applications 2
- Xuena Ren, Dongming Zhang, Xiuguo Bao, Lei Shi:

Double Granularity Relation Network with Self-criticism for Occluded Person Re-identification. 325-338 - Haoyuan Zheng, Weihang Wang, Fei Wen, Peilin Liu:

A Complementary Fusion Strategy for RGB-D Face Recognition. 339-351 - Zhibin Xiao

, Pengwei Xie
, Guijin Wang
:
Multi-scale Cross-Modal Transformer Network for RGB-D Object Detection. 352-363 - Jian He, Xian Zhong

, Jingling Yuan, Ming Tan, Shilei Zhao
, Luo Zhong:
Joint Re-Detection and Re-Identification for Multi-Object Tracking. 364-376
Multimedia Analytics for Contextual Human Understanding (Special Session)
- Srijith Unni, Sushma Suryanarayana Gowda, Alan F. Smeaton

:
An Investigation into Keystroke Dynamics and Heart Rate Variability as Indicators of Stress. 379-391 - Thao V. Ha, Hoang Nguyen

, Son T. Huynh, Trung T. Nguyen, Binh T. Nguyen:
Fall Detection Using Multimodal Data. 392-403 - Tenzin Palbar, Manoj Kesavulu

, Cathal Gurrin
, Renaat Verbruggen:
Prediction of Blood Glucose Using Contextual LifeLog Data. 404-415 - Liting Zhou

, Cathal Gurrin
:
Multimodal Embedding for Lifelog Retrieval. 416-427
Applications 3
- Yi Li, Dehao Wu, Yuesheng Zhu:

A Multiple Positives Enhanced NCE Loss for Image-Text Retrieval. 431-442 - Xiang Shuai, Xiao Wang, Wei Wang, Xin Yuan

, Xin Xu:
SAM: Self Attention Mechanism for Scene Text Recognition Based on Swin Transformer. 443-454 - Jian Yang, Chi Do-Kim Pham, Jinjia Zhou:

JVCSR: Video Compressive Sensing Reconstruction with Joint In-Loop Reference Enhancement and Out-Loop Super-Resolution. 455-466 - Yingrui Wang, Suyu Wang, Longhua Sun:

Point Cloud Upsampling via a Coarse-to-Fine Network. 467-478
Image Analytics
- Yuzhuo Wang, Yanlin Geng:

Arbitrary Style Transfer with Adaptive Channel Network. 481-492 - Shuang Zheng, Liang Wang

:
Fast Single Image Dehazing Using Morphological Reconstruction and Saturation Compensation. 493-504 - Lulu Zhao, Ling Shen, Richang Hong:

One-Stage Image Inpainting with Hybrid Attention. 505-517 - Jiayao Xu

, Chen Fu
, Zhiqiang Zhang, Jinjia Zhou:
Real-Time FPGA Design for OMP Targeting 8K Image Reconstruction. 518-529
Speech and Music
- Ke Liu

, Chen Wang
, Jiayue Chen
, Jun Feng
:
Time-Frequency Attention for Speech Emotion Recognition with Squeeze-and-Excitation Blocks. 533-543 - Jing Xiao, Jiaqi Liu, Dengshi Li, Lanxin Zhao, Qianrui Wang:

Speech Intelligibility Enhancement By Non-Parallel Speech Style Conversion Using CWT and iMetricGAN Based CycleGAN. 544-556 - Or Goren, Eliya Nachmani, Lior Wolf:

A-Muze-Net: Music Generation by Composing the Harmony Based on the Generated Melody. 557-568 - Abhishek Srivastava, Wei Duan, Rajiv Ratn Shah

, Jianming Wu, Suhua Tang
, Wei Li, Yi Yu:
Melody Generation from Lyrics Using Three Branch Conditional LSTM-GAN. 569-581
Multimodal Analytics
- Pengfei Du

, Yali Gao, Xiaoyong Li:
Bi-attention Modal Separation Network for Multimodal Video Fusion. 585-598 - Qi Zhong, Qian Wang, Ji Liu:

Combining Knowledge and Multi-modal Fusion for Meme Classification. 599-611 - Binqiang Wang

, Gang Dong, Yaqian Zhao, Rengang Li, Qichun Cao, Yinyin Chao:
Non-Uniform Attention Network for Multi-modal Sentiment Analysis. 612-623 - Yanbei Sun, Yao Lu, Haowei Lu, Qingjie Zhao, Shunzhou Wang:

Multimodal Unsupervised Image-to-Image Translation Without Independent Style Encoder. 624-636

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














