


default search action
ICMR 2020: Dublin, Ireland
- Cathal Gurrin, Björn Þór Jónsson, Noriko Kando, Klaus Schöffmann, Yi-Ping Phoebe Chen, Noel E. O'Connor:

Proceedings of the 2020 on International Conference on Multimedia Retrieval, ICMR 2020, Dublin, Ireland, June 8-11, 2020. ACM 2020, ISBN 978-1-4503-7087-5
Keynote Talks
- Ramesh C. Jain:

What Should I Do? 1 - Henning Müller:

Medical Image Retrieval: Applications and Resources. 2-3 - Marcel Worring

:
Beyond Relevance Feedback for Searching and Exploring large Multimedia Collections. 4
Tutorials
- Martin Wistuba, Ambrish Rawat, Tejaswini Pedapati:

Automation of Deep Learning - Theory and Practice. 5-6 - Xavier Giró-i-Nieto:

One Perceptron to Rule Them All: Language, Vision, Audio and Speech. 7-8
Best Paper Session
- Yutian Guo, Jingjing Chen

, Hao Zhang, Yu-Gang Jiang:
Visual Relations Augmented Cross-modal Retrieval. 9-15 - Eric Müller-Budack

, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Ralph Ewerth:
Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency. 16-25 - Xu Sun, Xinwen Hu, Tongwei Ren, Gangshan Wu:

Human Object Interaction Detection via Multi-level Conditioned Network. 26-34 - Sadaf Gulshad

, Arnold W. M. Smeulders:
Explaining with Counter Visual Attributes and Examples. 35-43
Oral Session 1: Cross-Modal Analysis
- Dejie Yang, Dayan Wu, Wanqian Zhang, Haisu Zhang, Bo Li

, Weiping Wang
:
Deep Semantic-Alignment Hashing for Unsupervised Cross-Modal Retrieval. 44-52 - Po-Yao Huang, Xiaojun Chang

, Alexander G. Hauptmann, Eduard H. Hovy
:
Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval. 53-62 - Petr Byvshev, Pascal Mettes, Yu Xiao

:
Heterogeneous Non-Local Fusion for Multimodal Activity Recognition. 63-72 - Pim Dijt, Pascal Mettes:

Trajectory Prediction Network for Future Anticipation of Ships. 73-81
Oral Session 2: Applications
- Yunshan Ma

, Yujuan Ding, Xun Yang, Lizi Liao
, Wai Keung Wong
, Tat-Seng Chua:
Knowledge Enhanced Neural Fashion Trend Forecasting. 82-90 - Guolong Wang, Zheng Qin, Junchi Yan, Liu Jiang:

Learning to Select Elements for Graphic Design. 91-99 - Zhengcong Fei:

Actor-Critic Sequence Generation for Relative Difference Captioning. 100-107 - Shuo Chen, Pascal Mettes, Tao Hu, Cees G. M. Snoek:

Interactivity Proposals for Surveillance Videos. 108-116
Oral Session 3: Retrieval
- Zichen Zan, Lin Li, Jianquan Liu, Dong Zhou:

Sentence-based and Noise-robust Cross-modal Retrieval on Cooking Recipes and Food Images. 117-125 - Arun Zachariah, Mohamed Gharibi, Praveen Rao:

QIK: A System for Large-Scale Image Retrieval on Everyday Scenes With Common Objects. 126-135 - Zhi Xiong, Dayan Wu, Wen Gu, Haisu Zhang, Bo Li

, Weiping Wang
:
Deep Discrete Attention Guided Hashing for Face Image Retrieval. 136-144 - Tianrui Niu, Fangxiang Feng, Lingxuan Li, Xiaojie Wang:

Image Synthesis from Locally Related Texts. 145-153
Oral Session 4: Semantic Enrichment
- Suzi Kim, Sunghee Choi:

Automatic Color Scheme Extraction from Movies. 154-163 - Hussam Lawen, Avi Ben-Cohen, Matan Protter, Itamar Friedman, Lihi Zelnik-Manor:

Compact Network Training for Person ReID. 164-171 - Xinzhe Zhou, Yadong Mu:

Google Helps YouTube: Learning Few-Shot Video Classification from Historic Tasks and Cross-Domain Sample Transfer. 172-179 - Yash Garg, K. Selçuk Candan:

iSparse: Output Informed Sparsification of Neural Network. 180-188
Session: Posters (Full Length)
- Yanjie Chen, Likun Cai, Wei Cheng, Hao Wang:

Super-Resolution Coding Defense Against Adversarial Examples. 189-197 - Fabio Carrara

, Giuseppe Amato
, Fabrizio Falchi
, Claudio Gennaro
:
Continuous ODE-defined Image Features for Adaptive Retrieval. 198-206 - Xavier Favory, Frederic Font, Xavier Serra

:
Search Result Clustering in Collaborative Sound Collections. 207-214 - Pengcheng Gao, Ke Lu, Jian Xue:

EfficientFAN: Deep Knowledge Transfer for Face Alignment. 215-223 - Qi Sun, Hongyan Liu, Jun He, Zhaoxin Fan, Xiaoyong Du:

DAGC: Employing Dual Attention and Graph Convolution for Point Cloud based Place Recognition. 224-232 - Roshan Prakash Rane

, Edit Szügyi, Vageesh Saxena, André Ofner, Sebastian Stober:
PredNet and Predictive Coding: A Critical Review. 233-241 - Jia-Hong Huang, Marcel Worring

:
Query-controllable Video Summarization. 242-250
Session: Posters (Short)
- Xuxiao Bu, Bingfeng Li, Yaxiong Wang, Jihua Zhu, Xueming Qian, Marco Zhao:

Semantic Gated Network for Efficient News Representation. 251-255 - Liviu-Daniel Stefan

, Mihai Gabriel Constantin
, Bogdan Ionescu:
System Fusion with Deep Ensembles. 256-260 - Asra Aslam

, Edward Curry
:
Reducing Response Time for Multimedia Event Processing using Domain Adaptation. 261-265 - Mahnaz Amiri Parian

, Luca Rossetto
, Heiko Schuldt
, Stéphane Dupont
:
Are You Watching Closely? Content-based Retrieval of Hand Gestures. 266-270 - Takumi Ohkuma, Hideki Nakayama:

Efficient Base Class Selection Algorithms for Few-Shot Classification. 271-275 - Konstantinos Gkountakos

, Konstantinos Ioannidis, Theodora Tsikrika
, Stefanos Vrochidis
, Ioannis Kompatsiaris:
A Crowd Analysis Framework for Detecting Violence Scenes. 276-280 - Ladislav Peska, Frantisek Mejzlík, Tomás Soucek, Jakub Lokoc:

Towards Evaluating and Simulating Keyword Queries for Development of Interactive Known-item Search Systems. 281-285 - Shengxin Chen, Bo-Hao Chen, Zhaojiong Chen, YunBing Wu:

Itinerary Planning via Deep Reinforcement Learning. 286-290 - Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard:

Confidence-based Weighted Loss for Multi-label Classification with Missing Labels. 291-295 - Dawei Zhang

, Zhonglong Zheng, Xiaowei He, Liu Su, Liyuan Chen:
Learning Fine-Grained Similarity Matching Networks for Visual Tracking. 296-300 - Bo Dong, Cristian Lumezanu, Yuncong Chen

, Dongjin Song, Takehiko Mizoguchi, Haifeng Chen, Latifur Khan
:
At the Speed of Sound: Efficient Audio Scene Classification. 301-305 - Chihaya Matsuhira, Marc A. Kastner, Ichiro Ide

, Yasutomo Kawanishi
, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi
, Hiroshi Murase:
Imageability Estimation using Visual and Language Features. 306-310 - Federico Vaccaro, Marco Bertini, Tiberio Uricchio

, Alberto Del Bimbo:
Image Retrieval using Multi-scale CNN Features Pooling. 311-315 - Sabina Hult, Line Bay Kreiberg, Sami Sebastian Brandt

, Björn Þór Jónsson:
Analysis of the Effect of Dataset Construction Methodology on Transferability of Music Emotion Recognition Models. 316-320 - Camilo Vargas, Qianni Zhang, Ebroul Izquierdo:

One Shot Logo Recognition Based on Siamese Neural Networks. 321-325 - Wei-Rou Lin, Hen-Hsen Huang

, Hsin-Hsi Chen:
Visual Story Ordering with a Bidirectional Writer. 326-330 - Lili Wang, Ruibo Liu, Soroush Vosoughi:

Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural Networks. 331-335 - Damianos Galanopoulos, Vasileios Mezaris:

Attention Mechanisms, Signal Encodings and Fusion Strategies for Improved Ad-hoc Video Search with Dual Encoding Networks. 336-340 - Imam Yogie Susanto, Tse-Yu Pan, Chien-Wen Chen, Min-Chun Hu, Wen-Huang Cheng:

Emotion Recognition from Galvanic Skin Response Signal Based on Deep Hybrid Neural Networks. 341-345
Session: Brave New Ideas
- Riku Togashi, Sumio Fujita, Tetsuya Sakai:

Automatic Evaluation of Iconic Image Retrieval based on Colour, Shape, and Texture. 346-354 - Keith Curtis

, George Awad, Shahzad Rajput, Ian Soboroff:
HLVU: A New Challenge to Test Deep Understanding of Movies the Way Humans do. 355-361 - Tomás Skopal

:
On Visualizations in the Role of Universal Data Representation. 362-367
Session: Doctoral Symposium
- Omar Shahbaz Khan:

An Interactive Learning System for Large-Scale Multimedia Analytics. 368-372 - Asra Aslam

:
Object Detection for Unseen Domains while Reducing Response Time using Knowledge Transfer in Multimedia Event Processing. 373-377 - Negin Ghamsarian

:
Enabling Relevance-Based Exploration of Cataract Videos. 378-382
Session: Demonstrations
- Mariona Caros

, Maite Garolera
, Petia Radeva, Xavier Giró-i-Nieto:
Automatic Reminiscence Therapy for Dementia. 383-387 - Markus Schedl, Michael Mayr, Peter Knees:

Music Tower Blocks: Multi-Faceted Exploration Interface for Web-Scale Music Access. 388-392 - Dinh V. Cuong, Dac H. Nguyen, Son Huynh, Phong Huynh, Cathal Gurrin

, Minh-Son Dao, Duc-Tien Dang-Nguyen
, Binh T. Nguyen:
A Framework for Paper Submission Recommendation System. 393-396 - Andreas Leibetseder, Klaus Schöffmann:

surgXplore: Interactive Video Exploration for Endoscopy. 397-401 - Thinhinane Yebda, Jenny Benois-Pineau, Marion Pech

, Hélène Amièva, Cathal Gurrin
:
Detection of Semantic Risk Situations in Lifelog Data for Improving Life of Frail People. 402-406 - Chenhao Lin, Pengwei Hu, Hui Su, Shaochun Li, Jing Mei, Jie Zhou, Henry Leung:

SenseMood: Depression Detection on Social Media. 407-411 - Quy H. Nguyen

, Dac H. Nguyen, Minh-Son Dao, Duc-Tien Dang-Nguyen
, Cathal Gurrin
, Binh T. Nguyen:
An Active Learning Framework for Duplicate Detection in SaaS Platforms. 412-415 - Van-Luon Tran, Anh-Vu Mai-Nguyen

, Trong-Dat Phan, Anh-Khoa Vo, Minh-Son Dao, Koji Zettsu:
An Interactive Multimodal Retrieval System for Memory Assistant and Life Organized Support. 416-420
Special Session 1: Human-Centric Cross-Modal Retrieval
- Xian Zhong

, Tianyou Lu, Wenxin Huang, Jingling Yuan, Wenxuan Liu, Chia-Wen Lin
:
Visible-infrared Person Re-identification via Colorization-based Siamese Generative Adversarial Network. 421-427 - Zhengxiong Jia, Xirong Li

:
iCap: Interactive Image Captioning with Predictive Text. 428-435 - Taeyong Kim, Bowon Lee:

Multi-Attention Multimodal Sentiment Analysis. 436-441 - Yongbiao Chen, Sheng Zhang, Zhengwei Qi:

MAENet: Boosting Feature Representation for Cross-Modal Person Re-Identification with Pairwise Supervision. 442-449
Special Session 2: Activities of Daily Living
- Min-Huan Fu, An-Zi Yen, Hen-Hsen Huang, Hsin-Hsi Chen:

Incorporating Semantic Knowledge for Visual Lifelog Activity Recognition. 450-456 - Khac-Tuan Nguyen, Dat-Thanh Dinh, Minh N. Do

, Minh-Triet Tran
:
Anomaly Detection in Traffic Surveillance Videos with GAN-based Future Frame Prediction. 457-463 - Jiawei Li, Shu-Tao Xia, Qianggang Ding:

Multi-level Recognition on Falls from Activities of Daily Living. 464-471 - Jonathan Liono, Mohammad Saiedur Rahaman

, Flora D. Salim
, Yongli Ren, Damiano Spina
, Falk Scholer, Johanne R. Trippas
, Mark Sanderson
, Paul N. Bennett, Ryen W. White:
Intelligent Task Recognition: Towards Enabling Productivity Assistance in Daily Life. 472-478 - Khanh-An C. Quan, Vinh-Tiep Nguyen, Tan-Cong Nguyen, Tam V. Nguyen

, Minh-Triet Tran
:
Flood Level Prediction via Human Pose Estimation from Social Media Images. 479-485 - Vaibhav Pandey, Nitish Nag, Ramesh C. Jain:

Continuous Health Interface Event Retrieval. 486-494
Special Session 3: Multimedia Information Retrieval for Urban Data
- Shahin Sharifi Noorian, Sihang Qiu, Achilleas Psyllidis

, Alessandro Bozzon
, Geert-Jan Houben:
Detecting, Classifying, and Mapping Retail Storefronts Using Street-level Imagery. 495-501 - Naoki Sugimoto, Toru Okubo, Kiyoharu Aizawa:

Urban Movie Map for Walkers: Route View Synthesis using 360° Videos. 502-508 - Maarten Sukel, Stevan Rudinac, Marcel Worring

:
Urban Object Detection Kit: A System for Collection and Analysis of Street-Level Imagery. 509-516
Special Session 4: Knowledge-Driven Analysis and Retrieval on Multimedia
- Runchen Wei, Ning He, Ke Lu:

YOLO-mini-tiger: Amur Tiger Detection. 517-524 - Cong Bai, Chao Zeng, Qing Ma, Jinglin Zhang, Shengyong Chen:

Deep Adversarial Discrete Hashing for Cross-Modal Retrieval. 525-531 - Li Hao, Liping Hou, Yuantao Song, Ke Lu, Jian Xue:

A Lightweight Gated Global Module for Global Context Modeling in Neural Networks. 532-539 - Youze Wang, Shengsheng Qian, Jun Hu, Quan Fang, Changsheng Xu:

Fake News Detection via Knowledge-driven Multimodal Graph Convolutional Networks. 540-547 - Jiansheng Dong, Jingling Yuan, Lin Li, Xian Zhong

, Weiru Liu
:
Optimizing Queries over Video via Lightweight Keypoint-based Object Detection. 548-554 - Bo Jiang:

Multi-Graph Group Collaborative Filtering. 555-562 - Haiyan Fu, Ying Li

, Hengheng Zhang, Jinfeng Liu, Tao Yao:
Rank-embedded Hashing for Large-scale Image Retrieval. 563-570 - Yifeng Han, Lin Li, Jianwei Zhang:

A Coordinated Representation Learning Enhanced Multimodal Machine Translation Approach with Multi-Attention. 571-577
Workshop Summaries
- Ichiro Ide, Yoko Yamakata, Atsushi Hashimoto:

CEA'20: The 12th Workshop on Multimedia for Cooking and Eating Activities. 578-579 - Minh-Son Dao, Morten Fjeld, Filip Biljecki, Uraz Yavanoglu, Mianxiong Dong:

ICDAR'20: Intelligent Cross-Data Analysis and Retrieval. 580-581 - Wei-Ta Chu

, Ichiro Ide
, Naoko Nitta, Norimichi Tsumura, Toshihiko Yamasaki:
MMArt-ACM'20: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2020. 582-583 - Cathal Gurrin

, Tu-Khiem Le, Van-Tu Ninh
, Duc-Tien Dang-Nguyen
, Björn Þór Jónsson, Jakub Lokoc, Wolfgang Hürst, Minh-Triet Tran
, Klaus Schöffmann:
Introduction to the Third Annual Lifelog Search Challenge (LSC'20). 584-585

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














