


default search action
23rd MMM 2018: Bangkok, Thailand
- Klaus Schoeffmann, Thanarat H. Chalidabhongse, Chong-Wah Ngo, Supavadee Aramvith, Noel E. O'Connor, Yo-Sung Ho, Moncef Gabbouj

, Ahmed Elgammal:
MultiMedia Modeling - 24th International Conference, MMM 2018, Bangkok, Thailand, February 5-7, 2018, Proceedings, Part I. Lecture Notes in Computer Science 10704, Springer 2018, ISBN 978-3-319-73602-0
Full Papers Accepted for Oral Presentation
- Shurong Sheng, Aparna Nurani Venkitasubramanian, Marie-Francine Moens:

A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain. 3-15 - En Shi, Qian Li

, Daquan Gu, Zhangming Zhao:
A Method of Weather Radar Echo Extrapolation Based on Convolutional Neural Networks. 16-28 - Konstantinos Apostolidis, Evlampios Apostolidis

, Vasileios Mezaris:
A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos. 29-41 - Lianglei Wei, Yirui Wu, Wenhai Wang, Tong Lu:

A Novel 3D Human Action Recognition Framework for Video Content Analysis. 42-53 - Dorian Michaud, Thierry Urruty, François Lecellier, Philippe Carré:

Adaptive Image Representation Using Information Gain and Saliency: Application to Cultural Heritage Datasets. 54-66 - Peng Yao, Hua Zhang, Yanbing Xue, Shengyong Chen:

AGO: Accelerating Global Optimization for Accurate Stereo Matching. 67-80 - Wanzhao Yang, Weiping Tu, Jiaxi Zheng, Xiong Zhang, Yuhong Yang, Yucheng Song:

An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder. 81-92 - Jiang Zhu, Wei Zhai, Yang Cao, Zheng-Jun Zha

:
Co-occurrent Structural Edge Detection for Color-Guided Depth Map Super-Resolution. 93-105 - Kaiping Xu, Zheng Qin, Guolong Wang, Kai Huang, Shuxiong Ye, Huidi Zhang:

Collision-Free LSTM for Human Trajectory Prediction. 106-116 - Tae Kwan Lee, Wissam J. Baddar, Seong Tae Kim

, Yong Man Ro
:
Convolution with Logarithmic Filter Groups for Efficient Shallow CNN. 117-129 - Junjie Zhao, Yuxin Peng:

Cost-Sensitive Deep Metric Learning for Fine-Grained Image Classification. 130-141 - Meng Wei, Yu Kang, Weiguo Song, Yang Cao:

Crowd Distribution Estimation with Multi-scale Recursive Convolutional Neural Network. 142-153 - Yuhua Jia, Liang Bai, Peng Wang, Jinlin Guo, Yuxiang Xie:

Deep Convolutional Neural Network for Correlating Images and Sentences. 154-165 - Weijie Kong, Nannan Li, Thomas H. Li, Ge Li:

Deep Pedestrian Detection Using Contextual Information and Multi-level Features. 166-177 - Hua Yuan, Yuanyuan Zhou, Yun Sheng, Guixu Zhang:

Dual-Way Guided Depth Image Inpainting with RGBD Image Pairs. 178-189 - Ryosuke Furuta, Naoto Inoue, Toshihiko Yamasaki:

Efficient and Interactive Spatial-Semantic Image Retrieval. 190-202 - Sabrina Kletz, Andreas Leibetseder, Klaus Schoeffmann:

Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown. 203-215 - Saumya Rawat, Siddhartha Gairola

, Rajvi Shah, P. J. Narayanan:
Find Me a Sky: A Data-Driven Method for Color-Consistent Sky Search and Replacement. 216-228 - Yizhi Wang, Zhouhui Lian, Yingmin Tang, Jianguo Xiao:

Font Recognition in Natural Images via Transfer Learning. 229-240 - Manfred Jürgen Primus, Doris Putzgruber-Adamitsch, Mario Taschwer, Bernd Münzer, Yosuf El-Shabrawi, László Böszörményi, Klaus Schoeffmann:

Frame-Based Classification of Operation Phases in Cataract Surgery Videos. 241-253 - Jong-Hee Back, Sunho Kim, Yo-Sung Ho:

High-Precision 3D Coarse Registration Using RANSAC and Randomly-Picked Rejections. 254-266 - Huidi Fang, Chaoran Cui, Xiang Deng

, Xiushan Nie, Muwei Jian
, Yilong Yin:
Image Aesthetic Distribution Prediction with Fully Convolutional Network. 267-278 - Laura Pérez-Mayos, Federico M. Sukno

, Leo Wanner:
Improving the Quality of Video-to-Language Models by Optimizing Annotation of the Training Material. 279-290 - Mofei Song

, Zhengxing Sun, Bo Li, Jiagao Hu
:
Iterative Active Classification of Large Image Collection. 291-304 - Amorntip Prayoonwong, Cheng-Hsien Wang, Chih-Yi Chiu

:
Learning to Index in Large-Scale Datasets. 305-316 - Jianshe Zhou, Tuya Naren, Xianyu Chen, Yike Ma, Jie Liu, Feng Dai:

Light Field Foreground Matting Based on Defocus and Correspondence. 317-328 - Peng Cheng, Wu Liu, Yifan Zhang, Huadong Ma:

LOCO: Local Context Based Faster R-CNN for Small Traffic Sign Detection. 329-341 - Yongfei Zhang

, Zhe Li:
Multi-hypothesis-Based Error Concealment for Whole Frame Loss in HEVC. 342-354 - Jinna Lv, Wu Liu, Lili Zhou, Bin Wu, Huadong Ma:

Multi-stream Fusion Model for Social Relation Recognition from Videos. 355-368 - Geert Lugtenberg, Wolfgang Hürst, Nina Rosa, Christian Sandor

, Alexander Plopski, Takafumi Taketomi, Hirokazu Kato
:
Multimodal Augmented Reality - Augmenting Auditory-Tactile Feedback to Change the Perception of Thickness. 369-380 - Jianjun Li, Lanlan Xu, Haojie Li, Chin-Chen Chang, Fuming Sun

:
Parameter Selection for Denoising Algorithms Using NR-IQA with CNN. 381-392 - Itsara Wichakam, Teerapong Panboonyuen

, Can Udomcharoenchaikit, Peerapon Vateekul
:
Real-Time Polyps Segmentation for Colonoscopy Video Frames Using Compressed Fully Convolutional Network. 393-404 - Yuxin Yuan, Yuxin Peng:

Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval. 405-416 - Qi Zheng, Jun Chen, Junjun Jiang

, Ruimin Hu:
Reinforcing Pedestrian Parsing on Small Scale Dataset. 417-427 - Xiangyu Liu, Yunhong Wang, Qingjie Liu:

Remote Sensing Image Fusion Based on Two-Stream Fusion Network. 428-439 - Peng Wu, Di Huang, Yunhong Wang:

REVT: Robust and Efficient Visual Tracking by Region-Convolutional Regression Network. 440-452 - Dongmei Huang, Yan Wang, Wei Song

, Jean Sequeira, Sébastien Mavromatis
:
Shallow-Water Image Enhancement Using Relative Global Histogram Stretching Based on Adaptive Parameter Acquisition. 453-465 - Lintao Guo, Hunter Quant, Nikolas Lamb

, Benjamin Lowit, Sean Banerjee, Natasha Kholgade Banerjee:
Spatiotemporal 3D Models of Aging Fruit from Multi-view Time-Lapse Videos. 466-478 - Kewei Yang, Zhengxing Sun, Shuang Wang

, Bo Li:
Stitch-Based Image Stylization for Thread Art Using Sparse Modeling. 479-492 - Hong Joo Lee, Wissam J. Baddar, Hak Gu Kim, Seong Tae Kim

, Yong Man Ro
:
Teacher and Student Joint Learning for Compact Facial Landmark Detection Network. 493-504 - Zhengcai Qin, Bin Wu

, Meng Li:
Text Image Deblurring via Intensity Extremums Prior. 505-517 - Dries Hulens, Bram Aerts, Punarjay Chakravarty, Ali Diba, Toon Goedemé

, Tom Roussel, Jeroen Zegers
, Tinne Tuytelaars
, Luc Van Eycken, Luc Van Gool, Hugo Van hamme
, Joost Vennekens
:
The CAMETRON Lecture Recording System: High Quality Video Recording and Editing with Minimal Human Supervision. 518-530 - Magzhan Kairanbay, John See

, Lai-Kuan Wong
:
Towards Demographic-Based Photographic Aesthetics Prediction for Portraitures. 531-543 - Xiaoyu Qi, Deshun Yang, Xiaoou Chen:

Triplet Convolutional Network for Music Version Identification. 544-555 - Yujing Chen, Jing Xiao, Gen Zhan, Xu Wang, Zhongyuan Wang:

Two-Level Segment-Based Bitrate Control for Live ABR Streaming. 556-564 - Jianjun Chen, Hongtao Xie, Yue Hu, Chenggang Yan:

Uyghur Text Localization with Fast Component Detection. 565-577
SS: Multimedia Analytics: Perspectives, Techniques and Applications
- Rashmi Gupta

, Cathal Gurrin
:
Approaches for Event Segmentation of Visual Lifelog Data. 581-593 - Masoud Mazloom, Iliana Pappi, Marcel Worring

:
Category Specific Post Popularity Prediction. 594-607 - Feiyan Hu

, Alan F. Smeaton:
Image Aesthetics and Content in Selecting Memorable Keyframes from Lifelogs. 608-619 - Werner Bailer:

On the Traceability of Results from Deep Learning-Based Cloud Services. 620-631 - Stevan Rudinac, Tat-Seng Chua, Nicolás E. Díaz Ferreyra

, Gerald Friedland, Tatjana Gornostaja, Benoit Huet, Rianne Kaptein, Krister Lindén
, Marie-Francine Moens, Jaakko Peltonen
, Miriam Redi, Markus Schedl, David A. Shamma, Alan F. Smeaton, Lexing Xie
:
Rethinking Summarization and Storytelling for Modern Social Multimedia. 632-644

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














