


default search action
21st ACM Multimedia 2013: Barcelona, Spain
- Alejandro Jaimes, Nicu Sebe, Nozha Boujemaa, Daniel Gatica-Perez, David A. Shamma, Marcel Worring, Roger Zimmermann:
ACM Multimedia Conference, MM '13, Barcelona, Spain, October 21-25, 2013. ACM 2013, ISBN 978-1-4503-2404-5
Keynote address
- Elizabeth F. Churchill:
Multimedia framed. 1-2
Best paper session
- Luoqi Liu, Hui Xu, Junliang Xing, Si Liu, Xi Zhou, Shuicheng Yan:
"Wow! you are so beautiful today!". 3-12 - Quan Fang, Jitao Sang, Changsheng Xu:
GIANT: geo-informative attributes for location recognition and exploration. 13-22 - Xin Zhao, Xue Li
, Chaoyi Pang, Xiaofeng Zhu
, Quan Z. Sheng
:
Online human gesture recognition from motion data streams. 23-32 - Hanwang Zhang
, Zheng-Jun Zha
, Yang Yang, Shuicheng Yan, Yue Gao, Tat-Seng Chua:
Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval. 33-42
Experience
- Qianqian Xu, Jiechao Xiong, Qingming Huang, Yuan Yao:
Robust evaluation for quality of experience in crowdsourcing. 43-52 - Wei-Ta Chu
, Yu-Kuang Chen, Kuan-Ta Chen:
Size does matter: how image size affects aesthetic perception? 53-62 - Zhonghua Li, Ju-Chiang Wang, Jingli Cai, Zhiyan Duan, Hsin-Min Wang
, Ye Wang
:
Non-reference audio quality assessment for online live music recordings. 63-72 - Yu-Chuan Su, Tzu-Hsuan Chiu, Yan-Ying Chen, Chun-Yen Yeh, Winston H. Hsu:
Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation. 73-82
Music & play
- André Mourão
, João Magalhães:
Competitive affective gaming: winning with a smile. 83-92 - Wolfgang Hürst, Joris Dekker:
Tracking-based interaction for object creation in mobile augmented reality. 93-102 - Graham Percival, Nicholas Bailey, George Tzanetakis:
Physical modelling and supervised training of a virtual string quartet. 103-112 - Jordan B. L. Smith, Elaine Chew
:
Using quadratic programming to estimate feature relevance in structural analyses of music. 113-122
Similarity search
- Lei Zhang
, Yongdong Zhang, Jinhui Tang
, Xiaoguang Gu, Jintao Li, Qi Tian:
Topology preserving hashing for similarity search. 123-132 - Jianfeng Wang, Jingdong Wang
, Nenghai Yu, Shipeng Li
:
Order preserving hashing for approximate nearest neighbor search. 133-142 - Xiaofeng Zhu
, Zi Huang
, Heng Tao Shen, Xin Zhao:
Linear cross-modal hashing for efficient multimedia search. 143-152 - Pengcheng Wu, Steven C. H. Hoi
, Hao Xia, Peilin Zhao, Dayong Wang, Chunyan Miao
:
Online multimodal deep similarity learning with application to image retrieval. 153-162
Art, performance, and sports
- Eric Foote, Peter Carr, Patrick Lucey, Yaser Sheikh, Iain A. Matthews:
One-man-band: a touch screen interface for producing live multi-camera sports broadcasts. 163-172 - Hill Hiroki Kobayashi, Michitaka Hirose, Akio Fujiwara, Kazuhiko Nakamura, Kaoru Sezaki, Kaoru Saito:
Tele echo tube: beyond cultural and imaginable boundaries. 173-182 - Min Lin, Zhenzhen Hu, Si Liu, Meng Wang, Richang Hong, Shuicheng Yan:
eHeritage of shadow puppetry: creation and manipulation. 183-192 - Peter Carr, Michael N. Mistry, Iain A. Matthews:
Hybrid robotic/virtual pan-tilt-zom cameras for autonomous event recording. 193-202
Brave new topics: social and cognitive aspects
- Amarnath Gupta, Ramesh C. Jain:
Social life networks: a multimedia problem? 203-212 - Marco Cristani, Alessandro Vinciarelli, Cristina Segalin
, Alessandro Perina:
Unveiling the multimedia unconscious: implicit cognitive processes and multimedia content analysis. 213-222 - Damian Borth, Rongrong Ji, Tao Chen, Thomas M. Breuel, Shih-Fu Chang:
Large-scale visual sentiment ontology and detectors using adjective noun pairs. 223-232 - Xinghai Sun, Changhu Wang, Chao Xu, Lei Zhang:
Indexing billions of images for sketch-based retrieval. 233-242 - Xian-Sheng Hua, Linjun Yang, Jingdong Wang
, Jing Wang, Ming Ye, Kuansan Wang, Yong Rui, Jin Li:
Clickage: towards bridging semantic and intent gaps via mining click logs of search engines. 243-252 - Zhaoquan Yuan, Jitao Sang, Yan Liu, Changsheng Xu:
Latent feature learning in social media network. 253-262
Action and event recognition
- Xiaodan Liang, Liang Lin, Liangliang Cao:
Learning latent spatio-temporal compositional model for human action recognition. 263-272 - Xu Zhao, Yuncai Liu, Yun Fu:
Exploring discriminative pose sub-patterns for effective action classification. 273-282 - Raj Kumar Gupta, Alex Yong Sang Chia, Deepu Rajan:
Human activities recognition using depth images. 283-292 - Zhigang Ma, Yi Yang, Zhongwen Xu, Nicu Sebe
, Alexander G. Hauptmann:
We are not equally negative: fine-grained labeling for multimedia event detection. 293-302
Streaming and synchronization
- Zhen Wei Zhao, Wei Tsang Ooi
:
Joserlin: joint request and service scheduling for peer-to-peer non-linear media access. 303-312 - Moonkyung Ryu, Umakishore Ramachandran:
FlashStream: a multi-tiered storage architecture for adaptive HTTP streaming. 313-322 - Mario Montagud, Fernando Boronat
, Hans Stokking:
Early event-driven (EED) RTCP feedback for rapid IDMS. 323-332 - Marian Florin Ursu, Martin Groen
, Manolis Falelakis, Michael Frantzis, Vilmos Zsombori, Rene Kaiser
:
Orchestration: tv-like mixing grammars applied to video-communication for social groups. 333-342
Keynote address
- Leonidas J. Guibas:
The space between the images. 343-344
Multimedia grand challenge
- Xiaoyin Che, Haojin Yang, Christoph Meinel:
Lecture video segmentation by automatically analyzing the synchronized slides. 345-348 - Chien-Nan (Shannon) Chen, Pengye Xia, Klara Nahrstedt:
Activity-aware adaptive compression: a morphing-based frame synthesis application in 3DTI. 349-352 - Yu-Chuan Su, Tzu-Hsuan Chiu, Guan-Long Wu, Chun-Yen Yeh, Felix Wu, Winston H. Hsu:
Flickr-tag prediction using multi-modal fusion and meta information. 353-356 - Brendan Jou, Hongzhi Li, Joseph G. Ellis, Daniel Morozoff-Abegauz, Shih-Fu Chang:
Structured exploration of who, what, when, and where in heterogeneous multimedia news sources. 357-360 - Subhabrata Bhattacharya, Behnaz Nojavanasghari, Tao Chen, Dong Liu, Shih-Fu Chang, Mubarak Shah:
Towards a comprehensive computational model foraesthetic assessment of videos. 361-364 - Chidansh Amitkumar Bhatt, Andrei Popescu-Belis
, Maryam Habibi, Sandy Ingram
, Stefano Masneri
, Fergus McInnes, Nikolaos Pappas
, Oliver Schreer
:
Multi-factor segmentation for topic visualization and recommendation: the MUST-VIS system. 365-368 - Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang:
Beauty is here: evaluating aesthetics in videos using multimodal features and free training data. 369-372 - Kong-Wah Wan, Wei-Yun Yau, Sujoy Roy:
Metadata enrichment for news video retrieval: a graph-based propagation approach. 373-376 - Zong Bo Hao, Qianni Zhang
, Ebroul Izquierdo, Nan Sang:
Human action recognition by fast dense trajectories. 377-380 - Eleni Mantziou, Symeon Papadopoulos, Yiannis Kompatsiaris:
Scalable training with approximate incremental laplacian eigenmaps and PCA. 381-384 - Gökhan Yildirim, Appu Shaji, Sabine Süsstrunk:
Estimating beauty ratings of videos using supervoxels. 385-388 - Litian Sun, Kiyoharu Aizawa:
Action recognition using invariant features under unexampled viewing conditions. 389-392 - Chun-Che Wu, Kuan-Yu Chu, Yin-Hsi Kuo, Yan-Ying Chen, Wen-Yu Lee, Winston H. Hsu:
Search-based relevance association with auxiliary contextual cues. 393-396 - Yingwei Pan
, Ting Yao, Kuiyuan Yang, Houqiang Li, Chong-Wah Ngo
, Jingdong Wang
, Tao Mei:
Image search by graph-based label propagation with image representation from DNN. 397-400
Demos
- Chia-Ju Lu, Chih-Fan Hsu, Mei-Chen Yeh
:
Real-time salient object detection. 401-402 - Kiia Korpi, Kiyoharu Aizawa:
Kanji snap: an OCR-based smartphone application for learning Japanese kanji characters. 403-404 - Marco A. Hudelist, Klaus Schoeffmann, Laszlo Boeszoermenyi:
Mobile video browsing with the ThumbBrowser. 405-406 - Che-Hao Hsu, Kai-Lung Hua
, Wen-Huang Cheng:
Physiognomy master: a novel personality analysis system based on facial features. 407-408 - Wu Liu, Feibin Yang, Yongdong Zhang, Qinghua Huang, Tao Mei:
LAVES: an instant mobile video search system based on layered audio-video indexing. 409-410 - Frederic Font
, Gerard Roma
, Xavier Serra
:
Freesound technical demo. 411-412 - Yu You, Ville-Veikko Mattila:
Visualizing web mash-ups for in-situ vision-based mobile AR applications. 413-414 - Oscar Mayor, Quim Llimona, Marco Marchini, Panagiotis Papiotis, Esteban Maestre
:
repoVizz: a framework for remote storage, browsing, annotation, and exchange of multi-modal data. 415-416 - Pierre Letessier, Nicolas Hervé
, Julien Champ
, Alexis Joly, Olivier Buisson, Amel Hamzaoui:
Small objects query suggestion in a large web-image collection. 417-418 - AmirHossein Habibian, Cees G. M. Snoek:
Video2Sentence and vice versa. 419-420 - Christoph Korinke, Mohammad Rabbath, Dennis Lamken, Susanne Boll:
A tool for catching back your preferred videos from physical collages. 421-422 - Hervé Goëau
, Pierre Bonnet
, Alexis Joly, Vera Bakic, Julien Barbe, Itheri Yahiaoui, Souheil Selmi, Jennifer Carré, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino
, Grégoire Duché, Aurélien Péronnet:
Pl@ntNet mobile app. 423-424 - Benjamin Guthier, Kalun Ho, Stephan Kopf, Wolfgang Effelsberg:
Determining exposure values from HDR histograms for smartphone photography. 425-426 - Julien Law-To, Gregory Grefenstette
, Rémi Landais:
Semantic dispatching of multimedia news with MEWS. 427-428 - Peng Wu, Rares Vernica, Qian Lin:
Cloud based multimedia analytic platform. 429-430 - Sandro Hardy, Stefan Göbel, Ralf Steinmetz
:
Adaptable and personalized game-based training system for fall prevention. 431-432 - Chao Dong, Shifeng Chen, Xiaoou Tang:
AdVisual: a visual-based advertising system. 433-434 - Yichao Jin, Tian Xie, Yonggang Wen, Haiyong Xie:
Multi-screen cloud social TV: transforming TV experience into 21st century. 435-436 - Luoqi Liu, Hui Xu, Si Liu, Junliang Xing, Xi Zhou, Shuicheng Yan:
"Wow! you are so beautiful today!". 437-438 - Guanfeng Wang, Beomjoo Seo, Yifang Yin
, Roger Zimmermann
, Zhijie Shen:
OSCOR: an orientation sensor data correction system for mobile generated contents. 439-440 - Nicolas Hervé
, Marie-Luce Viaud, Jérôme Thièvre, Agnès Saulnier, Julien Champ
, Pierre Letessier, Olivier Buisson, Alexis Joly:
OTMedia: the French TransMedia news observatory. 441-442 - Pengye Xia, Klara Nahrstedt:
TEEVE endpoint: towards the ease of 3D tele-immersive application development. 443-444 - Zhenzhen Hu, Min Lin, Si Liu, Meng Wang, Richang Hong, Shuicheng Yan:
eHeritage of shadow puppetry: creation and manipulation. 445-446 - Jules Françoise
, Norbert Schnell, Frédéric Bevilacqua:
Gesture-based control of physical modeling sound synthesis: a mapping-by-demonstration approach. 447-448 - Hongzhi Li, Brendan Jou, Joseph G. Ellis, Daniel Morozoff, Shih-Fu Chang:
News rover: exploring topical structures and serendipity in heterogeneous multimedia news. 449-450 - Marco Bertini
, Alberto Del Bimbo
, Andrea Ferracani, Francesco Gelli, Daniele Maddaluno, Daniele Pezzatini:
A novel framework for collaborative video recommendation, interest discovery and friendship suggestion based on semantic profiling. 451-452 - Marco Bertini
, Alberto Del Bimbo
, George Ioannidis, Emile Bijk, Isabel Trancoso
, Hugo Meinedo
:
euTV: a system for media monitoring and publishing. 453-454 - Giuliano Armano, Alessandro Giuliani
, Alberto Messina, Maurizio Montagnuolo:
CAMMA: contextual advertising system for multimodal news aggregations. 455-456 - Alberto Del Bimbo
, Andrea Ferracani, Daniele Pezzatini:
Flarty: recommending art routes using check-ins latent topics. 457-458 - Damian Borth, Tao Chen, Rongrong Ji, Shih-Fu Chang:
SentiBank: large-scale ontology and classifiers for detecting sentiment and emotions in visual content. 459-460 - Junsheng Fu
, Lixin Fan, Yu You, Kimmo Roimela:
Augmented and interactive video playback based on global camera pose. 461-462 - Matt C. Yu, Peter Vajda, David M. Chen, Sam S. Tsai, Maryam Daneshi, André F. Araújo, Huizhong Chen, Bernd Girod:
EigenNews: a personalized news video delivery platform. 463-464 - André Mourão
, João Magalhães:
NovaEmötions: winning with a smile. 465-466 - Jia Chen, Qin Jin, Weipeng Zhang, Shenghua Bao, Zhong Su, Yong Yu:
Tell me what happened here in history. 467-468 - Xiaoyan Wang, Lifeng Sun, Shou Wang:
Group TV: a cloud based social TV for group social experience. 469-470 - Xingyu Gao
, Juan Cao, Zhiwei Jin, Xin Li, Jintao Li:
GeSoDeck: a geo-social event detection and tracking system. 471-472 - You Yang, Qiong Liu, Yue Gao, Binbin Xiong, Li Yu, Huan-Bo Luan, Rongrong Ji, Qi Tian:
Stereotime: a wireless 2D and 3D switchable video communication system. 473-474 - Xinghai Sun, Changhu Wang, Avneesh Sud, Chao Xu, Lei Zhang:
MagicBrush: image search by color sketch. 475-476 - Duong-Trung-Dung Nguyen, Mukesh Kumar Saini, Vu-Thanh Nguyen, Wei Tsang Ooi
:
Jiku director: a mobile video mashup system. 477-478 - Huijie Lin, Jia Jia, Hanyu Liao, Lianhong Cai:
WeCard: a multimodal solution for making personalized electronic greeting cards. 479-480
Posters
- Xirong Li
, Cees G. M. Snoek:
Classifying tag relevance with relevant positive and negative examples. 485-488 - Tao Zhu, Yanning Zhang, Peng Zhang, Wei Huang, Hichem Sahli:
Non-rigid target tracking based on 'flow-cut' in pair-wise frames with online hough forests. 489-492 - Jingjing Chen
, Yahong Han, Xiaochun Cao, Qi Tian:
Object coding on the semantic graph for scene classification. 493-496 - Chunjie Zhang
, Shuhui Wang, Chao Liang, Jing Liu, Qingming Huang, Haojie Li, Qi Tian:
Beyond bag of words: image representation in sub-semantic space. 497-500 - Darshan Santani, Daniel Gatica-Perez:
Speaking swiss: languages and venues in foursquare. 501-504 - Zhendong Mao, Yongdong Zhang, Qi Tian:
What are the distance metrics for local features? 505-508 - Ye Luo, Junsong Yuan
:
Salient object detection in videos by optimal spatio-temporal path discovery. 509-512 - Ali Fakeri-Tabrizi, Massih-Reza Amini, Patrick Gallinari:
Multiview semi-supervised ranking for automatic image annotation. 513-516 - Raynor Vliegendhart, Babak Loni, Martha A. Larson, Alan Hanjalic:
How do we deep-link?: leveraging user-contributed time-links for non-linear video access. 517-520 - Xiaodan Zhuang, Shuang Wu, Pradeep Natarajan:
Compact bag-of-words visual representation for effective linear classification. 521-524 - Do Hang Nga, Keiji Yanai
:
Large-scale web video shot ranking based on visual features and tag co-occurrence. 525-528 - Shanmin Pang, Jianru Xue, Nanning Zheng, Qi Tian:
Locality preserving verification for image search. 529-532 - Chunjie Zhang
, Yifan Zhang, Shuhui Wang, Junbiao Pang, Chao Liang, Qingming Huang, Qi Tian:
Undo the codebook bias by linear transformation for visual applications. 533-536 - Yan Yan, Zhongwen Xu, Gaowen Liu, Zhigang Ma, Nicu Sebe
:
GLocal structural feature selection with sparsity for multimedia data understanding. 537-540 - Jonathan Driedger, Harald Grohganz, Thomas Prätzlich, Sebastian Ewert
, Meinard Müller
:
Score-informed audio decomposition and applications. 541-544 - Zhixiang Ren, Liang-Tien Chia, Deepu Rajan, Shenghua Gao:
Background subtraction via coherent trajectory decomposition. 545-548 - Xiaojie Guo, Siyuan Li, Xiaochun Cao:
Motion matters: a novel framework for compressing surveillance videos. 549-552 - Viet Anh Nguyen, Shengkui Zhao, Tien Dung Vu, Douglas L. Jones, Minh N. Do
:
Spatialized audio multiparty teleconferencing with commodity miniature microphone array. 553-556 - Davide Baltieri, Roberto Vezzani
, Rita Cucchiara
:
Learning articulated body models for people re-identification. 557-560 - Zhanpeng Zhang, Wei Zhang, Jianzhuang Liu, Xiaoou Tang:
Facial landmark localization based on hierarchical pose regression with cascaded random ferns. 561-564 - Akisato Kimura, Katsuhiko Ishiguro, Makoto Yamada
, Alejandro Marcos Alvarez, Kaori Kataoka, Kazuhiko Murasaki:
Image context discovery from socially curated contents. 565-568 - Lu Li
, Jianru Xue, Zhiqiang Tian, Nanning Zheng:
Moment feature based forensic detection of resampled digital images. 569-572 - Adrian Popescu, Aymen Shabou:
Towards precise POI localization with social media. 573-576 - Wan-Lei Zhao, Hervé Jégou, Guillaume Gravier:
Sim-min-hash: an efficient matching technique for linking large image collections. 577-580 - Ramya Srinivasan, Amit K. Roy-Chowdhury, Conrad Rudolph, Jeanette Kohl:
Recognizing the royals: leveraging computerized face recognition for identifying subjects in ancient artworks. 581-584 - Asier Marzo Pérez, Oscar Ardaiz
:
CollARt: a tool for creating 3D photo collages using mobile augmented reality. 585-588 - Qiang Chen, Yang Cai, Lisa M. Brown, Ankur Datta, Quanfu Fan, Rogério Schmidt Feris, Shuicheng Yan, Alexander G. Hauptmann, Sharath Pankanti:
Spatio-temporal fisher vector coding for surveillance event detection. 589-592 - Lin Wu
, Yang Wang, John Shepherd
:
Efficient image and tag co-ranking: a bregman divergence optimization method. 593-596