


default search action
21st ACM Multimedia 2013: Barcelona, Spain
- Alejandro Jaimes, Nicu Sebe, Nozha Boujemaa, Daniel Gatica-Perez, David A. Shamma, Marcel Worring, Roger Zimmermann:

ACM Multimedia Conference, MM '13, Barcelona, Spain, October 21-25, 2013. ACM 2013, ISBN 978-1-4503-2404-5
Keynote address
- Elizabeth F. Churchill:

Multimedia framed. 1-2
Best paper session
- Luoqi Liu, Hui Xu, Junliang Xing, Si Liu, Xi Zhou, Shuicheng Yan:

"Wow! you are so beautiful today!". 3-12 - Quan Fang, Jitao Sang, Changsheng Xu:

GIANT: geo-informative attributes for location recognition and exploration. 13-22 - Xin Zhao, Xue Li

, Chaoyi Pang, Xiaofeng Zhu
, Quan Z. Sheng
:
Online human gesture recognition from motion data streams. 23-32 - Hanwang Zhang

, Zheng-Jun Zha
, Yang Yang, Shuicheng Yan, Yue Gao, Tat-Seng Chua:
Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval. 33-42
Experience
- Qianqian Xu, Jiechao Xiong, Qingming Huang, Yuan Yao:

Robust evaluation for quality of experience in crowdsourcing. 43-52 - Wei-Ta Chu

, Yu-Kuang Chen, Kuan-Ta Chen:
Size does matter: how image size affects aesthetic perception? 53-62 - Zhonghua Li, Ju-Chiang Wang, Jingli Cai, Zhiyan Duan, Hsin-Min Wang

, Ye Wang
:
Non-reference audio quality assessment for online live music recordings. 63-72 - Yu-Chuan Su, Tzu-Hsuan Chiu, Yan-Ying Chen, Chun-Yen Yeh, Winston H. Hsu:

Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation. 73-82
Music & play
- André Mourão

, João Magalhães:
Competitive affective gaming: winning with a smile. 83-92 - Wolfgang Hürst, Joris Dekker:

Tracking-based interaction for object creation in mobile augmented reality. 93-102 - Graham Percival, Nicholas Bailey, George Tzanetakis:

Physical modelling and supervised training of a virtual string quartet. 103-112 - Jordan B. L. Smith, Elaine Chew

:
Using quadratic programming to estimate feature relevance in structural analyses of music. 113-122
Similarity search
- Lei Zhang

, Yongdong Zhang, Jinhui Tang
, Xiaoguang Gu, Jintao Li, Qi Tian:
Topology preserving hashing for similarity search. 123-132 - Jianfeng Wang, Jingdong Wang

, Nenghai Yu, Shipeng Li
:
Order preserving hashing for approximate nearest neighbor search. 133-142 - Xiaofeng Zhu

, Zi Huang
, Heng Tao Shen, Xin Zhao:
Linear cross-modal hashing for efficient multimedia search. 143-152 - Pengcheng Wu, Steven C. H. Hoi

, Hao Xia, Peilin Zhao, Dayong Wang, Chunyan Miao
:
Online multimodal deep similarity learning with application to image retrieval. 153-162
Art, performance, and sports
- Eric Foote, Peter Carr, Patrick Lucey, Yaser Sheikh, Iain A. Matthews:

One-man-band: a touch screen interface for producing live multi-camera sports broadcasts. 163-172 - Hill Hiroki Kobayashi, Michitaka Hirose, Akio Fujiwara, Kazuhiko Nakamura, Kaoru Sezaki, Kaoru Saito:

Tele echo tube: beyond cultural and imaginable boundaries. 173-182 - Min Lin, Zhenzhen Hu, Si Liu, Meng Wang, Richang Hong, Shuicheng Yan:

eHeritage of shadow puppetry: creation and manipulation. 183-192 - Peter Carr, Michael N. Mistry, Iain A. Matthews:

Hybrid robotic/virtual pan-tilt-zom cameras for autonomous event recording. 193-202
Brave new topics: social and cognitive aspects
- Amarnath Gupta, Ramesh C. Jain:

Social life networks: a multimedia problem? 203-212 - Marco Cristani, Alessandro Vinciarelli, Cristina Segalin

, Alessandro Perina:
Unveiling the multimedia unconscious: implicit cognitive processes and multimedia content analysis. 213-222 - Damian Borth, Rongrong Ji, Tao Chen, Thomas M. Breuel, Shih-Fu Chang:

Large-scale visual sentiment ontology and detectors using adjective noun pairs. 223-232 - Xinghai Sun, Changhu Wang, Chao Xu, Lei Zhang:

Indexing billions of images for sketch-based retrieval. 233-242 - Xian-Sheng Hua, Linjun Yang, Jingdong Wang

, Jing Wang, Ming Ye, Kuansan Wang, Yong Rui, Jin Li:
Clickage: towards bridging semantic and intent gaps via mining click logs of search engines. 243-252 - Zhaoquan Yuan, Jitao Sang, Yan Liu, Changsheng Xu:

Latent feature learning in social media network. 253-262
Action and event recognition
- Xiaodan Liang, Liang Lin, Liangliang Cao:

Learning latent spatio-temporal compositional model for human action recognition. 263-272 - Xu Zhao, Yuncai Liu, Yun Fu:

Exploring discriminative pose sub-patterns for effective action classification. 273-282 - Raj Kumar Gupta, Alex Yong Sang Chia, Deepu Rajan:

Human activities recognition using depth images. 283-292 - Zhigang Ma, Yi Yang, Zhongwen Xu, Nicu Sebe

, Alexander G. Hauptmann:
We are not equally negative: fine-grained labeling for multimedia event detection. 293-302
Streaming and synchronization
- Zhen Wei Zhao, Wei Tsang Ooi

:
Joserlin: joint request and service scheduling for peer-to-peer non-linear media access. 303-312 - Moonkyung Ryu, Umakishore Ramachandran:

FlashStream: a multi-tiered storage architecture for adaptive HTTP streaming. 313-322 - Mario Montagud, Fernando Boronat

, Hans Stokking:
Early event-driven (EED) RTCP feedback for rapid IDMS. 323-332 - Marian Florin Ursu, Martin Groen

, Manolis Falelakis, Michael Frantzis, Vilmos Zsombori, Rene Kaiser
:
Orchestration: tv-like mixing grammars applied to video-communication for social groups. 333-342
Keynote address
- Leonidas J. Guibas:

The space between the images. 343-344
Multimedia grand challenge
- Xiaoyin Che, Haojin Yang, Christoph Meinel:

Lecture video segmentation by automatically analyzing the synchronized slides. 345-348 - Chien-Nan (Shannon) Chen, Pengye Xia, Klara Nahrstedt:

Activity-aware adaptive compression: a morphing-based frame synthesis application in 3DTI. 349-352 - Yu-Chuan Su, Tzu-Hsuan Chiu, Guan-Long Wu, Chun-Yen Yeh, Felix Wu, Winston H. Hsu:

Flickr-tag prediction using multi-modal fusion and meta information. 353-356 - Brendan Jou, Hongzhi Li, Joseph G. Ellis, Daniel Morozoff-Abegauz, Shih-Fu Chang:

Structured exploration of who, what, when, and where in heterogeneous multimedia news sources. 357-360 - Subhabrata Bhattacharya, Behnaz Nojavanasghari, Tao Chen, Dong Liu, Shih-Fu Chang, Mubarak Shah:

Towards a comprehensive computational model foraesthetic assessment of videos. 361-364 - Chidansh Amitkumar Bhatt, Andrei Popescu-Belis

, Maryam Habibi, Sandy Ingram
, Stefano Masneri
, Fergus McInnes, Nikolaos Pappas
, Oliver Schreer
:
Multi-factor segmentation for topic visualization and recommendation: the MUST-VIS system. 365-368 - Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang:

Beauty is here: evaluating aesthetics in videos using multimodal features and free training data. 369-372 - Kong-Wah Wan, Wei-Yun Yau, Sujoy Roy:

Metadata enrichment for news video retrieval: a graph-based propagation approach. 373-376 - Zong Bo Hao, Qianni Zhang

, Ebroul Izquierdo, Nan Sang:
Human action recognition by fast dense trajectories. 377-380 - Eleni Mantziou, Symeon Papadopoulos, Yiannis Kompatsiaris:

Scalable training with approximate incremental laplacian eigenmaps and PCA. 381-384 - Gökhan Yildirim, Appu Shaji, Sabine Süsstrunk:

Estimating beauty ratings of videos using supervoxels. 385-388 - Litian Sun, Kiyoharu Aizawa:

Action recognition using invariant features under unexampled viewing conditions. 389-392 - Chun-Che Wu, Kuan-Yu Chu, Yin-Hsi Kuo, Yan-Ying Chen, Wen-Yu Lee, Winston H. Hsu:

Search-based relevance association with auxiliary contextual cues. 393-396 - Yingwei Pan

, Ting Yao, Kuiyuan Yang, Houqiang Li, Chong-Wah Ngo
, Jingdong Wang
, Tao Mei:
Image search by graph-based label propagation with image representation from DNN. 397-400
Demos
- Chia-Ju Lu, Chih-Fan Hsu, Mei-Chen Yeh

:
Real-time salient object detection. 401-402 - Kiia Korpi, Kiyoharu Aizawa:

Kanji snap: an OCR-based smartphone application for learning Japanese kanji characters. 403-404 - Marco A. Hudelist, Klaus Schoeffmann, Laszlo Boeszoermenyi:

Mobile video browsing with the ThumbBrowser. 405-406 - Che-Hao Hsu, Kai-Lung Hua

, Wen-Huang Cheng:
Physiognomy master: a novel personality analysis system based on facial features. 407-408 - Wu Liu, Feibin Yang, Yongdong Zhang, Qinghua Huang, Tao Mei:

LAVES: an instant mobile video search system based on layered audio-video indexing. 409-410 - Frederic Font

, Gerard Roma
, Xavier Serra
:
Freesound technical demo. 411-412 - Yu You, Ville-Veikko Mattila:

Visualizing web mash-ups for in-situ vision-based mobile AR applications. 413-414 - Oscar Mayor, Quim Llimona, Marco Marchini, Panagiotis Papiotis, Esteban Maestre

:
repoVizz: a framework for remote storage, browsing, annotation, and exchange of multi-modal data. 415-416 - Pierre Letessier, Nicolas Hervé

, Julien Champ
, Alexis Joly, Olivier Buisson, Amel Hamzaoui:
Small objects query suggestion in a large web-image collection. 417-418 - AmirHossein Habibian, Cees G. M. Snoek:

Video2Sentence and vice versa. 419-420 - Christoph Korinke, Mohammad Rabbath, Dennis Lamken, Susanne Boll:

A tool for catching back your preferred videos from physical collages. 421-422 - Hervé Goëau

, Pierre Bonnet
, Alexis Joly, Vera Bakic, Julien Barbe
, Itheri Yahiaoui, Souheil Selmi, Jennifer Carré, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino
, Grégoire Duché, Aurélien Péronnet:
Pl@ntNet mobile app. 423-424 - Benjamin Guthier, Kalun Ho, Stephan Kopf, Wolfgang Effelsberg:

Determining exposure values from HDR histograms for smartphone photography. 425-426 - Julien Law-To, Gregory Grefenstette

, Rémi Landais:
Semantic dispatching of multimedia news with MEWS. 427-428 - Peng Wu, Rares Vernica, Qian Lin:

Cloud based multimedia analytic platform. 429-430 - Sandro Hardy, Stefan Göbel, Ralf Steinmetz

:
Adaptable and personalized game-based training system for fall prevention. 431-432 - Chao Dong, Shifeng Chen, Xiaoou Tang:

AdVisual: a visual-based advertising system. 433-434 - Yichao Jin, Tian Xie, Yonggang Wen, Haiyong Xie:

Multi-screen cloud social TV: transforming TV experience into 21st century. 435-436 - Luoqi Liu, Hui Xu, Si Liu, Junliang Xing, Xi Zhou, Shuicheng Yan:

"Wow! you are so beautiful today!". 437-438 - Guanfeng Wang, Beomjoo Seo, Yifang Yin

, Roger Zimmermann
, Zhijie Shen:
OSCOR: an orientation sensor data correction system for mobile generated contents. 439-440 - Nicolas Hervé

, Marie-Luce Viaud, Jérôme Thièvre, Agnès Saulnier, Julien Champ
, Pierre Letessier, Olivier Buisson, Alexis Joly:
OTMedia: the French TransMedia news observatory. 441-442 - Pengye Xia, Klara Nahrstedt:

TEEVE endpoint: towards the ease of 3D tele-immersive application development. 443-444 - Zhenzhen Hu, Min Lin, Si Liu, Meng Wang, Richang Hong, Shuicheng Yan:

eHeritage of shadow puppetry: creation and manipulation. 445-446 - Jules Françoise

, Norbert Schnell, Frédéric Bevilacqua:
Gesture-based control of physical modeling sound synthesis: a mapping-by-demonstration approach. 447-448 - Hongzhi Li, Brendan Jou, Joseph G. Ellis, Daniel Morozoff, Shih-Fu Chang:

News rover: exploring topical structures and serendipity in heterogeneous multimedia news. 449-450 - Marco Bertini

, Alberto Del Bimbo
, Andrea Ferracani, Francesco Gelli, Daniele Maddaluno, Daniele Pezzatini:
A novel framework for collaborative video recommendation, interest discovery and friendship suggestion based on semantic profiling. 451-452 - Marco Bertini

, Alberto Del Bimbo
, George Ioannidis, Emile Bijk, Isabel Trancoso
, Hugo Meinedo
:
euTV: a system for media monitoring and publishing. 453-454 - Giuliano Armano, Alessandro Giuliani

, Alberto Messina
, Maurizio Montagnuolo:
CAMMA: contextual advertising system for multimodal news aggregations. 455-456 - Alberto Del Bimbo

, Andrea Ferracani, Daniele Pezzatini:
Flarty: recommending art routes using check-ins latent topics. 457-458 - Damian Borth, Tao Chen, Rongrong Ji, Shih-Fu Chang:

SentiBank: large-scale ontology and classifiers for detecting sentiment and emotions in visual content. 459-460 - Junsheng Fu

, Lixin Fan, Yu You, Kimmo Roimela:
Augmented and interactive video playback based on global camera pose. 461-462 - Matt C. Yu, Peter Vajda, David M. Chen, Sam S. Tsai, Maryam Daneshi, André F. Araújo, Huizhong Chen, Bernd Girod:

EigenNews: a personalized news video delivery platform. 463-464 - André Mourão

, João Magalhães:
NovaEmötions: winning with a smile. 465-466 - Jia Chen, Qin Jin, Weipeng Zhang, Shenghua Bao, Zhong Su, Yong Yu:

Tell me what happened here in history. 467-468 - Xiaoyan Wang, Lifeng Sun, Shou Wang:

Group TV: a cloud based social TV for group social experience. 469-470 - Xingyu Gao

, Juan Cao, Zhiwei Jin, Xin Li, Jintao Li:
GeSoDeck: a geo-social event detection and tracking system. 471-472 - You Yang, Qiong Liu, Yue Gao, Binbin Xiong, Li Yu, Huan-Bo Luan, Rongrong Ji, Qi Tian:

Stereotime: a wireless 2D and 3D switchable video communication system. 473-474 - Xinghai Sun, Changhu Wang, Avneesh Sud, Chao Xu, Lei Zhang:

MagicBrush: image search by color sketch. 475-476 - Duong-Trung-Dung Nguyen, Mukesh Kumar Saini, Vu-Thanh Nguyen, Wei Tsang Ooi

:
Jiku director: a mobile video mashup system. 477-478 - Huijie Lin, Jia Jia, Hanyu Liao, Lianhong Cai:

WeCard: a multimodal solution for making personalized electronic greeting cards. 479-480
Posters
- Xirong Li

, Cees G. M. Snoek:
Classifying tag relevance with relevant positive and negative examples. 485-488 - Tao Zhu, Yanning Zhang, Peng Zhang, Wei Huang, Hichem Sahli:

Non-rigid target tracking based on 'flow-cut' in pair-wise frames with online hough forests. 489-492 - Jingjing Chen

, Yahong Han, Xiaochun Cao, Qi Tian:
Object coding on the semantic graph for scene classification. 493-496 - Chunjie Zhang

, Shuhui Wang, Chao Liang, Jing Liu, Qingming Huang, Haojie Li, Qi Tian:
Beyond bag of words: image representation in sub-semantic space. 497-500 - Darshan Santani, Daniel Gatica-Perez

:
Speaking swiss: languages and venues in foursquare. 501-504 - Zhendong Mao, Yongdong Zhang, Qi Tian:

What are the distance metrics for local features? 505-508 - Ye Luo, Junsong Yuan

:
Salient object detection in videos by optimal spatio-temporal path discovery. 509-512 - Ali Fakeri-Tabrizi, Massih-Reza Amini, Patrick Gallinari:

Multiview semi-supervised ranking for automatic image annotation. 513-516 - Raynor Vliegendhart, Babak Loni, Martha A. Larson, Alan Hanjalic:

How do we deep-link?: leveraging user-contributed time-links for non-linear video access. 517-520 - Xiaodan Zhuang, Shuang Wu, Pradeep Natarajan:

Compact bag-of-words visual representation for effective linear classification. 521-524 - Do Hang Nga, Keiji Yanai

:
Large-scale web video shot ranking based on visual features and tag co-occurrence. 525-528 - Shanmin Pang, Jianru Xue, Nanning Zheng, Qi Tian:

Locality preserving verification for image search. 529-532 - Chunjie Zhang

, Yifan Zhang, Shuhui Wang, Junbiao Pang, Chao Liang, Qingming Huang, Qi Tian:
Undo the codebook bias by linear transformation for visual applications. 533-536 - Yan Yan, Zhongwen Xu, Gaowen Liu, Zhigang Ma, Nicu Sebe

:
GLocal structural feature selection with sparsity for multimedia data understanding. 537-540 - Jonathan Driedger, Harald Grohganz

, Thomas Prätzlich, Sebastian Ewert
, Meinard Müller
:
Score-informed audio decomposition and applications. 541-544 - Zhixiang Ren, Liang-Tien Chia, Deepu Rajan, Shenghua Gao:

Background subtraction via coherent trajectory decomposition. 545-548 - Xiaojie Guo, Siyuan Li, Xiaochun Cao:

Motion matters: a novel framework for compressing surveillance videos. 549-552 - Viet Anh Nguyen, Shengkui Zhao, Tien Dung Vu, Douglas L. Jones, Minh N. Do

:
Spatialized audio multiparty teleconferencing with commodity miniature microphone array. 553-556 - Davide Baltieri, Roberto Vezzani

, Rita Cucchiara
:
Learning articulated body models for people re-identification. 557-560 - Zhanpeng Zhang, Wei Zhang, Jianzhuang Liu, Xiaoou Tang:

Facial landmark localization based on hierarchical pose regression with cascaded random ferns. 561-564 - Akisato Kimura, Katsuhiko Ishiguro, Makoto Yamada

, Alejandro Marcos Alvarez, Kaori Kataoka, Kazuhiko Murasaki:
Image context discovery from socially curated contents. 565-568 - Lu Li

, Jianru Xue, Zhiqiang Tian, Nanning Zheng:
Moment feature based forensic detection of resampled digital images. 569-572 - Adrian Popescu, Aymen Shabou:

Towards precise POI localization with social media. 573-576 - Wan-Lei Zhao, Hervé Jégou, Guillaume Gravier:

Sim-min-hash: an efficient matching technique for linking large image collections. 577-580 - Ramya Srinivasan, Amit K. Roy-Chowdhury, Conrad Rudolph, Jeanette Kohl:

Recognizing the royals: leveraging computerized face recognition for identifying subjects in ancient artworks. 581-584 - Asier Marzo Pérez

, Oscar Ardaiz
:
CollARt: a tool for creating 3D photo collages using mobile augmented reality. 585-588 - Qiang Chen, Yang Cai, Lisa M. Brown, Ankur Datta, Quanfu Fan, Rogério Schmidt Feris, Shuicheng Yan, Alexander G. Hauptmann, Sharath Pankanti:

Spatio-temporal fisher vector coding for surveillance event detection. 589-592 - Lin Wu

, Yang Wang, John Shepherd
:
Efficient image and tag co-ranking: a bregman divergence optimization method. 593-596 - Kuan-Yu Chu, Yin-Hsi Kuo, Winston H. Hsu:

Real-time privacy-preserving moving object detection in the cloud. 597-600 - De-An Huang, Yu-Chiang Frank Wang:

With one look: robust face recognition using single sample per person. 601-604 - Asako Kanezaki, Yasuo Kuniyoshi, Tatsuya Harada:

Weakly-supervised multi-class object detection using multi-type 3D features. 605-608 - Masoud Mazloom, AmirHossein Habibian, Cees G. M. Snoek:

Querying for video events by semantic signatures from few examples. 609-612 - Peter Grosche, Meinard Müller

, Joan Serrà:
Towards cover group thumbnailing. 613-616 - Dihong Gong, Zhifeng Li, Jianzhuang Liu, Yu Qiao:

Multi-feature canonical correlation analysis for face photo-sketch image retrieval. 617-620 - Zhihan Lu

, Muhammad Sikandar Lal Khan, Shafiq ur Réhman:
Hand and foot gesture interaction for handheld devices. 621-624 - Shih-Yao Lin, Chuen-Kai Shie, Shen-Chi Chen, Yi-Ping Hung:

AirTouch panel: a re-anchorable virtual touch panel. 625-628 - Tina Walber, Chantal Neuhaus, Steffen Staab

, Ansgar Scherp
, Ramesh C. Jain:
Creation of individual photo selections: read preferences from the users' eyes. 629-632 - Junqiang Wang, Jinhui Tang

, Yu-Gang Jiang:
Strong geometrical consistency in large scale partial-duplicate image search. 633-636 - Ilseo Kim, Sangmin Oh, Arash Vahdat, Kevin J. Cannons, A. G. Amitha Perera, Greg Mori:

Segmental multi-way local pooling for video recognition. 637-640 - Peng Peng

, Kevin J. Cannons, Ze-Nian Li:
Efficient video quality assessment based on spacetime texture representation. 641-644 - Yu Wang, Sheng Tang, Yalin Zhang, Jintao Li, DanYi Chen:

Fitted spectral hashing. 645-648 - Chih-Ming Chen

, Ming-Feng Tsai
, Jen-Yu Liu, Yi-Hsuan Yang:
Using emotional context from article for contextual music recommendation. 649-652 - Jonathan Delhumeau, Philippe Henri Gosselin, Hervé Jégou, Patrick Pérez:

Revisiting the VLAD image representation. 653-656 - Mohammad Soleymani, Sebastian Kaltwang

, Maja Pantic:
Human behavior sensing for tag relevance assessment. 657-660 - Jinhui Chen, Yasuo Ariki, Tetsuya Takiguchi:

Robust facial expressions recognition using 3D average face and ameliorated adaboost. 661-664 - Amir Roshan Zamir, Afshin Dehghan, Mubarak Shah:

Visual business recognition: a multimodal approach. 665-668 - David Wolinski, Olivier Le Meur, Josselin Gautier:

3D view synthesis with inter-view consistency. 669-672 - Christos Tzelepis

, Nikolaos Gkalelis, Vasileios Mezaris, Ioannis Kompatsiaris:
Improving event detection using related videos and relevance degree support vector machines. 673-676 - Tao Yan

, Shengfeng He
, Rynson W. H. Lau
, Yun Xu:
Consistent stereo image editing. 677-680 - Shuhui Bu, Zhenbao Liu, Junwei Han, Jun Wu:

Superpixel segmentation based structural scene recognition. 681-684 - Song Wu, Michael S. Lew:

Evaluation of salient point methods. 685-688 - Xikui Wang, Yang Liu, Donghui Wang, Fei Wu:

Cross-media topic mining on wikipedia. 689-692 - Yuan Tian, Yin Yang, Xiaohu Guo, Balakrishnan Prabhakaran:

A multigrid approach for bandwidth and display resolution aware streaming of 3D deformations. 693-696 - Shiai Zhu, Xiao-Yong Wei

, Chong-Wah Ngo
:
Error recovered hierarchical classification. 697-700 - Ionut Mironica, Jasper R. R. Uijlings, Negar Rostamzadeh, Bogdan Ionescu, Nicu Sebe

:
Time matters!: capturing variation in time in video using fisher kernels. 701-704 - Jules Françoise

, Norbert Schnell, Frédéric Bevilacqua:
A multimodal probabilistic model for gesture-based control of sound synthesis. 705-708 - Giuseppe Serra

, Costantino Grana
, Marco Manfredi, Rita Cucchiara
:
Modeling local descriptors with multivariate gaussians for object and scene recognition. 709-712 - Shi Qiu, Xiaogang Wang, Xiaoou Tang:

Anchor concept graph distance for web image re-ranking. 713-716 - Esra Acar, Frank Hopfgartner

, Sahin Albayrak
:
Violence detection in hollywood movies by the fusion of visual and mid-level audio cues. 717-720 - Suraj Raghuraman, Karthik Venkatraman, Zhanyu Wang, Balakrishnan Prabhakaran, Xiaohu Guo:

A 3D tele-immersion streaming approach using skeleton-based prediction. 721-724 - Shuhei Tarashima, Go Irie, Ken Tsutsuguchi, Hiroyuki Arai, Yukinobu Taniguchi:

Fast image/video collection summarization with local clustering. 725-728 - H. Emrah Tasli, Jan C. van Gemert, Theo Gevers:

Spot the differences: from a photograph burst to the single best picture. 729-732 - Qian Yu, Jingen Liu, Hui Cheng, Ajay Divakaran, Harpreet S. Sawhney:

Semantic pooling for complex event detection. 733-736 - George Legrady, Danny Bazo, Marco Pinter:

SwarmVision: autonomous aesthetic multi-camera interaction. 737-740 - Johan Pauwels

, Geoffroy Peeters:
Segmenting music through the joint estimation of keys, chords and structural boundaries. 741-744 - Aadhar Jain, Ahsan Arefin, Raoul Rivas, Chien-Nan (Shannon) Chen, Klara Nahrstedt:

3D teleimmersive activity classification based on application-system metadata. 745-748 - Yong Li, Jing Liu, Zechao Li, Yang Liu, Hanqing Lu:

Object co-segmentation via discriminative low rank matrix recovery. 749-752 - Siliang Tang

, Hanqi Wang, Jian Shao, Fei Wu, Ming Chen, Yueting Zhuang:
πLDA: document clustering with selective structural constraints. 753-756 - Sezer Karaoglu, Jan C. van Gemert, Theo Gevers:

Con-text: text detection using background connectivity for fine-grained object classification. 757-760 - Jongpil Kim, Sejong Yoon, Vladimir Pavlovic

:
Relative spatial features for image memorability. 761-764 - Morris Franken, Jan C. van Gemert:

Automatic Egyptian hieroglyph recognition by retrieving images as texts. 765-768 - Jialong Wang, Cheng Deng, Wei Liu

, Rongrong Ji, Xiangyu Chen, Xinbo Gao:
Query-dependent visual dictionary adaptation for image reranking. 769-772 - Mihalis A. Nicolaou, Stefanos Zafeiriou, Maja Pantic:

Correlated-spaces regression for learning continuous emotion dimensions. 773-776 - Chien-Pang Lin, Cheng-Yao Wang, Hou-Ren Chen, Wei-Chen Chu, Mike Y. Chen:

RealSense: directional interaction for proximate mobile sharing using built-in orientation sensors. 777-780 - Tao Chen

, Dongyuan Lu, Min-Yen Kan, Peng Cui:
Understanding and classifying image tweets. 781-784 - Yun Yang, Peng Cui, Wenwu Zhu, Shiqiang Yang:

User interest and social influence based emotion prediction for individuals. 785-788 - Ognjen Rudovic, Stavros Petridis, Maja Pantic:

Bimodal log-linear regression for fusion of audio and visual features. 789-792
Security and forensics
- Ranran Feng, Balakrishnan Prabhakaran:

Facilitating fashion camouflage art. 793-802 - Peijia Zheng, Jiwu Huang:

An efficient image homomorphic encryption scheme with small ciphertext expansion. 803-812 - Ricky J. Sethi

, Yolanda Gil
, Hyunjoon Jo, Andrew Philpot:
Large-scale multimedia content analysis using scientific workflows. 813-822
Open source software
- Michiel Hildebrand, Maarten Brinkerink, Riste Gligorov, Martijn Van Steenbergen, Johan Huijkman, Johan Oomen:

Waisda?: video labeling game. 823-826 - Chun-Ying Huang

, De-Yu Chen, Cheng-Hsin Hsu, Kuan-Ta Chen:
GamingAnywhere: an open-source cloud gaming testbed. 827-830 - Johannes Wagner, Florian Lingenfelser, Tobias Baur

, Ionut Damian, Felix Kistler, Elisabeth André
:
The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time. 831-834 - Florian Eyben, Felix Weninger, Florian Groß

, Björn W. Schuller
:
Recent developments in openSMILE, the munich open-source multimedia feature extractor. 835-838 - Ioannis Tsampoulatidis

, Dimitrios Ververidis, Panagiotis Tsarchopoulos
, Spiros Nikolopoulos
, Ioannis Kompatsiaris, Nicos Komninos
:
ImproveMyCity: an open source platform for direct citizen-government communication. 839-842 - Mathias Lux

:
LIRE: open source image retrieval in Java. 843-846 - Lazaros T. Tsochatzidis

, Chryssanthi Iakovidou, Savvas A. Chatzichristofis
, Yiannis S. Boutalis:
Golden retriever: a Java based open source image retrieval engine. 847-850 - Rami Aamulehto, Mikko Kuhna, Jussi Tarvainen, Pirkko Oittinen:

Stage framework: an HTML5 and CSS3 framework for digital publishing. 851-854 - Dmitry Bogdanov

, Nicolas Wack, Emilia Gómez, Sankalp Gulati, Perfecto Herrera, Oscar Mayor, Gerard Roma
, Justin Salamon
, José Ricardo Zapata
, Xavier Serra
:
ESSENTIA: an open-source library for sound and music analysis. 855-858 - Carl Flynn, David S. Monaghan, Noel E. O'Connor

:
SCReen adjusted panoramic effect: SCRAPE. 859-862 - Hervé Yviquel

, Antoine Lorence, Khaled Jerbi, Gildas Cocherel, Alexandre Sanchez, Mickaël Raulet:
Orcc: multimedia development made easy. 863-866
Multimodal analysis
- Jaeyoung Choi, Howard Lei, Venkatesan N. Ekambaram, Pascal Kelm, Luke R. Gottlieb, Thomas Sikora, Kannan Ramchandran, Gerald Friedland:

Human vs machine: establishing a human baseline for multimodal location estimation. 867-876 - Fei Wu, Xinyan Lu, Zhongfei Zhang, Shuicheng Yan, Yong Rui, Yueting Zhuang:

Cross-media semantic representation via bi-directional learning to rank. 877-886 - Wu Liu, Tao Mei, Yongdong Zhang, Jintao Li, Shipeng Li

:
Listen, look, and gotcha: instant video search with mobile phones by layered audio-video indexing. 887-896 - Xiangbo Mao, Binbin Lin, Deng Cai, Xiaofei He, Jian Pei

:
Parallel field alignment for cross media retrieval. 897-906
Social dynamics
- Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel:

Analysis and forecasting of trending topics in online media streams. 907-916 - Sourav S. Bhowmick

, Aixin Sun
, Ba Quan Truong:
Why not, WINE?: towards answering why-not questions in social image search. 917-926 - Wenyuan Yin, Tao Mei, Chang Wen Chen:

Automatic generation of social media snippets for mobile browsing. 927-936 - Tian Gan, Yongkang Wong, Daqing Zhang, Mohan S. Kankanhalli

:
Temporal encoded F-formation system for social interaction detection. 937-946
Annotation
- Junshi Huang, Hairong Liu, Jialie Shen, Shuicheng Yan:

Towards efficient sparse coding for scalable image annotation. 947-956 - Yingming Li, Zhongang Qi, Zhongfei (Mark) Zhang, Ming Yang:

Learning with limited and noisy tagging. 957-966 - Lexing Xie

, Xuming He:
Picture tags and world knowledge: learning tag relations from visual semantic sources. 967-976 - Ting Yao, Tao Mei, Chong-Wah Ngo

, Shipeng Li
:
Annotation for free: video tagging by mining user search behavior. 977-986
Scene understanding
- Tam V. Nguyen

, Mengdi Xu, Guangyu Gao, Mohan S. Kankanhalli
, Qi Tian, Shuicheng Yan:
Static saliency vs. dynamic saliency: a comparative study. 987-996 - Li Liu, Ling Shao

, Xuelong Li
:
Building holistic descriptors for scene recognition: a multi-objective genetic programming approach. 997-1006 - Junhua Mao, Houqiang Li, Wengang Zhou, Shuicheng Yan, Qi Tian:

Scale based region growing for scene text detection. 1007-1016 - Helmut Grabner, Fabian Nater, Michel Druey, Luc Van Gool:

Visual interestingness in image sequences. 1017-1026
Doctoral symposium
- Elisavet Chatzilari:

Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors. 1027-1030 - Michael James Scott

:
Projective identity and procedural rhetoric in educational multimedia: towards the enrichment of programming self-concept and growth mindset with fantasy role-play. 1031-1034 - Subhabrata Bhattacharya:

Recognition of complex events in open-source web-scale videos: a bottom up approach. 1035-1038 - Tanima Dutta:

Motion compensated compressed domain watermarking. 1039-1042 - Tian Gan:

Social interaction detection using a multi-sensor approach. 1043-1046 - Rene Kaiser

:
Virtual director technology for social video communication and live event broadcast production. 1047-1050 - Jules Françoise

:
Gesture-sound mapping by demonstration in interactive music systems. 1051-1054 - Esra Acar:

Learning representations for affective video understanding. 1055-1058 - Álvaro Sarasúa:

Context-aware gesture recognition in classical music conducting. 1059-1062 - Pedro Centieiro:

Bringing the sport stadium atmosphere to remote fans. 1063-1066 - Juan J. Bosch:

Automatic melodic and structural analysis of music material for enriched concert related experiences. 1067-1070 - Mario Montagud:

Design, development and evaluation of an adaptive and standardized RTP/RTCP-based IDMS solution. 1071-1074 - Carles Ventura:

Visual object analysis using regions and interest points. 1075-1078
Art session overview
- Marc Cavazza, Antonio Camurri:

The ACM multimedia 2013 art exhibition. 1079-1082
Workshops overview
- Anastasios D. Doulamis

, Nikolaos D. Doulamis, Marco Bertini, Jordi Gonzàlez, Thomas B. Moeslund
:
4th ACM/IEEE ARTEMIS 2013 international workshop on analysis and retrieval of tracked events and motion in imagery streams. 1083-1084 - Michel F. Valstar, Björn W. Schuller, Jarek Krajewski, Roddy Cowie, Maja Pantic:

Workshop summary for the 3rd international audio/visual emotion challenge and workshop (AVEC'13). 1085-1086 - Kiyoharu Aizawa, Yoko Yamakata, Takuya Funatomi:

Workshop summary for the 5th international workshop on multimedia for cooking and eating activities (CEA'13). 1087-1088 - Kuan-Ta Chen, Wei-Ta Chu, Martha A. Larson:

ACM multimedia 2013 workshop on crowdsourcing for multimedia. 1089-1090 - Liangliang Cao, Gerald Friedland, Pascal Kelm:

Second ACM multimedia workshop on geotagging and its applications in multimedia (GeoMM 2013). 1091-1092 - Albert Ali Salah

, Hayley Hung, Oya Aran
, Hatice Gunes:
Fourth international workshop on human behavior understanding (HBU 2013). 1093-1094 - Teresa Chambel

, V. Michael Bove Jr., Sharon Strover
, Paula Viana
, Graham Thomas
:
Immersive media experiences: immersiveme 2013 workshop at ACM multimedia. 1095-1096 - Jiebo Luo

, Caifeng Shan
, Ling Shao
, Minoru Etoh:
The third ACM international workshop on interactive multimedia on mobile and portable devices (IMMPD'13). 1097-1098 - Pablo César

, Matthew Cooper, David A. Shamma, Doug Williams:
2nd international workshop on socially-aware multimedia (SAM'13). 1099-1100 - Concetto Spampinato, Vasileios Mezaris, Jacco van Ossenbruggen

:
Summary abstract for the 2nd ACM international workshop on multimedia analysis for ecological data. 1101-1102 - Jenny Benois-Pineau

, Alexia Briassouli
, Alexander G. Hauptmann:
ACM MM MIIRH 2013: workshop on multimedia indexing and information retrieval for healthcare. 1103-1104 - Vivek K. Singh, Tat-Seng Chua, Ramesh C. Jain, Alex Pentland:

Summary abstract for the 1st ACM international workshop on personal data meets distributed multimedia. 1105-1106
Technical
- Ansgar Scherp

:
Semantic technologies for multimedia content: foundations and applications. 1107-1108 - Jialie Shen, Xian-Sheng Hua, Emre Sargin:

Towards next generation multimedia recommendation systems. 1109-1110 - Mohammad Soleymani, Martha A. Larson:

Crowdsourcing for multimedia research. 1111-1112 - John R. Smith, Liangliang Cao:

Massive-scale multimedia semantic modeling. 1113-1114 - Roger Zimmermann

, Yi Yu:
Social interactions over geographic-aware multimedia systems. 1115-1116 - Markus Schedl, Emilia Gómez, Masataka Goto

:
Multimedia information retrieval: music and audio. 1117-1118 - George Tzanetakis, Sidney S. Fels

, Michael J. Lyons
:
Blending the physical and the virtual in music technology: from interface design to multi-modal signal processing. 1119-1120 - Gerald Friedland:

Privacy concerns of sharing multimedia in social networks. 1121-1122

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














