default search action
20th ACM Multimedia 2012: Nara, Japan
- Noboru Babaguchi, Kiyoharu Aizawa, John R. Smith, Shin'ichi Satoh, Thomas Plagemann, Xian-Sheng Hua, Rong Yan:
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29 - November 02, 2012. ACM 2012, ISBN 978-1-4503-1089-5
Plenary talk 1
- Masahiro Fujita:
Future direction of digital content: 20th anniversary keynote talk. 1-2
Panel 1: 20th anniversary panel
- Klara Nahrstedt, Malcolm Slaney:
Coulda, woulda, shoulda: 20 years of multimedia opportunities. 3-4
Plenary talk 2
- Yukiyasu Kamitani:
Decoding visual experience from the human brain. 5-6
Panel 2: panel discussion
- Lexing Xie, David Ayman Shamma, Cees Snoek:
Content is dead: long-live content! 7-8
Best paper session
- Heng Liu, Tao Mei, Jiebo Luo, Houqiang Li, Shipeng Li:
Finding perfect rendezvous on the go: accurate mobile visual localization and its applications to routing. 9-18 - Jitao Sang, Changsheng Xu:
Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications. 19-28 - Zhi Wang, Lifeng Sun, Xiangwen Chen, Wenwu Zhu, Jiangchuan Liu, Minghua Chen, Shiqiang Yang:
Propagation-based social-aware replication for social video contents. 29-38 - Shih-Yao Lin, Chuen-Kai Shie, Shen-Chi Chen, Yi-Ping Hung:
Action recognition for human-marionette interaction. 39-48
Full paper session 1: content-based image retrieval
- Yang Yang, Linjun Yang, Gangshan Wu, Shipeng Li:
A bag-of-objects retrieval model for web image search. 49-58 - Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, Tat-Seng Chua:
Harvesting visual concepts for image search with complex queries. 59-68 - Miaojing Shi, Xinghai Sun, Dacheng Tao, Chao Xu:
Exploiting visual word co-occurrence for image retrieval. 69-78 - Hanwang Zhang, Zheng-Jun Zha, Shuicheng Yan, Jingwen Bian, Tat-Seng Chua:
Attribute feedback. 79-88
Full paper session 2: audio and music
- Ju-Chiang Wang, Yi-Hsuan Yang, Hsin-Min Wang, Shyh-Kang Jeng:
The acoustic emotion gaussians model for emotion-based music annotation and retrieval. 89-98 - Xinxi Wang, David S. Rosenblum, Ye Wang:
Context-aware mobile music recommendation for daily activities. 99-108 - Zimu Liu, Yuan Feng, Baochun Li:
MusicScore: mobile music composition for practice and fun. 109-118 - Chien-Nan (Shannon) Chen, Cing-yu Chu, Su-Ling Yeh, Hao-Hua Chu, Polly Huang:
Modeling the qoe of rate changes in SKYPE/SILK VoIP calls. 119-128
Full paper session 3: video applications
- Roman Lissermann, Simon Olberding, Benjamin Petry, Max Mühlhäuser, Jürgen Steimle:
PaperVideo: interacting with videos on multiple paper-like displays. 129-138 - Mukesh Kumar Saini, Raghudeep Gadde, Shuicheng Yan, Wei Tsang Ooi:
MoViMash: online mobile video mashup. 139-148 - Zhebin Zhang, Chen Zhou, Bo Xin, Yizhou Wang, Wen Gao:
An interactive system of stereoscopic video conversion. 149-158 - Ian Kegel, Pablo César, Jack Jansen, Dick C. A. Bulterman, Tim Stevens, Joke Kort, Nikolaus Färber:
Enabling 'togetherness' in high-quality domestic video. 159-168
Full paper session 4: large scale search
- Wengang Zhou, Yijuan Lu, Houqiang Li, Qi Tian:
Scalar quantization for large scale image search. 169-178 - Jingdong Wang, Shipeng Li:
Query-driven iterated neighborhood graph search for large scale indexing. 179-188 - Giorgos Tolias, Yannis Kalantidis, Yannis Avrithis:
SymCity: feature selection by symmetry for large scale image retrieval. 189-198 - Zhen Liu, Houqiang Li, Wengang Zhou, Qi Tian:
Embedding spatial context information into inverted filefor large-scale image retrieval. 199-208
Full paper session 5: person and face analysis
- Hamdi Dibeklioglu, Theo Gevers, Albert Ali Salah, Roberto Valenti:
A smile can reveal your age: enabling facial dynamics in age estimation. 209-218 - Jiajun Bu, Bin Xu, Chenxia Wu, Chun Chen, Jianke Zhu, Deng Cai, Xiaofei He:
Unsupervised face-name association via commute distance. 219-228 - Xin Lu, Poonam Suryanarayan, Reginald B. Adams Jr., Jia Li, Michelle G. Newman, James Z. Wang:
On shape and the computability of emotions. 229-238 - Tam V. Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan:
Sense beauty via face, dressing, and/or voice. 239-248
Full paper session 6: video distribution
- Haiying Shen, Ze Li, Hailang Wang, Jin Li:
Leveraging social network concepts for efficient peer-to-peer live streaming systems. 249-258 - Yuan Feng, Baochun Li, Bo Li:
Jetway: minimizing costs on inter-datacenter video traffic. 259-268 - Nesrine Changuel, Bessem Sayadi, Michel Kieffer:
Control of distributed servers for quality-fair delivery of multiple video streams. 269-278 - Xin Li, Mian Dong, Zhan Ma, Felix C. A. Fernandes:
GreenTube: power optimization for mobile videostreaming via dynamic cache management. 279-288
Full paper session 7: visual search
- Marcel Worring, Andreas Engl, Camelia Smeria:
A multimedia analytics framework for browsing image collections in digital forensics. 289-298 - Liangliang Cao, Zhenguo Li, Yadong Mu, Shih-Fu Chang:
Submodular video hashing: a unified framework towards video pooling and indexing. 299-308 - Gregory D. Castañón, André-Louis Caron, Venkatesh Saligrama, Pierre-Marc Jodoin:
Exploratory search of long surveillance videos. 309-318 - Christoph Kofler, Linjun Yang, Martha A. Larson, Tao Mei, Alan Hanjalic, Shipeng Li:
When video search goes wrong: predicting query failure using search engine logs and visual search results. 319-328
Full paper session 8: human-centric media
- Ruchir Srivastava, Jiashi Feng, Sujoy Roy, Shuicheng Yan, Terence Sim:
Don't ask me what i'm like, just watch and listen. 329-338 - Esben Skouboe Poulsen, Hans Jørgen Andersen, Ole B. Jensen, Rikke Gade, Tobias Thyrrestrup, Thomas B. Moeslund:
Controlling urban lighting by human motion patterns results from a full scale experiment. 339-348 - Victoria Yanulevskaya, Jasper R. R. Uijlings, Elia Bruni, Andreza Sartori, Elisa Zamboni, Francesca Bacci, David Melcher, Nicu Sebe:
In the eye of the beholder: employing statistical analysis and eye tracking for analyzing abstract paintings. 349-358 - Qianqian Xu, Qingming Huang, Yuan Yao:
Online crowdsourcing subjective image quality assessment. 359-368
Full paper session 9: presentation and organization
- Raj Kumar Gupta, Alex Yong Sang Chia, Deepu Rajan, Ee Sin Ng, Zhiyong Huang:
Image colorization using similar images. 369-378 - Mikko Kuhna, Ida-Maria Kivelä, Pirkko Oittinen:
Semi-automated magazine layout using content-based image features. 379-388 - Surendar Chandra, Jacob T. Biehl, John S. Boreczky, Scott A. Carter, Lawrence A. Rowe:
Understanding screen contents for building a high performance, real time screen sharing system. 389-398 - Bhojan Anand, Lee Kee Chong, Ee-Chien Chang, Mun Choon Chan, Akkihebbal L. Ananda, Wei Tsang Ooi:
El-pincel: a painter cloud service for greener web pages. 399-408
Full paper session 10: haptics
- Rahul Gopal Chaudhari, Burak Cizmeci, Katherine J. Kuchenbecker, Seungmoon Choi, Eckehard G. Steinbach:
Low bitrate source-filter model based compression of vibrotactile texture signals in haptic teleoperation. 409-418 - Troy McDaniel, Morris Goldberg, Shantanu Bala, Bijan Fakhri, Sethuraman Panchanathan:
Vibrotactile feedback of motor performance errors for enhancing motor learning. 419-428 - Yinsheng Zhou, Khe Chai Sim, Patsy Tan, Ye Wang:
MOGAT: mobile games with auditory training for children with cochlear implants. 429-438
Full paper session 11: event recognition
- Ming-Fang Weng, Yen-Yu Lin, Nick C. Tang, Hong-Yuan Mark Liao:
Visual knowledge transfer among multiple cameras for people counting with occlusion handling. 439-448 - Lu Jiang, Alexander G. Hauptmann, Guang Xiang:
Leveraging high-level and low-level features for multimedia event detection. 449-458 - Chreston A. Miller, Francis K. H. Quek:
Interactive data-driven discovery of temporal behavior models from events in media streams. 459-468 - Zhigang Ma, Yi Yang, Yang Cai, Nicu Sebe, Alexander G. Hauptmann:
Knowledge adaptation for ad hoc multimedia event detection with few exemplars. 469-478
Full paper session 12: semantic tagging
- Zhongang Qi, Ming Yang, Zhongfei (Mark) Zhang, Zhengyou Zhang:
Multi-view learning from imperfect tagging. 479-488 - Albrecht J. Lindner, Appu Shaji, Nicolas Bonnier, Sabine Süsstrunk:
Joint statistical analysis of images and keywords with applications in semantic image enhancement. 489-498 - Zhiwu Lu, Yuxin Peng:
Image annotation by semantic sparse recoding of visual content. 499-508 - Fei Wu, Ying Yuan, Yong Rui, Shuicheng Yan, Yueting Zhuang:
Annotating web images using NOVA: NOn-conVex group spArsity. 509-518
Full paper session 13: image analysis
- Zhenbang Sun, Changhu Wang, Liqing Zhang, Lei Zhang:
Query-adaptive shape topic mining for hand-drawn sketch recognition. 519-528 - Yahong Han, Fei Wu, Xinyan Lu, Qi Tian, Yueting Zhuang, Jiebo Luo:
Correlated attribute transfer with multi-task graph-guided fusion. 529-538 - Lingxi Xie, Qi Tian, Bo Zhang:
Spatial pooling of heterogeneous features for image applications. 539-548 - Yoshitaka Ushiku, Tatsuya Harada, Yasuo Kuniyoshi:
Efficient image annotation for automatic sentence generation. 549-558
Full paper session 14: mobile systems
- Yu-Chuan Tseng, Yi-Ching Huang, Kuan-Ying Wu, Chi-Ping Chin:
Dinner of Luciérnaga: an interactive play with iPhone app in theater. 559-568 - Xin Yang, Kwang-Ting (Tim) Cheng:
Accelerating SURF detector on mobile devices. 569-578 - Lican Dai, Huanjing Yue, Xiaoyan Sun, Feng Wu:
IMShare: instantly sharing your mobile landmark images by search-based reconstruction. 579-588 - Jiajun Liu, Zi Huang, Lei Chen, Heng Tao Shen, Zhixian Yan:
Discovering areas of interest with geo-tagged images and check-ins. 589-598
Full paper session 15: image content analysis
- Pierre Letessier, Olivier Buisson, Alexis Joly:
Scalable mining of small visual objects. 599-608 - Wei Zhang, Lei Pang, Chong-Wah Ngo:
Snap-and-ask: answering multimodal question by naming visual instance. 609-618 - Si Liu, Jiashi Feng, Zheng Song, Tianzhu Zhang, Hanqing Lu, Changsheng Xu, Shuicheng Yan:
Hi, magic closet, tell me what to wear! 619-628 - Chun-Shien Lu, Chao-Yung Hsu:
Constraint-optimized keypoint inhibition/insertion attack: security threat to scale-space image feature extraction. 629-638
Full paper session 16: social media
- Xiao-Yong Wei, Zhen-Qun Yang:
Mining in-class social networks for large-scale pedagogical analysis. 639-648 - Suman Deb Roy, Tao Mei, Wenjun Zeng, Shipeng Li:
SocialTransfer: cross-domain transfer learning from social streams for media applications. 649-658 - Dong Liu, Guangnan Ye, Ching-Ting Chen, Shuicheng Yan, Shih-Fu Chang:
Hybrid social media network. 659-668 - Yan-Ying Chen, Winston H. Hsu, Hong-Yuan Mark Liao:
Discovering informative social subgraphs and predicting pairwise relationships from group photos. 669-678
Poster Session I
- Yuming Fang, Weisi Lin, Zhenzhong Chen, Chia-Ming Tsai, Chia-Wen Lin:
Video saliency detection in the compressed domain. 697-700 - Chi Zhang, Weiqiang Wang:
A robust and efficient shot boundary detection approach based on fisher criterion. 701-704 - Stephan Wenger, Marcus A. Magnor:
A genetic algorithm for audio retargeting. 705-708 - Hyeonwoo Noh, Bohyung Han:
Seam carving with forward gradient difference maps. 709-712 - Chongyu Chen, Jianfei Cai, Weisi Lin, Guangming Shi:
Surveillance video coding via low-rank and sparse decomposition. 713-716 - Jui-Yu Yen, Bo-Hao Chen, Shih-Chia Huang:
Enhanced extraction of moving objects in variable bit-rate video streams. 717-720 - Bing Li, Weihua Xiong, Weiming Hu, Xinmiao Ding:
Context-aware affective images classification based on bilayer sparse representation. 721-724 - Xiuzhuang Zhou, Jiwen Lu, Junlin Hu, Yuanyuan Shang:
Gabor-based gradient orientation pyramid for kinship verification under uncontrolled environments. 725-728 - Tao Lin, Liang Lin, Qing Wang:
Robust stroke-based video animation via layered motion and correspondence. 729-732 - Wenxiu Sun, Oscar C. Au, Lingfeng Xu, Yujun Li, Wei Hu, Zhiding Yu:
Texture optimization for seamless view synthesis through energy minimization. 733-736 - Xin-Shun Xu, Yuan Jiang, Xiangyang Xue, Zhi-Hua Zhou:
Semi-supervised multi-instance multi-label learning for video annotation task. 737-740 - Youjie Zhou, Jiebo Luo:
Geo-location inference on news articles via multimodal pLSA. 741-744 - Zhendong Mao, Yongdong Zhang, Ke Gao, Dongming Zhang:
A method for detecting salient regions using integrated features. 745-748 - Xinyuan Cai, Chunheng Wang, Baihua Xiao, Xue Chen, Ji Zhou:
Deep nonlinear metric learning with independent subspace analysis for face verification. 749-752 - Zhuo Su, Daiguo Deng, Xue Yang, Xiaonan Luo:
Color transfer based on multiscale gradient-aware decomposition and color distribution mapping. 753-756 - Yi-Hsuan Yang:
On sparse and low-rank matrix decomposition for singing voice separation. 757-760 - Xiaoshuai Sun, Hongxun Yao:
Memorable basis: towards human-centralized sparse representation. 761-764 - Trung Quy Phan, Palaiahnakote Shivakumara, Chew Lim Tan:
Detecting text in the real world. 765-768 - Zhen Han, Junjun Jiang, Ruimin Hu, Tao Lu, Kebin Huang:
Face image super-resolution via nearest feature line. 769-772 - Zhenyong Fu, Hongtao Lu, Horace Ho-Shing Ip, Zhiwu Lu:
Modalities consensus for multi-modal constraint propagation. 773-776 - Ping Luo, Xiaogang Wang, Liang Lin, Xiaoou Tang:
Joint semantic segmentation by searching for compatible-competitive references. 777-780 - Tianlong Chen, Chunxi Liu, Qingming Huang:
An effective multi-clue fusion approach for web video topic detection. 781-784 - Hui Liang, Junsong Yuan, Daniel Thalmann:
3D fingertip and palm tracking in depth image sequences. 785-788 - Gelareh Mohammadi, Antonio Origlia, Maurizio Filippone, Alessandro Vinciarelli:
From speech to personality: mapping voice quality and intonation into personality differences. 789-792 - Samuel Kim, Maurizio Filippone, Fabio Valente, Alessandro Vinciarelli:
Predicting the conflict level in television political debates: an approach based on crowdsourcing, nonverbal communication and gaussian processes. 793-796 - Pengyang Bu, Nan Wang, Haizhou Ai:
Using structural patches tiling to guide human head-shoulder segmentation. 797-800 - Bao Zhang, Handong Zhao, Xiaochun Cao:
Video object segmentation with shortest path. 801-804 - Ding-Jie Chen, Hwann-Tzong Chen, Long-Wen Chang:
Video object cosegmentation. 805-808 - Zhineng Chen, Chong-Wah Ngo, Juan Cao, Wei Zhang:
Community as a connector: associating faces with celebrity names in web videos. 809-812 - Stavros Petridis, Sanjay Bilakhia, Maja Pantic:
Comparison of prediction-based fusion and feature-level fusion across different learning models. 813-816 - Richard Rzeszutek, Raymond Phan, Dimitrios Androutsos:
Depth estimation for semi-automatic 2D to 3D conversion. 817-820 - Ting Yao, Chong-Wah Ngo, Shiai Zhu:
Predicting domain adaptivity: redo or recycle? 821-824 - Xi Jiang, Tuo Zhang, Xintao Hu, Lie Lu, Junwei Han, Lei Guo, Tianming Liu:
Music/speech classification using high-level features derived from fmri brain imaging. 825-828 - Jen-Yu Liu, Chin-Chia Michael Yeh, Yi-Hsuan Yang, Yuan-Ching Teng:
Bilingual analysis of song lyrics and audio words. 829-832 - Ye Tang, Yu-Bin Yang, Yang Gao:
Self-paced dictionary learning for image classification. 833-836 - Xixuan Wu, Yu Qiao, Xiaogang Wang, Xiaoou Tang:
Cross matching of music and image. 837-840 - Nils Peters, Howard Lei, Gerald Friedland:
Name that room: room identification using acoustic features in a recording. 841-844 - Lai-Kuan Wong, Kok-Lim Low:
Enhancing visual dominance by semantics-preserving image recomposition. 845-848 - Jie Xiao, Wengang Zhou, Xia Li, Meng Wang, Qi Tian:
Image tag re-ranking by coupled probability transition. 849-852 - Zechao Li, Jing Liu, Yu Jiang, Jinhui Tang, Hanqing Lu:
Low rank metric learning for social image retrieval. 853-856 - Jia Jia, Sen Wu, Xiaohui Wang, Peiyun Hu, Lianhong Cai, Jie Tang:
Can we understand van gogh's mood?: learning to infer affects from images in social networks. 857-860 - Yang Liu, Jing Liu, Zechao Li, Biao Niu, Hanqing Lu:
Social tag alignment with image regions by sparse reconstructions. 861-864 - Yanxiang Wang, Hari Sundaram, Lexing Xie:
Social event detection with interaction graph modeling. 865-868
Poster Session 2
- Jingwen Bian, Zheng-Jun Zha, Hanwang Zhang, Qi Tian, Tat-Seng Chua:
Visual query attributes suggestion. 869-872 - Junjie Cai, Zheng-Jun Zha, Wengang Zhou, Qi Tian:
Attribute-assisted reranking for web image retrieval. 873-876 - Xia Li, Wengang Zhou, Jinhui Tang, Qi Tian:
Query expansion enhancement by fast binary matching. 877-880 - Xianglong Liu, Junfeng He, Di Liu, Bo Lang:
Compact kernel hashing with multiple features. 881-884 - Weiwen Tu, Rong Pan, Jingdong Wang:
Similar image search with a tiny bag-of-delegates representation. 885-888