default search action
ICME 2012: Melbourne, Australia
- Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, ICME 2012, Melbourne, Australia, July 9-13, 2012. IEEE Computer Society 2012, ISBN 978-1-4673-1659-0
Multimedia Content Analysis, Understanding, and Retrieval I
- Yu Kong, Yunde Jia:
A Hierarchical Model for Human Interaction Recognition. 1-6 - Jie Lin, Junsong Yuan, Ling-Yu Duan, Siwei Luo, Wen Gao:
Social Image Tagging by Mining Sparse Tag Patterns from Auxiliary Data. 7-12 - Xi Song, Tianfu Wu, Yi Xie, Yunde Jia:
Learning Global and Reconfigurable Part-Based Models for Object Detection. 13-18 - Chun-Chieh Hsu, Hua-Tsung Chen, Chien-Li Chou, Suh-Yin Lee:
Spiking and Blocking Events Detection and Analysis in Volleyball Videos. 19-24 - Yuji Matsuda, Hajime Hoashi, Keiji Yanai:
Recognition of Multiple-Food Images by Detecting Candidate Regions. 25-30
Special Session - Online Community
- Luca Chiarandini, Michele Trevisiol, Alejandro Jaimes:
Discovering Social Photo Navigation Patterns. 31-36 - Xiaoyan Wang, Lifeng Sun, Zhi Wang, Da Meng:
Group Recommendation Using External Followee for Social TV. 37-42 - Jaeyoung Choi, Gerald Friedland, Venkatesan N. Ekambaram, Kannan Ramchandran:
Multimodal Location Estimation of Consumer Media: Dealing with Sparse Training Data. 43-48 - Suman Deb Roy, Tao Mei, Wenjun Zeng, Shipeng Li:
Empowering Cross-Domain Internet Media with Real-Time Topic Learning from Social Streams. 49-54 - Lexing Xie, Hari Sundaram:
Media Lifecycle and Content Analysis in Social Media Communities. 55-60
Media Coding and Transcoding I
- Mingchao Geng, Xianguo Zhang, Yonghong Tian, Luhong Liang, Tiejun Huang:
A Fast and Performance-Maintained Transcoding Method Based on Background Modeling for Surveillance Video. 61-66 - Jingning Han, Vinay Melkote, Kenneth Rose:
A Unified Estimation-Theoretic Framework for Error-Resilient Scalable Video Coding. 67-72 - Hadi Hadizadeh, Ivan V. Bajic, Gene Cheung:
Saliency-Cognizant Error Concealment in Loss-Corrupted Streaming Video. 73-78 - Ivan Himawan, Wei Song, Dian Tjondronegoro:
Impact of Region-of-Interest Video Coding on Perceived Quality in Mobile Video. 79-84 - Tianwu Yang, Ce Zhu, Xiaojiu Fan, Qiang Peng:
Source Distortion Temporal Propagation Model for Motion Compensated Video Coding Optimization. 85-90
3D Analysis and Scene Synthesis
- Siyuan Fang, Neill W. Campbell:
Multi-perspective Panoramas of Long Scenes. 91-96 - Tuan Q. Pham, Philip Cox:
Multi-hypothesis Projection-Based Shift Estimation for Sweeping Panorama Reconstruction. 97-102 - Xue Wei, Son Lam Phung, Abdesselam Bouzerdoum:
Scene Segmentation and Pedestrian Classification from 3-D Range and Intensity Images. 103-108 - Ilkoo Ahn, Changick Kim:
Depth-Based Disocclusion Filling for Virtual View Synthesis. 109-114 - Shujie Liu, Philip A. Chou, Cha Zhang, Zhengyou Zhang, Chang Wen Chen:
Virtual View Reconstruction Using Temporal Information. 115-120
Multimedia Content Analysis, Understanding, and Retrieval II
- Ibrahim Radwan, Abhinav Dhall, Jyoti Joshi, Roland Goecke:
Regression Based Pose Estimation with Automatic Occlusion Detection and Rectification. 121-127 - Shen-Chi Chen, Chia-Hsiang Wu, Shih-Yao Lin, Yi-Ping Hung:
2D Face Alignment and Pose Estimation Based on 3D Facial Models. 128-133 - Lican Dai, Xin-Jing Wang, Lei Zhang, Nenghai Yu:
Efficient Tag Mining via Mixture Modeling for Real-Time Search-Based Image Annotation. 134-139 - Jana Eggink, Denise Bland:
A Large Scale Experiment for Mood-Based Classification of TV Programmes. 140-145 - Ilseo Kim, Sangmin Oh, A. G. Amitha Perera, Chin-Hui Lee:
Per-Exemplar Fusion Learning for Video Retrieval and Recounting. 146-151
Image / Video Processing
- Yanjie Li, Tianfan Xue, Lifeng Sun, Jianzhuang Liu:
Joint Example-Based Depth Map Super-Resolution. 152-157 - Zhixiang Ren, Shenghua Gao, Deepu Rajan, Liang-Tien Chia, Yun Huang:
Spatiotemporal Saliency Detection via Sparse Representation. 158-163 - De-An Huang, Li-Wei Kang, Min-Chun Yang, Chia-Wen Lin, Yu-Chiang Frank Wang:
Context-Aware Single Image Rain Removal. 164-169 - Pengfei Wan, Oscar C. Au, Ketan Tang, Yuanfang Guo, Lu Fang:
From 2D Extrapolation to 1D Interpolation: Content Adaptive Image Bit-Depth Expansion. 170-175 - Behzad Mirmahboub, Shadrokh Samavi, Nader Karimi, Shahram Shirani:
View-Invariant Fall Detection System Based on Silhouette Area and Orientation. 176-181
Media Streaming
- Takuya Fujihashi, Ziyuan Pan, Takashi Watanabe:
Traffic Reduction for Multiple Users in Multi-view Video Streaming. 182-187 - Qian Liu, Zixuan Zou, Chang Wen Chen:
QoS-driven and Fair Downlink Scheduling for Video Streaming over LTE Networks with Deadline and Hard Hand-off. 188-193 - Attilio Fiandrotti, Valerio Bioglio, Enrico Magli, Marco Grangetto, Rossano Gaeta:
Band Codes: Controlled Complexity Network Coding for Peer-to-Peer Video Streaming. 194-199 - Dejan Vukobratovic, Chadi Khirallah, Vladimir Stankovic, John S. Thompson:
Random Network Coding for Multimedia Delivery over LTE-Advanced. 200-205 - Yahui Hu, Guofeng Lv, Song Ci, Hui Tang:
A Cross-Layer Video Transmission Scheme with Guaranteed End-to-End QoS over MIMO OFDM Systems. 206-211
Multimedia Security and Privacy
- Junjun Jiang, Ruimin Hu, Zhen Han, Tao Lu, Kebin Huang:
Position-Patch Based Face Hallucination via Locality-Constrained Representation. 212-217 - Truyen Tran, Dinh Q. Phung, Svetha Venkatesh:
Learning Boltzmann Distance Metric for Face Recognition. 218-223 - Hamed Kiani Galoogahi, Terence Sim:
Inter-modality Face Sketch Recognition. 224-229 - Yongdong Wu, Robert H. Deng:
A Pollution Attack to Public-key Watermarking Schemes. 230-235 - Pravin Kakar, Natarajan Sudha:
Authenticating Image Metadata Elements Using Geolocation Information and Sun Direction Estimation. 236-241
Poster Session I
- Tao Xu, Hong Liu, Yueliang Qian, Zhe Wang:
A Fast and Robust Pedestrian Detection Framework Based on Static and Dynamic Information. 242-247 - David Philippou-Hübner, Bogdan Vlasenko, Ronald Böck, Andreas Wendemuth:
The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech. 248-253 - Itheri Yahiaoui, Olfa Mzoughi, Nozha Boujemaa:
Leaf Shape Descriptor for Tree Species Identification. 254-259 - Quan Fang, Jitao Sang, Changsheng Xu:
Saliency Aware Locality-preserving Coding for Image Classification. 260-265 - Yang Liu, Jing Liu, Zechao Li, Hanqing Lu:
Noisy Tag Alignment with Image Regions. 266-271 - Yuki Sugiyama, Makoto P. Kato, Hiroaki Ohshima, Katsumi Tanaka:
Relative Relevance Feedback in Image Retrieval. 272-277 - Jiangen Zhang, Benjamin Z. Yao, Yongtian Wang:
Modelling Atomic Actions for Activity Classification. 278-283 - Rogério Schmidt Feris, Sharath Pankanti, Behjat Siddiquie:
Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance. 284-289 - Ishrat Jahan Sumana, Guojun Lu, Dengsheng Zhang:
Comparison of Curvelet and Wavelet Texture Features for Content Based Image Retrieval. 290-295 - Wei Fu, Jinqiao Wang, Zechao Li, Hanqing Lu, Songde Ma:
Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding. 296-301 - Yu-Hsiang Huang, Tzu-Kuei Huang, Yan-Hsiang Huang, Wei-Chao Chen, Yung-Yu Chuang:
Warping-Based Novel View Synthesis from a Binocular Image for Autostereoscopic Displays. 302-307 - Zhenyu Wang, Ronggang Wang, Shengfu Dong, Wei Wu, Longshe Huo, Wen Gao:
Depth Template Based 2D-to-3D Video Conversion and Coding System. 308-313 - Po-Chen Wu, Jui-Hsin Lai, Ja-Ling Wu, Shao-Yi Chien:
Stable Pose Estimation with a Motion Model in Real-Time Application. 314-319 - John Judnich, Nam Ling:
Symmetric Cluster Set Level of Detail for Real-Time Terrain Rendering. 320-324 - Fumio Okura, Masayuki Kanbara, Naokazu Yokoya:
Full Spherical High Dynamic Range Imaging from the Sky. 325-332 - Yuan Lin, Shengjin Wang, Qian Lin, Feng Tang:
Face Swapping under Large Pose Variations: A 3D Model Based Approach. 333-338 - Xiao-han Lu, Fang Wei, Fang-min Chen:
Foreground-Object-Protected Depth Map Smoothing for DIBR. 339-343 - Yue Ming, Qiuqi Ruan, Alexander G. Hauptmann:
Activity Recognition from RGB-D Camera with 3D Local Spatio-temporal Features. 344-349 - Peijiang Liu, Yunhong Wang, Di Huang, Zhaoxiang Zhang:
Recognizing Occluded 3D Faces Using an Efficient ICP Variant. 350-355 - Xuran Zhao, Nicholas W. D. Evans, Jean-Luc Dugelay:
CO-LDA: A Semi-supervised Approach to Audio-Visual Person Recognition. 356-361 - Deok-Yeon Kim, Joon Young Kwak, ByoungChul Ko, Jae-Yeal Nam:
Human Detection Using Wavelet-Based CS-LBP and a Cascade of Random Forests. 362-367 - Tomonari Yoshida, Tomokazu Takahashi, Daisuke Deguchi, Ichiro Ide, Hiroshi Murase:
Robust Face Super-Resolution Using Free-Form Deformations for Low-Quality Surveillance Video. 368-373 - Menglin Jiang, Yonghong Tian, Tiejun Huang:
Video Copy Detection Using a Soft Cascade of Multimodal Features. 374-379 - Li Weng, Geert Braeckman, Ann Dooms, Bart Preneel, Peter Schelkens:
Robust Image Content Authentication with Tamper Location. 380-385 - Wei Sun, Shantanu Rane:
A Distance-Sensitive Attribute Based Cryptosystem for Privacy-Preserving Querying. 386-391 - Guanshuo Xu, Yun-Qing Shi:
Camera Model Identification Using Local Binary Patterns. 392-397 - Chen Gong, Yang Liu, Tianyu Li, Jie Yang, Xiangjian He:
The Extended Co-learning Framework for Robust Object Tracking. 398-403 - W. L. Warner Hong, Fei Long, Pengye Xia, Shueng-Han Gary Chan:
Distributed Joint Channel and Routing Assignment for Multimedia Wireless Mesh Networks. 404-409 - Yousef O. Sharrab, Nabil J. Sarhan:
Accuracy and Power Consumption Tradeoffs in Video Rate Adaptation for Computer Vision Applications. 410-415 - Yongfei Zhang, Yunsheng Zhang, Shiyin Qin, Zhihai He:
Resource-Distortion Modeling for Video Streaming over Mesh Networks with Priority-Based Packet Scheduling. 416-421
Expert Talk: Time Machine Plenary Session
- Xavier Anguera:
Expert Talk for Time Machine Session: Dynamic Time Warping New Youth. 422 - John N. A. Brown:
Expert Talk for Time Machine Session: Designing Calm Technology "as Refreshing as Taking a Walk in the Woods". 423 - Mohammad Soleymani:
Expert Talk for Time Machine Session: Affective Multimedia Analysis: Introduction, Background and Perspectives. 424 - Wenjun Zeng:
Expert Talk for Time Machine Session: High Order Entropy Coding - From Conventional Video Coding to Distributed Video Coding. 425
Multimedia Content Analysis, Understanding and Retrieval III
- Shen-Fu Tsai, Hao Tang, Feng Tang, Thomas S. Huang:
Ontological Inference Framework with Joint Ontology Construction and Learning for Image Understanding. 426-431 - Yuxuan Lan, Barry-John Theobald, Richard W. Harvey:
View Independent Computer Lip-Reading. 432-437 - Junyong You:
Video Gaze Prediction: Minimizing Perceptual Information Loss. 438-443 - Costantino Grana, Daniele Borghesani, Rita Cucchiara:
Class-Based Color Bag of Words for Fashion Retrieval. 444-449
Acoustic Signal Analysis and Processing
- Talal Bin Amin, Pina Marziliano, James Sneed German:
Nine Voices, One Artist: Linguistic and Acoustic Analysis. 450-454 - Xavier Anguera, Antonio Garzon, Tomasz Adamek:
MASK: Robust Local Features for Audio Fingerprinting. 455-460 - Pierre Guillon, Reza Zolfaghari, Nicolas Epain, André van Schaik, Craig T. Jin, Carl Hetherington, Jonathan Thorpe, Anthony I. Tew:
Creating the Sydney York Morphological and Acoustic Recordings of Ears Database. 461-466 - Xulei Bao, Jie Zhu, Zhen Huang:
Blind Speech Dereverberation Based on a Statistical Model. 467-472
Media Coding and Transcoding II
- Huanjing Yue, Xiaoyan Sun, Feng Wu, Jingyu Yang:
SIFT-Based Image Compression. 473-478 - R. M. Thilini P. Rajakaruna, Warnakulasuriya Anil Chandana Fernando, Janko Calic:
Lagrange-based Video Encoder Optimisation to Enhance Motion Representation in the Compressed-Domain. 479-484 - Bruno Boessio Vizzotto, Bruno Zatt, Muhammad Shafique, Sergio Bampi, Jörg Henkel:
A Model Predictive Controller for Frame-Level Rate Control in Multiview Video Coding. 485-490 - Dung Vu, Jilong Kuang, Laxmi N. Bhuyan:
An Adaptive Dynamic Scheduling Scheme for H.264/AVC Decoding on Multicore Architecture. 491-496
Special Session - Perceptual Visual Signal Coding and Display
- Abdul Rehman, Zhou Wang:
SSIM-Inspired Perceptual Video Coding for HEVC. 497-502 - Shuai Wan, Yanchao Gong, Fuzheng Yang:
Perception of Temporal Pumping Artifact in Video Coding with the Hierarchical Prediction Structure. 503-508 - Guan-Lin Wu, Yu-Jie Fu, Shao-Yi Chien:
System Design of Perceptual Quality-Regulable H.264 Video Encoder. 509-514 - Liyuan Xing, Jie Xu, Kim Skildheim, Andrew Perkis, Touradj Ebrahimi:
Subjective Crosstalk Assessment Methodology for Auto-stereoscopic Displays. 515-520
Poster Session II
- Wei-Ho Tsai, Yu-Ming Tu:
An Efficient Query-by-Singing/Humming System Based on Fast Fourier Transforms of Note Sequences. 521-525 - Anh-Phuong Ta, Guillaume Gravier:
Unsupervised Mining of Multiple Audiovisually Consistent Clusters for Video Structure Analysis. 526-531 - Hongda Tian, Wanqing Li, Lei Wang, Philip Ogunbona:
A Novel Video-Based Smoke Detection Method Using Image Separation. 532-537 - Jin-Woo Jeong, Hyun-Ki Hong, Jee-Uk Heu, Iqbal Qasim, Dong-Ho Lee:
Visual Summarization of the Social Image Collection Using Image Attractiveness Learned from Social Behaviors. 538-543 - Pei Dong, Yong Xia, David Dagan Feng:
Real-Time Storyboard Generation for H.264/AVC Compressed Videos. 544-549 - Junbin Gao, Manoranjan Paul, Jun Liu:
The Image Matting Method with Regularized Matte. 550-555 - Ruben Gonzalez:
Radon-based Audio Classification Features. 556-561 - Hyun-seok Min, Semin Kim, Wesley De Neve, Yong Man Ro:
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words. 562-567 - Yanan Liu:
Image Classification with Group Fusion Sparse Representation. 568-573 - Min-Chun Yang, De-An Huang, Chih-Yun Tsai, Yu-Chiang Frank Wang:
Self-Learning of Edge-Preserving Single Image Super-Resolution via Contourlet Transform. 574-579 - Bing Yang, Zhiyong Gao, Xiaoyun Zhang:
Principal Components Analysis-Based Edge-Directed Image Interpolation. 580-585 - Qing Yan, Yi Xu, Xiaokang Yang:
A Robust Homography Estimation Method Based on Keypoint Consensus and Appearance Similarity. 586-591 - Jian Zhang, Ruiqin Xiong, Chen Zhao, Siwei Ma, Debin Zhao:
Exploiting Image Local and Nonlocal Consistency for Mixed Gaussian-Impulse Noise Removal. 592-597 - Qingchun Lu, Xiangzhong Fang, Chong Xu, Yongzhe Wang:
Frame Rate Up-Conversion for Depth-Based 3D Video. 598-603 - Ting-Chun Wang, Yi-Nung Liu, Shao-Yi Chien:
Color Filter Array Demosaicking Using Self-validation Framework. 604-609 - Junjun Jiang, Ruimin Hu, Zhen Han, Kebin Huang, Tao Lu:
Efficient Single Image Super-Resolution via Graph Embedding. 610-615 - Haichao Zhang, Yanning Zhang, Thomas S. Huang:
Exploiting Structured Sparsity for Image Deblurring. 616-621 - Ke Chen, Zhong Zhou, Wei Wu:
Clustering Based Search Algorithm for Motion Estimation. 622-627 - Shi Dong, Ruimin Hu, Weiping Tu, Xiang Zheng, Junjun Jiang, Song Wang:
Enhanced Principal Component Using Polar Coordinate PCA for Stereo Audio Coding. 628-633 - Gerhard Tech, Heiko Schwarz, Karsten Müller, Thomas Wiegand:
Synthesized View Distortion Based 3D Video Coding for Extrapolation and Interpolation of Views. 634-639 - Filippo Speranza, Ron Renaud, André Vincent, Wa James Tam:
Perceived Picture Quality of Frame-Compatible 3DTV Video Formats. 640-645 - Huiping Deng, Li Yu, Jinbo Qiu, Juntao Zhang:
A Joint Texture/Depth Edge-Directed Up-sampling Algorithm for Depth Map Coding. 646-650 - Dong Zhang, Bin Li, Jizheng Xu, Houqiang Li:
Fast Transcoding from H.264 AVC to High Efficiency Video Coding. 651-656 - Felipe Sampaio, Sergio Bampi, Mateus Grellert, Luciano Volcan Agostini, Júlio C. B. de Mattos:
Motion Vectors Merging: Low Complexity Prediction Unit Decision Heuristic for the Inter-prediction of HEVC Encoders. 657-662 - Guan-Ju Peng, Wen-Liang Hwang, Sao-Jie Chen:
Optimal Bit-allocation for Wavelet-based Scalable Video Coding. 663-668 - Jincao Yao, Huimin Yu, Roland Hu:
Pooling Search: Serum Samples Test Simulated Video Fingerprint Search. 669-674 - Sumit Negi, Santanu Chaudhury:
Finding Subgroups in a Flickr Group. 675-680