default search action
IEEE Transactions on Multimedia, Volume 22
Volume 22, Number 1, January 2020
- Wenwu Zhu:
Message From the Outgoing Editor-in-Chief. 1 - Jiebo Luo:
Editorial. 2 - S. Chandrakala, S. L. Jayalakshmi:
Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition. 3-14 - Kuo-Wei Chen, Ying-Sheng Luo, Yu-Chi Lai, Yan-Lin Chen, Chih-Yuan Yao, Hung-Kuo Chu, Tong-Yee Lee:
Image Vectorization With Real-Time Thin-Plate Spline. 15-29 - Joongchol Shin, Minseo Kim, Joonki Paik, Sangkeun Lee:
Radiance-Reflectance Combined Optimization and Structure-Guided ℓ0-Norm for Single Image Dehazing. 30-44 - Linwei Zhu, Sam Kwong, Yun Zhang, Shiqi Wang, Xu Wang:
Generative Adversarial Network-Based Intra Prediction for Video Coding. 45-58 - Wei Xiao, Xiaolin Huang, Fan He, Jorge Silva, Saba Emrani, Arin Chaudhuri:
Online Robust Principal Component Analysis With Change Point Detection. 59-68 - Javier Cubelos, Pablo Carballeira, Jesús Gutiérrez, Narciso García:
QoE Analysis of Dense Multiview Video With Head-Mounted Devices. 69-81 - Lixiang Li, Guoqian Wen, Zeming Wang, Yixian Yang:
Efficient and Secure Image Communication System Based on Compressed Sensing for IoT Monitoring Applications. 82-95 - Yufei Zha, Tao Ku, Yunqiang Li, Peng Zhang:
Deep Position-Sensitive Tracking. 96-107 - Sarala Ghimire, Jae Young Choi, Bumshik Lee:
Using Blockchain for Improved Video Integrity Verification. 108-121 - Sijie Mai, Songlong Xing, Haifeng Hu:
Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing. 122-137 - Laura Cabrera Quiros, David M. J. Tax, Hayley Hung:
Gestures In-The-Wild: Detecting Conversational Hand Gestures in Crowded Scenes Using a Multimodal Fusion of Bags of Video Trajectories and Body Worn Acceleration. 138-147 - Guoyun Tu, Yanwei Fu, Boyang Li, Jiarui Gao, Yu-Gang Jiang, Xiangyang Xue:
A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization. 148-159 - Zhengzheng Tu, Tian Xia, Chenglong Li, Xiaoxiao Wang, Yan Ma, Jin Tang:
RGB-T Image Saliency Detection via Collaborative Graph Learning. 160-173 - Jian Zhang, Yuxin Peng:
Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval. 174-187 - Yanbin Hao, Chong-Wah Ngo, Benoit Huet:
Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking. 188-200 - Xin-Lin Huang, Xiao-Wei Tang, Fei Hu:
Dynamic Spectrum Access for Multimedia Transmission Over Multi-User, Multi-Channel Cognitive Radio Networks. 201-214 - Jiale Bai, Zefan Li, Bingbing Ni, Minsi Wang, Xiaokang Yang, Chuanping Hu, Wen Gao:
Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval. 215-228 - Chenggang Yan, Yunbin Tu, Xingzheng Wang, Yongbing Zhang, Xinhong Hao, Yongdong Zhang, Qionghai Dai:
STAT: Spatial-Temporal Attention Mechanism for Video Captioning. 229-241 - Shafin Rahman, Salman H. Khan, Nick Barnes:
Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging. 242-255 - Songtao Wu, Sheng-hua Zhong, Yan Liu:
A Novel Convolutional Neural Network for Image Steganalysis With Shared Normalization. 256-270 - Silvia Cascianelli, Gabriele Costante, Alessandro Devo, Thomas A. Ciarfuglia, Paolo Valigi, Mario Luca Fravolini:
The Role of the Input in Natural Language Video Description. 271-283
Volume 22, Number 2, February 2020
- Bin Xiao, Ge Ou, Han Tang, Xiuli Bi, Weisheng Li:
Multi-Focus Image Fusion by Hessian Matrix Based Decomposition. 285-297 - Bo-Kyeong Kim, Geon-min Kim, Soo-Young Lee:
Style-Controlled Synthesis of Clothing Segments for Fashion Image Manipulation. 298-310 - Ke Gu, Zhifang Xia, Junfei Qiao, Weisi Lin:
Deep Dual-Channel Neural Network for Image-Based Smoke Detection. 311-323 - Guangxiao Ma, Chenglizhao Chen, Shuai Li, Chong Peng, Aimin Hao, Hong Qin:
Salient Object Detection via Multiple Instance Joint Re-Learning. 324-336 - Haijun Liu, Shiguang Wang, Wen Wang, Jian Cheng:
Multi-Scale Based Context-Aware Net for Action Detection. 337-348 - Congxuan Zhang, Liyue Ge, Zhen Chen, Ming Li, Wen Liu, Hao Chen:
Refined TV-L1 Optical Flow Estimation Using Joint Filtering. 349-364 - Youtian Du, Xue Wang, Yunbo Cui, Hang Wang, Chang Su:
Kernel-Based Mixture Mapping for Image and Text Association. 365-379 - Shifeng Zhang, Yiliang Xie, Jun Wan, Hansheng Xia, Stan Z. Li, Guodong Guo:
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild. 380-393 - Ke Xu, Tanfeng Sun, Xinghao Jiang:
Video Anomaly Detection and Localization Based on an Adaptive Intra-Frame Classification Network. 394-406 - Ming Cheung, James She:
Detecting Social Signals in User-Shared Images for Connection Discovery Using Deep Learning. 407-420 - Yeqiang Qian, Ming Yang, Xu Zhao, Chunxiang Wang, Bing Wang:
Oriented Spatial Transformer Network for Pedestrian Detection Using Fish-Eye Camera. 421-431 - Lixing Chen, Linqi Song, Jacob Chakareski, Jie Xu:
Collaborative Content Placement Among Wireless Edge Caching Stations With Time-to-Live Cache. 432-444 - Zeyu Xu, Yang Cao, Wei Wang, Tao Jiang, Qian Zhang:
Incentive Mechanism for Cooperative Scalable Video Coding (SVC) Multicast Based on Contract Theory. 445-458 - Hao Chen, Xu Zhang, Yiling Xu, Zhan Ma, Wenjun Zhang:
Efficient Mobile Video Streaming via Context-Aware RaptorQ-Based Unequal Error Protection. 459-473 - Kefan Xiao, Shiwen Mao, Jitendra K. Tugnait:
Robust QoE-Driven DASH Over OFDMA Networks. 474-486 - Cheng Shi, Chi-Man Pun:
Multiscale Superpixel-Based Hyperspectral Image Classification Using Recurrent Neural Networks With Stacked Autoencoders. 487-501 - Pau Rodríguez, Diego Velazquez Dorta, Guillem Cucurull, Josep M. Gonfaus, F. Xavier Roca, Jordi Gonzàlez:
Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition. 502-514 - Wei Zhang, Xuanyu He, Weizhi Lu:
Exploring Discriminative Representations for Image Emotion Recognition With CNNs. 515-523 - Lihua Lu, Yao Lu, Ruizhe Yu, Huijun Di, Lin Zhang, Shunzhou Wang:
GAIM: Graph Attention Interaction Model for Collective Activity Recognition. 524-539 - Zheng Zhang, Qin Zou, Yuewei Lin, Long Chen, Song Wang:
Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval. 540-553 - Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli:
Video Storytelling: Textual Summaries for Events. 554-565
Volume 22, Number 3, March 2020
- Xianjun Xia, Roberto Togneri, Ferdous Sohel, Yuanjun Zhao, David Huang:
Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information. 569-578 - Ruben Verhack, Thomas Sikora, Glenn Van Wallendael, Peter Lambert:
Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding. 579-593 - Qianru Jiang, Sheng Li, Zhihui Zhu, Huang Bai, Xiongxiong He, Rodrigo C. de Lamare:
Design of Compressed Sensing System With Probability-Based Prior Information. 594-609 - Pan Gao, Manoranjan Paul:
Rate-Distortion Optimal Joint Texture and Depth Map Coding for 3-D Video Streaming. 610-625 - Zhaoqiang Xia, Xiaopeng Hong, Xingyu Gao, Xiaoyi Feng, Guoying Zhao:
Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions. 626-640 - Fan Tang, Weiming Dong, Yiping Meng, Chongyang Ma, Fuzhang Wu, Xinrui Li, Tong-Yee Lee:
Image Retargetability. 641-654 - Xiaoting Fan, Jianjun Lei, Yuming Fang, Qingming Huang, Nam Ling, Chunping Hou:
Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending. 655-665 - Qiao Liu, Zhenyu He, Xin Li, Yuan Zheng:
PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark. 666-675 - Bo Yan, Xuejing Niu, Bahetiyaer Bare, Weimin Tan:
Semantic Segmentation Guided Pixel Fusion for Image Retargeting. 676-687 - Konstantina Fotiadou, Grigorios Tsagkatakis, Panagiotis Tsakalides:
Snapshot High Dynamic Range Imaging via Sparse Representations and Feature Learning. 688-703 - Chongyi Li, Chunle Guo, Jichang Guo, Ping Han, Huazhu Fu, Runmin Cong:
PDR-Net: Perception-Inspired Single Image Dehazing Network With Refinement. 704-716 - Wooyoung Jang:
MLC STT-MRAM-Aware Memory Subsystem for Smart Image Applications. 717-729 - Jianwen Lou, Yiming Wang, Charles Nduka, Mahyar Hamedi, Ifigeneia Mavridou, Fei-Yue Wang, Hui Yu:
Realistic Facial Expression Reconstruction for VR HMD Users. 730-743 - Ching-Ling Fan, Shou-Cheng Yen, Chun-Ying Huang, Cheng-Hsin Hsu:
Optimizing Fixation Prediction Using Recurrent Neural Networks for 360$^{\circ }$ Video Streaming in Head-Mounted Virtual Reality. 744-759 - Chao Ma, Chen Gong, Xiang Li, Xiaolin Huang, Wei Liu, Jie Yang:
Toward Making Unsupervised Graph Hashing Discriminative. 760-774 - Lingling Zhang, Minnan Luo, Jun Liu, Xiaojun Chang, Yi Yang, Alexander G. Hauptmann:
Deep Top-$k$ Ranking for Image-Sentence Matching. 775-785 - Liping Zhao, Tao Lin, Dongyu Zhang, Kailun Zhou, Shuhui Wang:
An Ultra-Low Complexity and High Efficiency Approach for Lossless Alpha Channel Coding. 786-794 - Cheng Zhan, Han Hu, Zhi Wang, Rongfei Fan, Dusit Niyato:
Unmanned Aircraft System Aided Adaptive Video Streaming: A Joint Optimization Approach. 795-807 - Lingxiang Wu, Min Xu, Jinqiao Wang, Stuart W. Perry:
Recall What You See Continually Using GridLSTM in Image Captioning. 808-818 - Sebastian Agethen, Winston H. Hsu:
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos. 819-829 - Chenggang Yan, Yunbin Tu, Xingzheng Wang, Yongbing Zhang, Xinhong Hao, Yongdong Zhang, Qionghai Dai:
Corrections to "STAT: Spatial-Temporal Attention Mechanism for Video Captioning". 830
Volume 22, Number 4, April 2020
- Dayong Wang, Yu Sun, Ce Zhu, Weisheng Li, Frédéric Dufaux:
Fast Depth and Inter Mode Prediction for Quality Scalable High Efficiency Video Coding. 833-845 - Deyang Liu, Ping An, Ran Ma, Wenfa Zhan, Xinpeng Huang, Ali Abdullah Yahya:
Content-Based Light Field Image Compression Method With Gaussian Process Regression. 846-859 - Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto:
Energy Compaction-Based Image Compression Using Convolutional AutoEncoder. 860-873 - Zhaoxia Yin, Youzhi Xiang, Xinpeng Zhang:
Reversible Data Hiding in Encrypted Images Based on Multi-MSB Prediction and Huffman Coding. 874-884 - Cheng Deng, Xu Yang, Feiping Nie, Dapeng Tao:
Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking. 885-896 - Chandramani Chaudhary, Poonam Goyal, Dhanashree Nellayi Prasad, Yi-Ping Phoebe Chen:
Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base. 897-911 - Badri Narayan Subudhi, Thangaraj Veerakumar, Esakkirajan Sankaralingam, Ashish Ghosh:
Kernelized Fuzzy Modal Variation for Local Change Detection From Video Scenes. 912-920 - Xun Liu, Mischa Dohler, Yansha Deng:
Vibrotactile Quality Assessment: Hybrid Metric Design Based on SNR and SSIM. 921-933 - Yang Liu, Volkan Kiliç, Jian Guan, Wenwu Wang:
Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. 934-948 - Xiwen Liu, Xiaoming Tao, Mai Xu, Yafeng Zhan, Jianhua Lu:
An EEG-Based Study on Perception of Video Distortion Under Various Content Motion Conditions. 949-960 - Lukas Krasula, Yoann Baveye, Patrick Le Callet:
Training Objective Image and Video Quality Estimators Using Multiple Databases. 961-969 - Muwei Jian, Junyu Dong, Maoguo Gong, Hui Yu, Liqiang Nie, Yilong Yin, Kin-Man Lam:
Learning the Traditional Art of Chinese Calligraphy via Three-Dimensional Reconstruction and Assessment. 970-979 - Hyunmin Jung, Hyuk-Jae Lee, Chae-Eun Rhee:
Flexibly Connectable Light Field System For Free View Exploration. 980-991 - Thanh-Toan Do, Tuan Hoang, Dang-Khoa Le Tan, Anh-Dzung Doan, Ngai-Man Cheung:
Compact Hash Code Learning With Binary Deep Neural Network. 992-1004 - Riza Arda Kirmizioglu, A. Murat Tekalp:
Multi-Party WebRTC Services Using Delay and Bandwidth Aware SDN-Assisted IP Multicasting of Scalable Video Over 5G Networks. 1005-1015 - Chung-Chi Tsai, Kuang-Jui Hsu, Yen-Yu Lin, Xiaoning Qian, Yung-Yu Chuang:
Deep Co-Saliency Detection via Stacked Autoencoder-Enabled Fusion and Self-Trained CNNs. 1016-1031 - Wenqiao Zhang, Siliang Tang, Yanpeng Cao, Shiliang Pu, Fei Wu, Yueting Zhuang:
Frame Augmented Alternating Attention Network for Video Question Answering. 1032-1041 - Zewei He, Yanpeng Cao, Lei Du, Baobei Xu, Jiangxin Yang, Yanlong Cao, Siliang Tang, Yueting Zhuang:
MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution. 1042-1054 - Zhi Jin, Muhammad Zafar Iqbal, Dmytro Bobkov, Wenbin Zou, Xia Li, Eckehard G. Steinbach:
A Flexible Deep CNN Framework for Image Restoration. 1055-1068 - Zhe Zhang, Chung-Horng Lung, Marc St-Hilaire, Ioannis Lambadaris:
An SDN-Based Caching Decision Policy for Video Caching in Information-Centric Networking. 1069-1083 - Shangfei Wang, Longfei Hao, Qiang Ji:
Knowledge-Augmented Multimodal Deep Regression Bayesian Networks for Emotion Video Tagging. 1084-1097 - Tianliang Liu, Junwei Wan, Xiubin Dai, Feng Liu, Quanzeng You, Jiebo Luo:
Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion. 1098-1110 - Zhaoqiang Xia, Xiaopeng Hong, Xingyu Gao, Xiaoyi Feng, Guoying Zhao:
Corrections to "Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions". 1111
Volume 22, Number 5, May 2020
- Minxiang Ye, Cheng Yang, Vladimir Stankovic, Lina Stankovic, Samuel Cheng:
Distinct Feature Extraction for Video-Based Gait Phase Classification. 1113-1125 - Tilo Strutz, Phillip Möller:
Screen Content Compression Based on Enhanced Soft Context Formation. 1126-1138 - Chieh-Chi Kao, Yu-Xiang Wang, Jonathan Waltman, Pradeep Sen:
Patch-Based Image Hallucination for Super Resolution With Detail Reconstruction From Similar Sample Images. 1139-1152 - Yunxiao Li, Shuai Li, Chenglizhao Chen, Aimin Hao, Hong Qin:
Accurate and Robust Video Saliency Detection via Self-Paced Diffusion. 1153-1167 - Yongqing Liang, Xin Li:
Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver. 1168-1181 - Lin Xie, Feifei Lee, Li Liu, Zhong Yin, Qiu Chen:
Hierarchical Coding of Convolutional Features for Scene Recognition. 1182-1192 - Ying Wang, Yifan Dong, Songtao Guo, Yuanyuan Yang, Xiaofeng Liao:
Latency-Aware Adaptive Video Summarization for Mobile Edge Clouds. 1193-1207 - Xiongli Chai, Feng Shao, Qiuping Jiang, Yo-Sung Ho:
MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution. 1208-1219 - Wenfeng Song, Shuai Li, Ji Liu, Aimin Hao, Qinping Zhao, Hong Qin:
Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image. 1220-1233 - Weipeng Hu, Haifeng Hu:
Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition. 1234-1248 - Alexandra Covaci, Estêvão Bissoli Saleme, Gebremariam Mesfin, Nadia Hussain, Elahe Kani-Zabihi, Gheorghita Ghinea:
How Do We Experience Crossmodal Correspondent Mulsemedia Content? 1249-1258 - Tao Xiang, Ying Yang, Shangwei Guo:
Blind Night-Time Image Quality Assessment: Subjective and Objective Approaches. 1259-1272 - Luming Zhang, Jianwei Yin, Ping Li, Yongheng Shang, Roger Zimmermann, Ling Shao:
Flickr Image Community Analytics by Deep Noise-Refined Matrix Factorization. 1273-1284 - Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Deep Metric Learning With Density Adaptivity. 1285-1297 - Yujuan Ding, Wai Keung Wong, Zhihui Lai, Yudong Chen:
Study on 2D Feature-Based Hash Learning. 1298-1309 - Yiling Wu, Shuhui Wang, Qingming Huang:
Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval. 1310-1322 - Zhiyang Xia, Ping Yi, Yunyu Liu, Bo Jiang, Wei Wang, Ting Zhu:
GENPass: A Multi-Source Deep Learning Model for Password Guessing. 1323-1332 - Shuang Qiu, Yao Zhao, Jianbo Jiao, Yunchao Wei, Shikui Wei:
Referring Image Segmentation by Generative Adversarial Learning. 1333-1344 - Yabin Zhang, Kui Jia, Zhixin Wang:
Part-Aware Fine-Grained Object Categorization Using Weakly Supervised Part Detection Network. 1345-1357 - Dongyu She, Jufeng Yang, Ming-Ming Cheng, Yu-Kun Lai, Paul L. Rosin, Liang Wang:
WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection. 1358-1371 - Ning Xu, Hanwang Zhang, An-An Liu, Weizhi Nie, Yuting Su, Jie Nie, Yongdong Zhang:
Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning. 1372-1383
Volume 22, Number 6, June 2020
- Yanxiong Li, Mingle Liu, Wucheng Wang, Yuhan Zhang, Qianhua He:
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration. 1385-1394 - Miaohui Wang, Jian Xiong, Long Xu, Wuyuan Xie, King Ngi Ngan, Jing Qin:
Rate Constrained Multiple-QP Optimization for HEVC. 1395-1406