


default search action
IEEE Transactions on Multimedia, Volume 23
Volume 23, 2021
- Fei Tao

, Carlos Busso
:
End-to-End Audiovisual Speech Recognition System With Multitask Learning. 1-11 - Hadi Hadizadeh

, Ivan V. Bajic
:
Soft Video Multicasting Using Adaptive Compressed Sensing. 12-25 - Angeliki V. Katsenou

, Goce Dimitrov, Di Ma
, David R. Bull
:
BVI-SynTex: A Synthetic Video Texture Dataset for Video Compression and Quality Assessment. 26-38 - Chuanmin Jia

, Falei Luo
, Xinfeng Zhang
, Shiqi Wang
, Shanshe Wang
, Siwei Ma
:
Fast Non-Local Adaptive In-Loop Filter Optimization on GPU. 39-51 - Wenguang He

, Zhanchuan Cai
, Yaomin Wang
:
High-Fidelity Reversible Image Watermarking Based on Effective Prediction Error-Pairs Modification. 52-63 - Kai Liu

, Lei Gao
, Naimul Mefraz Khan
, Lin Qi
, Ling Guan
:
A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition. 64-76 - André F. R. Guarda

, Nuno M. M. Rodrigues
, Fernando Pereira
:
Constant Size Point Cloud Clustering: A Compact, Non-Overlapping Solution. 77-91 - Ji Zhang

, Kuizhi Mei
, Yu Zheng, Jianping Fan
:
Integrating Part of Speech Guidance for Image Captioning. 92-104 - Meihui Li

, Lingbing Peng, Tianfu Wu
, Zhenming Peng
:
A Bottom-Up and Top-Down Integration Framework for Online Object Tracking. 105-119 - Shengjing Tian

, Xiuping Liu
, Meng Liu
, Shuhua Li
, Baocai Yin:
Siamese Tracking Network With Informative Enhanced Loss. 120-132 - Huijing Zhan

, Chenyu Yi
, Boxin Shi
, Jie Lin
, Ling-Yu Duan
, Alex C. Kot
:
Pose-Normalized and Appearance-Preserved Street-to-Shop Clothing Image Generation and Feature Learning. 133-144 - Weipeng Hu

, Haifeng Hu
:
Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition. 145-160 - Qian Bao

, Wu Liu
, Yuhao Cheng, Boyan Zhou, Tao Mei
:
Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking. 161-175 - Yifei Huang

, Sheng Qiu, Changbo Wang
, Chenhui Li
:
Learning Representations for High-Dynamic-Range Image Color Transfer in a Self-Supervised Way. 176-188 - Qing Zhang, Yongwei Nie, Lei Zhu

, Chunxia Xiao
, Wei-Shi Zheng
:
Enhancing Underexposed Photos Using Perceptually Bidirectional Similarity. 189-202 - Nanjun Li, Faliang Chang

, Chunsheng Liu
:
Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes. 203-215 - Boyue Wang

, Yongli Hu
, Junbin Gao
, Yanfeng Sun
, Fujiao Ju, Baocai Yin
:
Learning Adaptive Neighborhood Graph on Grassmann Manifolds for Video/Image-Set Subspace Clustering. 216-227 - Rui Wang

, Xiao-Jun Wu
, Josef Kittler
:
Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features. 228-242 - Luca Rossetto

, Ralph Gasser
, Jakub Lokoc
, Werner Bailer
, Klaus Schoeffmann, Bernd Münzer
, Tomás Soucek, Phuong Anh Nguyen
, Paolo Bolettieri, Andreas Leibetseder, Stefanos Vrochidis
:
Interactive Video Retrieval in the Age of Deep Learning - Detailed Evaluation of VBS 2019. 243-256 - Ke Li

, Yuxia Wu, Yao Xue
, Xueming Qian
:
Viewpoint Recommendation Based on Object-Oriented 3D Scene Reconstruction. 257-267 - Haoran An

, Hai-Miao Hu
, Yuanfang Guo
, Qianli Zhou, Bo Li
:
Hierarchical Reasoning Network for Pedestrian Attribute Recognition. 268-280 - Shizhou Zhang

, Qi Zhang, Yifei Yang, Xing Wei
, Peng Wang
, Bingliang Jiao, Yanning Zhang
:
Person Re-Identification in Aerial Imagery. 281-291 - Li Liu

, Gang Feng, Denis Beautemps
, Xiao-Ping Zhang
:
Re-Synchronization Using the Hand Preceding Model for Multi-Modal Fusion in Automatic Continuous Cued Speech Recognition. 292-305 - Zhuo Li, Hai-Miao Hu

, Wei Zhang
, Shiliang Pu, Bo Li
:
Spectrum Characteristics Preserved Visible and Near-Infrared Image Fusion Algorithm. 306-319 - Leida Li

, Yu Zhou
, Jinjian Wu
, Fu Li
, Guangming Shi
:
Quality Index for View Synthesis by Measuring Instance Degradation and Global Appearance. 320-332 - Conor Keighrey

, Ronan Flynn
, Siobhan Murray, Niall Murray:
A Physiology-Based QoE Comparison of Interactive Augmented Reality, Virtual Reality and Tablet-Based Applications. 333-341 - Yi Xu, Xianglong Liu

, Binshuai Wang, Renshuai Tao
, Ke Xia, Xianbin Cao
:
Fast Nearest Subspace Search via Random Angular Hashing. 342-352 - Yan Wu, Xianglong Liu

, Haotong Qin
, Ke Xia, Sheng Hu, Yuqing Ma
, Meng Wang
:
Boosting Temporal Binary Coding for Large-Scale Video Search. 353-364 - Shintami Chusnul Hidayati, Ting Wei Goh, Ji-Sheng Gary Chan

, Cheng-Chun Hsu, John See
, Lai-Kuan Wong
, Kai-Lung Hua
, Yu Tsao
, Wen-Huang Cheng
:
Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes. 365-377 - Xueming Qian

, Yuxia Wu, Mingdi Li, Yayun Ren, Shuhui Jiang
, Zhetao Li
:
LAST: Location-Appearance-Semantic-Temporal Clustering Based POI Summarization. 378-390 - Hajar Emami

, Majid Moradi Aliabadi
, Ming Dong, Ratna Babu Chinnam
:
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation. 391-401 - Diego Valsesia

, Giulia Fracastoro
, Enrico Magli
:
Learning Localized Representations of Point Clouds With Graph-Convolutional Generative Adversarial Networks. 402-414 - Inwoong Lee

, Doyoung Kim
, Sanghoon Lee
:
3-D Human Behavior Understanding Using Generalized TS-LSTM Networks. 415-428 - Qiang Wang

, Huijie Fan
, Gan Sun
, Weihong Ren
, Yandong Tang
:
Recurrent Generative Adversarial Network for Face Completion. 429-442 - Xiaoheng Jiang

, Li Zhang
, Tianzhu Zhang
, Pei Lv
, Bing Zhou
, Yanwei Pang
, Mingliang Xu
, Changsheng Xu
:
Density-Aware Multi-Task Learning for Crowd Counting. 443-453 - Sheng Zhang, Yuliang Liu

, Lianwen Jin
, Zhongrong Wei, Chunhua Shen
:
OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection. 454-467 - Mengyan Li, Zhaoyu Zhang, Jun Yu

, Chang Wen Chen
:
Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement. 468-483 - Xusong Chen

, Dong Liu
, Zhiwei Xiong
, Zheng-Jun Zha
:
Learning and Fusing Multiple User Interest Representations for Micro-Video and Movie Recommendations. 484-496 - Guofei Sun

, Yongkang Wong
, Zhiyong Cheng
, Mohan S. Kankanhalli
, Weidong Geng
, Xiangdong Li:
DeepDance: Music-to-Dance Motion Choreography With Adversarial Learning. 497-509 - Yuqi Gao, Jitao Sang

, Chengpeng Fu
, Zhengjia Wang
, Tongwei Ren
, Changsheng Xu
:
Metadata Connector: Exploiting Hashtag and Tag for Cross-OSN Event Search. 510-523 - Jingcai Guo

, Song Guo
:
A Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion. 524-537 - Guangyu Li

, Lina Qiu, Chenguang Yu, Houwei Cao
, Yong Liu
, Can Yang
:
IPTV Channel Zapping Recommendation With Attention Mechanism. 538-549 - Qiubin Lin

, Wenming Cao
, Zhiquan He
, Zhihai He:
Mask Cross-Modal Hashing Networks. 550-558 - Yiling Wu, Shuhui Wang

, Guoli Song
, Qingming Huang
:
Augmented Adversarial Training for Cross-Modal Retrieval. 559-571 - Hao Yang

, Li Liu
, Weidong Min
, Xiaosong Yang, Xin Xiong
:
Driver Yawning Detection Based on Subtle Facial Action Recognition. 572-583 - Hao Chen

, Ming Lu
, Zhan Ma
, Xu Zhang
, Yiling Xu
, Qiu Shen, Wenjun Zhang
:
Learned Resolution Scaling Powered Gaming-as-a-Service at Scale. 584-596 - Qiaokang Xie

, Wengang Zhou
, Guo-Jun Qi
, Qi Tian, Houqiang Li
:
Progressive Unsupervised Person Re-Identification by Tracklet Association With Spatio-Temporal Regularization. 597-610 - Xiaodan Zhang, Xinbo Gao

, Wen Lu
, Lihuo He
, Jie Li:
Beyond Vision: A Multimodal Recurrent Attention Convolutional Neural Network for Unified Image Aesthetic Prediction Tasks. 611-623 - Chang Tang

, Xinwang Liu
, Shan An
, Pichao Wang
:
BR$^2$Net: Defocus Blur Detection Via a Bidirectional Channel Attention Residual Refining Network. 624-635 - Pauline Puteaux

, William Puech
:
A Recursive Reversible Data Hiding in Encrypted Images Method With a Very High Payload. 636-650 - Laizhong Cui

, Dongyuan Su
, Shu Yang
, Zhi Wang
, Zhong Ming
:
TCLiVi: Transmission Control in Live Video Streaming Based on Deep Reinforcement Learning. 651-663 - Xiao Lin

, Lizhuang Ma, Bin Sheng, Zhi-Jie Wang
, Wansheng Chen:
Utilizing Two-Phase Processing With FBLS for Single Image Deraining. 664-676 - Bogdan Ionescu

, Maia Rohm
, Bogdan Boteanu, Alexandru-Lucian Gînsca, Mihai Lupu
, Henning Müller
:
Benchmarking Image Retrieval Diversification Techniques for Social Media. 677-691 - Xuejin Wang

, Qiuping Jiang
, Feng Shao
, Ke Gu
, Guangtao Zhai
, Xiaokang Yang:
Exploiting Local Degradation Characteristics and Global Statistical Properties for Blind Quality Assessment of Tone-Mapped HDR Images. 692-705 - Ohini Kafui Toffa

, Max Mignotte:
A Hierarchical Visual Feature-Based Approach For Image Sonification. 706-715 - Xueshi Hou

, Sujit Dey, Jianzhong Zhang, Madhukar Budagavi:
Predictive Adaptive Streaming to Enable Mobile 360-Degree and VR Experiences. 716-731 - Shaohui Mei

, Mingyang Ma
, Shuai Wan
, Junhui Hou
, Zhiyong Wang
, David Dagan Feng
:
Patch Based Video Summarization With Block Sparse Representation. 732-747 - Minglang Qiao

, Mai Xu
, Zulin Wang, Ali Borji
:
Viewport-Dependent Saliency Prediction in 360° Video. 748-760 - Yijun Cao

, Chuan Lin
, Yong-Jie Li
:
Learning Crisp Boundaries Using Deep Refinement Network and Adaptive Weighting Loss. 761-771 - Yifan Zuo

, Yuming Fang
, Ping An
, Xiwu Shang
, Junnan Yang
:
Frequency-Dependent Depth Map Enhancement via Iterative Depth-Guided Affine Transformation and Intensity-Guided Refinement. 772-783 - Yuan Gao

, Maoguo Gong
, Yu Xie
, Alex Kai Qin
:
An Attention-Based Unsupervised Adversarial Model for Movie Review Spam Detection. 784-796 - Jiachen Yang

, Tianlin Liu
, Bin Jiang
, Wen Lu, Qinggang Meng
:
Panoramic Video Quality Assessment Based on Non-Local Spherical CNN. 797-809 - Yiming Li

, Changhong Fu
, Ziyuan Huang
, Yinqiang Zhang
, Jia Pan
:
Intermittent Contextual Learning for Keyfilter-Aware UAV Object Tracking Using Deep Convolutional Feature. 810-822 - Longyu Yang

, Hanli Wang
, Pengjie Tang, Qinyu Li:
CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions. 835-845 - Jiajun Deng

, Yingwei Pan
, Ting Yao
, Wengang Zhou
, Houqiang Li
, Tao Mei
:
Single Shot Video Object Detector. 846-858 - Shiquan Zhang, Xu Zhao

, Liangji Fang:
CAT: Corner Aided Tracking With Deep Regression Network. 859-870 - Zhengguang Zhou

, Wengang Zhou
, Xutao Lv
, Xuan Huang, Xiaoyu Wang
, Houqiang Li
:
Progressive Learning of Low-Precision Networks for Image Classification. 871-882 - Jianyu Yang

, Wu Liu
, Junsong Yuan
, Tao Mei
:
Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition. 883-898 - Shaobo Min

, Xuejin Chen
, Hongtao Xie
, Zheng-Jun Zha
, Yongdong Zhang
:
A Mutually Attentive Co-Training Framework for Semi-Supervised Recognition. 899-910 - Philipp Schulz

, Henrik Klessig
, Meryem Simsek
, Gerhard P. Fettweis
:
Modeling QoE for Buffered Video Streaming in Interference-Limited Cellular Networks. 911-925 - Pengcheng Gao, Ke Lu

, Jian Xue
, Ling Shao
, Jiayi Lyu:
A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism. 926-938 - Lingchen Gu, Ju Liu

, Xiaoxi Liu, Jiande Sun
:
Deep Loss Driven Multi-Scale Hashing Based on Pyramid Connected Network. 939-954 - Yuming Fang

, Jiebin Yan, Rengang Du, Yifan Zuo
, Wenying Wen, Yan Zeng, Leida Li
:
Blind Quality Assessment for Tone-Mapped Images by Analysis of Gradient and Chromatic Statistics. 955-966 - Di Liu, Kao Zhang

, Zhenzhong Chen
:
Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection. 967-981 - Hong Zhong

, Fei Wu, Yan Xu, Jie Cui
:
QoS-Aware Multicast for Scalable Video Streaming in Software-Defined Networks. 982-994 - Hengcan Shi

, Hongliang Li
, Qingbo Wu
, King Ngi Ngan
:
Query Reconstruction Network for Referring Expression Image Segmentation. 995-1007 - Weiling Chen

, Ke Gu
, Tiesong Zhao, Gangyi Jiang, Patrick Le Callet:
Semi-Reference Sonar Image Quality Assessment Based on Task and Visual Perception. 1008-1020 - Weizhi Nie

, Wen-Wu Jia, Wenhui Li
, An-An Liu
, Sicheng Zhao
:
3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval. 1021-1034 - Lei Zhou

, Chen Gong
, Zhi Liu
, Keren Fu
:
SAL: Selection and Attention Losses for Weakly Supervised Semantic Segmentation. 1035-1048 - Jia-Li Yin, Bo-Hao Chen

, Yan-Tsung Peng
, Chung-Chi Tsai
:
Deep Battery Saver: End-to-End Learning for Power Constrained Contrast Enhancement. 1049-1059 - Lei Liu

, Jie Jiang
, Wenjing Jia
, Saeed Amirgholipour, Yi Wang, Michelle Zeibots, Xiangjian He
:
DENet: A Universal Network for Counting Crowd With Varying Densities and Scales. 1060-1068 - Yumo Zhang

, Zhanchuan Cai
, Gangqiang Xiong:
A New Image Compression Algorithm Based on Non-Uniform Partition and U-System. 1069-1082 - Erik Quintanilla, Yogesh S. Rawat

, Andrey Sakryukin, Mubarak Shah
, Mohan S. Kankanhalli
:
Adversarial Learning for Personalized Tag Recommendation. 1083-1094 - Gebremariam Mesfin

, Estêvão Bissoli Saleme
, Oluwakemi Adewunmi Ademoye, Elahe Kani-Zabihi
, Celso A. S. Santos
, Gheorghita Ghinea
:
Less is (Just as Good as) More - an Investigation of Odor Intensity and Hedonic Valence in Mulsemedia QoE using Heart Rate and Eye Tracking. 1095-1105 - Mingliang Zhou

, Xuekai Wei
, Sam Kwong
, Weijia Jia, Bin Fang
:
Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC. 1106-1121 - Huiyu Mo

, Leibo Liu
, Wenping Zhu
, Qiang Li, Shouyi Yin
, Shaojun Wei
:
A 460 GOPS/W Improved Mnemonic Descent Method-Based Hardwired Accelerator for Face Alignment. 1122-1135 - Reza Ghazalian

, Ali Aghagolzadeh
, Seyed Mehdi Hosseini Andargoli
:
Energy Optimization and QoE Satisfaction for Wireless Visual Sensor Networks in Multi Target Tracking Scenario. 823-834 - Ya Lu, Thomai Stathopoulou, Maria F. Vasiloglou

, Stergios Christodoulidis
, Zeno Stanga, Stavroula G. Mougiakakou
:
An Artificial Intelligence-Based System to Assess Nutrient Intake for Hospitalised Patients. 1136-1147 - Jinjian Wu

, Chuanwei Ma, Leida Li
, Weisheng Dong
, Guangming Shi
:
Probabilistic Undirected Graph Based Denoising Method for Dynamic Vision Sensor. 1148-1159 - Xiaoguang Tu

, Jian Zhao
, Mei Xie, Zihang Jiang, Akshaya Balamurugan, Yao Luo, Yang Zhao
, Lingxiao He
, Zheng Ma, Jiashi Feng
:
3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild. 1160-1172 - Xuejin Wang

, Feng Shao
, Qiuping Jiang
, Xiangchao Meng
, Yo-Sung Ho
:
Measuring Coarse-to-Fine Texture and Geometric Distortions for Quality Assessment of DIBR-Synthesized Images. 1173-1186 - Xiangtao Zheng

, Lei Qi, Yutao Ren
, Xiaoqiang Lu
:
Fine-Grained Visual Categorization by Localizing Object Parts With Single Image. 1187-1199 - Yaohui Zhu, Weiqing Min

, Shuqiang Jiang
:
Attribute-Guided Feature Learning for Few-Shot Image Recognition. 1200-1209 - Ruotao Xu

, Yong Xu
, Yuhui Quan
:
Factorized Tensor Dictionary Learning for Visual Tensor Data Completion. 1225-1238 - Min Cao

, Chen Chen
, Hao Dou, Xiyuan Hu
, Silong Peng
, Arjan Kuijper
:
Progressive Bilateral-Context Driven Model for Post-Processing Person Re-Identification. 1239-1251 - Xin Fan

, Shichao Cheng
, Kang Huyan, Minjun Hou, Risheng Liu
, Zhongxuan Luo:
Dual Neural Networks Coupling Data Regression With Explicit Priors for Monocular 3D Face Reconstruction. 1252-1263 - Huasong Zhong

, Jingyuan Chen, Chen Shen
, Hanwang Zhang
, Jianqiang Huang, Xian-Sheng Hua:
Self-Adaptive Neural Module Transformer for Visual Question Answering. 1264-1273 - Zijian Wang

, Zheng Zhang
, Yadan Luo
, Zi Huang
, Heng Tao Shen
:
Deep Collaborative Discrete Hashing With Semantic-Invariant Structure Construction. 1274-1286 - Le Wang

, Xin Lv, Qilin Zhang
, Zhenxing Niu, Nanning Zheng, Gang Hua
:
Object Cosegmentation in Noisy Videos With Multilevel Hypergraph. 1287-1300 - Ting Lan

, Zhanchuan Cai
:
A Novel Image Representation Method Under a Non-Standard Positional Numeral System. 1301-1315 - Yuxin Wang

, Hongtao Xie
, Zhengjun Zha
, Youliang Tian
, Zilong Fu, Yongdong Zhang
:
R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection. 1316-1329 - Aouaidjia Kamel

, Bin Sheng
, Ping Li
, Jinman Kim
, David Dagan Feng
:
Hybrid Refinement-Correction Heatmaps for Human Pose Estimation. 1330-1342 - Bo Jiang

, Zitai Zhou
, Xiao Wang
, Jin Tang, Bin Luo
:
cmSalGAN: RGB-D Salient Object Detection With Cross-View Generative Adversarial Networks. 1343-1353 - Yang Li

, Zhiqun Zhao
, Hao Sun
, Yigang Cen
, Zhihai He:
Snowball: Iterative Model Evolution and Confident Sample Discovery for Semi-Supervised Learning on Very Small Labeled Datasets. 1354-1366 - Thanh Tuan Nguyen

, Thanh Phuong Nguyen
, Frédéric Bouchara:
Prominent Local Representation for Dynamic Textures Based on High-Order Gaussian-Gradients. 1367-1382 - Jing Li

, Hongtao Huo
, Chang Li
, Renhua Wang, Qi Feng:
AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks. 1383-1396 - Junxia Li

, Zefeng Pan
, Qingshan Liu
, Ziyang Wang:
Stacked U-Shape Network With Channel-Wise Attention for Salient Object Detection. 1397-1409 - Fangbing Zhang, Tao Yang

, Linfeng Liu
, Bang Liang, Yi Bai, Jing Li
:
Image-Only Real-Time Incremental UAV Image Mosaic for Multi-Strip Flight. 1410-1425 - Xunxiang Yao

, Qiang Wu
, Peng Zhang
, Fangxun Bao
:
Weighted Adaptive Image Super-Resolution Scheme Based on Local Fractal Feature and Image Roughness. 1426-1441 - Qinghua Ren

, Shijian Lu
, Jinxia Zhang, Renjie Hu
:
Salient Object Detection by Fusing Local and Global Contexts. 1442-1453 - Jianmin Jiang

, Ahmed Fares
, Sheng-Hua Zhong
:
A Brain-Media Deep Framework Towards Seeing Imaginations Inside Brains. 1454-1465 - Yaomin Wang

, Zhanchuan Cai
, Wenguang He
:
High Capacity Reversible Data Hiding in Encrypted Image Based on Intra-Block Lossless Compression. 1466-1473 - Pingyu Wang

, Zhicheng Zhao
, Fei Su
, Yanyun Zhao, Haiying Wang, Lei Yang
, Yang Li:
Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification. 1474-1488 - Chunwei Tian

, Yong Xu
, Wangmeng Zuo
, Bob Zhang
, Lunke Fei
, Chia-Wen Lin
:
Coarse-to-Fine CNN for Image Super-Resolution. 1489-1502 - Haisheng Su

, Xu Zhao
, Tianwei Lin, Shuming Liu, Zhilan Hu:
Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection. 1503-1515 - Ziqing Huang, Shiguang Liu

:
Perceptual Image Hashing With Texture and Invariant Vector Distance for Copy Detection. 1516-1529 - Junquan Liu, Weizhan Zhang

, Shouqin Huang, Haipeng Du
, Qinghua Zheng:
QoE-driven HAS Live Video Channel Placement in the Media Cloud. 1530-1541 - Zongyi Xu

, Wei Chang, Yindi Zhu, Le Dong
, Huiyu Zhou, Qianni Zhang
:
Building High-Fidelity Human Body Models From User-Generated Data. 1542-1556 - Chao Yang

, Xinfeng Zhang
, Ping An
, Liquan Shen
, C.-C. Jay Kuo
:
Blind Image Quality Assessment Based on Multi-scale KLT. 1557-1566 - Ping-Jung Duh, Yu-Cheng Sung, Liang-Yu Fan Chiang, Yung-Ju Chang, Kuan-Wen Chen

:
V-Eye: A Vision-Based Navigation System for the Visually Impaired. 1567-1580 - Shisong Lin

, Mengchao Bai
, Feng Liu
, Linlin Shen
, Yicong Zhou
:
Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition. 1581-1591 - Zihan Zhou, Jing Li

, Yuhui Quan
, Ruotao Xu
:
Image Quality Assessment Using Kernel Sparse Coding. 1592-1604 - Yi-Hsun Lin

, Homer H. Chen
:
Tag Propagation and Cost-Sensitive Learning for Music Auto-Tagging. 1605-1616 - Xinxin Zuo

, Sen Wang
, Jiangbin Zheng
, Weiwei Yu, Minglun Gong
, Ruigang Yang
, Li Cheng
:
SparseFusion: Dynamic Human Avatar Modeling From Sparse RGBD Images. 1617-1629 - Guangliang Zhou

, Yi Yan
, Deming Wang
, Qijun Chen
:
A Novel Depth and Color Feature Fusion Framework for 6D Object Pose Estimation. 1630-1639 - Xingxu Yao, Dongyu She

, Haiwei Zhang
, Jufeng Yang
, Ming-Ming Cheng
, Liang Wang:
Adaptive Deep Metric Learning for Affective Image Retrieval and Classification. 1640-1653 - Jialu Huang

, Jing Liao
, Sam Kwong
:
Semantic Example Guided Image-to-Image Translation. 1654-1665 - Huaxi Huang

, Junjie Zhang
, Jian Zhang
, Jingsong Xu
, Qiang Wu
:
Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification. 1666-1680 - Fan Yang

, Ke Yan, Shijian Lu
, Huizhu Jia
, Don Xie, Zongqiao Yu, Xiaowei Guo, Feiyue Huang, Wen Gao:
Part-aware Progressive Unsupervised Domain Adaptation for Person Re-Identification. 1681-1695 - Jiahao Xu

, Hongda Tian
, Zhiyong Wang
, Yang Wang
, Wenxiong Kang
, Fang Chen
:
Joint Input and Output Space Learning for Multi-Label Image Classification. 1696-1707 - Ge Song

, Xiaoyang Tan
:
Real-world Cross-modal Retrieval via Sequential Learning. 1708-1721 - Baoxin Zhao

, Haoyi Xiong
, Jiang Bian, Zhishan Guo
, Cheng-Zhong Xu
, Dejing Dou
:
COMO: Efficient Deep Neural Networks Expansion With COnvolutional MaxOut. 1722-1730 - Wen-Li Wei

, Jen-Chun Lin
, Tyng-Luh Liu
, Hsiao-Rong Tyan, Hsin-Min Wang
, Hong-Yuan Mark Liao:
Learning to Visualize Music Through Shot Sequence for Automatic Concert Video Mashup. 1731-1743 - Zhaoquan Yuan, Siyuan Sun, Lixin Duan

, Changsheng Li, Xiao Wu
, Changsheng Xu:
Adversarial Multimodal Network for Movie Story Question Answering. 1744-1756 - Wen Ji

, H. Vincent Poor
:
Risk Optimization for Revenue-Driven Wireless Video Broadcasting Systems: A Copula-Based Framework. 1757-1771 - Wanru Xu

, Jian Yu, Zhenjiang Miao, Lili Wan
, Yi Tian
, Qiang Ji
:
Deep Reinforcement Polishing Network for Video Captioning. 1772-1784 - Wenya Guo

, Ying Zhang, Xiangrui Cai, Lei Meng
, Jufeng Yang
, Xiaojie Yuan:
LD-MAN: Layout-Driven Multimodal Attention Network for Online News Sentiment Recognition. 1785-1798 - Zhiwang Zhang

, Dong Xu
, Wanli Ouyang
, Luping Zhou
:
Dense Video Captioning Using Graph-Based Sentence Summarization. 1799-1810 - Fang-Yi Chao

, Lu Zhang
, Wassim Hamidouche
, Olivier Déforges:
A Multi-FoV Viewport-Based Visual Saliency Model Using Adaptive Weighting Losses for 360$^\circ$ Images. 1811-1826 - Zhao-Min Chen

, Quan Cui
, Xiu-Shen Wei
, Xin Jin
, Yanwen Guo:
Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition. 1827-1840 - Xin Liu

, Guoying Zhao
:
3D Skeletal Gesture Recognition via Discriminative Coding on Time-Warping Invariant Riemannian Trajectories. 1841-1854 - Zhan Wang

, Lizhi Wang
, Jun Wan
, Hua Huang
:
Shared Low-Rank Correlation Embedding for Multiple Feature Fusion. 1855-1867 - Zhenyu Weng

, Yuesheng Zhu
:
Online Hashing With Bit Selection for Image Retrieval. 1868-1881 - Guoli Song

, Shuhui Wang
, Qingming Huang
, Qi Tian
:
Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data. 1882-1894 - Sanchita Ghose

, John J. Prevost
:
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos With Deep Learning. 1895-1907 - Chi Ho Cheung

, Lu Sheng
, King Ngi Ngan
:
Motion Compensated Virtual View Synthesis Using Novel Particle Cell. 1908-1923 - Xinyan Zhang

, Peng Gao
, Sunxiangyu Liu
, Kongya Zhao
, Guitao Li
, Liuguo Yin
, Chang Wen Chen
:
Accurate and Efficient Image Super-Resolution via Global-Local Adjusting Dense Network. 1924-1937 - Menglei Zhang, Qiang Ling

:
Supervised Pixel-Wise GAN for Face Super-Resolution. 1938-1950 - Xin Zhong

, Pei-Chi Huang, Spyridon Mastorakis
, Frank Y. Shih:
An Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks. 1951-1961 - Wei-Zhi Nie

, Min-Jie Ren
, An-An Liu
, Zhendong Mao
, Jie Nie
:
M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval. 1962-1976 - Zhaopeng Li

, Qianqian Xu
, Yangbangyan Jiang
, Ke Ma, Xiaochun Cao
, Qingming Huang
:
Neural Collaborative Preference Learning With Pairwise Comparisons. 1977-1989 - Hayeon Kim

, Eun-Cheol Lee
, Yongseok Seo
, Dong-Hyuck Im, In-Kwon Lee
:
Character Detection in Animated Movies Using Multi-Style Adaptation and Visual Attention. 1990-2004 - Maryam Sultana, Arif Mahmood

, Soon Ki Jung
:
Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations. 2005-2018 - Lei Sang, Min Xu, Shengsheng Qian

, Matt Martin, Peter Li, Xindong Wu
:
Context-Dependent Propagating-Based Video Recommendation in Multimodal Heterogeneous Information Networks. 2019-2032 - Haimin Zhang

, Min Xu:
Weakly Supervised Emotion Intensity Prediction for Recomi/tmi40.htmlgnition of Emotions in Images. 2033-2044 - Hao Liu

, Yulan Guo
, Yanni Ma, Yinjie Lei
, Gongjian Wen:
Semantic Context Encoding for Accurate 3D Point Cloud Segmentation. 2045-2055 - Yuwu Lu

, Wenjing Wang, Chun Yuan
, Xuelong Li
, Zhihui Lai
:
Manifold Transfer Learning via Discriminant Regression Analysis. 2056-2070 - Cigdem Beyan

, Muhammad Shahid
, Vittorio Murino
:
RealVAD: A Real-World Dataset and A Method for Voice Activity Detection by Body Motion Analysis. 2071-2085 - Qiuxia Lai

, Salman H. Khan
, Yongwei Nie
, Hanqiu Sun
, Jianbing Shen
, Ling Shao
:
Understanding More About Human and Machine Attention in Deep Neural Networks. 2086-2099 - Zhenqi Fu

, Feng Shao
, Qiuping Jiang
, Xiangchao Meng
, Yo-Sung Ho
:
Subjective and Objective Quality Assessment for Stereoscopic Image Retargeting. 2100-2113 - Qiao Liu

, Xin Li
, Zhenyu He
, Nana Fan, Di Yuan
, Hongpeng Wang
:
Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking. 2114-2126 - Yuting Su

, Yuqian Li
, Dan Song
, Anan Liu
, Jie Nie
:
Joint Intermediate Domain Generation and Distribution Alignment for 2D Image-Based 3D Objects Retrieval. 2127-2138 - Yong Du

, Guoqiang Han
, Yinjie Tan
, Chufeng Xiao
, Shengfeng He
:
Blind Image Denoising via Dynamic Dual Learning. 2139-2152 - Vinayak Abrol

, Pulkit Sharma
, Arijit Patra:
Improving Generative Modelling in VAEs Using Multimodal Prior. 2153-2161 - Bo Jiang

, Yuan Zhang, Bin Luo
, Xiaochun Cao
, Jin Tang:
STGL: Spatial-Temporal Graph Representation and Learning for Visual Tracking. 2162-2171 - Dongyang Zhang, Jie Shao

, Zhenwen Liang, Lianli Gao
, Heng Tao Shen:
Large Factor Image Super-Resolution With Cascaded Convolutional Neural Networks. 2172-2184 - Raouf Hamzaoui

, Huansheng Ning, Chonggang Wang, Reza Malekian, Wei Ding:
Guest Editorial Special Section on Hybrid Human-Artificial Intelligence for Multimedia Computing. 2185-2187 - Shuai Liu

, Shuai Wang, Xinyu Liu, Amir H. Gandomi
, Mahmoud Daneshmand
, Khan Muhammad
, Victor Hugo C. de Albuquerque
:
Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote Visual Monitoring. 2188-2198 - Cong Bai

, Hongkai Li, Jinglin Zhang
, Ling Huang, Lu Zhang
:
Unsupervised Adversarial Instance-Level Image Retrieval. 2199-2207 - Dapeng Wu

, Ruili Bao
, Zhidu Li
, Honggang Wang
, Hong Zhang
, Ruyan Wang
:
Edge-Cloud Collaboration Enabled Video Service Enhancement: A Hybrid Human-Artificial Intelligence Scheme. 2208-2221 - Jun Xu

, Yuanyuan Pu, Rencan Nie
, Dan Xu
, Zhengpeng Zhao, Wenhua Qian:
Virtual Try-on Network With Attribute Transformation and Local Rendering. 2222-2234 - Yihao Chen, Bin Tan, Jun Wu

, Zhifeng Zhang, Haoqi Ren:
A Deep Image Coding Scheme With Generative Network to Learn From Correlated Images. 2235-2244 - John Jethro Virtusio

, Jose Jaena Mari Ople, Daniel Stanley Tan
, Muhammad Tanveer
, Neeraj Kumar
, Kai-Lung Hua
:
Neural Style Palette: A Multimodal and Interactive Style Transfer From a Single Style Image. 2245-2258 - Peiguang Jing

, Yuechen Shang, Liqiang Nie
, Yuting Su
, Jing Liu
, Meng Wang
:
Learning Low-Rank Sparse Representations With Robust Relationship Inference for Image Memorability Prediction. 2259-2272 - John Jethro Virtusio, Daniel Stanley Tan

, Wen-Huang Cheng
, Mohammad Tanveer, Kai-Lung Hua
:
Enabling Artistic Control Over Pattern Density and Stroke Strength. 2273-2285 - Yuan-Yu Tsai

:
Separable Reversible Data Hiding for Encrypted Three-Dimensional Models Based on Spatial Subdivision and Space Encoding. 2286-2296 - Sree Ramya S. P. Malladi

, Sundaresh Ram
, Jeffrey J. Rodríguez
:
Image Denoising Using Superpixel-Based PCA. 2297-2309 - Fang Yan

, Yuanjie Zheng
, Jinyu Cong, Liu Liu, Dacheng Tao
, Sujuan Hou
:
Solving Jigsaw Puzzles via Nonconvex Quadratic Programming With the Projected Power Method. 2310-2320 - Jun Hu, Shengsheng Qian

, Quan Fang, Changsheng Xu
:
Heterogeneous Community Question Answering via Social-Aware Multi-Modal Co-Attention Convolutional Matching. 2321-2334 - Chaofan Chen

, Shengsheng Qian
, Quan Fang, Changsheng Xu
:
HAPGN: Hierarchical Attentive Pooling Graph Network for Point Cloud Segmentation. 2335-2346 - Fan Yang

, Yang Wu, Zheng Wang
, Xiang Li, Sakriani Sakti
, Satoshi Nakamura
:
Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval. 2347-2360 - Xiaoling Gu

, Jun Yu
, Yongkang Wong
, Mohan S. Kankanhalli
:
Toward Multi-Modal Conditioned Fashion Image Translation. 2361-2371 - Junxin Chen

, Lei Chen
, Yicong Zhou
:
Universal Chosen-Ciphertext Attack for a Family of Image Encryption Schemes. 2372-2385 - Wei Wang, Junyu Gao, Xiaoshan Yang, Changsheng Xu

:
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval. 2386-2397 - Xiao-Wei Tang

, Xin-Lin Huang
, Fei Hu
:
QoE-Driven UAV-Enabled Pseudo-Analog Wireless Video Broadcast: A Joint Optimization of Power and Trajectory. 2398-2412 - Jie Wu, Tianshui Chen

, Hefeng Wu
, Zhi Yang, Guangchun Luo, Liang Lin
:
Fine-Grained Image Captioning With Global-Local Discriminative Objective. 2413-2427 - Nianchang Huang

, Yi Liu, Qiang Zhang
, Jungong Han
:
Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection. 2428-2441 - Jihoon Sung

, Dujeong Lee:
Efficient Design and Control for Network-Assisted Device-to-Device Content Delivery Network. 2442-2456 - Dongxu Wei

, Xiaowei Xu, Haibin Shen
, Kejie Huang
:
GAC-GAN: A General Method for Appearance-Controllable Human Video Motion Transfer. 2457-2470 - Jianshu Zhang

, Jun Du
, Yongxin Yang, Yi-Zhe Song
, Lirong Dai:
SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition. 2471-2480 - Yinglong Wang

, Dong Gong
, Jie Yang, Qinfeng Shi
, Anton van den Hengel
, Dehua Xie
, Bing Zeng
:
Deep Single Image Deraining via Modeling Haze-Like Effect. 2481-2492 - Jie Wen

, Ke Yan
, Zheng Zhang
, Yong Xu
, Junqian Wang, Lunke Fei
, Bob Zhang
:
Adaptive Graph Completion Based Incomplete Multi-View Clustering. 2493-2504 - Sergio Pezzulli, Maria G. Martini

, Nabajeet Barman
:
Estimation of Quality Scores From Subjective Tests-Beyond Subjects' MOS. 2505-2519 - Changmeng Zheng

, Zhiwei Wu
, Tao Wang
, Yi Cai
, Qing Li
:
Object-Aware Multimodal Named Entity Recognition in Social Media Posts With Adversarial Learning. 2520-2532 - Youqing Xiao, Zhanchuan Cai

, Xixi Yuan
:
YuvConv: Multi-Scale Non-Uniform Convolution Structure Based on YUV Color Model. 2533-2544 - Xiaoyan Zhang

, Zhuopeng Li, Jianmin Jiang
:
Emotion Attention-Aware Collaborative Deep Reinforcement Learning for Image Cropping. 2545-2560 - Siyeong Lee

, So Yeon Jo, Gwon Hwan An
, Suk-Ju Kang
:
Learning to Generate Multi-Exposure Stacks With Cycle Consistency for High Dynamic Range Imaging. 2561-2574 - Jiaming Zhang

, Jitao Sang
, Kaiyuan Xu
, Shangxi Wu
, Xian Zhao
, Yanfeng Sun
, Yongli Hu
, Jian Yu:
Robust CAPTCHAs Towards Malicious OCR. 2575-2587 - Abdelhak Bentaleb

, Ali C. Begen
, Saad Harous
, Roger Zimmermann
:
Data-Driven Bandwidth Prediction Models and Automated Model Selection for Low Latency. 2588-2601 - Xiang Jiang, Shikui Wei

, Ting Liu
, Ruizhen Zhao, Yao Zhao
, Heng Huang:
Blind Image Clustering for Camera Source Identification via Row-Sparsity Optimization. 2602-2613 - Ji Zhu

, Hua Yang
, Weiyao Lin
, Nian Liu
, Jia Wang, Wenjun Zhang
:
Group Re-Identification With Group Context Graph Neural Networks. 2614-2626 - Siwang Zhou

, Yan He, Yonghe Liu
, Chengqing Li, Jianming Zhang
:
Multi-Channel Deep Networks for Block-Based Image Compressive Sensing. 2627-2640 - Yuechi Jiang

, Frank H. F. Leung
:
Vector-Based Feature Representations for Speech Signals: From Supervector to Latent Vector. 2641-2655 - Bo Zhang

, Di Xiao
, Yong Xiang
:
Robust Coding of Encrypted Images via 2D Compressed Sensing. 2656-2671 - Guang Chen

, Can Zhang
, Yuexian Zou
:
AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection. 2672-2682 - Zhedong Zheng

, Tao Ruan
, Yunchao Wei, Yi Yang, Tao Mei
:
VehicleNet: Learning Robust Visual Representation for Vehicle Re-Identification. 2683-2693 - Zeyu Li, Cheng Deng

, Erkun Yang, Dacheng Tao
:
Staged Sketch-to-Image Synthesis via Semi-supervised Generative Adversarial Networks. 2694-2705 - Minglong Xue, Palaiahnakote Shivakumara

, Chao Zhang, Yao Xiao, Tong Lu
, Umapada Pal, Daniel Lopresti
, Zhibo Yang:
Arbitrarily-Oriented Text Detection in Low Light Natural Scene Images. 2706-2720 - Dan Song

, Tianbao Li, Wenhui Li
, Wei-Zhi Nie
, Wu Liu
, An-An Liu
:
Universal Cross-Domain 3D Model Retrieval. 2721-2731 - Tasfia Shermin

, Guojun Lu
, Shyh Wei Teng
, M. Manzur Murshed
, Ferdous Sohel
:
Adversarial Network With Multiple Classifiers for Open Set Domain Adaptation. 2732-2744 - Fan Zhao

, Wenda Zhao
:
Learning Specific and General Realm Feature Representations for Image Fusion. 2745-2756 - Leida Li

, Yixuan Li
, Jinjian Wu
, Lin Ma
, Yuming Fang
:
Quality Evaluation for Image Retargeting With Instance Semantics. 2757-2769 - Bin Fan

, Hongmin Liu
, Hui Zeng
, Jiyong Zhang
, Xin Liu
, Junwei Han
:
Deep Unsupervised Binary Descriptor Learning Through Locality Consistency and Self Distinctiveness. 2770-2781 - Yuhui Wang

, Francesco Gelli, Christian von der Weth, Mohan S. Kankanhalli
:
A Matrix Factorization Based Framework for Fusion of Physical and Social Sensors. 2782-2793 - Yingying Deng, Fan Tang, Weiming Dong

, Chongyang Ma, Feiyue Huang, Oliver Deussen, Changsheng Xu
:
Exploring the Representativity of Art Paintings. 2794-2805 - Li Li

, Zhu Li
, Shan Liu, Houqiang Li
:
Efficient Projected Frame Padding for Video-Based Point Cloud Compression. 2806-2819 - Xiaoxi Gong

, Yuanpeng Liu
, Qiaoyun Wu, Jiayi Huang, Hua Zong, Jun Wang
:
An Accurate, Robust Visual Odometry and Detail-Preserving Reconstruction System. 2820-2832 - Titir Dutta, Anurag Singh, Soma Biswas

:
StyleGuide: Zero-Shot Sketch-Based Image Retrieval Using Style-Guided Image Generation. 2833-2842 - Caixia Liu, Dehui Kong

, Shaofan Wang
, Jinghua Li, Baocai Yin:
DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction. 2843-2856 - Lichun Wang, Shuang Li

, Shaofan Wang
, Dehui Kong
, Baocai Yin
:
Hardness-Aware Dictionary Learning: Boosting Dictionary for Recognition. 2857-2867 - Navid Mahmoudian Bidgoli

, Thomas Maugey
, Aline Roumy
:
Fine Granularity Access in Interactive Compression of 360-Degree Images Based on Rate-adaptive Channel Codes. 2868-2882 - Zhengzhi Lu

, Guoan Yang
, Junjie Yang
, Yuhao Wang:
An Adaptive Arbitrary Multiresolution Decomposition for Multiscale Geometric Analysis. 2883-2893 - Xin Liu

, Yongbin Sun, Ziwei Liu
, Dahua Lin:
Learning Diverse Fashion Collocation by Neural Graph Filtering. 2894-2901 - Mehmood Nawaz

, Hong Yan
:
Saliency Detection Using Deep Features and Affinity-Based Robust Background Subtraction. 2902-2916 - Yanchao Zhang, Weiqing Min

, Liqiang Nie
, Shuqiang Jiang
:
Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction. 2917-2929 - Lunke Fei

, Bob Zhang
, Lin Zhang
, Wei Jia
, Jie Wen
, Jigang Wu
:
Learning Compact Multifeature Codes for Palmprint Recognition From a Single Training Image per Palm. 2930-2942 - Junpeng Tan, Yukai Shi

, Zhijing Yang
, Caizhen Wen, Liang Lin
:
Unsupervised Multi-View Clustering by Squeezing Hybrid Knowledge From Cross View and Each View. 2943-2956 - Shuai Yang

, Yueyu Hu
, Wenhan Yang
, Lingyu Duan
, Jiaying Liu
:
Towards Coding for Human and Machine Vision: Scalable Face Image Coding. 2957-2971 - Nader Bakir

, Wassim Hamidouche
, Sid Ahmed Fezza
, Khouloud Samrouth, Olivier Déforges
:
Light Field Image Coding Using VVC Standard and View Synthesis Based on Dual Discriminator GAN. 2972-2985 - Jichun Li

, Bo Yan, Qing Lin, Ang Li, Chenxi Ma
:
Motion Blur Removal With Quality Assessment Guidance. 2986-2997 - Xiaoyu Chai

, Jun Chen
, Chao Liang
, Dongshu Xu, Chia-Wen Lin
:
Expression-Aware Face Reconstruction via a Dual-Stream Network. 2998-3012 - Mohammad Akbari

, Jie Liang
, Jingning Han
, Chengjie Tu:
Learned Multi-Resolution Variable-Rate Image Compression With Octave-Based Residual Blocks. 3013-3021 - Zeqing Fu

, Wei Hu
:
Dynamic Point Cloud Inpainting via Spatial-Temporal Graph Learning. 3022-3034 - Gang Li, Xiaochen Wang

, Ruimin Hu
, Huyin Zhang
, Shanfa Ke:
Intelligibility Enhancement Via Normal-to-Lombard Speech Conversion With Long Short-Term Memory Network and Bayesian Gaussian Mixture Model. 3035-3047 - Xu Ma

, Jingda Guo
, Andrew Sansom
, Mara McGuire, Andrew Kalaani, Qi Chen
, Sihai Tang, Qing Yang
, Song Fu
:
Spatial Pyramid Attention for Deep Convolutional Neural Networks. 3048-3058 - Yue Que

, Suli Li
, Hyo Jong Lee:
Attentive Composite Residual Network for Robust Rain Removal from Single Images. 3059-3072 - Feiyu Chen, Jie Shao

, Yonghui Zhang, Xing Xu
, Heng Tao Shen
:
Interclass-Relativity-Adaptive Metric Learning for Cross-Modal Matching and Beyond. 3073-3084 - Jialiang Zhang

, Lixiang Lin
, Jianke Zhu
, Yang Li
, Yun-chen Chen, Yao Hu, Steven C. H. Hoi
:
Attribute-Aware Pedestrian Detection in a Crowd. 3085-3097 - Yu Chen

, Jieyu Zhao, Congwei Shi, Dongdong Yuan:
Mesh Convolution: A Novel Feature Extraction Method for 3D Nonrigid Object Classification. 3098-3111 - Jingchun Cheng

, Yuhui Yuan, Yali Li
, Jingdong Wang
, Shengjin Wang
:
Learning to Segment Video Object With Accurate Boundaries. 3112-3123 - Shijie Yang

, Liang Li
, Shuhui Wang
, Weigang Zhang
, Qingming Huang
, Qi Tian
:
Graph Regularized Encoder-Decoder Networks for Image Representation Learning. 3124-3136 - Bin Wang

, Huifang Niu, Jianchao Zeng
, Guifeng Bai
, Suzhen Lin, Yanbo Wang
:
Latent Representation Learning Model for Multi-Band Images Fusion via Low-Rank and Sparse Embedding. 3137-3152 - Jiaqian Li, Juncheng Li

, Faming Fang
, Fang Li
, Guixu Zhang
:
Luminance-Aware Pyramid Network for Low-Light Image Enhancement. 3153-3165 - Ahmed Khalid

, Ahmed H. Zahran
, Cormac J. Sreenan
:
Optimizing Video QoE for Mobile eMBMS Users in Cellular Networks. 3166-3178 - Peisong He

, Haoliang Li
, Hongxia Wang, Shiqi Wang
, Xinghao Jiang
, Ruimei Zhang
:
Frame-Wise Detection of Double HEVC Compression by Learning Deep Spatio-Temporal Representations in Compression Domain. 3179-3192 - Bo Jiang

, Xingyue Jiang, Jin Tang, Bin Luo
:
Co-Saliency Detection via a General Optimization Model and Adaptive Graph Learning. 3193-3202 - Junyu Gao, Xiaoshan Yang, Yingying Zhang, Changsheng Xu

:
Unsupervised Video Summarization via Relation-Aware Assignment Learning. 3203-3214 - Xinxin Zhang

, Ronggang Wang
, Da Chen
, Yang Zhao, Wen Gao:
Handling Outliers by Robust M-Estimation in Blind Image Deblurring. 3215-3226 - Jianyi Wang

, Mai Xu
, Lai Jiang, Yuhang Song
:
Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360$^{\circ}$ Videos. 3227-3238 - Tao Wang

, Zexuan Ji
, Jian Yang, Quansen Sun
, Peng Fu:
Global Manifold Learning for Interactive Image Segmentation. 3239-3249 - Fan Wu

, Wang Yang
, Ju Ren
, Feng Lyu
, Peng Yang
, Yaoxue Zhang
, Xuemin Shen
:
NDN-MMRA: Multi-Stage Multicast Rate Adaptation in Named Data Networking WLAN. 3250-3263 - Yuxuan Shi

, Zhen Wei
, Hefei Ling
, Ziyang Wang, Pengfei Zhu
, Jialie Shen
, Ping Li:
Adaptive and Robust Partition Learning for Person Retrieval With Policy Gradient. 3264-3277 - Qi Liu

, Hui Yuan
, Junhui Hou
, Raouf Hamzaoui
, Honglei Su
:
Model-Based Joint Bit Allocation Between Geometry and Color for Video-Based 3D Point Cloud Compression. 3278-3291 - Che Sun

, Yunde Jia
, Hao Song
, Yuwei Wu
:
Adversarial 3D Convolutional Auto-Encoder for Abnormal Event Detection in Videos. 3292-3305 - Zijian Zhang

, Zhou Zhao
, Zhu Zhang
, Zhijie Lin
, Qi Wang, Richang Hong:
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks. 3306-3317 - Mohamed Azzam

, Wenhao Wu, Wen-ming Cao
, Si Wu
, Hau-San Wong
:
KTransGAN: Variational Inference-Based Knowledge Transfer for Unsupervised Conditional Generative Learning. 3318-3331 - Zan Gao

, Li-Shuai Gao, Hua Zhang, Zhiyong Cheng
, Richang Hong
, Shengyong Chen
:
DCR: A Unified Framework for Holistic/Partial Person ReID. 3332-3345 - Jiawen Liao

, Chun Qi, Jianzhong Cao:
Temporal Constraint Background-Aware Correlation Filter With Saliency Map. 3346-3361 - Yaxiong Wang

, Hao Yang, Xiuxiu Bai
, Xueming Qian
, Lin Ma
, Jing Lu, Biao Li, Xin Fan:
PFAN++: Bi-Directional Image-Text Retrieval With Position Focused Attention Network. 3362-3376 - Yanxiong Li

, Wucheng Wang, Mingle Liu, Zhongjie Jiang, Qianhua He:
Speaker Clustering by Co-Optimizing Deep Representation Learning and Cluster Estimation. 3377-3387 - Wujie Zhou

, Junwei Wu
, Jingsheng Lei, Jenq-Neng Hwang
, Lu Yu
:
Salient Object Detection in Stereoscopic 3D Images Using a Deep Convolutional Residual Autoencoder. 3388-3399 - Haofeng Zhang

, Yifan Gu, Yazhou Yao
, Zheng Zhang
, Li Liu, Jian Zhang
, Ling Shao
:
Deep Unsupervised Self-Evolutionary Hashing for Image Retrieval. 3400-3413 - Zhen-Tao Liu

, Abdul Rehman
, Min Wu
, Weihua Cao
, Man Hao:
Speech Personality Recognition Based on Annotation Classification Using Log-Likelihood Distance and Extraction of Essential Audio Features. 3414-3426 - Xiangping Wu

, Qingcai Chen
, Yulun Xiao, Wei Li, Xin Liu
, Baotian Hu:
LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition. 3427-3440 - Peisen Zhao

, Lingxi Xie
, Ya Zhang
, Qi Tian
:
Universal-to-Specific Framework for Complex Action Recognition. 3441-3453 - Jun Xiao

, Lin Li
, Dejing Xu
, Chengjiang Long
, Jian Shao, Shifeng Zhang, Shiliang Pu
, Yueting Zhuang:
Explore Video Clip Order With Self-Supervised and Curriculum Learning for Video Applications. 3454-3466 - Arnaud Delmotte

, Kenichiro Tanaka
, Hiroyuki Kubo
, Takuya Funatomi
, Yasuhiro Mukaigawa
:
Blind 3D-Printing Watermarking Using Moment Alignment and Surface Norm Distribution. 3467-3482 - Qianqian Wang

, Jiafeng Cheng, Quanxue Gao
, Guoshuai Zhao
, Licheng Jiao
:
Deep Multi-View Subspace Clustering With Unified and Discriminative Learning. 3483-3493 - Ting Bi

, Roisin Lyons
, Grace Fox
, Gabriel-Miro Muntean
:
Improving Student Learning Satisfaction by Using an Innovative DASH-Based Multiple Sensorial Media Delivery Solution. 3494-3505 - Beijing Chen

, Weijin Tan, Gouenou Coatrieux
, Yuhui Zheng
, Yun Qing Shi:
A Serial Image Copy-Move Forgery Localization Scheme With Source/Target Distinguishment. 3506-3517 - Fei Liu

, Jing Liu
, Zhiwei Fang, Richang Hong
, Hanqing Lu:
Visual Question Answering With Dense Inter- and Intra-Modality Interactions. 3518-3529 - Kai Xu

, Longyin Wen, Guorong Li
, Qingming Huang
:
Self-Supervised Deep TripleNet for Video Object Segmentation. 3530-3539 - Ercheng Pei

, Meshia Cédric Oveneke
, Yong Zhao
, Dongmei Jiang
, Hichem Sahli
:
Monocular 3D Facial Expression Features for Continuous Affect Recognition. 3540-3550 - Haozan Liang

, Guihua Wen
, Yang Hu
, Mingnan Luo, Pei Yang
, Yingxue Xu
:
MVANet: Multi-Task Guided Multi-View Attention Network for Chinese Food Recognition. 3551-3561 - Peng Zhang

, Jingsong Xu
, Qiang Wu
, Yan Huang
, Xianye Ben
:
Learning Spatial-Temporal Representations Over Walking Tracklet for Long-Term Person Re-Identification in the Wild. 3562-3576 - Shuai Zheng

, Jian Chen
, Xiao-Ping Zhang
, Yonghong Kuo
:
A New Multihypothesis-Based Compressed Video Sensing Reconstruction System. 3577-3589 - Ying Zheng

, Hongxun Yao
, Xiaoshuai Sun
:
Deep Semantic Parsing of Freehand Sketches With Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning. 3590-3602 - Peiqin Zhuang

, Yali Wang
, Yu Qiao
:
Wildfish++: A Comprehensive Fish Benchmark for Multimedia Research. 3603-3617 - Peng Lu

, Hao Zhang, Xujun Peng, Xiaofu Jin:
Learning the Relation Between Interested Objects and Aesthetic Region for Image Cropping. 3618-3630 - Wanxin Shi

, Chao Wang, Yong Jiang, Qing Li
, Gengbiao Shen, Gabriel-Miro Muntean
:
CoLEAP: Cooperative Learning-Based Edge Scheme With Caching and Prefetching for DASH Video Delivery. 3631-3645 - Jianbo Ouyang

, Wengang Zhou
, Min Wang
, Qi Tian
, Houqiang Li
:
Collaborative Image Relevance Learning for Visual Re-Ranking. 3646-3656 - Yucheng Lu

, Jin-Hyuck Cha, Sekyoung Youm
, Seung-Won Jung
:
Parametric Shape Estimation of Human Body Under Wide Clothing. 3657-3669 - Kaixuan Long, Ying Cui

, Chencheng Ye
, Zhi Liu
:
Optimal Wireless Streaming of Multi-Quality 360 VR Video By Exploiting Natural, Relative Smoothness-Enabled, and Transcoding-Enabled Multicast Opportunities. 3670-3683 - Jixin Liu

, Rong Tan
, Guang Han, Ning Sun
, Sam Kwong
:
Privacy-Preserving In-Home Fall Detection Using Visual Shielding Sensing and Private Information-Embedding. 3684-3699 - Guangtao Zhai

, Yucheng Zhu
, Xiongkuo Min
:
Comparative Perceptual Assessment of Visual Signals Using Free Energy Features. 3700-3713 - Zhangyu Chang

, S.-H. Gary Chan
:
An Approximation Algorithm to Maximize User Capacity for an Auto-Scaling VoD System. 3714-3725 - Kai Zhu

, Yang Cao
, Wei Zhai, Zheng-Jun Zha
:
One-Shot Texture Retrieval Using Global Grouping Metric. 3726-3737 - Zhangxuan Gu

, Li Niu
, Haohua Zhao
, Liqing Zhang
:
Hard Pixel Mining for Depth Privileged Semantic Segmentation. 3738-3751 - Ning Xie

, Qiqi Zhang, Yicong Chen, Ji Hu, Gang Luo, Changsheng Chen
:
Low-Cost Anti-Copying 2D Barcode by Exploiting Channel Noise Characteristics. 3752-3767 - Qiuying Huang

, Zhanchuan Cai
, Ting Lan
:
A New Approach for Character Recognition of Multi-Style Vehicle License Plates. 3768-3777 - Yang Shi

, Xiushan Nie
, Meng Chen
, Li Lian, Yilong Yin
:
Deep Hashing With Weighted Spatial Importance. 3778-3792 - Weizhi Nie

, Minjie Ren
, Jie Nie
, Sicheng Zhao
:
C-GCN: Correlation Based Graph Convolutional Network for Audio-Video Emotion Recognition. 3793-3804 - Selin Nacakli

, A. Murat Tekalp
:
Controlling P2P-CDN Live Streaming Services at SDN-Enabled Multi-Access Edge Datacenters. 3805-3816 - Junghyuk Lee

, Toinon Vigier, Patrick Le Callet
, Jong-Seok Lee
:
Wide Color Gamut Image Content Characterization: Method, Evaluation, and Applications. 3817-3827 - Huibing Wang

, Yang Wang
, Zhao Zhang
, Xianping Fu
, Li Zhuo, Mingliang Xu
, Meng Wang
:
Kernelized Multiview Subspace Analysis By Self-Weighted Learning. 3828-3840 - Mingyang Guan

, Changyun Wen
:
Adaptive Multi-Feature Reliability Re-Determinative Correlation Filter for Visual Tracking. 3841-3852 - Jinyu Chen

, Xianzhuo Luo, Miao Hu
, Di Wu
, Yipeng Zhou
:
Sparkle: User-Aware Viewport Prediction in 360-Degree Video Streaming. 3853-3866 - Qingyang Zhou, Liping Zhao

, Kailun Zhou
, Tao Lin
, Huihui Wang, Shuhui Wang, Mengcao Jiao:
String Prediction for 4: 2: 0 Format Screen Content Coding and Its Implementation in AVS3. 3867-3876 - Qi Yang

, Hao Chen
, Zhan Ma
, Yiling Xu
, Rongjun Tang
, Jun Sun:
Predicting the Perceptual Quality of Point Cloud: A 3D-to-2D Projection-Based Exploration. 3877-3891 - Katsuya Fujii

, Daisuke Sugimura
, Takayuki Hamamoto
:
Hierarchical Group-Level Emotion Recognition. 3892-3906 - Yuan Cao

, Heng Qi
, Jie Gui
, Keqiu Li, Yuan Yan Tang, James Tin-Yau Kwok
:
Learning to Hash With Dimension Analysis Based Quantizer for Image Retrieval. 3907-3918 - Shaobo Min

, Hantao Yao
, Hongtao Xie
, Zheng-Jun Zha
, Yongdong Zhang
:
Domain-Oriented Semantic Embedding for Zero-Shot Learning. 3919-3930 - Mahsa Mesgaran, A. Ben Hamza

:
Anisotropic Graph Convolutional Network for Semi-Supervised Learning. 3931-3942 - Cheng Ma, Jiwen Lu

, Jie Zhou:
Rank-Consistency Deep Hashing for Scalable Multi-Label Image Search. 3943-3956 - Xiaolin Chen

, Xuemeng Song
, Siwei Cui
, Tian Gan
, Zhiyong Cheng
, Liqiang Nie
:
User Identity Linkage Across Social Media via Attentive Time-Aware User Modeling. 3957-3967 - Xianming Lin

, Run Li, Xiawu Zheng, Pai Peng, Yongjian Wu, Feiyue Huang, Rongrong Ji
:
Aggregating Global and Local Visual Representation for Vehicle Re-IDentification. 3968-3977 - Ohini Kafui Toffa

, Max Mignotte
:
Environmental Sound Classification Using Local Binary Pattern and Audio Features Collaboration. 3978-3985 - Na-Young Kim, Je-Won Kang

:
Dynamic Motion Estimation and Evolution Video Prediction Network. 3986-3998 - Fan Qi, Xiaoshan Yang, Changsheng Xu

:
Emotion Knowledge Driven Video Highlight Detection. 3999-4013 - Xiaocui Yang

, Shi Feng, Daling Wang, Yifei Zhang:
Image-Text Multimodal Emotion Classification via Multi-View Attentional Network. 4014-4026 - Zheng Wang

, Jianguo Li, Yu-Gang Jiang
:
Story-driven Video Editing. 4027-4036 - Kyohoon Sim

, Jiachen Yang
, Wen Lu, Xinbo Gao
:
MaD-DLS: Mean and Deviation of Deep and Local Similarity for Image Quality Assessment. 4037-4048 - Alireza Javaheri

, Catarina Brites
, Fernando Pereira
, João Ascenso
:
Point Cloud Rendering After Coding: Impacts on Subjective and Objective Quality. 4049-4064 - Jun Xu

, Zhi-Ang Liu
, Yingkun Hou
, Xiantong Zhen
, Ling Shao
, Ming-Ming Cheng
:
Pixel-Level Non-local Image Smoothing With Objective Evaluation. 4065-4078 - Chaoqun Zheng

, Lei Zhu
, Zhiyong Cheng
, Jingjing Li
, An-An Liu
:
Adaptive Partial Multi-View Hashing for Efficient Social Image Retrieval. 4079-4092 - Kun Lu

, Lihong Zhang
:
TBEFN: A Two-Branch Exposure-Fusion Network for Low-Light Image Enhancement. 4093-4105 - Zhiwen Fang

, Joey Tianyi Zhou
, Yang Xiao
, Yanan Li
, Feng Yang
:
Multi-Encoder Towards Effective Anomaly Detection in Videos. 4106-4116 - Hansung Kim

, Luca Remaggi
, Sam Fowler, Philip J. B. Jackson
, Adrian Hilton
:
Acoustic Room Modelling Using 360 Stereo Cameras. 4117-4130 - Zhao Ren

, Qiuqiang Kong
, Jing Han
, Mark D. Plumbley
, Björn W. Schuller
:
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification. 4131-4142 - Ziad Al-Halah

, Kristen Grauman:
Modeling Fashion Influence From Photos. 4143-4157 - Libo Zhang

, Dawei Du
, Congcong Li, Yanjun Wu, Tiejian Luo:
Iterative Knowledge Distillation for Automatic Check-Out. 4158-4170 - Haifeng Chen

, Dongmei Jiang
, Hichem Sahli
:
Transformer Encoder With Multi-Modal Multi-Head Attention for Continuous Affect Recognition. 4171-4183 - Chen Zhao

, Wu Gao
, Feiping Nie
:
A Resource-Efficient Parallel Connected Component Labeling Algorithm and Its Hardware Implementation. 4184-4197 - Kai Lv

, Hao Sheng
, Zhang Xiong, Wei Li, Liang Zheng
:
Improving Driver Gaze Prediction With Reinforced Attention. 4198-4207 - Xin Yang

, Zikang Yuan, Dongfu Zhu, Cheng Chi, Kun Li, Chunyuan Liao:
Robust and Efficient RGB-D SLAM in Dynamic Environments. 4208-4219 - M. A. Tugtekin Turan

, Engin Erzin
:
Domain Adaptation for Food Intake Classification With Teacher/Student Learning. 4220-4231 - Yuping Zhang

, Bo Ma, Jiahao Wu, Lianghua Huang
, Jianbing Shen
:
Capturing Relevant Context for Visual Tracking. 4232-4244 - Suiyi Ling

, Jing Li
, Zhaohui Che
, Wei Zhou
, Junle Wang, Patrick Le Callet
:
Re-Visiting Discriminator for Blind Free-Viewpoint Image Quality Assessment. 4245-4258 - Yongqiang Bai

, Zhongjie Zhu
, Gangyi Jiang
, Huifang Sun:
Blind Quality Assessment of Screen Content Images Via Macro-Micro Modeling of Tensor Domain Dictionary. 4259-4271 - Seokjae Lim

, Wonjun Kim
:
DSLR: Deep Stacked Laplacian Restorer for Low-Light Image Enhancement. 4272-4284 - Yufan Hu

, Junyu Gao, Changsheng Xu
:
Learning Dual-Pooling Graph Neural Networks for Few-Shot Video Classification. 4285-4296 - Lvran Chen, Huicheng Zheng

, Zhiwei Yan, Ye Li:
Discriminative Region Mining for Object Detection. 4297-4310 - Yao Chiang

, Chih-Ho Hsu, Hung-Yu Wei
:
Collaborative Social-Aware and QoE-Driven Video Caching and Adaptation in Edge Network. 4311-4325 - Xiaohan Yang

, Fan Li
, Hantao Liu
:
TTL-IQA: Transitive Transfer Learning Based No-Reference Image Quality Assessment. 4326-4340 - Ian Blanes

, Miguel Hernández-Cabronero
, Joan Serra-Sagristà
, Michael W. Marcellin
:
Redundancy and Optimization of tANS Entropy Encoders. 4341-4350 - Xirong Li

, Fangming Zhou
, Chaoxi Xu
, Jiaqi Ji, Gang Yang:
SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries. 4351-4362 - Yuan Zhou

, Ruolin Wang
, Hongru Li, Sun-Yuan Kung
:
Temporal Action Localization Using Long Short-Term Dependency. 4363-4375 - Yuxuan Shi

, Zhen Wei
, Hefei Ling
, Ziyang Wang, Jialie Shen
, Ping Li:
Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning. 4376-4387 - Sanghyo Park

, Je-Won Kang
:
Fast Multi-Type Tree Partitioning for Versatile Video Coding Using a Lightweight Neural Network. 4388-4399 - Xing Tian

, Wing W. Y. Ng
, Hui Wang
, Sam Kwong
:
Complementary Incremental Hashing With Query-Adaptive Re-Ranking for Image Retrieval. 1210-1224 - Hang Wang

, Youtian Du
, Guangxun Zhang, Zhongmin Cai
, Chang Su:
Learning Fundamental Visual Concepts Based on Evolved Multi-Edge Concept Graph. 4400-4413 - Haijun Liu

, Xiaoheng Tan, Xichuan Zhou
:
Parameter Sharing Exploration and Hetero-Center Triplet Loss for Visible-Thermal Person Re-Identification. 4414-4425 - Yanyuan Qiao

, Chaorui Deng
, Qi Wu
:
Referring Expression Comprehension: A Survey of Methods and Datasets. 4426-4440 - Huaiwen Zhang

, Shengsheng Qian
, Quan Fang, Changsheng Xu
:
Multimodal Disentangled Domain Adaption for Social Media Event Rumor Detection. 4441-4454 - Rania Hassen

, Basak Güleçyüz
, Eckehard G. Steinbach
:
PVC-SLP: Perceptual Vibrotactile-Signal Compression Based-on Sparse Linear Prediction. 4455-4468 - Xingxu Yao, Sicheng Zhao

, Yu-Kun Lai
, Dongyu She
, Jie Liang
, Jufeng Yang
:
APSE: Attention-Aware Polarity-Sensitive Embedding for Emotion-Based Image Retrieval. 4469-4482 - Weidong Zhang

, Qian Zhang
, Wei Zhang
, Jianjun Gu, Yibin Li
:
From Edge to Keypoint: An End-to-End Framework For Indoor Layout Estimation. 4483-4490 - Liangming Pan

, Jingjing Chen
, Shaoteng Liu, Chong-Wah Ngo, Min-Yen Kan
, Tat-Seng Chua
:
A Hybrid Approach for Detecting Prerequisite Relations in Multi-Modal Food Recipes. 4491-4501 - Huisi Wu

, Wei Yan, Ping Li
, Zhenkun Wen:
Deep Texture Exemplar Extraction Based on Trimmed T-CNN. 4502-4514 - An-An Liu

, Yanhui Wang
, Ning Xu
, Weizhi Nie
, Jie Nie
, Yongdong Zhang
:
Adaptively Clustering-Driven Learning for Visual Relationship Detection. 4515-4525 - Pierre R. Lebreton

, Kazuhisa Yamagishi
:
Predicting User Quitting Ratio in Adaptive Bitrate Video Streaming. 4526-4540 - Xu Lu

, Li Liu
, Liqiang Nie
, Xiaojun Chang
, Huaxiang Zhang
:
Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval. 4541-4554 - Xiaolin Xiao

, Yue-Jiao Gong
, Zhongyun Hua
, Wei-Neng Chen
:
On Reliable Multi-View Affinity Learning for Subspace Clustering. 4555-4566

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














