Stop the war!
Остановите войну!
for scientists:
default search action
ICME 2019: Shanghai, China
- IEEE International Conference on Multimedia and Expo, ICME 2019, Shanghai, China, July 8-12, 2019. IEEE 2019, ISBN 978-1-5386-9552-4
Oral Sessions
Best Paper Session
- Yu Hao, Yanwei Fu, Yu-Gang Jiang, Qi Tian:
An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation. 1-6 - Zunjie Zhu, Feng Xu, Chenggang Yan, Xinhong Hao, Xiangyang Ji, Yongdong Zhang, Qionghai Dai:
Real-time Indoor Scene Reconstruction with RGBD and Inertial Input. 7-12 - Changde Du, Changying Du, Huiguang He:
Doubly Semi-Supervised Multimodal Adversarial Learning for Classification, Generation and Retrieval. 13-18 - Yihang Lou, Ling-Yu Duan, Yong Luo, Ziqian Chen, Tongliang Liu, Shiqi Wang, Wen Gao:
Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm. 19-24
O-01: Content Recommendation and Cross-modal Hashing
- Zhenhua Tan, Danke Wu, Liangliang He, Qiuyun Chang, Bin Zhang:
SDP: An Improved Baseline Estimation Model Based On Standard Deviation Proportion. 25-30 - Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang:
Citation Recommendation Based on Weighted Heterogeneous Information Network Containing Semantic Linking. 31-36 - Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang:
Fusion-Supervised Deep Cross-Modal Hashing. 37-42 - Wei Chen, Nan Pu, Yu Liu, Erwin M. Bakker, Michael S. Lew:
Domain Uncertainty Based On Information Theory for Cross-Modal Hash Retrieval. 43-48
O-02: Development of Multimedia Standards and Related Research
- Eurico Lopes, João Ascenso, Catarina Brites, Fernando Pereira:
Adaptive Plane Projection for Video-Based Point Cloud Coding. 49-54 - Ting Fu, Hao Zhang, Fan Mu, Huanbang Chen:
Fast CU Partitioning Algorithm for H.266/VVC Intra-Frame Coding. 55-60 - Ting Fu, Hao Zhang, Fan Mu, Huanbang Chen:
Two-Stage Fast Multiple Transform Selection Algorithm for VVC Intra Coding. 61-66 - Junru Li, Meng Wang, Li Zhang, Kai Zhang, Hongbin Liu, Shiqi Wang, Siwei Ma, Wen Gao:
History-Based Motion Vector Prediction for Future Video Coding. 67-72
O-03: Classification and Low Shot Learning
- Jingcai Guo, Song Guo:
AMS-SFE: Towards an Alignment of Manifold Structures via Semantic Feature Expansion for Zero-shot Learning. 73-78 - Xuefeng Du, Dexing Zhong, Pengna Li:
Low-Shot Palmprint Recognition Based on Meta-Siamese Network. 79-84 - Zihan Ye, Fan Lyu, Linyan Li, Qiming Fu, Jinchang Ren, Fuyuan Hu:
SR-GAN: Semantic Rectifying Generative Adversarial Network for Zero-shot Learning. 85-90 - Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu:
Compare More Nuanced: Pairwise Alignment Bilinear Network for Few-Shot Fine-Grained Learning. 91-96
O-04: 3D Media Computing
- Gerasimos Arvanitis, Aris S. Lalos, Konstantinos Moustakas:
Feature-Aware and Content-wise Denoising of 3D Static and Dynamic Meshes using Deep Autoencoders. 97-102 - Xinyu Wei, Jun Huang, Xiaoyuan Ma:
Real-Time Monocular Visual SLAM by Combining Points and Lines. 103-108 - Chuanpu Li, Xin Jin, Junke Li, Qionghai Dai:
F-Number Adaptation for Maximizing the Sensor Usage of Light Field Cameras. 109-114 - Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen, Qionghai Dai:
Blind Calibration for Focused Plenoptic Cameras. 115-120
O-05: Special Session "Pedestrian Detection, Tracking and Reidentification in Videos"
- Peizhen Zhang, Feng Zheng, Junlong Du, Jun Zhang, Xiaowei Guo, Wei-Shi Zheng:
Particle Swarm Loss for Lightweight Object Detection. 121-126 - Qiang Fu, Linsen Dong, Ziyuan Liu, Yong Luo, Yonggang Wen, Ying Li, Ling-Yu Duan:
Incorporating Category Taxonomy in Deep Reinforcement Learning Based Image Hashing. 127-132 - Ji Hu, Chenggang Yan, Xin Liu, Jiyong Zhang, Dongliang Peng, Yi Yang:
Truncated Gradient Confidence-Weighted Based Online Learning for Imbalance Streaming Data. 133-138 - Mohamed A. Kassab, Ali Maher, Fathy Elkazzaz, Baochang Zhang:
UAV Target Tracking By Detection via Deep Neural Networks. 139-144
O-06: Special Session "Multimedia Technologies Empowering Retail Experiences"
- Shan An, Zhibiao Huang, Guangfu Che, Xianglong Liu, Xin Ma, Yu Chen:
Quarter-Point Codeword Expansion for Product Quantization. 145-150 - Minghui Zhang, Yumeng Liang, Huadong Ma:
Context-Aware Affective Graph Reasoning for Emotion Recognition. 151-156 - Weibo Zhang, Fuqing Zhu, Jiao Dai, Songlin Hu, Jizhong Han, Tao Guo:
SPL: Exploiting Unlabeled Data for Multi-label Image Classification. 157-162 - Yu Zhou, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
MLTS: A Multi-Language Scene Text Spotter. 163-168
O-07: 3D and Low Level Vision
- Xinchen Ye, Mingliang Zhang, Rui Xu, Wei Zhong, Xin Fan, Zhu Liu, Jiaao Zhang:
Unsupervised Monocular Depth Estimation Based on Dual Attention Mechanism and Depth-Aware Loss. 169-174 - Gang Fu, Qing Zhang, Chunxia Xiao:
Towards High-Quality Intrinsic Images in the Wild. 175-180 - Shuosen Guan, Haoxin Li, Wei-Shi Zheng:
Unsupervised Learning for Optical Flow Estimation Using Pyramid Convolution LSTM. 181-186 - Yuan Gao, Robert Bregovic, Atanas P. Gotchev, Reinhard Koch:
MAST: Mask-Accelerated Shearlet Transform for Densely-Sampled Light Field Reconstruction. 187-192
O-08: Object Detection I
- Li Wang, Yongbo Li, Xiangyang Xue:
CODA: Counting Objects via Scale-Aware Adversarial Density Adaption. 193-198 - Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H. Li, Ge Li:
PDNet: Prior-Model Guided Depth-Enhanced Network for Salient Object Detection. 199-204 - Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong:
Continuous Scale Adaption for Efficient Box-Based Scene Text Detection. 205 - Xiaobao Guo, Jinxing Li, Bingzhi Chen, Guangming Lu:
Mask-Most Net: Mask Approximation Based Multi-oriented Scene Text Detection Network. 206-211
O-09: Emerging Applications of Deep Learning
- Junhao Huang, Lin Zhang, Ying Shen, Huijuan Zhang, Shengjie Zhao, Yukai Yang:
DMPR-PS: A Novel Approach for Parking-Slot Detection Using Directional Marking-Point Regression. 212-217 - Yong-Xiang Lin, Daniel Stanley Tan, Wen-Huang Cheng, Kai-Lung Hua:
Adapting Semantic Segmentation of Urban Scenes via Mask-Aware Gated Discriminator. 218-223 - Maomao Li, Chun Yuan, Zhihui Lin, Zhuobin Zheng, Yangyang Cheng:
Stochastic Video Generation with Disentangled Representations. 224-229 - Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu:
Z-Order Recurrent Neural Networks for Video Prediction. 230-235
O-10: Multimedia Quality Assessment and Enhancement
- Yingru Liu, Dongliang Xie, Xin Wang:
Energy-Based Recurrent Model for Stochastic Modeling of Music. 236-241 - Huaixuan Zhang, Yuhai Lan, Tao Dai, Ruizhi Qiao, Ying Xu, Yao Yao, Shu-Tao Xia:
Residual Frame for Noisy Video Classification According to Perceptual Quality in Convolutional Neural Networks. 242-247 - Guanqun Hou, Yujiu Yang, Jing-Hao Xue:
Residual Dilated Network with Attention for Image Blind Denoising. 248-253 - Zhuopeng Li, Xiaoyan Zhang:
Collaborative Deep Reinforcement Learning for Image Cropping. 254-259
O-11: Multimedia for Society and Health
- Penghui Sun, Hao Liu, Xin Wang, Zhenhua Yu, Suping Wu:
Similarity-Aware Deep Adversarial Learning for Facial Age Estimation. 260-265 - Yinghong Liao, Bin Qiu, Zhuo Su, Ruomei Wang, Xiangjian He:
Learning Transmission Filtering Network for Image-Based Pm2.5 Estimation. 266-271 - Yuan Tian, Xiongkuo Min, Guangtao Zhai, Zhiyong Gao:
Video-Based Early ASD Detection via Temporal Pyramid Networks. 272-277 - Ying Zhang, Yinjia Zhang, Qinpei Zhao, Weixiong Rao:
Automatic User Categorization Through Large Transaction Data. 278-283
O-12: Immersive Media
- Junkun Qi, Wei Hu, Zongming Guo:
Feature Preserving and Uniformity-Controllable Point Cloud Simplification on Graph. 284-289 - Jun Fu, Xiaoming Chen, Zhizheng Zhang, Shilin Wu, Zhibo Chen:
360SRL: A Sequential Reinforcement Learning Approach for ABR Tile-Based 360 Video Streaming. 290-295 - Falah Jabar, João Ascenso, Maria Paula Queluz:
Content-Aware Perspective Projection Optimization for Viewport Rendering of 360° Images. 296-301 - Ziming Wu, Jiabin Guo, Shuangli Zhang, Chen Zhao, Xiaojuan Ma:
An AR Benchmark System for Indoor Planar Object Tracking. 302-307
O-13: 3D and Stereo Computing
- Zhenchao Wu, Kun Li, Yu-Kun Lai, Jingyu Yang:
Global as-Conformal-as-Possible Non-Rigid Registration of Multi-view Scans. 308-313 - Zhengning Wang, Longfei Feng, Fanwei Zeng, Guang Hu, Xiang Zhang, Xia Lv, Fengjun Zhang:
A Light-Weighted Network for Facial Landmark Detection via Combined Heatmap and Coordinate Regression. 314-319 - Xianzhe Xu, Yonghong Hou, Pichao Wang, Zhongyu Jiang, Wanqing Li:
Light Weight Stereo Matching via Deep Extraction and Integration of Low and High Level Information. 320-325 - Hongxin Lin, Zelin Xiao, Yang Tan, Hongyang Chao, Shengyong Ding:
Justlookup: One Millisecond Deep Feature Extraction for Point Clouds By Lookup Tables. 326-331
O-14: Machine Learning Applications in Image and Video Coding I
- Bo Jiang, Xingyue Jiang, Jin Tang, Bin Luo, Shilei Huang:
Multiple Graph Convolutional Networks for Co-Saliency Detection. 332-337 - Lahiru D. Chamain, Sen-ching Samson Cheung, Zhi Ding:
Quannet: Joint Image Compression and Classification Over Channels with Limited Bandwidth. 338-343 - Jiawen Gu, Bichuan Guo, Jiangtao Wen:
High Efficiency Light Field Compression via Virtual Reference and Hierarchical MV-HEVC. 344-349 - Youfa Liu, Bo Du, Lefei Zhang:
Self-Paced Subspace Clustering. 350-355
O-15: Vison, Language and Text Processing
- Xuri Ge, Fuhai Chen, Chen Shen, Rongrong Ji:
Colloquial Image Captioning. 356-361 - Yike Wu, Shiwan Zhao, Jia Chen, Ying Zhang, Xiaojie Yuan, Zhong Su:
Improving Captioning for Low-Resource Languages by Cycle Consistency. 362-367 - Zhuo Lei, Chao Zhang, Qian Zhang, Guoping Qiu:
FrameRank: A Text Processing Approach to Video Summarization. 368-373 - Anna Zhu, Qiyang Zhang, Xiongbo Lu, Shengwu Xiong:
Character Image Synthesis Based on Selected Content and Referenced Style Embedding. 374-379
O-16: Media Classification and Segmentation II
- Yujia Liu, Weiming Zhang, Nenghai Yu:
Query-Free Embedding Attack Against Deep Learning. 380-386 - Zongmin Li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li:
Graph Attention Neural Networks for Point Cloud Recognition. 387-392 - Lu Li, Yang Li, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang:
Maximal Correlation Embedding Network for Multilabel Learning with Missing Labels. 393-398 - Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang:
Self-Adaption Multi-classifier Fusion Networks for Image Recognition. 399-405
O-17: AI for Human Understanding
- Baohan Xu, Yingbin Zheng, Hao Ye, Caili Wu, Heng Wang, Gufei Sun:
Video Emotion Recognition with Concept Selection. 406-411 - Han Zhang, Yonghong Song, Yuanlin Zhang:
Graph Convolutional LSTM Model for Skeleton-Based Action Recognition. 412-417 - Zhongwei Qiu, Kai Qiu, Jianlong Fu, Dongmei Fu:
Learning Recurrent Structure-Guided Attention Network for Multi-person Pose Estimation. 418-423 - Zhenying Fang, Suguo Zhu, Jun Yu, Qi Tian:
PCPCAD: Proposal Complementary Action Detector. 424-429
O-18: Image Quality Metrics
- Leida Li, Hancheng Zhu, Sicheng Zhao, Guiguang Ding, Hongyan Jiang, Allen Tan:
Personality Driven Multi-task Learning for Image Aesthetic Assessment. 430-435 - Chen Bai, Amy R. Reibman:
Video Quality Temporal Pooling using a Visibility Measure. 436-441 - Yuming Fang, Yan Zeng, Hanwei Zhu, Guangtao Zhai:
Image Quality Assessment of Multi-exposure Image Fusion for Both Static and Dynamic Scenes. 442-447 - Sumei Li, Jianwei Xue, Yongtian Han:
No-Reference Stereoscopic Image Quality Assessment Based on Local to Global Feature Regression. 448-453
O-19: Multimedia Recommendations
- Wenmian Yang, Wenyuan Gao, Xiaojie Zhou, Weijia Jia, Shaohua Zhang, Yutao Luo:
Herding Effect Based Attention for Personalized Time-Sync Video Recommendation. 454-459 - Shang Liu, Zhenzhong Chen:
Sequential Behavior Modeling for Next Micro-Video Recommendation with Collaborative Transformer. 460-465 - Dawei Liu, Ying Cao, Rynson W. H. Lau, Antoni B. Chan:
ButtonTips: Design Web Buttons with Suggestions. 466-471 - Shengjie Ma, Zheng-Jun Zha, Feng Wu:
Knowing User Better: Jointly Predicting Click-Through and Playtime for Micro-Video. 472-477
O-20: Search and Retrieval
- Xin Wen, Zhizhong Han, Xinyu Yin, Yu-Shen Liu:
Adversarial Cross-Modal Retrieval via Learning and Transferring Single-Modal Similarities. 478-483 - Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang:
Semi-Supervised Compatibility Learning Across Categories for Clothing Matching. 484-489 - Kevin Lin, Fan Yang, Qiaosong Wang, Robinson Piramuthu:
Adversarial Learning for Fine-Grained Image Search. 490-495 - Lei Qi, Jing Huo, Lei Wang, Yinghuan Shi, Yang Gao:
A Mask Based Deep Ranking Neural Network for Person Retrieval. 496-501
O-21: Media Understanding
- Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluri, Rituparna Sarkar, Pankaj Bajpai:
DISCO: Depth Inference from Stereo using Context. 502-507 - Yunian Chen, Yanjie Wang, Yang Zhang, Yanwen Guo:
PANet: A Context Based Predicate Association Network for Scene Graph Generation. 508-513 - Aming Wu, Yahong Han, Quanxin Zhang, Xiaohui Kuang:
Untargeted Adversarial Attack via Expanding the Semantic Gap. 514-519 - Yen-Wei Chang, Wen-Hsiao Peng:
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic Experts. 520-525
O-22: Super-resolution and Enhancement
- Kui Jiang, Zhongyuan Wang, Peng Yi, Junjun Jiang, Guangcheng Wang, Zhen Han, Tao Lu:
GAN-Based Multi-level Mapping Network for Satellite Imagery Super-Resolution. 526-531 - Ren Yang, Xiaoyan Sun, Mai Xu, Wenjun Zeng:
Quality-Gated Convolutional Lstm for Enhancing Compressed Video. 532-537 - Risheng Liu, Minjun Hou, Jinyuan Liu, Xin Fan, Zhongxuan Luo:
Compounded Layer-Prior Unrolling: A Unified Transmission-Based Image Enhancement Framework. 538-543 - Qiang Fu, Wenhan Yang, Ying Li, Jiaying Liu:
Deep Pyramid Variation Learning for Image Interpolation. 544-549
O-23: Pose and Action Recognition II
- Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang:
Clothes Keypoints Localization and Attribute Recognition via Prior Knowledge. 550-555 - Yong Su, Zhiyong Feng:
Spatio-Temporal Multi-Factor Discriminant Analysis for Individual Identification. 556-561 - Jianjun Lei, Yalong Jia, Bo Peng, Qingming Huang:
Channel-wise Temporal Attention Network for Video Action Recognition. 562-567 - Qichao Xu, John See, Weiyao Lin:
Localization Guided Fight Action Detection in Surveillance Videos. 568-573
O-24: Image and Video Enhancements I
- Yue Lu, Zhuqing Jiang, Guodong Ju, Liangheng Shen, Aidong Men:
Recursive Multi-Stage Upscaling Network with Discriminative Fusion for Super-Resolution. 574-579 - Yuanfei Huang, Jie Li, Xinbo Gao, Wen Lu, Yanting Hu:
Improving Image Super-Resolution via Feature Re-Balancing Fusion. 580-585 - Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen:
Difficulty-Aware Image Super Resolution via Deep Adaptive Dual-Network. 586-591 - Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang, Dou Jin:
Dense-Connected Residual Network for Video Super-Resolution. 592-597
O-25: Face and Person Analysis
- Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li:
Dynamic Cascaded Regression Network with Reinforcement Learning for Robust Face Alignment. 598-603 - Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie, Jun Yu:
Deep Learning Face Hallucination via Attributes Transfer and Enhancement. 604-609 - Junjie Zhu, Xibin Zhao, Han Hu, Yue Gao:
Emotion Recognition from Physiological Signals using Multi-Hypergraph Neural Networks. 610-615 - Yue Liao, Si Liu, Tianrui Hui, Chen Gao, Yao Sun, Hefei Ling, Bo Li:
GPS: Group People Segmentation with Detailed Part Inference. 616-621
O-26: Media Classification and Segmentation III
- Zhao-Min Chen, Xiu-Shen Wei, Xin Jin, Yanwen Guo:
Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding. 622-627 - Zhengtao Tan, Bin Liu, Weihai Li, Nenghai Yu:
Real Time Compressed Video Object Segmentation. 628-633 - Zhihui Wang, Shijie Wang, Pengbo Zhang, Haojie Li, Bo Liu:
Accurate And Fast Fine-Grained Image Classification via Discriminative Learning. 634-639 - Zhong Li, Xin Chen, Wangyiteng Zhou, Yingliang Zhang, Jingyi Yu:
Pose2Body: Pose-Guided Human Parts Segmentation. 640-645
O-27: Image and Video Enhancements II
- Zhan Shu, Mengcheng Cheng, Biao Yang, Zhuo Su, Xiangjian He:
Residual Magnifier: A Dense Information Flow Network for Super Resolution. 646-651 - Xinyu Li, Wei Zhang, Tong Shen, Tao Mei:
Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks. 652-657 - Jichun Li, Ke Li, Bo Yan:
Scale-Aware Deep Network with Hole Convolution for Blind Motion Deblurring. 658-663 - Tie Liu, Mai Xu, Zulin Wang:
Removing Rain in Videos: A Large-Scale Database and a Two-Stream ConvLSTM Approach. 664-669
O-28: Multimedia Learning and Adaptation
- Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang:
Towards QoS-Aware Cloud Live Transcoding: A Deep Reinforcement Learning Approach. 670-675 - Ding Ma, Xiangqian Wu:
High Speed Recurrent Regression Network for Visual Tracking. 676-681 - Yanmin Shang, Zhezhou Kang, Yanan Cao, Dongjie Zhang, Yang Li, Yangxi Li, Yanbing Liu:
PAAE: A Unified Framework for Predicting Anchor Links with Adversarial Embedding. 682-687 - Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying:
Manifold Alignment and Distribution Adaptation for Unsupervised Domain Adaptation. 688-693
O-29: Person (Re-)Identification and People Detection
- Hui Li, Meng Yang, Zhihui Lai, Weishi Zheng, Zitong Yu:
Pedestrian re-Identification Based on Tree Branch Network with Local and Global Learning. 694-699