default search action
IEEE Transactions on Multimedia, Volume 26
Volume 26, 2024
- Qinwei Xu, Ruipeng Zhang, Ya Zhang, Yiyan Wu, Yanfeng Wang:
Federated Adversarial Domain Hallucination for Privacy-Preserving Domain Generalization. 1-14 - Xingxing Zhang, Shupeng Gui, Jian Jin, Zhenfeng Zhu, Yao Zhao:
ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries. 15-27 - Weiqing Lu, Hai-Miao Hu, Jinzuo Yu, Yibo Zhou, Hanzi Wang, Bo Li:
Orientation-Aware Pedestrian Attribute Recognition Based on Graph Convolution Network. 28-40 - Zhuang Li, Leilei Cao, Hongbin Wang, Lihong Xu:
End-to-End Instance-Level Human Parsing by Segmenting Persons. 41-50 - Xun Cai, Qingjie Shi, Yanbo Gao, Shuai Li, Wei Hua, Tian Xie:
A Structure-Preserving and Illumination-Consistent Cycle Framework for Image Harmonization. 51-64 - Saizhe Ding, Jinze Chen, Yang Wang, Yu Kang, Weiguo Song, Jie Cheng, Yang Cao:
E-MLB: Multilevel Benchmark for Event-Based Camera Denoising. 65-76 - Jiang Li, Xiaoping Wang, Guoqing Lv, Zhigang Zeng:
GraphCFC: A Directed Graph Based Cross-Modal Feature Complementation Approach for Multimodal Conversational Emotion Recognition. 77-89 - Zhe Li, Xinyu Wang, Yuliang Liu, Lianwen Jin, Yichao Huang, Kai Ding:
Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing. 90-102 - Zheng Li, Caili Guo, Zerun Feng, Jenq-Neng Hwang, Zhongtian Du:
Integrating Language Guidance Into Image-Text Matching for Correcting False Negatives. 103-116 - Muqing Deng, Zhuyao Fan, Peng Lin, Xiaoreng Feng:
Human Gait Recognition Based on Frontal-View Sequences Using Gait Dynamics and Deep Learning. 117-126 - Huimin Zeng, Jie Huang, Jiacheng Li, Zhiwei Xiong:
Region-Aware Portrait Retouching With Sparse Interactive Guidance. 127-140 - Jiawei Liu, Weining Wang, Sihan Chen, Xinxin Zhu, Jing Liu:
Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation. 141-153 - Yan-Bo Liu, Guo Cao, Boshan Shi, Yingxiang Hu:
CCANet: A Collaborative Cross-Modal Attention Network for RGB-D Crowd Counting. 154-165 - Wenhan Wu, Wenfeng Yi, Jinghai Li, Maoyin Chen, Xiaoping Zheng:
Automatic Identification of Human Subgroups in Time-Dependent Pedestrian Flow Networks. 166-177 - Ali Köksal, Kenan E. Ak, Ying Sun, Deepu Rajan, Joo Hwee Lim:
Controllable Video Generation With Text-Based Instructions. 190-201 - Bosheng Ding, Ruiheng Zhang, Lixin Xu, Guanyu Liu, Shuo Yang, Yumeng Liu, Qi Zhang:
U2D2Net: Unsupervised Unified Image Dehazing and Denoising Network for Single Hazy Image Enhancement. 202-217 - Zhiwu Qing, Shiwei Zhang, Ziyuan Huang, Xiang Wang, Yuehuan Wang, Yiliang Lv, Changxin Gao, Nong Sang:
MAR: Masked Autoencoders for Efficient Action Recognition. 218-233 - Jinguang Wang, Shengsheng Qian, Jun Hu, Richang Hong:
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network. 234-244 - Jianxun Lou, Hanhe Lin, Philippa Young, Richard White, Zelei Yang, Susan Cheng Shelmerdine, David Marshall, Emiliano Spezi, Marco Palombo, Hantao Liu:
Predicting Radiologists' Gaze With Computational Saliency Models in Mammogram Reading. 256-269 - Md. Moniruzzaman, Zhaozheng Yin:
Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization. 270-283 - Zehui Chen, Chenhongyi Yang, Jiahao Chang, Feng Zhao, Zheng-Jun Zha, Feng Wu:
DDOD: Dive Deeper into the Disentanglement of Object Detector. 284-298 - Kai Zeng, Kejiang Chen, Weiming Zhang, Yaofei Wang:
Upward Robust Steganography Based on Overflow Alleviation. 299-312 - Yukun Su, Jingliang Deng, Ruizhou Sun, Guosheng Lin, Hanjing Su, Qingyao Wu:
A Unified Transformer Framework for Group-Based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection. 313-325 - Jun Wang, Peng Yin, Yuanyun Wang, Wenhui Yang:
CMAT: Integrating Convolution Mixer and Self-Attention for Visual Tracking. 326-338 - Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo:
Perceptual Quality Improvement in Videoconferencing Using Keyframes-Based GAN. 339-352 - Wanting Zhou, Longteng Kong, Yushan Han, Jie Qin, Zhenan Sun:
Contextualized Relation Predictive Model for Self-Supervised Group Activity Representation Learning. 353-366 - Hua Li, Junyan Liang, Ruiqi Wu, Runmin Cong, Wenhui Wu, Sam Tak-Wu Kwong:
Stereo Superpixel Segmentation via Decoupled Dynamic Spatial-Embedding Fusion Network. 367-378 - Peipei Zhu, Xiao Wang, Lin Zhu, Zhenglong Sun, Wei-Shi Zheng, Yaowei Wang, Changwen Chen:
Prompt-Based Learning for Unpaired Image Captioning. 379-393 - Zhiyu Wang, Chao Yang, Bin Jiang, Junsong Yuan:
A Dual Reinforcement Learning Framework for Weakly Supervised Phrase Grounding. 394-405 - Andong Lu, Zhang Zhang, Yan Huang, Yifan Zhang, Chenglong Li, Jin Tang, Liang Wang:
Illumination Distillation Framework for Nighttime Person Re-Identification and a New Benchmark. 406-419 - Parham Hadikhani, Daphne Teck Ching Lai, Wee-Hong Ong:
Human Activity Discovery With Automatic Multi-Objective Particle Swarm Optimization Clustering With Gaussian Mutation and Game Theory. 420-435 - Quanpeng Song, Jiaxin Li, Si Wu, Hau-San Wong:
A Graph-Based Discriminator Architecture for Multi-Attribute Facial Image Editing. 436-446 - Jianping Gou, Xin He, Lan Du, Baosheng Yu, Wenbai Chen, Zhang Yi:
Hierarchical Locality-Aware Deep Dictionary Learning for Classification. 447-461 - Senmao Ye, Huan Wang, Mingkui Tan, Fei Liu:
Recurrent Affine Transformation for Text-to-Image Synthesis. 462-473 - Weitao You, Juntao Ji, Lingyun Sun, Changyuan Yang, Mi Yu, Shi Chen, Jiayi Yao:
Automatic Generation of Interactive Nonlinear Video for Online Apparel Shopping Navigation. 474-486 - Aijia Yang, Sihao Lin, Chung-Hsing Yeh, Minglei Shu, Yi Yang, Xiaojun Chang:
Context Matters: Distilling Knowledge Graph for Enhanced Object Detection. 487-500 - Qinghua Ren, Qirong Mao, Shijian Lu:
Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation. 501-513 - Weizhi Nie, Yuru Bao, Yue Zhao, Anan Liu:
Long Dialogue Emotion Detection Based on Commonsense Knowledge Graph Guidance. 514-528 - Ziqi Yuan, Yihe Liu, Hua Xu, Kai Gao:
Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis. 529-539 - Arbind Agrahari Baniya, Tsz-Kwan Lee, Peter W. Eklund, Sunil Aryal:
Omnidirectional Video Super-Resolution Using Deep Learning. 540-554 - Yufan Hu, Junyu Gao, Changsheng Xu:
Learning Multi-Expert Distribution Calibration for Long-Tailed Video Classification. 555-567 - Yanshan Li, Huajie Liang, Rui Yu:
BI-CAM: Generating Explanations for Deep Neural Networks Using Bipolar Information. 568-580 - Rongtao Xu, Changwei Wang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang:
Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation. 581-592 - Hezhen Hu, Junfu Pu, Wengang Zhou, Hang Fang, Houqiang Li:
Prior-Aware Cross Modality Augmentation Learning for Continuous Sign Language Recognition. 593-606 - Xiongli Chai, Feng Shao, Qiuping Jiang, Xuejin Wang, Long Xu, Yo-Sung Ho:
Blind Quality Evaluator of Light Field Images by Group-Based Representations and Multiple Plane-Oriented Perceptual Characteristics. 607-622 - Yuxuan Liu, Hongwei Ge, Zhen Wang, Yaqing Hou, Mingde Zhao:
Discriminative Identity-Feature Exploring and Differential Aware Learning for Unsupervised Person Re-Identification. 623-636 - Chunyang Xie, Dongheng Zhang, Zhi Wu, Cong Yu, Yang Hu, Yan Chen:
RPM: RF-Based Pose Machines. 637-649 - Mingliang Zhou, Xingtai Wu, Xuekai Wei, Tao Xiang, Bin Fang, Sam Kwong:
Low-Light Enhancement Method Based on a Retinex Model for Structure Preservation. 650-662 - Zilong Yu, Yunyun Yang, Yongbin Zhu, Bixue Guo, Chun Li:
CS-IntroVAE: Cauchy-Schwarz Divergence-Based Introspective Variational Autoencoder. 663-672 - Shentong Mo, Miao Xin:
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting. 673-686 - Songhan He, Dawen Xu, Lin Yang, Weipeng Liang:
Adaptive HEVC Video Steganography With High Performance Based on Attention-Net and PU Partition Modes. 687-700 - Xueping Wang, Min Liu, Fei Wang, Jianhua Dai, An-An Liu, Yaonan Wang:
Relation-Preserving Feature Embedding for Unsupervised Person Re-Identification. 714-723 - Shiqi Lin, Tao Yu, Ruoyu Feng, Xin Li, Xiaoyuan Yu, Lei Xiao, Zhibo Chen:
Local Patch AutoAugment With Multi-Agent Collaboration. 724-736 - Kaipeng Zhang, Yoichi Sato:
Semantic Image Segmentation by Dynamic Discriminative Prototypes. 737-749 - Shaowei Weng, Tangguo Zhu, Tiancong Zhang, Chunyu Zhang:
UCM-Net: A U-Net-Like Tampered-Region-Related Framework for Copy-Move Forgery Detection. 750-763 - Dandan Zhu, Kaiwei Zhang, Nana Zhang, Qiangqiang Zhou, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang:
Unified Audio-Visual Saliency Model for Omnidirectional Videos With Spatial Audio. 764-775 - Vladimir Frants, Sos S. Agaian, Karen Panetta:
QSAM-Net: Rain Streak Removal by Quaternion Neural Network With Self-Attention Module. 789-798 - Renshuai Liu, Yao Cheng, Sifei Huang, Chengyang Li, Xuan Cheng:
Transformer-Based High-Fidelity Facial Displacement Completion for Detailed 3D Face Reconstruction. 799-810 - Jinfu Liu, Xinshun Wang, Can Wang, Yuan Gao, Mengyuan Liu:
Temporal Decoupling Graph Convolutional Network for Skeleton-Based Gesture Recognition. 811-823 - Yuan Sun, Zhenwen Ren, Peng Hu, Dezhong Peng, Xu Wang:
Hierarchical Consensus Hashing for Cross-Modal Retrieval. 824-836 - Yaguang Song, Xiaoshan Yang, Yaowei Wang, Changsheng Xu:
Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution Visual Question Answering. 837-851 - Ruimin Li, Jiajun Xiang, Feixiang Sun, Ye Yuan, Longwu Yuan, Shuiping Gou:
Multiscale Cross-Modal Homogeneity Enhancement and Confidence-Aware Fusion for Multispectral Pedestrian Detection. 852-863 - Wenjie Li, Juncheng Li, Guangwei Gao, Weihong Deng, Jiantao Zhou, Jian Yang, Guo-Jun Qi:
Cross-Receptive Focused Inference Network for Lightweight Image Super-Resolution. 864-877 - Kedeng Tong, Xin Jin, Yuqing Yang, Chen Wang, Jinshi Kang, Fan Jiang:
Learned Focused Plenoptic Image Compression With Microimage Preprocessing and Global Attention. 890-903 - Ke Zhang, Hanliang Jiang, Jian Zhang, Qingming Huang, Jianping Fan, Jun Yu, Weidong Han:
Semi-Supervised Medical Report Generation via Graph-Guided Hybrid Feature Consistency. 904-915 - Gangjian Zhang, Shikui Wei, Huaxin Pang, Shuang Qiu, Yao Zhao:
Enhance Composed Image Retrieval via Multi-Level Collaborative Localization and Semantic Activeness Perception. 916-928 - Jianjun Xiang, Peng Chen, Yuanjie Dang, Ronghua Liang, Gangyi Jiang:
Pseudo Light Field Image and 4D Wavelet-Transform-Based Reduced-Reference Light Field Image Quality Assessment. 929-943 - Jinyu Wen, Feiwei Qin, Jiao Du, Meie Fang, Xinhua Wei, C. L. Philip Chen, Ping Li:
MsgFusion: Medical Semantic Guided Two-Branch Network for Multimodal Brain Image Fusion. 944-957 - Yuxin Xiang, Dongjie Tang, Rui Huang, Yong Yao, Chao Xie, Qiming Shi, Randy Xu, Mohammad Reza Haghighat, Cathy Bao, Yicheng Gu, Zhengwei Qi, Haibing Guan:
CARE: Cloudified Android With Optimized Rendering Platform. 958-971 - Tuan T. Nguyen, Hoang H. Nguyen, Mina Sartipi, Marco Fisichella:
Multi-Vehicle Multi-Camera Tracking With Graph-Based Tracklet Features. 972-983 - Geng Chen, Huazhu Fu, Tao Zhou, Guobao Xiao, Keren Fu, Yong Xia, Yanning Zhang:
Fusion-Embedding Siamese Network for Light Field Salient Object Detection. 984-994 - Bing Cao, Haifang Cao, Jiaxu Liu, Pengfei Zhu, Changqing Zhang, Qinghua Hu:
Autoencoder-Based Collaborative Attention GAN for Multi-Modal Image Synthesis. 995-1010 - Jiesheng Wu, Fangwei Hao, Weiyun Liang, Jing Xu:
Transformer Fusion and Pixel-Level Contrastive Learning for RGB-D Salient Object Detection. 1011-1026 - Tao Xie, Li Wang, Ke Wang, Ruifeng Li, Xinyu Zhang, Haoming Zhang, Linqi Yang, Huaping Liu, Jun Li:
FARP-Net: Local-Global Feature Aggregation and Relation-Aware Proposals for 3D Object Detection. 1027-1040 - Shuyue Lan, Zhilu Wang, Ermin Wei, Amit K. Roy-Chowdhury, Qi Zhu:
Collaborative Multi-Agent Video Fast-Forwarding. 1041-1054 - Shulei Ji, Xinyu Yang:
EmoMusicTV: Emotion-Conditioned Symbolic Music Generation With Hierarchical Transformer VAE. 1076-1088 - Yuqi Zhang, Qi Qian, Hongsong Wang, Chong Liu, Weihua Chen, Fan Wang:
Graph Convolution Based Efficient Re-Ranking for Visual Retrieval. 1089-1101 - Zhenguo Yang, Zhuopan Yang, Zhiwei Guo, Zehang Lin, Haizhong Zhu, Qing Li, Wenyin Liu:
Towards Temporal Event Detection: A Dataset, Benchmarks and Challenges. 1102-1113 - Chengrui Zhang, Junxin Chen, Dongming Chen, Wei Wang, Yushu Zhang, Yicong Zhou:
Exploiting Substitution Box for Cryptanalyzing Image Encryption Schemes With DNA Coding and Nonlinear Dynamics. 1114-1128 - Weizhi Nie, Xin Wen, Jing Liu, Jiawei Chen, Jiancan Wu, Guoqing Jin, Jing Lu, An-An Liu:
Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation. 1129-1142 - Wei Zhou, Weitao Jiang, Dihu Chen, Haifeng Hu, Tao Su:
Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification. 1143-1157 - Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao:
LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation. 1158-1168 - Zhaoyi Li, Ping Zhong, Jiawei Huang, Feng Gao, Jian-Xin Wang:
Achieving QoE Fairness in Bitrate Allocation of 360° Video Streaming. 1169-1178 - Feifei Ding, Jianjun Li, Wanyong Tian, Shanqing Zhang, Wenqiang Yuan:
Unsupervised Domain Adaptation via Risk-Consistent Estimators. 1179-1187 - Jian Xiao, Xiaojun Bi:
Model-Guided Generative Adversarial Networks for Unsupervised Fine-Grained Image Generation. 1188-1199 - Jiayuan Sun, Luping Ji, Jiewen Zhu:
Shared Coupling-Bridge Scheme for Weakly Supervised Local Feature Learning. 1200-1212 - Kangle Wu, Jun Huang, Yong Ma, Fan Fan, Jiayi Ma:
Cycle-Retinex: Unpaired Low-Light Image Enhancement via Retinex-Inline CycleGAN. 1213-1228 - Yuanyuan Shi, Xiaolong Fu, Yunan Li, Kaibin Miao, Xiangzeng Liu, Bocheng Zhao, Qiguang Miao:
A Semi-Supervised Underexposed Image Enhancement Network With Supervised Context Attention and Multi-Exposure Fusion. 1229-1243 - Theyab A. Alotaibi, Ishtiaq Rasool Khan, Farid Bourennani:
Quality Assessment of Tone-Mapped Images Using Fundamental Color and Structural Features. 1244-1254 - Bowen Yuan, Yefei Sheng, Bing-Kun Bao, Yi-Ping Phoebe Chen, Changsheng Xu:
Semantic Distance Adversarial Learning for Text-to-Image Synthesis. 1255-1266 - Weitao Feng, Lei Bai, Yongqiang Yao, Weihao Gan, Wei Wu, Wanli Ouyang:
Similarity- and Quality-Guided Relation Learning for Joint Detection and Tracking. 1267-1280 - Inske Groenen, Stevan Rudinac, Marcel Worring:
PanorAMS: Automatic Annotation for Detecting Objects in Urban Context. 1281-1294 - Jian Zhu, Hanli Wang, Bin He:
Multi-Modal Structure-Embedding Graph Transformer for Visual Commonsense Reasoning. 1295-1305 - Lei Ma, Hanyu Hong, Fanman Meng, Qingbo Wu, Jinmeng Wu:
Deep Progressive Asymmetric Quantization Based on Causal Intervention for Fine-Grained Image Retrieval. 1306-1318 - Jianping Gou, Nannan Xie, Yunhao Yuan, Lan Du, Weihua Ou, Zhang Yi:
Reconstructed Graph Constrained Auto-Encoders for Multi-View Representation Learning. 1319-1332 - Shuai Xiao, Guipeng Lan, Jiachen Yang, Wen Lu, Qinggang Meng, Xinbo Gao:
MCS-GAN: A Different Understanding for Generalization of Deep Forgery Detection. 1333-1345 - Yanxiong Li, Wenchang Cao, Wei Xie, Jialong Li, Emmanouil Benetos:
Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes. 1346-1360 - Jiamin Zhuang, Jing Yu, Yang Ding, Xiangyan Qu, Yue Hu:
Towards Fast and Accurate Image-Text Retrieval With Self-Supervised Fine-Grained Alignment. 1361-1372 - Quan Wang, Sheng Li, Zichi Wang, Xinpeng Zhang, Guorui Feng:
Multi-Source Style Transfer via Style Disentanglement Network. 1373-1383 - Yan Dai, Xiaojia Chen, Xuanhan Wang, Minghui Pang, Lianli Gao, Heng Tao Shen:
ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets. 1384-1394 - Tianli Sun, Haonan Chen, Guosheng Hu, Lianghua He, Cairong Zhao:
Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization. 1395-1406 - Zhongyu Bai, Hongli Xu, Xiangyue Zhang, Qichuan Ding:
GCSANet: Arbitrary Style Transfer With Global Context Self-Attentional Network. 1407-1420 - Ruixuan Cong, Hao Sheng, Da Yang, Zhenglong Cui, Rongshan Chen:
Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution. 1421-1435 - Lei Jin, Xiaojuan Wang, Xuecheng Nie, Wendong Wang, Yandong Guo, Shuicheng Yan, Jian Zhao:
Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation. 1436-1447 - Lvlong Lai, Jian Chen, Qingyao Wu:
Zero-Shot Single-View Point Cloud Reconstruction via Cross-Category Knowledge Transferring. 1448-1459 - Liyun Zuo, Baoyan Wang, Lei Zhang, Jun Xu, Xiantong Zhen:
Variational Neuron Shifting for Few-Shot Image Classification Across Domains. 1460-1473 - Qing Yu, Go Irie, Kiyoharu Aizawa:
Self-Labeling Framework for Open-Set Domain Adaptation With Few Labeled Samples. 1474-1487 - Xiaotian Wu, Xinjie Feng:
Size Invariant Visual Cryptography Schemes With Evolving Threshold Access Structures. 1488-1503 - Bo Jiang, Shuxian Luo, Xiao Wang, Chuanfu Li, Jin Tang:
AMatFormer: Efficient Feature Matching via Anchor Matching Transformer. 1504-1515