default search action

combined dblp search
author search
venue search
publication search

ask others

32nd MM 2024: Melbourne, VIC, Australia

> Home > Conferences and Workshops > MM

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  authority control:
- export record
  dblp key:
  - conf/mm/2024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/2024
Jianfei Cai, Mohan S. Kankanhalli, Balakrishnan Prabhakaran, Susanne Boll, Ramanathan Subramanian, Liang Zheng, Vivek K. Singh, Pablo César, Lexing Xie, Dong Xu:
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. ACM 2024, ISBN 979-8-4007-0686-8

Keynote Talks

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Fung24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Fung24
Pascale Fung:
From Assistants to Agents in the LLM Era. 1
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Huet24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Huet24
Benoit Huet:
Revolutionizing Lung Cancer Diagnostics with eyonis ^TM LCS: Cutting-edge AI/ML Technology-based SaMD for Enhanced Patient Care. 2-3
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Kay24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Kay24
Judy Kay:
Empowering People to Harness and Control their Multimodal Data in Scrutable User models. 4-5
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Luo24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Luo24
Jiebo Luo:
Large Multimodal Models as Social Multimedia Analysis Engines. 6-7

Oral Session 1: Large Language Models & Applications 1

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiaoLWGTT00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiaoLWGTT00L24
Haicheng Liao, Yongkang Li, Chengyue Wang, Yanchen Guan, Kahou Tam, Chunlin Tian, Li Li, Chengzhong Xu, Zhenning Li:
When, Where, and What? A Benchmark for Accident Anticipation and Localization with Large Language Models. 8-17
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengD0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengD0L24
Haonan Zheng, Xinyang Deng, Wen Jiang, Wenrui Li:
A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models. 18-27
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FangFLQD0LXCZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FangFLQD0LXCZ024
Xiang Fang, Wanlong Fang, Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Renfu Li, Zichuan Xu, Lixing Chen, Panpan Zheng, Yu Cheng:
Not All Inputs Are Valid: Towards Open-Set Video Moment Retrieval using Language. 28-37
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiS0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiS0024
Huishan Ji, Qingyi Si, Zheng Lin, Weiping Wang:
Towards Flexible Evaluation for Generative Visual Question Answering. 38-47
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuCDOW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuCDOW24
Jiaqi Zhu, Shaofeng Cai, Fang Deng, Beng Chin Ooi, Junran Wu:
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection. 48-57
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHZS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHZS024
Yudong Li, Xianxu Hou, Dezhi Zheng, Linlin Shen, Zhe Zhao:
FLIP-80M: 80 Million Visual-Linguistic Pairs for Facial Language-Image Pre-Training. 58-67

Oral Session 2: Large Language Models & Applications 2

- view
  authority control:
- export record
  dblp key:
  - conf/mm/HaasLHB0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HaasLHB0L24
Esmée Henrieke Anne de Haas, Lik-Hang Lee, Yiming Huang, Carlos Bermejo, Pan Hui, Zijun Lin:
Towards Trustworthy MetaShopping: Studying Manipulative Audiovisual Designs in Virtual-Physical Commercial Platforms. 68-77
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZCCLZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZCCLZZ24
Weiqi Li, Shijie Zhao, Bin Chen, Xinhua Cheng, Junlin Li, Li Zhang, Jian Zhang:
ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images. 78-87
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PeiZYTTT0L000S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PeiZYTTT0L000S24
Yunqiang Pei, Kaiyue Zhang, Hongrong Yang, Yong Tao, Qihang Tang, Jialei Tang, Guoqing Wang, Zhitao Liu, Ning Xie, Peng Wang, Yang Yang, Hengtao Shen:
Improving Interaction Comfort in Authoring Task in AR-HRI through Dynamic Dual-Layer Interaction Adjustment. 88-97
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuLCHLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuLCHLL24
Yang Lu, Junxian Li, Zhitong Cui, Jiapeng Hu, Yanna Lin, Shijian Luo:
Designing Spatial Visualization and Interactions of Immersive Sankey Diagram in Virtual Reality. 98-107
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WanTWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WanTWZ024
Zhang Wan, Sheng Tang, Jiawei Wei, Ruize Zhang, Juan Cao:
DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships. 108-116
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShigyoCT0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShigyoCT0Q24
Kento Shigyo, Yifan Cao, Kentaro Takahira, Mingming Fan, Huamin Qu:
VR-Mediated Cognitive Defusion: A Comparative Study for Managing Negative Thoughts. 117-126

Oral Session 3: Novel Multimedia Applications 1

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Gui0CNJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Gui0CNJ24
Yinxuan Gui, Bin Zhu, Jingjing Chen, Chong Wah Ngo, Yu-Gang Jiang:
Navigating Weight Prediction with Diet Diary. 127-136
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0005X0WLZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0005X0WLZW24
Feiyu Chen, Cong Xu, Qi Jia, Yihua Wang, Yuhan Liu, Haotian Zhang, Endong Wang:
Egocentric Vehicle Dense Video Captioning. 137-146
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenKWLGZSH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenKWLGZSH024
Jinyue Chen, Lingyu Kong, Haoran Wei, Chenglong Liu, Zheng Ge, Liang Zhao, Jianjian Sun, Chunrui Han, Xiangyu Zhang:
OneChart: Purify the Chart Structural Extraction via One Auxiliary Token. 147-155
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinJGS00L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinJGS00L024
Jiawei Lin, Zhaoyun Jiang, Jiaqi Guo, Shizhao Sun, Ting Liu, Zijiang Yang, Jian-Guang Lou, Dongmei Zhang:
IconDM: Text-Guided Icon Set Expansion Using Diffusion Models. 156-165
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouW0XMLW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouW0XMLW024
Haipeng Zhou, Hongqiu Wang, Tian Ye, Zhaohu Xing, Jun Ma, Ping Li, Qiong Wang, Lei Zhu:
Timeline and Boundary Guided Diffusion Network for Video Shadow Detection. 166-175
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QuLHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QuLHZ24
Yichang Qu, Bing Li, Jie Huang, Feng Zhao:
Training Pansharpening Networks at Full Resolution Using Degenerate Invariance. 176-185

Oral Session 4: Graph and Diffusion Models

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Lu0CCW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Lu0CCW24
Jielong Lu, Zhihao Wu, Zhaoliang Chen, Zhiling Cai, Shiping Wang:
Towards Multi-view Consistent Graph Diffusion. 186-195
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaFQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaFQ24
Liyuan Ma, Xueji Fang, Guo-Jun Qi:
Equilibrated Diffusion: Frequency-aware Textual Embedding for Equilibrated Image Customization. 196-204
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FengYAHD0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FengYAHD0X24
Weilun Feng, Chuanguang Yang, Zhulin An, Libo Huang, Boyu Diao, Fei Wang, Yongjun Xu:
Relational Diffusion Distillation for Efficient Image Generation. 205-213
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuHZ0LLZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuHZ0LLZ0024
Hongjie Wu, Linchao He, Mingqin Zhang, Dongdong Chen, Kunming Luo, Mengting Luo, Jizhe Zhou, Hu Chen, Jiancheng Lv:
Diffusion Posterior Proximal Sampling for Image Restoration. 214-223
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangYLWX00P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangYLWX00P24
Yiheng Huang, Hui Yang, Chuanchen Luo, Yuxi Wang, Shibiao Xu, Zhaoxiang Zhang, Man Zhang, Junran Peng:
StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework. 224-232
- view
  authority control:
- export record
  dblp key:
  - conf/mm/00090GX0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/00090GX0C24
Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Wen Zhang, Huajun Chen:
Making Large Language Models Perform Better in Knowledge Graph Completion. 233-242

Oral Session 5: Multimodal Models and Applications

- view
  authority control:
- export record
  dblp key:
  - conf/mm/DevanathanSP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DevanathanSP024
Rishikesh Devanathan, Apoorva Singh, A. S. Poornash, Sriparna Saha:
Seeing Beyond Words: Multimodal Aspect-Level Complaint Detection in Ecommerce Videos. 243-252
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HungDLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HungDLH24
Hsiang-Hui Hung, Huu-Phu Do, Yung-Hui Li, Ching-Chun Huang:
TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views. 253-262
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenYLLWYS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenYLLWYS24
Xiaoxuan Shen, Fenghua Yu, Yaqi Liu, Ruxia Liang, Qian Wan, Kai Yang, Jianwen Sun:
Revisiting Knowledge Tracing: A Simple and Powerful Model. 263-272
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWLL024
Peiming Li, Ziyi Wang, Mengyuan Liu, Hong Liu, Chen Chen:
ClickDiff: Click to Induce Semantic Contact Map for Controllable Grasp Generation with Diffusion Models. 273-281
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWGLZWG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWGLZWG24
Bochao Liu, Pengju Wang, Weijia Guo, Yong Li, Liansheng Zhuang, Weiping Wang, Shiming Ge:
Private Gradient Estimation is Useful for Generative Modeling. 282-290
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuZG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuZG024
Ke Zhu, Liang Zhao, Zheng Ge, Xiangyu Zhang:
Self-Supervised Visual Preference Alignment. 291-300

Oral Session 6: Innovations in Medical Imaging and Physiological Measurement

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Hong0ZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Hong0ZZ24
Yuxin Hong, Xiao Zhang, Xin Zhang, Joey Tianyi Zhou:
Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification. 301-310
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangHZLZ0ZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangHZLZ0ZC024
Ruiqi Wang, Jinyang Huang, Jie Zhang, Xin Liu, Xiang Zhang, Zhi Liu, Peng Zhao, Sigui Chen, Xiao Sun:
FacialPulse: An Efficient RNN-based Depression Detection via Temporal Facial Landmarks. 311-320
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZCL24
Wei Zhang, En Zhu, Juan Chen, Yunpeng Li:
MDDR: Multi-modal Dual-Attention aggregation for Depression Recognition. 321-329
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Qian0GHW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Qian0GHW24
Wei Qian, Kun Li, Dan Guo, Bin Hu, Meng Wang:
Cluster-Phys: Facial Clues Clustering Towards Efficient Remote Physiological Measurement. 330-339
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SongQRLG0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SongQRLG0Z24
Zhenxi Song, Ruihan Qin, Huixia Ren, Zhen Liang, Yi Guo, Min Zhang, Zhiguo Zhang:
EEG-MACS: Manifold Attention and Confidence Stratification for EEG-based Cross-Center Brain Disease Diagnosis under Unreliable Annotations. 340-349
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xu0LW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xu0LW24
Xueyuan Xu, Li Zhuo, Jinxin Lu, Xia Wu:
WSEL: EEG Feature Selection with Weighted Self-expression Learning for Incomplete Multi-dimensional Emotion Recognition. 350-359

Oral Session 7: Imaging, Computer Vision & Graphics

- view
  authority control:
- export record
  dblp key:
  - conf/mm/WenGC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WenGC24
Yuanbo Wen, Tao Gao, Ting Chen:
Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model. 360-369
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiGLWLZ0YZZP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiGLWLZ0YZZP24
Zeyu Li, Ruitong Gan, Chuanchen Luo, Yuxi Wang, Jiaheng Liu, Ziwei Zhu, Qing Li, Xucheng Yin, Man Zhang, Zhaoxiang Zhang, Junran Peng:
MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets. 370-379
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanRCSWXM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanRCSWXM24
Xiao Han, Yiming Ren, Peishan Cong, Yujing Sun, Jingya Wang, Lan Xu, Yuexin Ma:
Gait Recognition in Large-scale Free Environment via Single LiDAR. 380-389
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoGWLCZHLSY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoGWLCZHLSY24
Tang Tao, Longfei Gao, Guangrun Wang, Yixing Lao, Peng Chen, Hengshuang Zhao, Dayang Hao, Xiaodan Liang, Mathieu Salzmann, Kaicheng Yu:
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields. 390-398
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenZY24
Mu Chen, Zhedong Zheng, Yi Yang:
Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation. 399-408
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MoWZHHHWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MoWZHHHWY24
Yujian Mo, Yan Wu, Junqiao Zhao, Zhenjie Hou, Weiquan Huang, Yinghao Hu, Jijun Wang, Jun Yan:
Sparse Query Dense: Enhancing 3D Object Detection with Pseudo Points. 409-418

Oral Session 8: Multimodal Reasoning & Inference

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengLZWC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengLZWC024
Changmeng Zheng, Dayong Liang, Wengyu Zhang, Xiaoyong Wei, Tat-Seng Chua, Qing Li:
A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning. 419-428
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoLQC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoLQC024
Qian Guo, Xinyan Liang, Yuhua Qian, Zhihua Cui, Jie Wen:
A Progressive Skip Reasoning Fusion Method for Multi-Modal Classification. 429-437
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuJL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuJL24
Wenxin Xu, Hexin Jiang, Xuefeng Liang:
Leveraging Knowledge of Modality Experts for Incomplete Multimodal Learning. 438-446
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0008ZHSLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0008ZHSLZ024
Bo Xu, Junzhe Zheng, Jiayuan He, Yuxuan Sun, Hongfei Lin, Liang Zhao, Feng Xia:
Generating Multimodal Metaphorical Features for Meme Understanding. 447-455
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiSS0YY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiSS0YY24
Junjie Shi, Caozhi Shang, Zhaobin Sun, Li Yu, Xin Yang, Zengqiang Yan:
PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates. 456-465
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001HXLWZMZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001HXLWZMZC24
Mengze Li, Kairong Han, Jiahe Xu, Yueying Li, Tao Wu, Zhou Zhao, Jiaxu Miao, Shengyu Zhang, Jingyuan Chen:
Cross-modal Observation Hypothesis Inference. 466-475

Oral Session 9: Image, Video, and Multimedia Processing

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiCWMH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiCWMH24
Jiyang Li, Lechao Cheng, Zhangye Wang, Tingting Mu, Jingxuan He:
LoopGaussian: Creating 3D Cinemagraph with Multi-view Images via Eulerian Motion Field. 476-485
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenY0LZWSYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenY0LZWSYL24
Chaofeng Chen, Sensen Yang, Haoning Wu, Liang Liao, Zicheng Zhang, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin:
Q-Ground: Image Quality Grounding with Large Multi-modality Models. 486-495
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeCL0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeCL0M24
Cheng Ye, Weidong Chen, Jingyu Li, Lei Zhang, Zhendong Mao:
Dual-path Collaborative Generation Network for Emotional Video Captioning. 496-505
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinLFXYY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinLFXYY024
Hu Lin, Chengjiang Long, Yifeng Fei, Qianchen Xia, Erwei Yin, Baocai Yin, Xin Yang:
Exploring Matching Rates: From Keypoint Selection to Camera Relocalization. 506-514
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuCCCZ00X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuCCCZ00X24
Zhihong Zhu, Xuxin Cheng, Zhaorun Chen, Yuyan Chen, Yunyan Zhang, Xian Wu, Yefeng Zheng, Bowen Xing:
InMu-Net: Advancing Multi-modal Intent Detection via Information Bottleneck and Multi-sensory Processing. 515-524
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangJDYXYZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangJDYXYZZ24
Chaoya Jiang, Hongrui Jia, Mengfan Dong, Wei Ye, Haiyang Xu, Ming Yan, Ji Zhang, Shikun Zhang:
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models. 525-534

Oral Session 10: Speech and Audio in Multimedia Processing

- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWLH24
Zhongxu Wang, Yujia Wang, Mingzhu Li, Hua Huang:
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations. 535-544
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuHCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuHCY24
Shuai Yu, Xiaoliang He, Ke Chen, Yi Yu:
HKDSME: Heterogeneous Knowledge Distillation for Semi-supervised Singing Melody Extraction Using Harmonic Supervision. 545-553
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002QJZLZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002QJZLZ0024
Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. 554-563
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MajumderHGHMP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MajumderHGHMP24
Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria:
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization. 564-572
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0WS0CXS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0WS0CXS24
Xihua Wang, Yuyue Wang, Yihan Wu, Ruihua Song, Xu Tan, Zehua Chen, Hongteng Xu, Guodong Sui:
TiVA: Time-Aligned Video-to-Audio Generation. 573-582
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Galan-CuencaVMH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Galan-CuencaVMH24
Alejandro Galán-Cuenca, Jose J. Valero-Mas, Juan C. Martinez-Sevilla, Antonio Hidalgo-Centeno, Antonio Pertusa, Jorge Calvo-Zaragoza:
MUSCAT: A Multimodal mUSic Collection for Automatic Transcription of Real Recordings and Image Scores. 583-591

Oral Session 11: Emotion & Sentiment

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoWJLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoWJLZ24
Jianing Zhao, Jingjing Wang, Yujie Jin, Jiamin Luo, Guodong Zhou:
Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanced Video Large Language Model. 592-601
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuY0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuY0M24
Daiqing Wu, Dongbao Yang, Yu Zhou, Can Ma:
Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs. 602-611
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuWWLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuWWLZ24
Tan Yu, Jingjing Wang, Jiawen Wang, Jiamin Luo, Guodong Zhou:
Towards Emotion-enriched Text-to-Motion Generation via LLM-guided Limb-level Emotion Manipulating. 612-621
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengYX24
Wenjie Zheng, Jianfei Yu, Rui Xia:
A Unimodal Valence-Arousal Driven Contrastive Learning Framework for Multimodal Multi-Label Emotion Recognition. 622-631
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaiLWTWYTYWZ0GZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaiLWTWYTYWZ0GZ24
Xinji Mai, Junxiong Lin, Haoran Wang, Zeng Tao, Yan Wang, Shaoqi Yan, Xuan Tong, Jiawen Yu, Boyang Wang, Ziheng Zhou, Qing Zhao, Shuyong Gao, Wenqiang Zhang:
All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism. 632-641
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWH24
Xin Li, Shangfei Wang, Xuandong Huang:
Temporal Enhancement for Video Affective Content Analysis. 642-650

Poster Session 1

- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeJ0000YS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeJ0000YS24
Pei He, Licheng Jiao, Lingling Li, Xu Liu, Fang Liu, Wenping Ma, Shuyuan Yang, Ronghua Shang:
Domain Generalization-Aware Uncertainty Introspective Learning for 3D Point Clouds Segmentation. 651-660
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaDHZZRS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaDHZZRS24
Yi Ma, Peiqi Duan, Yuchen Hong, Chu Zhou, Yu Zhang, Jimmy S. J. Ren, Boxin Shi:
Color4E: Event Demosaicing for Full-color Event Guided Image Deblurring. 661-670
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuDZPXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuDZPXL24
Jiajie Zhu, Xia Du, Jizhe Zhou, Chi-Man Pun, Qizhen Xu, Xiaoyuan Liu:
DP-RAE: A Dual-Phase Merging Reversible Adversarial Example for Image Privacy Protection. 671-680
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangCBYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangCBYL24
Xinyi Zhang, Qinpeng Cui, Qiqi Bao, Wenming Yang, Qingmin Liao:
Geometry-Guided Diffusion Model with Masked Transformer for Robust Multi-View 3D Human Pose Estimation. 681-690
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Cao0SDYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Cao0SDYX24
Meiqi Cao, Rui Yan, Xiangbo Shu, Guangzhao Dai, Yazhou Yao, Guo-Sen Xie:
AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition. 691-700
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangG024
Junsheng Wang, Tiantian Gong, Yan Yan:
Partially Aligned Cross-modal Retrieval via Optimal Transport-based Prototype Alignment Learning. 701-709
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoMZYYD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoMZYYD24
Hu Gao, Bowen Ma, Ying Zhang, Jingfan Yang, Jing Yang, Depeng Dang:
Learning Enriched Features via Selective State Spaces Model for Efficient Image Deblurring. 710-718
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChePO024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChePO024
Hangjun Che, Xinyu Pu, Deqiang Ouyang, Beibei Li:
Enhanced Tensorial Self-representation Subspace Learning for Incomplete Multi-view Clustering. 719-728
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiaoD0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiaoD0S24
Jian-Jun Qiao, Meng-Yu Duan, Xiao Wu, Yu-Pei Song:
CartoonNet: Cartoon Parsing with Semantic Consistency and Structure Correlation. 729-737
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoRW0GZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoRW0GZ24
Qianyu Guo, Jieji Ren, Haofen Wang, Tianxing Wu, Weifeng Ge, Wenqiang Zhang:
Visual-Language Collaborative Representation Network for Broad-Domain Few-Shot Image Classification. 738-747
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xu0GWCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xu0GWCJ24
Wenzhuo Xu, Kai Chen, Ziyi Gao, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang:
Highly Transferable Diffusion-based Unrestricted Adversarial Attack on Pre-trained Vision-Language Models. 748-757
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLZGG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLZGG24
Hongzhi Wang, Xiubo Liang, Tao Zhang, Yue Gu, Weidong Geng:
PSSD-Transformer: Powerful Sparse Spike-Driven Transformer for Image Semantic Segmentation. 758-767
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KuangDY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KuangDY24
Zengsheng Kuang, Changxing Ding, Huan Yao:
Learning Context with Priors for 3D Interacting Hand-Object Pose Estimation. 768-777
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenGHL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenGHL024
Yang Chen, Jingcai Guo, Tian He, Xiaocheng Lu, Ling Wang:
Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition. 778-786
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZM024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZM024
Shuo Zhang, Yupeng Zhai, Jilin Mei, Yu Hu:
FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction. 787-796
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangYHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangYHG24
Shaokun Wang, Yifan Yu, Yuhang He, Yihong Gong:
Enhancing Pre-trained ViTs for Downstream Task Adaptation: A Locality-Aware Prompt Learning Method. 797-806
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CuiYWXT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CuiYWXT24
Fangming Cui, Xun Yang, Chao Wu, Liang Xiao, Xinmei Tian:
Advancing Prompt Learning through an External Layer. 807-816
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangRDRJCF024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangRDRJCF024
Hanzi Wang, Jiamin Ren, Yifeng Ding, Lei Ren, Huixing Jiang, Wei Chen, Fangxiang Feng, Xiaojie Wang:
Q-MoE: Connector for MLLMs with Text-Driven Routing. 817-825
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PengWZZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PengWZZL24
Guozhen Peng, Yunhong Wang, Yuwei Zhao, Shaoxiong Zhang, Annan Li:
GLGait: A Global-Local Temporal Receptive Field Network for Gait Recognition in the Wild. 826-835
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0LRZR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0LRZR24
Qiang Wang, Yuning Cui, Yawen Li, Yaping Ruan, Ben Zhu, Wenqi Ren:
RFFNet: Towards Robust and Flexible Fusion for Low-Light Image Denoising. 836-845
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoCPYDZ0TZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoCPYDZ0TZC24
Minghe Gao, Shuang Chen, Liang Pang, Yuan Yao, Jisheng Dang, Wenqiao Zhang, Juncheng Li, Siliang Tang, Yueting Zhuang, Tat-Seng Chua:
Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales. 846-855
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0004K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0004K24
Yue Zhang, Parisa Kordjamshidi:
Narrowing the Gap between Vision and Action in Navigation. 856-865
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengS0WSXW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengS0WSXW024
Zequn Zeng, Jianqiao Sun, Hao Zhang, Tiansheng Wen, Yudi Su, Yan Xie, Zhengjue Wang, Bo Chen:
HICEScore: A Hierarchical Metric for Image Captioning Evaluation. 866-875
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FengTP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FengTP24
Chen Feng, Georgios Tzimiropoulos, Ioannis Patras:
CLIPCleaner: Cleaning Noisy Labels with CLIP. 876-885
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoMYXWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoMYXWL024
Haochen Zhao, Hui Meng, Deqian Yang, Xiaozheng Xie, Xiaoze Wu, Qingfeng Li, Jianwei Niu:
GuidedNet: Semi-Supervised Multi-Organ Segmentation via Labeled Data Guide Unlabeled Data. 886-895
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Chan0G024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Chan0G024
Kin-Chung Chan, Jun Xiao, Hana Lebeta Goshu, Kin-Man Lam:
Point Cloud Densification for 3D Gaussian Splatting from Sparse Input Views. 896-904
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangLZTZSJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangLZTZSJ24
Xiaorui Huang, Gen Luo, Chaoyang Zhu, Bo Tong, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Deep Instruction Tuning for Segment Anything Model. 905-914
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangRJWZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangRJWZX24
Ziyi Wang, Yiming Rong, Deyang Jiang, Haoran Wu, Shiyu Zhou, Bo Xu:
CIEASR: Contextual Image-Enhanced Automatic Speech Recognition for Improved Homophone Discrimination. 915-924
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangYZ24
Jinxu Zhang, Yongqi Yu, Yu Zhang:
CREAM: Coarse-to-Fine Retrieval and Multi-modal Efficient Tuning for Document VQA. 925-934
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0YXF024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0YXF024
Hebaixu Wang, Hao Zhang, Xunpeng Yi, Xinyu Xiang, Leyuan Fang, Jiayi Ma:
TeRF: Text-driven and Region-aware Flexible Visible and Infrared Image Fusion. 935-944
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangSWYCCA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangSWYCCA24
Ruonan Zhang, Ziwei Shang, Fengjuan Wang, Zhaoqilin Yang, Shan Cao, Yigang Cen, Gaoyun An:
Synergetic Prototype Learning Network for Unbiased Scene Graph Generation. 945-954
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuLZLJ0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuLZLJ0C24
Jiawei Zhu, Yishu Liu, Huanjia Zhu, Hui Lin, Yuncheng Jiang, Zheng Zhang, Bingzhi Chen:
Combating Visual Question Answering Hallucinations via Robust Multi-Space Co-Debias Learning. 955-964
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaoCSWHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaoCSWHR24
Qian Cao, Xu Chen, Ruihua Song, Xiting Wang, Xinting Huang, Yuchen Ren:
See or Guess: Counterfactually Regularized Image Captioning. 965-974
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiQ0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiQ0X24
Shuai Li, Fan Qi, Zixin Zhang, Changsheng Xu:
Cross-Modal Meta Consensus for Heterogeneous Federated Learning. 975-984
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeLLZSKY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeLLZSKY024
Xiang He, Xiangxi Liu, Yang Li, Dongcheng Zhao, Guobin Shen, Qingqun Kong, Xin Yang, Yi Zeng:
CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization. 985-993
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoLLHZZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoLLHZZ0024
Jiabao Guo, Huan Liu, Yizhi Luo, Xueli Hu, Hang Zou, Yuan Zhang, Hui Liu, Bo Zhao:
Style-conditional Prompt Token Learning for Generalizable Face Anti-spoofing. 994-1003
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenKD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenKD24
Bowen Chen, Yun Sing Koh, Gillian Dobbie:
SSAT-Adapter: Enhancing Vision-Language Model Few-shot Learning with Auxiliary Tasks. 1004-1013
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Tong0J0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Tong0J0W024
Haoyu Tong, Xiaoyu Zhang, Yulin Jin, Jian Lou, Kai Wu, Xiaofeng Chen:
Balancing Generalization and Robustness in Adversarial Training via Steering through Clean and Adversarial Gradient Directions. 1014-1023
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengD0HZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengD0HZL24
Shuo Zheng, Yuanjie Dang, Peng Chen, Ruohong Huan, Dongdong Zhao, Ronghua Liang:
Saliency-Guided Fine-Grained Temporal Mask Learning for Few-Shot Action Recognition. 1024-1033
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu0RY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu0RY24
Mengyin Liu, Chao Zhu, Shiqi Ren, Xu-Cheng Yin:
Unsupervised Multi-view Pedestrian Detection. 1034-1042
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangY0QZZZ0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangY0QZZZ0Y24
Zhilin Huang, Yijie Yu, Ling Yang, Chujun Qin, Bing Zheng, Xiawu Zheng, Zikun Zhou, Yaowei Wang, Wenming Yang:
Motion-aware Latent Diffusion Models for Video Frame Interpolation. 1043-1052
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeLLQD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeLLQD24
Zongxin Ye, Wenyu Li, Sidun Liu, Peng Qiao, Yong Dou:
AbsGS: Recovering Fine Details in 3D Gaussian Splatting. 1053-1061
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZMWD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZMWD024
Ziming Wang, Boxiang Zhang, Ming Ma, Yue Wang, Taoli Du, Wenhui Li:
Multi-fineness Boundaries and the Shifted Ensemble-aware Encoding for Point Cloud Semantic Segmentation. 1062-1071
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLQCJX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLQCJX24
Yubo Wang, Chaohu Liu, Yanqiu Qu, Haoyu Cao, Deqiang Jiang, Linli Xu:
Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models. 1072-1081
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWZY24
Wenhao Li, Qiangchang Wang, Peng Zhao, Yilong Yin:
KNN Transformer with Pyramid Prompts for Few-Shot Learning. 1082-1091
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangYD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangYD24
Lu Zhang, Ke Yan, Shouhong Ding:
AlignCLIP: Align Multi Domains of Texts Input for CLIP models with Object-IoU Loss. 1092-1100
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YueLZHLNDZJCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YueLZHLNDZJCJ24
Pengfei Yue, Jianghang Lin, Shengchuan Zhang, Jie Hu, Yilin Lu, Hongwei Niu, Haixin Ding, Yan Zhang, Guannan Jiang, Liujuan Cao, Rongrong Ji:
Adaptive Selection based Referring Image Segmentation. 1101-1110
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0008AYXT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0008AYXT024
Shanshan Wang, ALuSi, Xun Yang, Ke Xu, Huibin Tan, Xingyi Zhang:
Dual-stream Feature Augmentation for Domain Generalization. 1111-1119
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuHQLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuHQLW24
Yang Liu, Xiang Huang, Minghan Qin, Qinwei Lin, Haoqian Wang:
Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars. 1120-1129
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0010W00G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0010W00G24
Wei Feng, Dongyuan Wei, Qianqian Wang, Bo Dong, Quanxue Gao:
Multi-View Clustering Based on Deep Non-negative Tensor Factorization. 1130-1138
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHWCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHWCH24
Aoqi Li, Saihui Hou, Chenye Wang, Qingyuan Cai, Yongzhen Huang:
AerialGait: Bridging Aerial and Ground Views for Gait Recognition. 1139-1147
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZL024
Zefan Zhang, Weiqi Zhang, Yanhui Li, Tian Bai:
Caption-Aware Multimodal Relation Extraction with Mutual Information Maximization. 1148-1157
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0XCSDT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0XCSDT24
Xiaochen Li, Jian Cheng, Ziying Xia, Zichong Chen, Junhao Shi, Zhicheng Dong, Nyima Tashi:
TS-ILM: Class Incremental Learning for Online Action Detection. 1158-1167
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaiSY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaiSY24
Yuxiang Cai, Yongheng Shang, Jianwei Yin:
MultiDAN: Unsupervised, Multistage, Multisource and Multitarget Domain Adaptation for Semantic Segmentation of Remote Sensing Images. 1168-1177
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TongL0LS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TongL0LS24
Yu Tong, Weihai Lu, Zhe Zhao, Song Lai, Tong Shi:
MMDFND: Multi-modal Multi-Domain Fake News Detection. 1178-1186
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengZCP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengZCP024
Minghang Zheng, Jiahua Zhang, Qingchao Chen, Yuxin Peng, Yang Liu:
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding. 1187-1196
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Jia0FZZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Jia0FZZL24
Shilong Jia, Tingting Wu, Yingying Fang, Tieyong Zeng, Guixu Zhang, Zhi Li:
Purified Distillation: Bridging Domain Shift and Category Gap in Incremental Object Detection. 1197-1205
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZGSS24
Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
MPT: Multi-grained Prompt Tuning for Text-Video Retrieval. 1206-1214
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengZWS0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengZWS0024
Ziwei Zheng, Zechuan Zhang, Yulin Wang, Shiji Song, Gao Huang, Le Yang:
Rethinking the Architecture Design for Efficient Generic Event Boundary Detection. 1215-1224
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZJHGCGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZJHGCGZ24
Jinglun Li, Xinyu Zhou, Kaixun Jiang, Lingyi Hong, Pinxue Guo, Zhaoyu Chen, Weifeng Ge, Wenqiang Zhang:
TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning. 1225-1234
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaoWDZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaoWDZ24
Zihan Cao, Xiao Wu, Liang-Jian Deng, Yu Zhong:
A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion. 1235-1244
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangQXW0DX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangQXW0DX24
Zhenyu Yang, Shengsheng Qian, Dizhan Xue, Jiahong Wu, Fan Yang, Weiming Dong, Changsheng Xu:
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval. 1245-1254
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Jin0W0ZZQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Jin0W0ZZQ024
Zeyu Jin, Jia Jia, Qixin Wang, Kehan Li, Shuoyi Zhou, Songtao Zhou, Xiaoyu Qin, Zhiyong Wu:
SpeechCraft: A Fine-Grained Expressive Speech Dataset with Natural Language Description. 1255-1264
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuCDW00LSA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuCDW00LSA24
Lihao Liu, Yanqi Cheng, Zhongying Deng, Shujun Wang, Dongdong Chen, Xiaowei Hu, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I. Avilés-Rivero:
TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios. 1265-1273
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangJGYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangJGYY24
Jing Yang, Xiaowen Jiang, Yuan Gao, Laurence T. Yang, Jieming Yang:
Generalize to Fully Unseen Graphs: Learn Transferable Hyper-Relation Structures for Inductive Link Prediction. 1274-1282
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu0WZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu0WZX24
Panjun Liu, Jiacheng Li, Lizhi Wang, Zheng-Jun Zha, Zhiwei Xiong:
MLP Embedded Inverse Tone Mapping. 1283-1291
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinLHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinLHL24
Mingkai Lin, Wenzhong Li, Xiaobin Hong, Sanglu Lu:
Scalable Multi-Source Pre-training for Graph Neural Networks. 1292-1301
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhao0X0JLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhao0X0JLL024
Xiaole Zhao, Linze Li, Chengxing Xie, Xiaoming Zhang, Ting Jiang, Wenjie Lin, Shuaicheng Liu, Tianrui Li:
Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation. 1302-1310
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KimYPRR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KimYPRR24
Minsu Kim, Jeong Hun Yeo, Se Jin Park, Hyeongseop Rha, Yong Man Ro:
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation. 1311-1320
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoSWSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoSWSZ24
Shoutong Luo, Zhengxing Sun, Yi Wang, Yunhan Sun, Chendi Zhu:
LDCNet: Long-Distance Context Modeling for Large-Scale 3D Point Cloud Scene Semantic Segmentation. 1321-1330
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Cui0Z0WWJW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Cui0Z0WWJW24
Yiming Cui, Liang Li, Jiehua Zhang, Chenggang Yan, Hongkui Wang, Shuai Wang, Heng Jin, Li Wu:
Stochastic Context Consistency Reasoning for Domain Adaptive Object Detection. 1331-1340
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0L24
Zhuoling Li, Yong Wang, Kaitong Li:
FewVS: A Vision-Semantics Integration Framework for Few-Shot Image Classification. 1341-1350
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Bu000W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Bu000W024
Yuyan Bu, Qiang Sheng, Juan Cao, Peng Qi, Danding Wang, Jintao Li:
FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process. 1351-1360
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KhanalXSDXAJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KhanalXSDXAJ24
Subash Khanal, Eric Xing, Srikumar Sastry, Aayush Dhakal, Zhexiao Xiong, Adeel Ahmad, Nathan Jacobs:
PSM: Learning Probabilistic Embeddings for Multi-scale Zero-Shot Soundscape Mapping. 1361-1369
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLCYGW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLCYGW24
Zizhao Wu, Haohan Li, Gongyi Chen, Zhou Yu, Xiaoling Gu, Yigang Wang:
3D Question Answering with Scene Graph Reasoning. 1370-1378
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0009WWZD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0009WWZD024
Liang He, Hongke Wang, Zhen Wu, Jianbing Zhang, Xinyu Dai, Jiajun Chen:
Focus & Gating: A Multimodal Approach for Unveiling Relations in Noisy Social Media. 1379-1388
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLLYZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLLYZZ24
Yuanchen Wu, Xiaoqiang Li, Jide Li, Kequan Yang, Pinpin Zhu, Shaohua Zhang:
DINO is Also a Semantic Guider: Exploiting Class-aware Affinity for Weakly Supervised Semantic Segmentation. 1389-1397
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinHLF024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinHLF024
Dongshuo Yin, Xueting Han, Bin Li, Hao Feng, Jing Bai:
Parameter-efficient is not Sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions. 1398-1406
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHDC0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHDC0Z24
Rongwen Li, Haiyang Hu, Liang Du, Jiarong Chen, Bingbing Jiang, Peng Zhou:
One-Stage Fair Multi-View Spectral Clustering. 1407-1416
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TanPZ0ZKDLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TanPZ0ZKDLL24
Jingfan Tan, Hyunhee Park, Ying Zhang, Tao Wang, Kaihao Zhang, Xiangyu Kong, Pengwen Dai, Zikun Liu, Wenhan Luo:
Blind Face Video Restoration with Temporal Consistent Generative Prior and Degradation-Aware Prompt. 1417-1426
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Sun0SZR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Sun0SZR24
Yinghui Sun, Xingfeng Li, Quansen Sun, Min-Ling Zhang, Zhenwen Ren:
Improved Weighted Tensor Schatten p-Norm for Fast Multi-view Graph Clustering. 1427-1436
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangZXLZZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangZXLZZH24
Xinjie Jiang, Chenxi Zheng, Xuemiao Xu, Bangzhen Liu, Weiying Zheng, Huaidong Zhang, Shengfeng He:
VrdONE: One-stage Video Visual Relation Detection. 1437-1446
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaTZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaTZ024
Chenxi Ma, Weimin Tan, Shili Zhou, Bo Yan:
Learning Cross-Spectral Prior for Image Super-Resolution. 1447-1455
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuLWZWHZT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuLWZWHZT024
Dayu Hu, Suyuan Liu, Jun Wang, Junpu Zhang, Siwei Wang, Xingchen Hu, Xinzhong Zhu, Chang Tang, Xinwang Liu:
Reliable Attribute-missing Multi-view Clustering with Instance-level and feature-level Cooperative Imputation. 1456-1466
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TranKL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TranKL24
Duc Dang Trung Tran, Byeongkeun Kang, Yeejin Lee:
MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation. 1467-1475
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuGLSY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuGLSY024
Jingjing Hu, Dan Guo, Kun Li, Zhan Si, Xun Yang, Meng Wang:
Maskable Retentive Network for Video Moment Retrieval. 1476-1485
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HouCZLCLCHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HouCZLCLCHZ24
Junming Hou, Zihan Cao, Naishan Zheng, Xuan Li, Xiaoyu Chen, Xinyang Liu, Xiaofeng Cong, Danfeng Hong, Man Zhou:
Linearly-evolved Transformer for Pan-sharpening. 1486-1494
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangLODZHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangLODZHL24
Zhenhao Yang, Xin Liu, Deqiang Ouyang, Guiduo Duan, Dongyang Zhang, Tao He, Yuan-Fang Li:
Towards Open-vocabulary HOI Detection with Calibrated Vision-language Models and Locality-aware Queries. 1495-1504
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengSLLCW0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengSLLCW0024
Kang Zeng, Hao Shi, Jiacheng Lin, Siyu Li, Jintao Cheng, Kaiwei Wang, Zhiyong Li, Kailun Yang:
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model. 1505-1513
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangLYWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangLYWL24
Tao Tang, Hong Liu, Yingxuan You, Ti Wang, Wenhao Li:
ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos. 1514-1523
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuJH0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuJH0Z24
Xudong Lu, Yuqi Jiang, Haiwen Hong, Qi Sun, Cheng Zhuo:
DCAFuse: Dual-Branch Diffusion-CNN Complementary Feature Aggregation Network for Multi-Modality Image Fusion. 1524-1533
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZouGYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZouGYL24
Wenbin Zou, Hongxia Gao, Weipeng Yang, Tongtong Liu:
Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement. 1534-1543
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003XJWSH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003XJWSH24
Junwei He, Qianqian Xu, Yangbangyan Jiang, Zitai Wang, Yuchen Sun, Qingming Huang:
HGOE: Hybrid External and Internal Graph Outlier Exposure for Graph Out-of-Distribution Detection. 1544-1553
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0006M00WLTW0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0006M00WLTW0024
Ke Liang, Lingyuan Meng, Yue Liu, Meng Liu, Wei Wei, Suyuan Liu, Wenxuan Tu, Siwei Wang, Sihang Zhou, Xinwang Liu:
Simple Yet Effective: Structure Guided Pre-trained Transformer for Multi-modal Knowledge Graph Reasoning. 1554-1563
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingZLZCDDS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingZLZCDDS24
Yuning Ding, Sifan Zhang, Shenglan Liu, Jinrong Zhang, Wenyue Chen, Haifei Duan, Bingcheng Dong, Tao Sun:
2M-AF: A Strong Multi-Modality Framework For Human Action Quality Assessment with Self-supervised Representation Learning. 1564-1572
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenHLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenHLZ024
Liqiu Chen, Yuqing Huang, Hengyu Li, Zikun Zhou, Zhenyu He:
Simplifying Cross-modal Interaction via Modality-Shared Features for RGBT Tracking. 1573-1582
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CuiHSDZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CuiHSDZW24
Can Cui, Siteng Huang, Wenxuan Song, Pengxiang Ding, Min Zhang, Donglin Wang:
ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification. 1583-1592
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wei0H024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wei0H024
Tianqi Wei, Zhi Chen, Zi Huang, Xin Yu:
Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline. 1593-1601
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Lei0WX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Lei0WX024
Jiaming Lei, Lin Li, Chunping Wang, Jun Xiao, Long Chen:
Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer. 1602-1611
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wen24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wen24
Jinyong Wen:
Gaussian Mutual Information Maximization for Efficient Graph Self-Supervised Learning: Bridging Contrastive-based to Decorrelation-based. 1612-1621
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KuangMYG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KuangMYG024
Haowei Kuang, Yiyang Ma, Wenhan Yang, Zongming Guo, Jiaying Liu:
Consistency Guided Diffusion Model with Neural Syntax for Perceptual Image Compression. 1622-1631
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FengZN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FengZN24
Zhangchi Feng, Richong Zhang, Zhijie Nie:
Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives. 1632-1641
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingLCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingLCC24
Guanchen Ding, Lingbo Liu, Zhenzhong Chen, Changwen Chen:
Domain-Agnostic Crowd Counting via Uncertainty-Guided Style Diversity Augmentation. 1642-1651
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FanZZX0LYSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FanZZX0LYSL24
Cunhang Fan, Jingjing Zhang, Hongyu Zhang, Wang Xiang, Jianhua Tao, Xinhui Li, Jiangyan Yi, Dianbo Sui, Zhao Lv:
MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction. 1652-1661
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiMZWPH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiMZWPH24
Zhong Ji, Changxu Meng, Yan Zhang, Haoran Wang, Yanwei Pang, Jungong Han:
Eliminate Before Align: A Remote Sensing Image-Text Retrieval Framework with Keyword Explicit Reasoning. 1662-1671
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLLWL24
Jinyan Zhang, Mengyuan Liu, Hong Liu, Guoquan Wang, Wenhao Li:
APP: Adaptive Pose Pooling for 3D Human Pose Estimation from Videos. 1672-1681
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Bi0SVNX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Bi0SVNX24
Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu:
EAGLE: Egocentric AGgregated Language-video Engine. 1682-1691
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinS24
Kai Yin, Jie Shen:
Expanded Convolutional Neural Network Based Look-Up Tables for High Efficient Single-Image Super-Resolution. 1692-1700
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Han0YZQY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Han0YZQY24
Zheng Han, Xiaobin Zhu, Chun Yang, Hongyang Zhou, Jingyan Qin, Xu-Cheng Yin:
Exploring Stable Meta-Optimization Patterns via Differentiable Reinforcement Learning for Few-Shot Classification. 1701-1710
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Guo0L0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Guo0L0024
Yixin Guo, Yu Liu, Jianghao Li, Weimin Wang, Qi Jia:
Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection. 1711-1720
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengZX0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengZX0L24
Jiangbin Zheng, Han Zhang, Qianqing Xu, An-Ping Zeng, Stan Z. Li:
MetaEnzyme: Meta Pan-Enzyme Learning for Task-Adaptive Redesign. 1721-1730
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhongZ0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhongZ0W24
Yiming Zhong, Xiaolin Zhang, Yao Zhao, Yunchao Wei:
DreamLCM: Towards High Quality Text-to-3D Generation via Latent Consistency Model. 1731-1740
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuXZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuXZW24
Anna Zhu, Ke Xiao, Bo Zhou, Runmin Wang:
Trust Prophet or Not? Taking a Further Verification Step toward Accurate Scene Text Recognition. 1741-1750
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xi0YZQW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xi0YZQW24
Gongli Xi, Ye Tian, Mengyu Yang, Lanshan Zhang, Xirong Que, Wendong Wang:
Global Patch-wise Attention is Masterful Facilitator for Masked Image Modeling. 1751-1760
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DengXCXTD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DengXCXTD024
Chenghao Deng, Haote Xu, Xiaolu Chen, Haodi Xu, Xiaotong Tu, Xinghao Ding, Yue Huang:
SimCLIP: Refining Image-Text Alignment with Simple Prompts for Zero-/Few-shot Anomaly Detection. 1761-1770
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TianX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TianX024
Yuanhe Tian, Fei Xia, Yan Song:
Diffusion Networks with Task-Specific Noise Control for Radiology Report Generation. 1771-1780
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XingG0TM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XingG0TM24
Yun Xing, Qing Guo, Xiaofeng Cao, Ivor W. Tsang, Lei Ma:
MetaRepair: Learning to Repair Deep Neural Networks from Repairing Experiences. 1781-1790
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZCXFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZCXFZ24
Xingtao Wang, Xianqi Zhang, Wenxue Cui, Ruiqin Xiong, Xiaopeng Fan, Debin Zhao:
Mesh Denoising Using Filtering Coefficients Jointly Aware of Noise and Geometry. 1791-1799
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuangZHZDR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuangZHZDR24
Yan Zhuang, Yanru Zhang, Zheng Hu, Xiaoyue Zhang, Jiawen Deng, Fuji Ren:
GLoMo: Global-Local Modal Fusion for Multimodal Sentiment Analysis. 1800-1809
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wu0W0LZLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wu0W0LZLS24
Yuhui Wu, Guoqing Wang, Zhiwen Wang, Yang Yang, Tianyu Li, Malu Zhang, Chongyi Li, Heng Tao Shen:
JoReS-Diff: Joint Retinex and Semantic Priors in Diffusion Model for Low-light Image Enhancement. 1810-1818
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WenW0LCP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WenW0LCP024
Zichen Wen, Tianyi Wu, Yazhou Ren, Yawen Ling, Chenhang Cui, Xiaorong Pu, Lifang He:
Dual-Optimized Adaptive Graph Reconstruction for Multi-View Graph Clustering. 1819-1828
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuH0ZRR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuH0ZRR24
Xiaobin Lu, Xiaobin Hu, Jun Luo, Ben Zhu, Yaping Ruan, Wenqi Ren:
3D Priors-Guided Diffusion for Blind Face Restoration. 1829-1838
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuZLX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuZLX024
Hao Wu, Likun Zhang, Shucheng Li, Fengyuan Xu, Sheng Zhong:
CoAst: Validation-Free Contribution Assessment for Federated Learning based on Cross-Round Valuation. 1839-1847
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaLSL24
Kang Xia, Wenzhong Li, Yimiao Shao, Sanglu Lu:
Vi2ACT: Video-enhanced Cross-modal Co-learning with Representation Conditional Discriminator for Few-shot Human Activity Recognition. 1848-1856
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KoKC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KoKC24
Seonggwan Ko, Yeong Jun Koh, Donghyeon Cho:
Reference-based Burst Super-resolution. 1857-1865
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0HDZHHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0HDZHHS24
Yi Zhang, Zhefeng Wang, Rui Hu, Xinyu Duan, Yi Zheng, Baoxing Huai, Jiarun Han, Jitao Sang:
Poisoning for Debiasing: Fair Recognition via Eliminating Bias Uncovered in Data Poisoning. 1866-1874
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XueQX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XueQX24
Dizhan Xue, Shengsheng Qian, Changsheng Xu:
Few-Shot Multimodal Explanation for Visual Question Answering. 1875-1884
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangL24
Jingtao Wang, Zechao Li:
3DPCP-Net: A Lightweight Progressive 3D Correspondence Pruning Network for Accurate and Efficient Point Cloud Registration. 1885-1894
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GeCZZLW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GeCZZLW024
Jiawei Ge, Jiuxin Cao, Xuelin Zhu, Xinyu Zhang, Chang Liu, Kun Wang, Bo Liu:
Consistencies are All You Need for Semi-supervised Vision-Language Tracking. 1895-1904
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZouYHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZouYHZ24
Zhen Zou, Hu Yu, Jie Huang, Feng Zhao:
FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining. 1905-1914
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoLWWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoLWWL24
Zhida Zhao, Jia Li, Lijun Wang, Yifan Wang, Huchuan Lu:
MaskMentor: Unlocking the Potential of Masked Self-Teaching for Missing Modality RGB-D Semantic Segmentation. 1915-1923
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YaoZWHGJ0J24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YaoZWHGJ0J24
Linli Yao, Yuanmeng Zhang, Ziheng Wang, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Xu Sun, Qin Jin:
Edit As You Wish: Video Caption Editing with Multi-grained User Control. 1924-1933
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiXZHWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiXZHWS24
Wenlin Li, Yucheng Xu, Xiaoqing Zheng, Suoya Han, Jun Wang, Xiaobo Sun:
Dual Advancement of Representation Learning and Clustering for Sparse and Noisy Images. 1934-1942
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HaoX0G0S024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HaoX0G0S024
Zhiwei Hao, Zhongyu Xiao, Yong Luo, Jianyuan Guo, Jing Wang, Li Shen, Han Hu:
PrimKD: Primary Modality Guided Multimodal Fusion for RGB-D Semantic Segmentation. 1943-1951
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenQZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenQZ0024
Kaixin Shen, Ruijie Quan, Linchao Zhu, Jun Xiao, Yi Yang:
Neural Interaction Energy for Multi-Agent Trajectory Prediction. 1952-1960
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuYWR0YC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuYWR0YC024
Hao Gu, Jiangyan Yi, Chenglong Wang, Yong Ren, Jianhua Tao, Xinrui Yan, Yujie Chen, Xiaohui Zhang:
Utilizing Speaker Profiles for Impersonation Audio Detection. 1961-1970
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWDLWZZF0CW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWDLWZZF0CW024
Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, Jingjing Chen, Zhongyu Wei, Xuanjing Huang:
ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks. 1971-1980
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenDG0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenDG0W24
Jiankang Chen, Ling Deng, Zhiyong Gan, Wei-Shi Zheng, Ruixuan Wang:
FodFoM: Fake Outlier Data by Foundation Models Creates Stronger Visual Out-of-Distribution Detector. 1981-1990
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangRCFTH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangRCFTH24
Xudong Wang, Weihong Ren, Xi'ai Chen, Huijie Fan, Yandong Tang, Zhi Han:
Uni-YOLO: Vision-Language Model-Guided YOLO for Robust and Fast Universal Detection in the Open World. 1991-2000
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhongLXHLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhongLXHLG24
Junliu Zhong, Zhiyi Li, Dan Xiang, Maotang Han, Changsheng Li, Yanfen Gan:
A Lightweight Multi-domain Multi-attention Progressive Network for Single Image Deraining. 2001-2010
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangL0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangL0M24
Weijia Zhang, Dongnan Liu, Weidong Cai, Chao Ma:
Cross-View Consistency Regularisation for Knowledge Distillation. 2011-2020
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SongTLMYC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SongTLMYC024
Zikai Song, Ying Tang, Run Luo, Lintao Ma, Junqing Yu, Yi-Ping Phoebe Chen, Wei Yang:
Autogenic Language Embedding for Coherent Point Tracking. 2021-2030
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PanSWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PanSWZ024
Yuwen Pan, Rui Sun, Yuan Wang, Tianzhu Zhang, Yongdong Zhang:
Rethinking the Implicit Optimization Paradigm with Dual Alignments for Referring Remote Sensing Image Segmentation. 2031-2040
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuZZCL0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuZZCL0W24
Zhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen, Hao Li, Ming Tang, Jinqiao Wang:
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization. 2041-2049
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LeiZYXZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LeiZYXZH24
Yi Lei, Huilin Zhu, Jingling Yuan, Guangli Xiang, Xian Zhong, Shengfeng He:
DenseTrack: Drone-Based Crowd Tracking via Density-Aware Motion-Appearance Synergy. 2050-2058
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangWG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangWG24
Fengze Jiang, Shuling Wang, Xiaojin Gong:
Task-Conditional Adapter for Multi-Task Dense Prediction. 2059-2068
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinWZLDWSX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinWZLDWSX024
Yitai Lin, Zhijie Wei, Wanfa Zhang, Xiping Lin, Yudi Dai, Chenglu Wen, Siqi Shen, Lan Xu, Cheng Wang:
HmPEAR: A Dataset for Human Pose Estimation and Action Recognition. 2069-2078
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoH00LHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoH00LHS24
Deji Zhao, Donghong Han, Ye Yuan, Bo Ning, Mengxiang Li, Zhongjiang He, Shuangyong Song:
AutoGraph: Enabling Visual Context via Graph Alignment in Open Domain Multi-Modal Dialogue Generation. 2079-2088
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0Y0FSRZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0Y0FSRZ024
Jiaxin Zhang, Yiqi Wang, Xihong Yang, Siwei Wang, Yu Feng, Yu Shi, Ruichao Ren, En Zhu, Xinwang Liu:
Test-Time Training on Graphs with Large Language Models (LLMs). 2089-2098
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xiao000ZZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xiao000ZZ024
Yujia Xiao, Xi Wang, Xu Tan, Lei He, Xinfa Zhu, Sheng Zhao, Tan Lee:
Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis. 2099-2107
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinZC0P024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinZC0P024
Junyu Lin, Yan Zheng, Xinyue Chen, Yazhou Ren, Xiaorong Pu, Jing He:
Cross-view Contrastive Unification Guides Generative Pretraining for Molecular Property Prediction. 2108-2116
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuanZ0LL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuanZ0LL24
Bo Yuan, Danpei Zhao, Zhuoran Liu, Wentao Li, Tian Li:
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images. 2117-2126
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWLL24
Shidi Chen, Lili Wei, Liqian Liang, Congyan Lang:
Joint Homophily and Heterophily Relational Knowledge Distillation for Efficient and Compact 3D Object Detection. 2127-2135
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangW0WL00S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangW0WL00S24
Zhiwen Wang, Yuhui Wu, Zheng Wang, Jiwei Wei, Tianyu Li, Guoqing Wang, Yang Yang, Hengtao Shen:
Cascaded Adversarial Attack: Simultaneously Fooling Rain Removal and Semantic Segmentation Networks. 2136-2145
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yan0MH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yan0MH024
Jiexuan Yan, Sheng Huang, Nankun Mu, Luwen Huangfu, Bo Liu:
Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification. 2146-2155
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunSLYWL0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunSLYWL0C24
Penglei Sun, Yaoxian Song, Xiang Liu, Xiaofei Yang, Qiang Wang, Tiefeng Li, Yang Yang, Xiaowen Chu:
3D Question Answering for City Scene Understanding. 2156-2165
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KongC0RK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KongC0RK24
Qiuyu Kong, Jiangming Chen, Jie Jiang, Zanxi Ruan, Lai Kang:
Dual-Branch Fusion with Style Modulation for Cross-Domain Few-Shot Semantic Segmentation. 2166-2174
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLCC24
Jiaqi Wang, Lu Lu, Mingmin Chi, Jian Chen:
MDR: Multi-stage Decoupled Relational Knowledge Distillation with Adaptive Stage Selection. 2175-2183
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoLLLDP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoLLLDP24
Xiongjun Zhao, Zhengyu Liu, Fen Liu, Guanting Li, Yutao Dou, Shaoliang Peng:
Report-Concept Textual-Prompt Learning for Enhancing X-ray Diagnosis. 2184-2193
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuHZT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuHZT024
Jianzhi Lu, Ruian He, Shili Zhou, Weimin Tan, Bo Yan:
FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model. 2194-2203
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangLL24
Wei-Bang Jiang, Yu-Ting Lan, Bao-Liang Lu:
REmoNet: Reducing Emotional Label Noise via Multi-regularized Self-supervision. 2204-2213
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLZLLYLLGH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLZLLYLLGH24
Shuxun Wang, Yunfei Lei, Ziqi Zhang, Wei Liu, Haowei Liu, Li Yang, Bing Li, Wenjuan Li, Jin Gao, Weiming Hu:
NFT1000: A Cross-Modal Dataset For Non-Fungible Token Retrieval. 2214-2222
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SuD0N24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SuD0N24
Haoyang Su, Wenzhe Du, Xiaoliang Wang, Cam-Tu Nguyen:
Sample Efficiency Matters: Training Multimodal Conversational Recommendation Systems in a Small Data Setting. 2223-2232
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JuZZLLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JuZZLLZ24
Xincheng Ju, Dong Zhang, Suyang Zhu, Junhui Li, Shoushan Li, Guodong Zhou:
ECFCON: Emotion Consequence Forecasting in Conversations. 2233-2241
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinS0L00Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinS0L00Q24
Xiangbo Yin, Jiangming Shi, Yachao Zhang, Yang Lu, Zhizhong Zhang, Yuan Xie, Yanyun Qu:
Robust Pseudo-label Learning with Neighbor Relation for Unsupervised Visible-Infrared Person Re-Identification. 2242-2251
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiCFJ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiCFJ0024
Yubo Li, De Cheng, Chaowei Fang, Changzhe Jiao, Nannan Wang, Xinbo Gao:
Disentangling Identity Features from Interference Factors for Cloth-Changing Person Re-identification. 2252-2261
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0LG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0LG024
Bing Wang, Shengsheng Wang, Changchun Li, Renchu Guan, Ximing Li:
Harmfully Manipulated Images Matter in Multimodal Misinformation Detection. 2262-2271
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangCJGCZYWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangCJGCZYWY24
Wuliang Huang, Yiqiang Chen, Xinlong Jiang, Chenlong Gao, Qian Chen, Teng Zhang, Bingjie Yan, Yifan Wang, Jianrong Yang:
Correlation-Driven Multi-Modality Graph Decomposition for Cross-Subject Emotion Recognition. 2272-2281
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0S00T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0S00T24
Wenbin Wang, Liang Ding, Li Shen, Yong Luo, Han Hu, Dacheng Tao:
WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge. 2282-2291
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenZXZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenZXZ0024
Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Yunyan Zhang, Xian Wu, Yefeng Zheng:
Aspects are Anchors: Towards Multimodal Aspect-based Sentiment Analysis via Aspect-driven Alignment and Refinement. 2292-2300
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenHDZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenHDZS24
Haodong Chen, Haojian Huang, Junhao Dong, Mingzhe Zheng, Dian Shao:
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs. 2301-2310
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiSZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiSZ024
Honghao Li, Lei Sang, Yi Zhang, Yiwen Zhang:
SimCEN: Simple Contrast-enhanced Network for CTR Prediction. 2311-2320
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiLLCM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiLLCM24
Yuanyuan Shi, Yunan Li, Siyu Liang, Huizhou Chen, Qiguang Miao:
MGR-Dark: A Large Multimodal Video Dataset and RGB-IR Benchmark for Gesture Recognition in Darkness. 2321-2330
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yan0D0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yan0D0024
Shuanglin Yan, Jun Liu, Neng Dong, Liyan Zhang, Jinhui Tang:
Prototypical Prompting for Text-to-image Person Re-identification. 2331-2340
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FengJM024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FengJM024
Kexiang Feng, Chuanmin Jia, Siwei Ma, Wen Gao:
Unifying Spike Perception and Prediction: A Compact Spike Representation Model Using Multi-scale Correlation. 2341-2349
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangQSX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangQSX24
Feifei Zhang, Sijia Qu, Fan Shi, Changsheng Xu:
Overcoming the Pitfalls of Vision-Language Model for Image-Text Retrieval. 2350-2359
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ToniniDVB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ToniniDVB024
Francesco Tonini, Nicola Dall'Asen, Lorenzo Vaquero, Cigdem Beyan, Elisa Ricci:
AL-GTD: Deep Active Learning for Gaze Target Detection. 2360-2369
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouSLCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouSLCZ24
Yuxiang Zhou, Zhe Sun, Rui Liu, Yong Chen, Dell Zhang:
AVHash: Joint Audio-Visual Hashing for Video Retrieval. 2370-2378
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Jiang000L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Jiang000L24
Xin Jiang, Hao Tang, Rui Yan, Jinhui Tang, Zechao Li:
DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines. 2379-2388
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZJLGW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZJLGW024
Qian Li, Yucheng Zhou, Cheng Ji, Feihong Lu, Jianian Gong, Shangguang Wang, Jianxin Li:
Multi-Modal Inductive Framework for Text-Video Retrieval. 2389-2398
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuSSY00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuSSY00L24
Hancheng Zhu, Ju Shi, Zhiwen Shao, Rui Yao, Yong Zhou, Jiaqi Zhao, Leida Li:
Attribute-Driven Multimodal Hierarchical Prompts for Image Aesthetic Quality Assessment. 2399-2408
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoKZ0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoKZ0X24
Zeyu Xiao, Dachun Kai, Yueyi Zhang, Xiaoyan Sun, Zhiwei Xiong:
Asymmetric Event-Guided Video Super-Resolution. 2409-2418
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PanSN0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PanSN0Z024
Yuanfeng Pan, Wenkang Su, Jiangqun Ni, Qingliang Liu, Yulin Zhang, Donghua Jiang:
Model-Based Non-Independent Distortion Cost Design for Effective JPEG Steganography. 2419-2427
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YueZCZLZQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YueZCZLZQ024
Xianghu Yue, Xueyi Zhang, Yiming Chen, Chengwei Zhang, Mingrui Lao, Huiping Zhuang, Xinyuan Qian, Haizhou Li:
MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks. 2428-2437
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangCZYG0LSZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangCZYG0LSZQ24
Yuzheng Wang, Zhaoyu Chen, Jie Zhang, Dingkang Yang, Zuhao Ge, Yang Liu, Siao Liu, Yunquan Sun, Wenqiang Zhang, Lizhe Qi:
Sampling to Distill: Knowledge Transfer from Open-World Data. 2438-2447
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuHLZR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuHLZR24
Xi Wu, Chuang Huang, Xinliu Liu, Fei Zhou, Zhenwen Ren:
Multiple Kernel Clustering with Shifted Laplacian on Grassmann Manifold. 2448-2456
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiJ0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiJ0W24
Guangyao Li, Yajun Jian, Yan Yan, Hanzi Wang:
GLATrack: Global and Local Awareness for Open-Vocabulary Multiple Object Tracking. 2457-2466
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HaoNJTY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HaoNJTY24
Xuze Hao, Wenqian Ni, Xuhao Jiang, Weimin Tan, Bo Yan:
Addressing Imbalance for Class Incremental Learning in Medical Image Classification. 2467-2476
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiPZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiPZ24
Qiwei Li, Yuxin Peng, Jiahuan Zhou:
Progressive Prototype Evolving for Dual-Forgetting Mitigation in Non-Exemplar Online Continual Learning. 2477-2486
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhou0YZLML24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhou0YZLML24
Fengfan Zhou, Qianyu Zhou, Bangjie Yin, Hui Zheng, Xuequan Lu, Lizhuang Ma, Hefei Ling:
Rethinking Impersonation and Dodging Attacks on Face Recognition Systems. 2487-2496
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWJZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWJZ024
Xin Chen, Bin Wang, Jinzheng Jiang, Kunkun Zhang, Yongsheng Gao:
SDePR: Fine-Grained Leaf Image Retrieval with Structural Deep Patch Representation. 2497-2505
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuHHFZWLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuHHFZWLW24
Yuhan Liu, Qianxin Huang, Siqi Hui, Jingwen Fu, Sanping Zhou, Kangyi Wu, Pengna Li, Jinjun Wang:
Semantic-aware Representation Learning for Homography Estimation. 2506-2514
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuiZYLJZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuiZYLJZ24
Chen Hui, Haiqi Zhu, Shuya Yan, Shaohui Liu, Feng Jiang, Debin Zhao:
S²-CSNet: Scale-Aware Scalable Sampling Network for Image Compressive Sensing. 2515-2524
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengZWY00Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengZWY00Q024
Gangyan Zeng, Yuan Zhang, Jin Wei, Dongbao Yang, Peng Zhang, Yiwen Gao, Xugong Qin, Yu Zhou:
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval. 2525-2534
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuLBGHO024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuLBGHO024
Hua Yu, Weiming Liu, Jiapeng Bai, Xu Gui, Yaqing Hou, Yew-Soon Ong, Qiang Zhang:
Towards Efficient and Diverse Generative Model for Unconditional Human Motion Synthesis. 2535-2544
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002ZLZS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002ZLZS024
Dan Zeng, Yu Zhu, Shuiwang Li, Qijun Zhao, Qiaomu Shen, Bo Tang:
Towards Labeling-free Fine-grained Animal Pose Estimation. 2545-2553
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieMHXM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieMHXM24
Rui Xie, Anlong Ming, Shuai He, Yi Xiao, Huadong Ma:
"Special Relativity" of Image Aesthetics Assessment: a Preliminary Empirical Perspective. 2554-2563
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinMLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinMLZ24
Zhengwei Yin, Mingze Ma, Guixu Lin, Yinqiang Zheng:
Exploring Data Efficiency in Image Restoration: A Gaussian Denoising Case Study. 2564-2573
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZWGW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZWGW24
Yuntao Wang, Jinpu Zhang, Ruonan Wei, Wenbo Gao, Yuehuan Wang:
MFRGN: Multi-scale Feature Representation Generalization Network for Ground-to-Aerial Geo-localization. 2574-2583
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuQHLLYLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuQHLLYLY24
Chang Wu, Guancheng Quan, Gang He, Xin-Quan Lai, Yunsong Li, Wenxin Yu, Xianmeng Lin, Cheng Yang:
QS-NeRV: Real-Time Quality-Scalable Decoding with Neural Representation for Videos. 2584-2592
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanZLWSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanZLWSM24
Xiaoyu Han, Shunyuan Zheng, Zonglin Li, Chenyang Wang, Xin Sun, Quanling Meng:
Shape-Guided Clothing Warping for Virtual Try-On. 2593-2602
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWWCLK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWWCLK024
Richen Liu, Hansheng Wang, Hailong Wang, Siru Chen, Chufan Lai, Ayush Kumar, Siming Chen:
ScaleTraversal: Creating Multi-Scale Biomedical Animation with Limited Hardware Resources. 2603-2612
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuWZFB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuWZFB24
Chenrui Wu, Haishuai Wang, Xiang Zhang, Zhen Fang, Jiajun Bu:
Spatio-temporal Heterogeneous Federated Learning for Time Series Classification with Multi-view Orthogonal Training. 2613-2622
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PengSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PengSC24
Yaopeng Peng, Milan Sonka, Danny Z. Chen:
Group Vision Transformer. 2623-2631
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0013L0WD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0013L0WD24
Zhichao Yang, Leida Li, Pengfei Chen, Jinjian Wu, Weisheng Dong:
Semantics-Aware Image Aesthetics Assessment using Tag Matching and Contrastive Ranking. 2632-2641
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang00ZN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang00ZN24
Pengcheng Zhang, Xiaohan Yu, Xiao Bai, Jin Zheng, Xin Ning:
Prompting Continual Person Search. 2642-2651
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoZYSL0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoZYSL0Z24
Xiao Zhao, Xukun Zhang, Dingkang Yang, Mingyang Sun, Mingcheng Li, Shunli Wang, Lihua Zhang:
MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation. 2652-2661
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangZHWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangZHWF24
Yong Yang, Aoqi Zhao, Shuying Huang, Xiaozheng Wang, Yajing Fan:
SCPSN: Spectral Clustering-based Pyramid Super-resolution Network for Hyperspectral Images. 2662-2670
- view
  authority control:
- export record
  dblp key:
  - conf/mm/00060PZ00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/00060PZ00024
Xiangyu Chen, Yihao Liu, Yuandong Pu, Wenlong Zhang, Jiantao Zhou, Yu Qiao, Chao Dong:
Learning A Low-Level Vision Generalist via Visual Task Prompt. 2671-2680
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiZ24
Wenxu Shi, Bochuan Zheng:
Alleviating the Equilibrium Challenge with Sample Virtual Labeling for Adversarial Domain Adaptation. 2681-2689
- view
  authority control:
- export record
  dblp key:
  - conf/mm/EspositiB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/EspositiB24
Federico Espositi, Andrea Bonarini:
The Room: Design and Embodiment of Spaces as Social Beings. 2690-2699
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaDG0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaDG0024
Chunjie Ma, Lina Du, Zan Gao, Li Zhuo, Meng Wang:
A Coarse to Fine Detection Method for Prohibited Object in X-ray Images Based on Progressive Transformer Decoder. 2700-2708
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieYQWSZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieYQWSZZ24
Qizhi Xie, Kun Yuan, Yunpeng Qu, Mingda Wu, Ming Sun, Chao Zhou, Jihong Zhu:
QPT-V2: Masked Image Modeling Advances Visual Scoring. 2709-2718
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuC024
Shengguang Wu, Zhenglun Chen, Qi Su:
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision. 2719-2728
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FengTZHLZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FengTZHLZS24
Yu Feng, Zhen Tian, Yifan Zhu, Zongfu Han, Haoran Luo, Guangwei Zhang, Meina Song:
CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning. 2729-2738
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WenYCXZZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WenYCXZZ024
Huixiang Wen, Shizong Yan, Shan Chang, Jie Xu, Hongzi Zhu, Yanting Zhang, Bo Li:
DepthCloak: Projecting Optical Camouflage Patches for Erroneous Monocular Depth Estimation of Vehicles. 2739-2747
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuYCQYXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuYCQYXL24
Keming Wu, Man Yao, Yuhong Chou, Xuerui Qiu, Rui Yang, Bo Xu, Guoqi Li:
RSC-SNN: Exploring the Trade-off Between Adversarial Robustness and Accuracy in Spiking Neural Networks via Randomized Smoothing Coding. 2748-2756
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaoHPGQ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaoHPGQ0024
Xueying Mao, Xiaoxiao Hu, Wanli Peng, Zhenliang Gan, Zhenxing Qian, Xinpeng Zhang, Sheng Li:
From Covert Hiding To Visual Editing: Robust Generative Video Steganography. 2757-2765
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RanMH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RanMH024
Wu Ran, Peirong Ma, Zhiquan He, Hong Lu:
Rainmer: Learning Multi-view Representations for Comprehensive Image Deraining and Beyond. 2766-2775
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYMB0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYMB0C24
Haoxuan Li, Zhengmao Yang, Yunshan Ma, Yi Bin, Yang Yang, Tat-Seng Chua:
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models. 2776-2785
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WenHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WenHL24
Shuyuan Wen, Bingrui Hu, Wenchao Li:
CDEA: Context- and Detail-Enhanced Unsupervised Learning for Domain Adaptive Semantic Segmentation. 2786-2794
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LingOWCYCCGTLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LingOWCYCCGTLH24
Xitong Ling, Minxi Ouyang, Yizhi Wang, Xinrui Chen, Renao Yan, Hongbo Chu, Junru Cheng, Tian Guan, Sufang Tian, Xiaoping Liu, Yonghong He:
Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis. 2795-2803
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuM00LYHY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuM00LYHY24
Kepeng Xu, Zijia Ma, Li Xu, Gang He, Yunsong Li, Wenxin Yu, Taichu Han, Cheng Yang:
An End-to-End Real-World Camera Imaging Pipeline. 2804-2813
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yang0SMH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yang0SMH024
Lijian Yang, Weisheng Li, Yucheng Shu, Jian-Xun Mi, Yuping Huang, Bin Xiao:
ShiftMorph: A Fast and Robust Convolutional Neural Network for 3D Deformable Medical Image Registration. 2814-2823
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuZ0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuZ0L24
Ximing Wu, Kongyange Zhao, Xu Chen, Teng Liang:
Edge-assisted Real-time Dynamic 3D Point Cloud Rendering for Multi-party Mobile Virtual Reality. 2824-2832
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuMZZBWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuMZZBWY24
Nannan Yu, Tao Ma, Jiqing Zhang, Yuji Zhang, Qirui Bao, Xiaopeng Wei, Xin Yang:
Adaptive Vision Transformer for Event-Based Human Pose Estimation. 2833-2841
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang00ZLH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang00ZLH024
Litian Zhang, Xiaoming Zhang, Chaozhuo Li, Ziyi Zhou, Jiacheng Liu, Feiran Huang, Xi Zhang:
Mitigating Social Hazards: Early Detection of Fake News via Diffusion-Guided Propagation Path Generation. 2842-2851
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuHYM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuHYM24
Yuzhen Du, Teng Hu, Ran Yi, Lizhuang Ma:
LD-BFR: Vector-Quantization-Based Face Restoration Model with Latent Diffusion Enhancement. 2852-2860
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangCZGYZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangCZGYZ024
Jie Huang, Zhao-Min Chen, Xiaoqin Zhang, Yisu Ge, Lusi Ye, Guodao Zhang, Huiling Chen:
Label Decoupling and Reconstruction: A Two-Stage Training Framework for Long-tailed Multi-label Medical Image Recognition. 2861-2869
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuF0JZ0AL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuF0JZ0AL024
Chengpei Xu, Hao Fu, Long Ma, Wenjing Jia, Chengqi Zhang, Feng Xia, Xiaoyu Ai, Binghao Li, Wenjie Zhang:
Seeing Text in the Dark: Algorithm and Benchmark. 2870-2878
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0027WSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0027WSZ24
Ye Tian, Zhe Wang, Jianguo Sun, Liguo Zhang:
Time-Frequency Domain Fusion Enhancement for Audio Super-Resolution. 2879-2887
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLC24
Lei Liu, Li Liu, Yawen Cui:
Prior-free Balanced Replay: Uncertainty-guided Reservoir Sampling for Long-Tailed Continual Learning. 2888-2897
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuCZGG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuCZGG24
Tianjiao Xu, Aoxuan Chen, Yuxi Zhao, Jinfei Gao, Tian Gan:
A Chinese Multimodal Social Video Dataset for Controversy Detection. 2898-2907
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiHZXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiHZXW24
Zhe Ji, Qiansiqi Hu, Yicheng Zheng, Liyao Xiang, Xinbing Wang:
A Principled Approach to Natural Language Watermarking. 2908-2916
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuX000024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuX000024
Hao Wu, Fan Xu, Chong Chen, Xian-Sheng Hua, Xiao Luo, Haixin Wang:
PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction. 2917-2926
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YaoLKWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YaoLKWL24
Jiawei Yao, Yingxin Lai, Hongrui Kou, Tong Wu, Ruixi Liu:
QE-BEV: Query Evolution for Bird's Eye View Object Detection in Varied Contexts. 2927-2935
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWZ0LK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWZ0LK24
Xiangrui Liu, Xinju Wu, Pingping Zhang, Shiqi Wang, Zhu Li, Sam Kwong:
CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting. 2936-2944
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HaoCZSHZZL0LW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HaoCZSHZZL0LW24
Shengyu Hao, Wenhao Chai, Zhonghan Zhao, Meiqi Sun, Wendi Hu, Jieyang Zhou, Yixian Zhao, Qi Li, Yizhou Wang, Xi Li, Gaoang Wang:
Ego3DT: Tracking Every 3D Object in Ego-centric Videos. 2945-2954
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuSLLLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuSLLLG24
Junkang Liu, Fanhua Shang, Yuanyuan Liu, Hongying Liu, Yuangang Li, YunXiang Gong:
FedBCGD: Communication-Efficient Accelerated Block Coordinate Gradient Descent for Federated Learning. 2955-2963
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChengH0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChengH0H24
Yiran Cheng, Bintao He, Fa Zhang, Renmin Han:
Serial Section Microscopy Image Inpainting Guided by Axial Optical Flow. 2964-2972
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FangCQM0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FangCQM0C24
Han Fang, Kejiang Chen, Yupeng Qiu, Zehua Ma, Weiming Zhang, Ee-Chien Chang:
DERO: Diffusion-Model-Erasure Robust Watermarking. 2973-2981
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0CKZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0CKZD24
Yin Wang, Hao Lu, Ying-Cong Chen, Li Kuang, Mengchu Zhou, Shuiguang Deng:
rPPG-HiBa: Hierarchical Balanced Framework for Remote Physiological Measurement. 2982-2991
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenXCLB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenXCLB024
Huan Chen, Tingfa Xu, Zhenxiang Chen, Peifu Liu, Huiyan Bai, Jianan Li:
Multi-scale Change-Aware Transformer for Remote Sensing Image Change Detection. 2992-3000
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PengWHCR024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PengWHCR024
Yinyin Peng, Yaofei Wang, Donghui Hu, Kejiang Chen, Xianjin Rong, Weiming Zhang:
LDStega: Practical and Robust Generative Image Steganography based on Latent Diffusion Models. 3001-3009
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuXJWLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuXJWLW24
Lei Lu, Yanyue Xie, Wei Jiang, Wei Wang, Xue Lin, Yanzhi Wang:
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression. 3010-3018
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li00024
Linfei Li, Lin Zhang, Zhong Wang, Ying Shen:
GS³LAM: Gaussian Semantic Splatting SLAM. 3019-3027
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangHWB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangHWB24
Shuang Wang, Pengyi Hao, Fuli Wu, Cong Bai:
Live on the Hump: Self Knowledge Distillation via Virtual Teacher-Students Mutual Learning. 3028-3036
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuX00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuX00L24
Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan:
Calibration for Long-tailed Scene Graph Generation. 3037-3046
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuZDSL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuZDSL024
Minjing Yu, Lingzhi Zeng, Xinxin Du, Jenny Sheng, Qiantian Liao, Yong-Jin Liu:
VisHanfu: An Interactive System for the Promotion of Hanfu Knowledge via Cross-Shaped Flat Structure. 3047-3055
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuCZ24
Xiuquan Du, Jiajia Chen, Xuejun Zhang:
CBNet: Cooperation-Based Weakly Supervised Polyp Detection. 3056-3064
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoLMXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoLMXW24
Zeyu Xiao, Zhihe Lu, Michael Bi Mi, Zhiwei Xiong, Xinchao Wang:
Unraveling Motion Uncertainty for Local Motion Deblurring. 3065-3074
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZCWG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZCWG24
Yi Wang, Ningze Zhong, Minglin Chen, Longguang Wang, Yulan Guo:
Tangram-Splatting: Optimizing 3D Gaussian Splatting Through Tangram-inspired Shape Priors. 3075-3083
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Chen0XWX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Chen0XWX024
Jiali Chen, Yi Cai, Ruohang Xu, Jiexin Wang, Jiayuan Xie, Qing Li:
Deconfounded Emotion Guidance Sticker Selection with Causal Inference. 3084-3093
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wu0HH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wu0HH24
Zhijian Wu, Jun Li, Yang Hu, Dingjiang Huang:
Compacter: A Lightweight Transformer for Image Restoration. 3094-3103
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BiHL0C024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BiHL0C024
Xiuli Bi, Yang Hu, Bo Liu, Weisheng Li, Pamela C. Cosman, Bin Xiao:
PriFU: Capturing Task-Relevant Information Without Adversarial Learning. 3104-3112
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenYF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenYF24
Zan Chen, Xiao Yu, Yuanjing Feng:
Connectivity-based Cerebrovascular Segmentation in Time-of-Flight Magnetic Resonance Angiography. 3113-3121
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0012YJLWHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0012YJLWHZ24
Jiawei Chen, Dingkang Yang, Yue Jiang, Mingcheng Li, Jinjie Wei, Xiaolu Hou, Lihua Zhang:
Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Models. 3122-3130
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangWPH0ZWT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangWPH0ZWT24
Keke Tang, Zhensu Wang, Weilong Peng, Lujie Huang, Le Wang, Peican Zhu, Wenping Wang, Zhihong Tian:
SymAttack: Symmetry-aware Imperceptible Adversarial Attacks on 3D Point Clouds. 3131-3140
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiangWPZXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiangWPZXW24
Jie Liang, Rongjie Wang, Rui Peng, Zhe Zhang, Kaiqiang Xiong, Ronggang Wang:
High Fidelity Aggregated Planar Prior Assisted PatchMatch Multi-View Stereo. 3141-3150
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0017OYHGHX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0017OYHGHX24
Tao Huang, Xinjia Ou, Huali Yang, Shengze Hu, Jing Geng, Junjie Hu, Zhuoran Xu:
Remembering is Not Applying: Interpretable Knowledge Tracing for Problem-solving Processes. 3151-3159
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PhamCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PhamCC24
Kien T. Pham, Jingye Chen, Qifeng Chen:
TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization. 3160-3169
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiongCTWLZMLXH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiongCTWLZMLXH24
Lingyu Xiong, Xize Cheng, Jintao Tan, Xianjia Wu, Xiandong Li, Lei Zhu, Fei Ma, Minglei Li, Huang Xu, Zhihui Hu:
SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing. 3170-3179
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003YWMLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003YWMLM24
Changshuo Wang, Mingzhe Yu, Lei Wu, Lei Meng, Xiang Li, Xiangxu Meng:
InstantAS: Minimum Coverage Sampling for Arbitrary-Size Image Generation. 3180-3188
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenZ0024
Du Chen, Zhengqiang Zhang, Jie Liang, Lei Zhang:
SSL: A Self-similarity Loss for Improving Generative Image Super-resolution. 3189-3198
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuCWXZSLXG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuCWXZSLXG24
Zhengze Xu, Mengting Chen, Zhao Wang, Linyu Xing, Zhonghua Zhai, Nong Sang, Jinsong Lan, Shuai Xiao, Changxin Gao:
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos. 3199-3208
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TanSZDWRLZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TanSZDWRLZX24
Lixing Tan, Shuang Song, Kangneng Zhou, Chengbo Duan, Lanying Wang, Huayang Ren, Linlin Liu, Wei Zhang, Ruoxiu Xiao:
Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans. 3209-3218
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLQLTCS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLQLTCS24
Zecheng Wang, Xinye Li, Zhanyue Qin, Chunshan Li, Zhiying Tu, Dianhui Chu, Dianbo Sui:
Can We Debias Multimodal Large Language Models via Model Editing? 3219-3228
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Dai0VG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Dai0VG24
Shuqi Dai, Ming-Yu Liu, Rafael Valle, Siddharth Gururani:
ExpressiveSinger: Multilingual and Multi-Style Score-based Singing Voice Synthesis with Expressive Performance Control. 3229-3238
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YingY0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YingY0024
Dehao Ying, Fengchang Yu, Haihua Chen, Wei Lu:
DIG: Complex Layout Document Image Generation with Authentic-looking Text for Enhancing Layout Analysis. 3239-3247
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Hong0DCWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Hong0DCWY24
Shibo Hong, Xuhong Zhang, Tianyu Du, Sheng Cheng, Xun Wang, Jianwei Yin:
Cons2Plan: Vector Floorplan Generation from Various Conditions via a Learning Framework based on Conditional Diffusion Models. 3248-3256
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Pan0WL0JLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Pan0WL0JLL24
Qihe Pan, Zhen Zhao, Zicheng Wang, Sifan Long, Yiming Wu, Wei Ji, Haoran Liang, Ronghua Liang:
Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach. 3257-3265
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Mao0WFZWWW0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Mao0WFZWWW0C24
Xiaofeng Mao, Zhengkai Jiang, Qilin Wang, Chencan Fu, Jiangning Zhang, Jiafu Wu, Yabiao Wang, Chengjie Wang, Wei Li, Mingmin Chi:
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation. 3266-3274
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LeeMKA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LeeMKA24
Jihoon Lee, Yunhong Min, Hwidong Kim, Sangtae Ahn:
DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting. 3275-3283
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeJTW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeJTW24
Boyong He, Yuxiang Ji, Zhuoyue Tan, Liaoni Wu:
Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector. 3284-3293
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLL0024
Weizhi Liu, Yue Li, Dongdong Lin, Hui Tian, Haizhou Li:
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis. 3294-3302
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuWLZSXSGLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuWLZSXSGLSL24
Feihong Lu, Weiqi Wang, Yangyifei Luo, Ziqin Zhu, Qingyun Sun, Baixuan Xu, Haochen Shi, Shiqi Gao, Qian Li, Yangqiu Song, Jianxin Li:
Miko: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery. 3303-3312
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhongGYZG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhongGYZG024
Guojin Zhong, Yihu Guo, Jin Yuan, Qianjun Zhang, Weili Guan, Long Chen:
PROMOTE: Prior-Guided Diffusion Model with Global-Local Contrastive Learning for Exemplar-Based Image Translation. 3313-3322
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaiJXHJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaiJXHJ024
Xiangcheng Zhai, Yingqi Jie, Xueguang Xie, Aimin Hao, Na Jiang, Yang Gao:
ANFluid: Animate Natural Fluid Photos base on Physics-Aware Simulation and Dual-Flow Texture Learning. 3323-3331
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuFZSOPB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuFZSOPB24
Shoubin Yu, Jacob Zhiyuan Fang, Jian Zheng, Gunnar A. Sigurdsson, Vicente Ordonez, Robinson Piramuthu, Mohit Bansal:
Zero-Shot Controllable Image-to-Video Animation via Motion Decomposition. 3332-3341
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChakrabartyCHA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChakrabartyCHA24
Goirik Chakrabarty, Aditya Chandrasekar, Ramya Hebbalaguppe, Prathosh AP:
LoMOE: Localized Multi-Object Editing via Multi-Diffusion. 3342-3351
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenYZLX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenYZLX24
Yuyan Chen, Songzhou Yan, Zhihong Zhu, Zhixu Li, Yanghua Xiao:
XMeCap: Meme Caption Generation with Sub-Image Adaptability. 3352-3361
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiLCWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiLCWL24
Zhenqiang Li, Jie Li, Yangjie Cao, Jiayi Wang, Runfeng Lv:
ImageBind3D: Image as Binding Step for Controllable 3D Generation. 3362-3371
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaiLZNW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaiLZNW24
Pengxiang Cai, Zhiwei Liu, Guibo Zhu, Yunfang Niu, Jinqiao Wang:
Auto DragGAN: Editing the Generative Image Manifold in an Autoregressive Manner. 3372-3380
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZYLJWZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZYLJWZC24
Chengwei Zhang, Xueyi Zhang, Xianghu Yue, Mingrui Lao, Tao Jiang, Jiawei Wang, Fubo Zhang, Longyong Chen:
PD-Refiner: An Underlying Surface Inheritance Refiner with Adaptive Edge-Aware Supervision for Point Cloud Denoising. 3381-3390
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangLH0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangLH0024
Yue Jiang, Yueming Lyu, Ziwen He, Bo Peng, Jing Dong:
Mitigating Social Biases in Text-to-Image Diffusion Models via Linguistic-Aligned Attention Guidance. 3391-3400
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0010CDZNQQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0010CDZNQQ24
Peng Zhou, Dunbo Cai, Yujian Du, Runqing Zhang, Bingbing Ni, Jie Qin, Ling Qian:
Edit3D: Elevating 3D Scene Editing with Attention-Driven Multi-Turn Interactivity. 3401-3410
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001CH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001CH24
Ziyu Yao, Xuxin Cheng, Zhiqi Huang:
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model. 3411-3420
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiJWDGLHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiJWDGLHL24
Xiaomin Li, Xu Jia, Qinghe Wang, Haiwen Diao, Mengmeng Ge, Pengxiang Li, You He, Huchuan Lu:
MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models. 3421-3430
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuLFSZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuLFSZ024
Qi Xu, Yaxin Li, Xuanye Fang, Jiangrong Shen, Qiang Zhang, Gang Pan:
Reversing Structural Pattern Learning with Biologically Inspired Knowledge Distillation for Spiking Neural Networks. 3431-3439
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001CFX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001CFX24
Xiaogang Wang, Yuhang Cheng, Ziyang Fan, Kai Xu:
Learning to Transfer Heterogeneous Translucent Materials from a 2D Image to 3D Models. 3440-3448
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LyuLJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LyuLJ024
Zonglin Lyu, Ming Li, Jianbo Jiao, Chen Chen:
Frame Interpolation with Consecutive Brownian Bridge Diffusion. 3449-3458
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuZYWWHWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuZYWWHWM24
Teng Hu, Jiangning Zhang, Ran Yi, Yating Wang, Jieyu Weng, Hongrui Huang, Yabiao Wang, Lizhuang Ma:
COMD: Training-free Video Motion Transfer With Camera-Object Motion Disentanglement. 3459-3468
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuXMZMS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuXMZMS24
Yihao Liu, Feng Xue, Anlong Ming, Mingshuai Zhao, Huadong Ma, Nicu Sebe:
SM⁴Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model. 3469-3478
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiSQX0DCWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiSQX0DCWY24
Qinfeng Li, Zhiqiang Shen, Zhenghan Qin, Yangfan Xie, Xuhong Zhang, Tianyu Du, Sheng Cheng, Xun Wang, Jianwei Yin:
TransLinkGuard: Safeguarding Transformer Models Against Model Stealing in Edge Deployment. 3479-3488
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wu0C0LGKZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wu0C0LGKZ024
Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu:
Semantic Alignment for Multimodal Large Language Models. 3489-3498
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangTS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangTS024
Wenxuan Yang, Weimin Tan, Yuqi Sun, Bo Yan:
A Medical Data-Effective Learning Benchmark for Highly Efficient Pre-training of Foundation Models. 3499-3508
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuH0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuH0024
Jin Liu, Huaibo Huang, Jie Cao, Ran He:
ZePo: Zero-Shot Portrait Stylization with Faster Sampling. 3509-3518
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0WX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0WX24
Yiding Li, Lingyun Yu, Li Wang, Hongtao Xie:
Control-Talker: A Rapid-Customization Talking Head Generation Method for Multi-Condition Control and High-Texture Enhancement. 3519-3527
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiTZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiTZ024
Zhaoyang Li, Zhu Teng, Baopeng Zhang, Jianping Fan:
Boosting Non-causal Semantic Elimination: An Unconventional Harnessing of LVM for Open-World Deepfake Interpretation. 3528-3537
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunF0ZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunF0ZW24
Zhihao Sun, Haipeng Fang, Juan Cao, Xinying Zhao, Danding Wang:
Rethinking Image Editing Detection in the Era of Generative AI Revolution. 3538-3547
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuQYC0C0X0LY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuQYC0C0X0LY24
Hongyun Yu, Zhan Qu, Qihang Yu, Jianchuan Chen, Zhonghua Jiang, Zhiwen Chen, Shengyu Zhang, Jimin Xu, Fei Wu, Chengfei Lv, Gang Yu:
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting. 3548-3557
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangY0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangY0024
Xingqi Wang, Xiaoyuan Yi, Xing Xie, Jia Jia:
Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization. 3558-3567
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengYZCCZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengYZCCZY24
Weili Zeng, Yichao Yan, Qi Zhu, Zhuo Chen, Pengzhi Chu, Weiming Zhao, Xiaokang Yang:
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting. 3568-3577
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuC0Y024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuC0Y024
Yi Liu, Chengjun Cai, Xiaoli Zhang, Xingliang Yuan, Cong Wang:
Arondight: Red Teaming Large Vision Language Models with Auto-generated Multi-modal Jailbreak Prompts. 3578-3586
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuAZWG0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuAZWG0024
Yisu Liu, Jinyang An, Wanqian Zhang, Dayan Wu, Jingzi Gu, Zheng Lin, Weiping Wang:
Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization. 3587-3596
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuM024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuM024
Yiren Lu, Jing Ma, Yu Yin:
View-consistent Object Removal in Radiance Fields. 3597-3606
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Long0LLY0MY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Long0LLY0MY24
Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan:
DGMamba: Domain Generalization via Generalized State Space Model. 3607-3616
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengXCSSXD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengXCSSXD24
Wangguandong Zheng, Haifeng Xia, Rui Chen, Libo Sun, Ming Shao, Siyu Xia, Zhengming Ding:
Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation. 3617-3626
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouSCKSJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouSCKSJ24
Ziyin Zhou, Ke Sun, Zhongxi Chen, Huafeng Kuang, Xiaoshuai Sun, Rongrong Ji:
StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model. 3627-3636
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Chen00ZZT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Chen00ZZT024
Hong Chen, Xin Wang, Yipeng Zhang, Yuwei Zhou, Zeyang Zhang, Siao Tang, Wenwu Zhu:
DisenStudio: Customized Multi-Subject Text-to-Video Generation with Disentangled Spatial Control. 3637-3646
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuZBFHLX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuZBFHLX24
Ziqi Yu, Jing Zhou, Zhongyun Bao, Gang Fu, Weilei He, Chao Liang, Chunxia Xiao:
CFDiffusion: Controllable Foreground Relighting in Image Compositing via Diffusion Model. 3647-3656
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003GHCZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003GHCZ0024
Hao Wang, Shangwei Guo, Jialing He, Kangjie Chen, Shudong Zhang, Tianwei Zhang, Tao Xiang:
EvilEdit: Backdooring Text-to-Image Diffusion Models in One Second. 3657-3665
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangSWSLDZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangSWSLDZ24
Haiyan Jiang, Leiyu Song, Dongdong Weng, Zhe Sun, Huiying Li, Xiaonuo Dongye, Zhenliang Zhang:
In Situ 3D Scene Synthesis for Ubiquitous Embodied Interfaces. 3666-3675
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001WLZC0ZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001WLZC0ZL24
Haoning Wu, Xiele Wu, Chunyi Li, Zicheng Zhang, Chaofeng Chen, Xiaohong Liu, Guangtao Zhai, Weisi Lin:
T2I-Scorer: Quantitative Evaluation on Text-to-Image Generation via Fine-Tuned Large Multi-Modal Models. 3676-3685
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiCW0XL0L0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiCW0XL0L0024
Shiwei Li, Yingyi Cheng, Haozhao Wang, Xing Tang, Shijie Xu, Weihong Luo, Yuhua Li, Dugang Liu, Xiuqiang He, Ruixuan Li:
Masked Random Noise for Communication-Efficient Federated Learning. 3686-3694
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YanKLDZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YanKLDZX24
Sa Yan, Nuowen Kan, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong:
Task-Oriented Multi-Bitstream Optimization for Image Compression and Transmission via Optimal Transport. 3695-3703
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0Y24
Tingting Li, Ziming Zhao, Jianwei Yin:
Minerva: Enhancing Quantum Network Performance for High-Fidelity Multimedia Transmission. 3704-3712
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuC24
Xiaotong Yu, Chang-Wen Chen:
Semantic-aware Next-Best-View for Multi-DoFs Mobile System in Search-and-Acquisition based Visual Perception. 3713-3721
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWHFC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWHFC024
Yu Chen, Yanan Wu, Na Han, Xiaozhao Fang, Bingzhi Chen, Jie Wen:
Partial Multi-label Learning Based On Near-Far Neighborhood Label Enhancement And Nonlinear Guidance. 3722-3731
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiaX0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiaX0L24
Ruofan Jia, Weiying Xie, Jie Lei, Yunsong Li:
Adaptive Hierarchical Aggregation for Federated Object Detection. 3732-3740
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieGZ024
Liang Xie, Wei Gao, Huiming Zheng, Ge Li:
ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision. 3741-3750

Oral Session 12: Human-centric and Interactive Multimedia

- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWTLWK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWTLWK24
Xiyu Wang, Yufei Wang, Satoshi Tsutsui, Weisi Lin, Bihan Wen, Alex C. Kot:
Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models. 3751-3760
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuZZZHW0XLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuZZZHW0XLG24
Shiyu Liu, Zibo Zhao, Yihao Zhi, Yiqun Zhao, Binbin Huang, Shuo Wang, Ruoyu Wang, Michael Xuan, Zhengxin Li, Shenghua Gao:
HeroMaker: Human-centric Video Editing with Motion Priors. 3761-3770
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuCD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuCD024
Yunze Liu, Changxi Chen, Chenjing Ding, Li Yi:
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation. 3771-3780
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001B0WYQPL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001B0WYQPL24
Wenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyu:
New Job, New Gender? Measuring the Social Bias in Image Generation Models. 3781-3789
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu0DXZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu0DXZW24
Mengzhen Liu, Mengyu Wang, Henghui Ding, Yilong Xu, Yao Zhao, Yunchao Wei:
Segment Anything with Precise Interaction. 3790-3799
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuCYQSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuCYQSL24
Zhihua Xu, Tianshui Chen, Zhijing Yang, Chunmei Qing, Yukai Shi, Liang Lin:
Self-Supervised Emotion Representation Disentanglement for Speech-Preserving Facial Expression Manipulation. 3800-3808

Oral Session 13: Machine Learning for Multimedia

- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieQLWLLLW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieQLWLLLW024
Dongyu Xie, Chaofan Qiao, Lanyue Liang, Zhiwen Wang, Tianyu Li, Qiao Liu, Chongyi Li, Guoqing Wang, Yang Yang:
Generalizing ISP Model by Unsupervised Raw-to-raw Mapping. 3809-3817
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLG024
Yang Liu, Daizong Liu, Zongming Guo, Wei Hu:
Cross-Task Knowledge Transfer for Semi-supervised Joint 3D Grounding and Captioning. 3818-3827
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuXWDH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuXWDH24
Yang Liu, Qianqian Xu, Peisong Wen, Siran Dai, Qingming Huang:
Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video Retrieval. 3828-3837
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FuCYWZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FuCYWZJ24
Dongjie Fu, Xize Cheng, Xiaoda Yang, Hanting Wang, Zhou Zhao, Tao Jin:
Boosting Speech Recognition Robustness to Modality-Distortion with Contrast-Augmented Prompts. 3838-3847
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuZTWHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuZTWHZ24
Xingyu Zhu, Beier Zhu, Yi Tan, Shuo Wang, Yanbin Hao, Hanwang Zhang:
Selective Vision-Language Subspace Projection for Few-shot CLIP. 3848-3857
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWWFM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWWFM24
Jin Liu, Bo Wang, Chuanming Wang, Huiyuan Fu, Huadong Ma:
Learning Exposure Correction in Dynamic Scenes. 3858-3866

Oral Session 14: Multimodal Datasets, Models & Analytics

- view
  authority control:
- export record
  dblp key:
  - conf/mm/NiuCFPDCH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/NiuCFPDCH024
Fuqiang Niu, Zebang Cheng, Xianghua Fu, Xiaojiang Peng, Genan Dai, Yin Chen, Hu Huang, Bowen Zhang:
Multimodal Multi-turn Conversation Stance Detection: A Challenge Dataset and Effective Model. 3867-3876
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YaoXZR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YaoXZR24
Ruilin Yao, Shengwu Xiong, Yichen Zhao, Yi Rong:
Visual Grounding with Multi-modal Conditional Adaptation. 3877-3886
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuCSHSJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuCSHSJ24
Junhao Xu, Jingjing Chen, Xue Song, Feng Han, Haijun Shan, Yu-Gang Jiang:
Identity-Driven Multimedia Forgery Detection via Reference Assistance. 3887-3896
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoCZ0FZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoCZ0FZ24
Bowen Zhao, Tianhao Cheng, Yuejie Zhang, Ying Cheng, Rui Feng, Xiaobo Zhang:
CT²C-QA: Multimodal Question Answering over Chinese Text, Table and Chart. 3897-3906
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangW0WLL0Z0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangW0WLL0Z0T24
Zhanyu Wang, Longyue Wang, Zhen Zhao, Minghao Wu, Chenyang Lyu, Huayang Li, Deng Cai, Luping Zhou, Shuming Shi, Zhaopeng Tu:
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation. 3907-3916
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuWPYSFN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuWPYSFN24
Linmei Hu, Duokang Wang, Yiming Pan, Jifan Yu, Yingxia Shao, Chong Feng, Liqiang Nie:
NovaChart: A Large-scale Dataset towards Chart Understanding and Generation of Multimodal Large Language Models. 3917-3925

Oral Session 15: Video Applications

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYWWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYWWL24
Jiaxu Li, Songsong Yu, Yifan Wang, Lijun Wang, Huchuan Lu:
SelM: Selective Mechanism based Audio-Visual Segmentation. 3926-3935
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangMMWHM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangMMWHM24
Yuqing Wang, Lei Meng, Haokai Ma, Yuqing Wang, Haibei Huang, Xiangxu Meng:
Modeling Event-level Causal Representation for Video Classification. 3936-3944
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangJWC0HC0L00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangJWC0HC0L00024
Te Yang, Jian Jia, Bo Wang, Yanhua Cheng, Yan Li, Dongze Hao, Xipeng Cao, Quan Chen, Han Li, Peng Jiang, Xiangyu Zhu, Zhen Lei:
Spatiotemporal Fine-grained Video Description for Short Videos. 3945-3954
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0GL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0GL0024
Yili Li, Jing Yu, Keke Gai, Bang Liu, Gang Xiong, Qi Wu:
T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval. 3955-3963
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangZ0Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangZ0Q024
Haijie Yang, Zhenyu Zhang, Hao Tang, Jianjun Qian, Jian Yang:
ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance. 3964-3973
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLLCT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLLCT024
Zhiyu Zhang, Guo Lu, Huanxiong Liang, Zhengxue Cheng, Anni Tang, Li Song:
Rate-aware Compression for NeRF-based Volumetric Video. 3974-3983

Oral Session 16: Biological and Health Applications

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZZSCSZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZZSCSZ0024
Jingxiong Li, Sunyi Zheng, Chenglu Zhu, Yuxuan Sun, Pingyi Chen, Zhongyi Shui, Yunlong Zhang, Honglin Li, Lin Yang:
PathUp: Patch-wise Timestep Tracking for Multi-class Large Pathology Image Synthesising Diffusion Model. 3984-3993
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieZZWNX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieZZWNX24
Dian Xie, Peiang Zhao, Jiarui Zhang, Kangqi Wei, Xiaobao Ni, Jiong Xia:
BrainRAM: Cross-Modality Retrieval-Augmented Image Reconstruction from Human Brain Activity. 3994-4003
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaZZCWJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaZZCWJ24
Shuo Ma, Yingwei Zhang, Qiqi Zhang, Yiqiang Chen, Haoran Wang, Ziyu Jia:
SleepMG: Multimodal Generalizable Sleep Staging with Inter-modal Balance of Classification and Domain Discrimination. 4004-4013
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GongZBZZLH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GongZBZZLH024
Zixuan Gong, Qi Zhang, Guangyin Bao, Lei Zhu, Yu Zhang, Ke Liu, Liang Hu, Duoqian Miao:
Lite-Mind: Towards Efficient and Robust Brain Representation Learning. 4014-4023
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DongXNL0LQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DongXNL0LQ24
Kun Dong, Jian Xue, Zehai Niu, Xing Lan, Ke Lu, Qingyuan Liu, Xiaoyu Qin:
Realistic Full-Body Motion Generation from Sparse Tracking with State Space Model. 4024-4033
- view
  authority control:
- export record
  dblp key:
  - conf/mm/NaseemDKK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/NaseemDKK24
Usman Naseem, Adam G. Dunn, Matloob Khushi, Jinman Kim:
Vaccine Misinformation Detection in X using Cooperative Multimodal Framework. 4034-4042

Oral Session 17: Person Modeling and Tracking

- view
  authority control:
- export record
  dblp key:
  - conf/mm/YanWCZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YanWCZZ24
Shizong Yan, Huixiang Wen, Shan Chang, Hongzi Zhu, Luo Zhou:
Fooling 3D Face Recognition with One Single 2D Image. 4043-4052
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuY024
Fangyi Liu, Mang Ye, Bo Du:
Cloth-aware Augmentation for Cloth-generalized Person Re-identification. 4053-4062
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PangZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PangZW24
Zhiqi Pang, Lingling Zhao, Chunyu Wang:
Dual-Resolution Fusion Modeling for Unsupervised Cross-Resolution Person Re-Identification. 4063-4072
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TianM0LYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TianM0LYZ24
Huilin Tian, Jingke Meng, Wei-Shi Zheng, Yuan-Ming Li, Junkai Yan, Yunong Zhang:
Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation. 4073-4081
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoCLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoCLL24
Changcheng Xiao, Qiong Cao, Zhigang Luo, Long Lan:
MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model. 4082-4091
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYYX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYYX024
Ling Li, Wenrui Yang, Xinchun Yu, Junliang Xing, Xiao-Ping Zhang:
Translating Motion to Notation: Hand Labanotation for Intuitive and Comprehensive Hand Movement Documentation. 4092-4100

Poster Session 2

- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoL24
Xiang Gao, Jiaying Liu:
FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation. 4101-4109
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinZX0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinZX0024
Wen Yin, Bin Benjamin Zhu, Yulai Xie, Pan Zhou, Dan Feng:
Backdoor Attacks on Bimodal Salient Object Detection with RGB-Thermal Data. 4110-4119
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenH024
Zhixiang Shen, Haolan He, Zhao Kang:
Balanced Multi-Relational Graph Clustering. 4120-4128
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLNLS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLNLS024
Jiyuan Wang, Chunyu Lin, Lang Nie, Kang Liao, Shuwei Shao, Yao Zhao:
Digging into Contrastive Learning for Robust Depth Estimation with Diffusion Models. 4129-4137
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWL0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWL0H24
Zhuoxiao Chen, Zixin Wang, Yadan Luo, Sen Wang, Zi Huang:
DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection. 4138-4147
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangWWQXN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangWWQXN24
Xian Zhang, Haokun Wen, Jianlong Wu, Pengda Qin, Hui Xue', Liqiang Nie:
Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning. 4148-4157
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu00J24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu00J24
Bingyan Liu, Chengyu Wang, Jun Huang, Kui Jia:
Attentive Linguistic Tracking in Diffusion Models for Training-free Text-guided Image Editing. 4158-4166
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeZ0024
Changhao He, Hongyuan Zhu, Peng Hu, Xi Peng:
Robust Variational Contrastive Learning for Partially View-unaligned Clustering. 4167-4176
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLZSJJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLZSJJ24
Shengxin Chen, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Rongrong Ji:
QueryMatch: A Query-based Contrastive Learning Framework for Weakly Supervised Visual Grounding. 4177-4186
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0008H00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0008H00024
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. 4187-4196
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DaiTZTPX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DaiTZTPX24
Zhien Dai, Zhaohui Tang, Hu Zhang, Can Tian, Mingjun Pan, Yongfang Xie:
Eglcr: Edge Structure Guidance and Scale Adaptive Attention for Iterative Stereo Matching. 4197-4206
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhong0LWTCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhong0LWTCY24
Humen Zhong, Zhibo Yang, Zhaohai Li, Peng Wang, Jun Tang, Wenqing Cheng, Cong Yao:
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer. 4207-4216
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GanTLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GanTLL24
Chaofan Gan, Yuanpeng Tu, Yuxi Li, Weiyao Lin:
DAC: 2D-3D Retrieval with Noisy Labels via Divide-and-Conquer Alignment and Correction. 4217-4226
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HouG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HouG24
Zhenyu Hou, Junjun Guo:
Virtual Visual-Guided Domain-Shadow Fusion via Modal Exchanging for Domain-Specific Multi-Modal Neural Machine Translation. 4227-4235
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangWZXWZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangWZXWZW24
Yuxiang Yang, Lu Wen, Xinyi Zeng, Yuanyuan Xu, Xi Wu, Jiliu Zhou, Yan Wang:
Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognition. 4236-4245
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenFCYHY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenFCYHY24
Shuhuang Chen, Dingjie Fu, Shiming Chen, Shuo Ye, Wenjin Hou, Xinge You:
Causal Visual-semantic Correlation for Zero-shot Learning. 4246-4255
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SteinertWFH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SteinertWFH24
Patrick Steinert, Stefan Wagenpfeil, Ingo Frommholz, Matthias L. Hemmje:
256 Metaverse Records Dataset. 4256-4263
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieZCCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieZCCH24
Yifeng Xie, Zhihong Zhu, Xin Chen, Zhanpeng Chen, Zhiqi Huang:
MoBA: Mixture of Bi-directional Adapter for Multi-modal Sarcasm Detection. 4264-4272
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYTZLLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYTZLLW24
Jiulin Li, Mengyu Yang, Ye Tian, Lanshan Zhang, Yongchun Lu, Jice Liu, Wendong Wang:
WaveDN: A Wavelet-based Training-free Zero-shot Enhancement for Vision-Language Models. 4273-4282
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhao0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhao0024
Runkai Zhao, Heng Wang, Weidong Cai:
LaneCMKT: Boosting Monocular 3D Lane Detection with Cross-Modal Knowledge Transfer. 4283-4291
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunLZ0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunLZ0G24
Wenju Sun, Qingyong Li, Siyu Zhang, Wen Wang, Yangli-ao Geng:
Incremental Learning via Robust Parameter Posterior Fusion. 4292-4301
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0004YWCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0004YWCSZ24
Tao Jin, Weicai Yan, Ye Wang, Sihang Cai, Qifan Shuai, Zhou Zhao:
Calibrating Prompt from History for Continual Vision-Language Retrieval and Grounding. 4302-4311
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinLJYFMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinLJYFMW24
Pengyue Lin, Ruifan Li, Yuzhe Ji, Zhihan Yu, Fangxiang Feng, Zhanyu Ma, Xiaojie Wang:
Triple Alignment Strategies for Zero-shot Phrase Grounding under Weak Supervision. 4312-4321
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yu00BX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yu00BX24
Zhenni Yu, Xiaoqin Zhang, Li Zhao, Yi Bin, Guobao Xiao:
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection. 4322-4330
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0025CLMXC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0025CLMXC24
Jiawei Wang, Da Cao, Shaofei Lu, Zhanchang Ma, Junbin Xiao, Tat-Seng Chua:
Causal-driven Large Language Models with Faithful Reasoning for Knowledge Question Answering. 4331-4340
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yi0SZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yi0SZ24
Zijian Yi, Ziming Zhao, Zhishu Shen, Tiehua Zhang:
Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Conversation. 4341-4348
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenSLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenSLY24
Cheng Shen, Liquan Shen, Mengyao Li, Meng Yu:
EPL-UFLSID: Efficient Pseudo Labels-Driven Underwater Forward-Looking Sonar Images Object Detection. 4349-4357
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GouWWC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GouWWC24
Shuiping Gou, Xin Wang, Xinlin Wang, Yunzhi Chen:
Interpretable Matching of Optical-SAR Image via Dynamically Conditioned Diffusion Models. 4358-4367
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingGSHXY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingGSHXY24
Xiaohuan Ding, Yangrui Gong, Tianyi Shi, Zihang Huang, Gangwei Xu, Xin Yang:
Masked Snake Attention for Fundus Image Restoration with Vessel Preservation. 4368-4376
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangHHWWT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangHHWWT24
Yajie Zhang, Zhi-An Huang, Zhiliang Hong, Songsong Wu, Jibin Wu, Kay Chen Tan:
Mixed Prototype Correction for Causal Inference in Medical Image Classification. 4377-4386
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangYAJTH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangYAJTH24
Yi Zhang, Ke Yu, Angelica I. Avilés-Rivero, Jiyuan Jia, Yushun Tang, Zhihai He:
Training-Free Feature Reconstruction with Sparse Optimization for Vision-Language Models. 4387-4396
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangDHJL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangDHJL24
Nan Wang, Zonglin Di, Houlin He, Qingchao Jiang, Xiaoxiao Li:
A Simple and Provable Approach for Learning on Noisy Labeled Medical Images. 4397-4405
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShengSP0LY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShengSP0LY24
Mengmeng Sheng, Zeren Sun, Gensheng Pei, Tao Chen, Haonan Luo, Yazhou Yao:
Enhancing Robustness in Learning with Noisy Labels: An Asymmetric Co-Training Approach. 4406-4415
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZ0XLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZ0XLQ24
Muquan Li, Dongyang Zhang, Tao He, Xiurui Xie, Yuan-Fang Li, Ke Qin:
Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation. 4416-4425
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenH24
Qiuhui Chen, Yi Hong:
SMART: Self-Weighted Multimodal Fusion for Diagnostics of Neurodegenerative Disorders. 4426-4435
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SuSWZXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SuSWZXL24
Taoyu Su, Jiawei Sheng, Shicheng Wang, Xinghua Zhang, Hongbo Xu, Tingwen Liu:
IBMEA: Exploring Variational Information Bottleneck for Multi-modal Entity Alignment. 4436-4445
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiaXP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiaXP024
Zhijun Jia, Huaying Xue, Xiulian Peng, Yan Lu:
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision. 4446-4454
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoXCBLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoXCBLZ24
Yihan Zhao, Wei Xi, Yuhang Cui, Gairui Bai, Xinhui Liu, Jizhong Zhao:
CoPL: Parameter-Efficient Collaborative Prompt Learning for Audio-Visual Tasks. 4455-4464
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Hu024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Hu024
Junbo Hu, Zhixin Li:
Distilled Cross-Combination Transformer for Image Captioning with Dual Refined Visual Features. 4465-4474
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuLSW0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuLSW0L24
Siyuan Xu, Guannan Li, Haofei Song, Jiansheng Wang, Yan Wang, Qingli Li:
GeNSeg-Net: A General Segmentation Framework for Any Nucleus in Immunohistochemistry Images. 4475-4484
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Gao0WMCTLJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Gao0WMCTLJ24
Ziyi Gao, Kai Chen, Zhipeng Wei, Tingshu Mou, Jingjing Chen, Zhiyu Tan, Hao Li, Yu-Gang Jiang:
ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack. 4485-4494
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PengSR00DZSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PengSR00DZSS24
Kunyu Peng, David Schneider, Alina Roitberg, Kailun Yang, Jiaming Zhang, Chen Deng, Kaiyu Zhang, M. Saquib Sarfraz, Rainer Stiefelhagen:
Towards Video-based Activated Muscle Group Estimation in the Wild. 4495-4504
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuLLYLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuLLYLC24
Rui Xu, Gaolei Li, Changze Li, Zhaohui Yang, Yuchen Liu, Mingzhe Chen:
OSNeRF: On-demand Semantic Neural Radiance Fields for Fast and Robust 3D Object Reconstruction. 4505-4514
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0LLHM024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0LLHM024
Wenjie Li, Heng Guo, Xuannan Liu, Kongming Liang, Jiani Hu, Zhanyu Ma, Jun Guo:
Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network. 4515-4523
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DengYLZCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DengYLZCH24
Ruoxi Deng, Bin Yu, Jinxuan Lu, Caixia Zhou, Zhao-Min Chen, Jie Hu:
Advancing Semantic Edge Detection through Cross-Modal Knowledge Learning. 4524-4532
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangWKZRCZXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangWKZRCZXL24
Jiacheng Zhang, Jie Wu, Huafeng Kuang, Haiming Zhang, Yuxi Ren, Weifeng Chen, Manlin Zhang, Xuefeng Xiao, Guanbin Li:
TreeReward: Improve Diffusion Model via Tree-Structured Feedback Learning. 4533-4542
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenHZFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenHZFZ24
Chaomin Shen, Yaomin Huang, Haokun Zhu, Jinsong Fan, Guixu Zhang:
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation. 4543-4552
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouLYX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouLYX024
Yanshan Zhou, Pingrui Lai, Jiaqi Yu, Yingjie Xiong, Hua Yang:
Hydrodynamics-Informed Neural Network for Simulating Dense Crowd Motion Patterns. 4553-4561
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuSL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuSL024
Zhidong Yu, Zhenbo Shi, Xiaoman Liu, Wei Yang:
PFFAA: Prototype-based Feature and Frequency Alteration Attack for Semantic Segmentation. 4562-4571
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangZQW0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangZQW0024
Wenbo Huang, Jinghui Zhang, Xuwei Qian, Zhen Wu, Meng Wang, Lei Zhang:
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition. 4572-4580
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Qu0GZT0G024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Qu0GZT0G024
Xiangyan Qu, Jing Yu, Keke Gai, Jiamin Zhuang, Yuanmin Tang, Gang Xiong, Gaopeng Gou, Qi Wu:
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning. 4581-4590
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanC0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanC0P24
Weixiang Han, Chengjun Cai, Yu Guo, Jialiang Peng:
ERL-MR: Harnessing the Power of Euler Feature Representations for Balanced Multi-modal Learning. 4591-4600
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RossettoSB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RossettoSB24
Luca Rossetto, Cristina Sarasua, Abraham Bernstein:
Estimating the Semantic Density of Visual Media. 4601-4609
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangWZW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangWZW024
Shaokun Zhang, Yiran Wu, Zhonghua Zheng, Qingyun Wu, Chi Wang:
HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts. 4610-4619
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChuDYDLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChuDYDLZ24
Xiaomeng Chu, Jiajun Deng, Guoliang You, Yifan Duan, Yao Li, Yanyong Zhang:
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies. 4620-4629
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BinLDL0NS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BinLDL0NS24
Yi Bin, Junrong Liao, Yujuan Ding, Haoxuan Li, Yang Yang, See-Kiong Ng, Heng Tao Shen:
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning. 4630-4639
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiaLCD0WDD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiaLCD0WDD024
Chengyou Jia, Minnan Luo, Xiaojun Chang, Zhuohang Dang, Mingfei Han, Mengmeng Wang, Guang Dai, Sizhe Dang, Jingdong Wang:
Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition. 4640-4649
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003WYR024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003WYR024
Jialu Zhang, Xinyi Wang, Chenglin Yao, Jianfeng Ren, Xudong Jiang:
Visual-linguistic Cross-domain Feature Learning with Group Attention and Gamma-correct Gated Fusion for Extracting Commonsense Knowledge. 4650-4659
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuZY0DL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuZY0DL24
Wenhan Wu, Ce Zheng, Zihao Yang, Chen Chen, Srijan Das, Aidong Lu:
Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer. 4660-4669
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuangCZCLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuangCZCLZ24
Xianwei Zhuang, Xuxin Cheng, Zhihong Zhu, Zhanpeng Chen, Hongxiang Li, Yuexian Zou:
Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration. 4670-4679
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuXH0GWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuXH0GWS24
Hongze Zhu, Guoyang Xie, Chengbin Hou, Tao Dai, Can Gao, Jinbao Wang, Linlin Shen:
Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning. 4680-4689
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangD024
Kaixiang Wang, Xiaojian Ding, Fan Yang:
Non-Overlapped Multi-View Weak-Label Learning Guided by Multiple Correlations. 4690-4698
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Mei0CYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Mei0CYC24
Xin Mei, Rui Mao, Xiaoyan Cai, Libin Yang, Erik Cambria:
Medical Report Generation via Multimodal Spatio-Temporal Fusion. 4699-4708
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FanQSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FanQSM24
Guofan Fan, Zekun Qi, Wenkai Shi, Kaisheng Ma:
Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast. 4709-4718
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang00R0ZWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang00R0ZWZL24
Menghao Zhang, Jingyu Wang, Qi Qi, Pengfei Ren, Haifeng Sun, Zirui Zhuang, Huazheng Wang, Lei Zhang, Jianxin Liao:
Video Anomaly Detection via Progressive Learning of Multiple Proxy Tasks. 4719-4728
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZSGZZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZSGZZQ24
Xingyu Zhang, Siyu Zhao, Zeen Song, Huijie Guo, Jianqi Zhang, Changwen Zheng, Wenwen Qiang:
Not All Frequencies Are Created Equal: Towards a Dynamic Fusion of Frequencies in Time-Series Forecasting. 4729-4737
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenZLLWC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenZLLWC024
Shijie Chen, Junbao Zhuo, Xin Li, Haizhuang Liu, Rongquan Wang, Jiansheng Chen, Huimin Ma:
CMT: Co-training Mean-Teacher for Unsupervised Domain Adaptation on 3D Object Detection. 4738-4747
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PanLWTW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PanLWTW24
Tianrui Pan, Jie Liu, Bohan Wang, Jie Tang, Gangshan Wu:
RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios with Missing Visual Cues. 4748-4756
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLGLLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLGLLW24
Siqi Wang, Chao Liang, Yunfan Gao, Yang Liu, Jing Li, Haofen Wang:
Decoding Urban Industrial Complexity: Enhancing Knowledge-Driven Insights via IndustryScopeGPT. 4757-4765
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FuYL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FuYL024
Yuanbin Fu, Jie Ying, Houlei Lv, Xiaojie Guo:
Semi-supervised Camouflaged Object Detection from Noisy Data. 4766-4775
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenK0LS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenK0LS024
Bolei Chen, Jiaxu Kang, Ping Zhong, Yixiong Liang, Yu Sheng, Jianxin Wang:
Embodied Contrastive Learning with Geometric Consistency and Behavioral Awareness for Object Navigation. 4776-4785
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinCHCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinCHCL24
Jia-Li Yin, Menghao Chen, Jin Han, Bo-Hao Chen, Ximeng Liu:
Adversarial Example Quality Assessment: A Large-scale Dataset and Strong Baseline. 4786-4794
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JingZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JingZ24
Ye Jing, Xinpei Zhao:
DQ-Former: Querying Transformer with Dynamic Modality Priority for Cognitive-aligned Multimodal Emotion Recognition in Conversation. 4795-4804
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangFW0ZM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangFW0ZM24
Xicong Wang, Huiyuan Fu, Jiaxuan Wang, Xin Wang, Heng Zhang, Huadong Ma:
Exploring in Extremely Dark: Low-Light Video Enhancement with Real Events. 4805-4813
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLLCD0HX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLLCD0HX24
Qing Zhang, Haocheng Lv, Jie Liu, Zhiyun Chen, Jianyong Duan, Hao Wang, Li He, Mingying Xu:
An Entailment Tree Generation Approach for Multimodal Multi-Hop Question Answering with Mixture-of-Experts and Iterative Feedback Mechanism. 4814-4822
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuSS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuSS024
Kangpeng Hu, Quansen Sun, Yinghui Sun, Tao Wang:
Interactive Segmentation by Considering First-Click Intentional Ambiguity. 4823-4831
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenZZ0ZLBD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenZZ0ZLBD24
Leqi Shen, Sicheng Zhao, Yifeng Zhang, Hui Chen, Jundong Zhou, Pengzhang Liu, Yongjun Bao, Guiguang Ding:
Multi-Label Learning with Block Diagonal Labels. 4832-4840
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeRB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeRB024
Wentao He, Jianfeng Ren, Ruibin Bai, Xudong Jiang:
Hierarchical Perceptual and Predictive Analogy-Inference Network for Abstract Visual Reasoning. 4841-4850
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiGZLM0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiGZLM0Y24
Wenxi Li, Yuchen Guo, Jilai Zheng, Haozhe Lin, Chao Ma, Lu Fang, Xiaokang Yang:
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer. 4851-4860
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLW24
Bo Liu, Zexin Lu, Yan Wang:
Towards Medical Vision-Language Contrastive Pre-training via Study-Oriented Semantic Exploration. 4861-4870
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWWQ24
Zihao Liu, Xiaoyu Wu, Shengjin Wang, Jiayao Qian:
Adaptively Building a Video-language Model for Video Captioning and Retrieval without Massive Video Pretraining. 4871-4880
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoLPZQD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoLPZQD24
Wenhao Guo, Peng Lu, Xujun Peng, Zhaoran Zhao, Ji Qiu, Xiangtao Dong:
BCSCN: Reducing Domain Gap through Bézier Curve basis-based Sparse Coding Network for Single-Image Super-Resolution. 4881-4889
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TuZGCTZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TuZGCTZ024
Yi Tu, Chong Zhang, Ya Guo, Huan Chen, Jinyang Tang, Huijia Zhu, Qi Zhang:
UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents. 4890-4898
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LingSWHW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LingSWHW24
Tao Ling, Siping Shi, Hao Wang, Chuang Hu, Dan Wang:
Federated Morozov Regularization for Shortcut Learning in Privacy Preserving Learning with Watermarked Image Data. 4899-4908
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu0L24
Jinfu Liu, Chen Chen, Mengyuan Liu:
Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition. 4909-4918
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuHZJM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuHZJM24
Zewen Du, Zhenjiang Hu, Guiyu Zhao, Ying Jin, Hongbin Ma:
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention. 4919-4927
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuGWZYL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuGWZYL024
Shichen Lu, Longteng Guo, Wenxuan Wang, Zijia Zhao, Tongtian Yue, Jing Liu, Si Liu:
Collaborative Training of Tiny-Large Vision Language Models. 4928-4937
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouC24
Xudong Zhou, Tianxiang Chen:
BSBP-RWKV: Background Suppression with Boundary Preservation for Efficient Medical Image Segmentation. 4938-4946
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangMCPGH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangMCPGH24
Yuxing Zhang, Siyuan Meng, Chunchun Chen, Mengyao Peng, Hongyan Gu, Xinli Huang:
LinkThief: Combining Generalized Structure Knowledge with Node Similarity for Link Stealing Attack against GNN. 4947-4956
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenLS24
Yeqing Shen, Shang Li, Kun Song:
Restoring Real-World Degraded Events Improves Deblurring Quality. 4957-4966
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiangZWZLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiangZWZLW24
Xiao Liang, Yanlei Zhang, Di Wang, Haodi Zhong, Ronghan Li, Quan Wang:
Divide and Conquer: Isolating Normal-Abnormal Attributes in Knowledge Graph-Enhanced Radiology Report Generation. 4967-4975
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLLZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLLZJ24
Zhen Wang, Dongyuan Li, Guang Li, Ziqing Zhang, Renhe Jiang:
Multimodal Low-light Image Enhancement with Depth Information. 4976-4985
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZXP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZXP24
Zishuo Wang, Wenhao Zhou, Jinglin Xu, Yuxin Peng:
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection. 4986-4994
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanTWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanTWL24
Xu Han, Yuan Tang, Zhaoxuan Wang, Xianzhi Li:
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model. 4995-5004
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RenX0WTS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RenX0WTS24
Wenqi Ren, Ruihao Xia, Meng Zheng, Ziyan Wu, Yang Tang, Nicu Sebe:
Cross-Class Domain Adaptive Semantic Segmentation with Visual Language Models. 5005-5014
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinZQLXYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinZQLXYY24
Xuefeng Yin, Chenyang Zhu, Shanglai Qu, Yuqi Li, Kai Xu, Baocai Yin, Xin Yang:
CSO: Constraint-Guided Space Optimization for Active Scene Mapping. 5015-5024
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunXWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunXWX24
Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie:
Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning. 5025-5034
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWLZ24
Xinyue Liu, Jianyuan Wang, Biao Leng, Shuo Zhang:
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection. 5035-5044
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Ma0YLH00Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Ma0YLH00Z24
Huimin Ma, Siwei Wang, Shengju Yu, Suyuan Liu, Junjie Huang, Huijun Wu, Xinwang Liu, En Zhu:
Automatic and Aligned Anchor Learning Strategy for Multi-View Clustering. 5045-5054
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunHFWLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunHFWLG24
Shengyang Sun, Jiashen Hua, Junyi Feng, Dongxu Wei, Baisheng Lai, Xiaojin Gong:
TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection. 5055-5064
- view
  authority control:
- export record
  dblp key:
  - conf/mm/00040J24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/00040J24
Yang Xin, Yu Zhou, Jianmin Jiang:
RobustFace: Adaptive Mining of Noise and Hard Samples for Robust Face Recognitions. 5065-5073
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Ma0F024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Ma0F024
Xiang Ma, Xuemei Li, Lexin Fang, Caiming Zhang:
Bridging the Modality Gap: Dimension Information Alignment and Sparse Spatial Constraint for Image-Text Matching. 5074-5082
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Peng0CLD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Peng0CLD024
Chunli Peng, Xuan Dong, Tiantian Cao, Zhengqing Li, Kun Dong, Weixin Li:
ReWiTe: Realistic Wide-angle and Telephoto Dual Camera Fusion Dataset via Beam Splitter Camera Rig. 5083-5091
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FangR00M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FangR00M24
Yang Fang, Xuefeng Rao, Xinbo Gao, Weisheng Li, Zijian Min:
MTSNet: Joint Feature Adaptation and Enhancement for Text-Guided Multi-view Martian Terrain Segmentation. 5092-5101
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Jiang0XX00W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Jiang0XX00W24
Le Jiang, Yan Huang, Lianxin Xie, Wen Xue, Cheng Liu, Si Wu, Hau-San Wong:
Hunting Blemishes: Language-guided High-fidelity Face Retouching Transformer with Limited Paired Data. 5102-5111
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoBHGLC0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoBHGLC0024
Yijia Guo, Yuanxi Bai, Liwen Hu, Ziyi Guo, Mianzhi Liu, Yu Cai, Tiejun Huang, Lei Ma:
PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting. 5112-5120
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiangTY0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiangTY0L24
Mingcan Xiang, Jiaxun Tang, Qizheng Yang, Hui Guan, Tongping Liu:
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model. 5121-5130
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangL0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangL0L024
Xinwei Zhang, Aishan Liu, Tianyuan Zhang, Siyuan Liang, Xianglong Liu:
Towards Robust Physical-world Backdoor Attacks on Lane Detection. 5131-5140
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangWLFZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangWLFZL24
Longtao Jiang, Min Wang, Zecheng Li, Yao Fang, Wengang Zhou, Houqiang Li:
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval. 5141-5150
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoLHHZCLJZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoLHHZCLJZZ24
Pinxue Guo, Wanyun Li, Hao Huang, Lingyi Hong, Xinyu Zhou, Zhaoyu Chen, Jinglun Li, Kaixun Jiang, Wei Zhang, Wenqiang Zhang:
X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation. 5151-5160
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangD0QYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangD0QYL24
Ling Huang, Wenqian Dong, Song Xiao, Jiahui Qu, Yuanbo Yang, Yunsong Li:
Language-Guided Visual Prompt Compensation for Multi-Modal Remote Sensing Image Classification with Modality Absence. 5161-5170
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinWLLHXJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinWLLHXJ24
Zening Lin, Jiapeng Wang, Teng Li, Wenhui Liao, Dayi Huang, Longfei Xiong, Lianwen Jin:
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction. 5171-5180
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangQCCLSC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangQCCLSC024
Haojian Huang, Xiaozhen Qiao, Zhuo Chen, Haodong Chen, Bingyu Li, Zhe Sun, Mulin Chen, Xuelong Li:
CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning. 5181-5190
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoDCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoDCJ24
Shuai Zhao, Yongkun Du, Zhineng Chen, Yu-Gang Jiang:
Decoder Pre-Training with only Text for Scene Text Recognition. 5191-5200
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangD0FYN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangD0FYN24
Naibo Wang, Yuchen Deng, Wenjie Feng, Shichen Fan, Jianwei Yin, See-Kiong Ng:
One-Shot Sequential Federated Learning for Non-IID Data by Enhancing Local Model Diversity. 5201-5210
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangHB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangHB024
Wendong Huang, Jinwu Hu, Xiuli Bi, Bin Xiao:
Anatomical Prior Guided Spatial Contrastive Learning for Few-Shot Medical Image Segmentation. 5211-5220
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LongH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LongH024
Libo Long, Xiao Hu, Jochen Lang:
Learning to Handle Large Obstructions in Video Frame Interpolation. 5221-5229
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangJZLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangJZLL24
Hefei Huang, Xu Jia, Xinyu Zhang, Shengming Li, Huchuan Lu:
Event-Guided Rolling Shutter Correction with Time-Aware Cross-Attentions. 5230-5239
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangGWP0L0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangGWP0L0W24
Xibiao Wang, Hang Gao, Xindian Wei, Liang Peng, Rui Li, Cheng Liu, Si Wu, Hau-San Wong:
Contrastive Graph Distribution Alignment for Partially View-Aligned Clustering. 5240-5249
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaiWLW0XG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaiWLW0XG024
Xudong Cai, Yongcai Wang, Lun Luo, Minhang Wang, Deying Li, Jintao Xu, Weihao Gu, Rui Ai:
PRISM: PRogressive dependency maxImization for Scale-invariant image Matching. 5250-5259
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Du0J24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Du0J24
Yang Du, Yuqi Liu, Qin Jin:
Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval. 5260-5269
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoXSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoXSL24
Wen Luo, Yu Xia, Tianshu Shen, Sujian Li:
Shapley Value-based Contrastive Alignment for Multimodal Information Extraction. 5270-5279
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yu0GFWK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yu0GFWK024
Hao Yu, Xin Yang, Xin Gao, Yihui Feng, Hao Wang, Yan Kang, Tianrui Li:
Overcoming Spatial-Temporal Catastrophic Forgetting for Federated Class-Incremental Learning. 5280-5288
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLSG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLSG24
Haibo Wang, Chenghang Lai, Yixuan Sun, Weifeng Ge:
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering. 5289-5298
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangCDF024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangCDF024
Shudong Huang, Hecheng Cai, Hao Dai, Wentao Feng, Jiancheng Lv:
Adaptive Instance-wise Multi-view Clustering. 5299-5307
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuanGAWZLCXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuanGAWZLCXL24
Ze Yuan, Jinyang Guo, Dakai An, Junran Wu, He Zhu, Jianhao Li, Xueyuan Chen, Ke Xu, Jiaheng Liu:
VRDistill: Vote Refinement Distillation for Efficient Indoor 3D Object Detection. 5308-5317
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KimUC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KimUC024
Sunoh Kim, Daeho Um, Hyunjun Choi, Jin Young Choi:
Learnable Negative Proposals Using Dual-Signed Cross-Entropy Loss for Weakly Supervised Video Moment Localization. 5318-5327
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QuDLLCZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QuDLLCZJ24
Yansong Qu, Shaohui Dai, Xinyang Li, Jianghang Lin, Liujuan Cao, Shengchuan Zhang, Rongrong Ji:
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane. 5328-5337
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YaoDXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YaoDXL24
Huan Yao, Changxing Ding, Xuanda Xu, Zhifeng Lin:
Decoupling Heterogeneous Features for Robust 3D Interacting Hand Poses Estimation. 5338-5346
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuJZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuJZC24
Zhiyu Zhu, Zhibo Jin, Jiayu Zhang, Huaming Chen:
Enhancing Model Interpretability with Local Attribution over Global Exploration. 5347-5355
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YanGLL0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YanGLL0Y24
Ruxue Yan, Wenya Guo, Xubo Liu, Xumeng Liu, Ying Zhang, Xiaojie Yuan:
Tracking-forced Referring Video Object Segmentation. 5356-5364
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0056ZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0056ZJ24
Xin Zhang, Shenghua Zhong, Jianmin Jiang:
Effective Optimization of Root Selection Towards Improved Explanation of Deep Classifiers. 5365-5373
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiZWZZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiZWZZL24
Guangchen Shi, Wei Zhu, Yirui Wu, Danhuai Zhao, Kang Zheng, Tong Lu:
Few-shot Semantic Segmentation via Perceptual Attention and Spatial Control. 5374-5383
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaZZLW0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaZZLW0W24
Zibo Ma, Bo Zhang, Zheng Zhang, Wu Liu, Wufan Wang, Hui Gao, Wendong Wang:
ADDG: An Adaptive Domain Generalization Framework for Cross-Plane MRI Segmentation. 5384-5392
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RuG0ZLWC0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RuG0ZLWC0024
Lixiang Ru, Xin Guo, Lei Yu, Yingying Zhang, Jiangwei Lao, Jian Wang, Jingdong Chen, Yansheng Li, Ming Yang:
Parameter-Efficient Complementary Expert Learning for Long-Tailed Visual Recognition. 5393-5402
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0004WLXLL0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0004WLXLL0T24
Tianyuan Zhang, Lu Wang, Hainan Li, Yisong Xiao, Siyuan Liang, Aishan Liu, Xianglong Liu, Dacheng Tao:
LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions. 5403-5412
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLLH24
Xinyue Zhang, Tingjin Luo, Yueying Liu, Chenping Hou:
Imbalanced Multi-instance Multi-label Learning via Coding Ensemble and Adaptive Thresholds. 5413-5422
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLDLTY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLDLTY24
Pengxu Chen, Huazhong Liu, Jihong Ding, Jiawen Luo, Peng Tan, Laurence T. Yang:
Holistic-CAM: Ultra-lucid and Sanity Preserving Visual Interpretation in Holistic Stage of CNNs. 5423-5431
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangY024
Yihao Wang, Meng Yang, Rui Cao:
Fine-grained Semantic Alignment with Transferred Person-SAM for Text-based Person Retrieval. 5432-5441
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangL024
Qijie Wang, Guandu Liu, Bin Wang:
CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification. 5442-5450
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangCYLGONKCDD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangCYLGONKCDD24
Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang:
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness. 5451-5459
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoYP0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoYP0X24
Linhui Xiao, Xiaoshan Yang, Fang Peng, Yaowei Wang, Changsheng Xu:
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding. 5460-5469
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Fan0WL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Fan0WL024
Yunfeng Fan, Wenchao Xu, Haozhao Wang, Junhong Liu, Song Guo:
Detached and Interactive Multimodal Learning. 5470-5478
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLZLZWS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLZLZWS024
Chenglong Zhang, Xinyan Liang, Peng Zhou, Zhaolong Ling, Yingwei Zhang, Xingyu Wu, Weiguo Sheng, Bingbing Jiang:
Scalable Multi-view Unsupervised Feature Selection with Structure Learning and Fusion. 5479-5488
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangDZQZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangDZQZ24
Chengyi Yang, Mingda Dong, Xiaoyue Zhang, Jiayin Qi, Aimin Zhou:
Introducing Common Null Space of Gradients for Gradient Projection Methods in Continual Learning. 5489-5497
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZareapoorS0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZareapoorS0L024
Masoumeh Zareapoor, Pourya Shamsolmoali, Huiyu Zhou, Yue Lu, Salvador García:
Fractional Correspondence Framework in Detection Transformer. 5498-5506
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LimKKC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LimKKC24
Geuntaek Lim, Hyunwoo Kim, Joonsoo Kim, Yukyung Choi:
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization. 5507-5516
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangM000000Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangM000000Z24
Xihong Yang, Erxue Min, Ke Liang, Yue Liu, Siwei Wang, Sihang Zhou, Huijun Wu, Xinwang Liu, En Zhu:
GraphLearner: Graph Node Clustering with Fully Learnable Augmentation. 5517-5526
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWZXWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWZXWZ24
Hongqiu Wang, Wei Wang, Haipeng Zhou, Huihui Xu, Shaozhi Wu, Lei Zhu:
Language-Driven Interactive Shadow Detection. 5527-5536
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaiZLGN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaiZLGN24
Jinyu Cai, Yunhe Zhang, Zhoumin Lu, Wenzhong Guo, See-Kiong Ng:
Towards Effective Federated Graph Anomaly Detection via Self-boosted Knowledge Distillation. 5537-5546
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Huo0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Huo0W24
Chaofan Huo, Ye Shi, Jingya Wang:
Monocular Human-Object Reconstruction in the Wild. 5547-5555
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoSZQ0ZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoSZQ0ZW24
Baoqi Gao, Daoxu Sheng, Lei Zhang, Qi Qi, Bo He, Zirui Zhuang, Jingyu Wang:
STAR-VP: Improving Long-term Viewport Prediction in 360° Videos via Space-aligned and Time-varying Fusion. 5556-5565
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoYZYMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoYZYMD24
Hu Gao, Jing Yang, Ying Zhang, Jingfan Yang, Bowen Ma, Depeng Dang:
Learning Optimal Combination Patterns for Lightweight Stereo Image Super-Resolution. 5566-5574
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangHLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangHLY24
Yifan Wang, Wuliang Huang, Lei Li, Chun Yuan:
Semantic Distillation from Neighborhood for Composed Image Retrieval. 5575-5583
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeXQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeXQL24
Zhentao He, Changqun Xia, Shengye Qiao, Jia Li:
Text-prompt Camouflaged Instance Segmentation with Graduated Camouflage Learning. 5584-5593
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLS24
Zuyu Zhang, Yan Li, Byung-Seok Shin:
Embracing Domain Gradient Conflicts: Domain Generalization Using Domain Gradient Equilibrium. 5594-5603
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhe0LLHT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhe0LLHT24
Ting Zhe, Jing Zhang, Yongqian Li, Yong Luo, Han Hu, Dacheng Tao:
Multi-Granularity Hand Action Detection. 5604-5613
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaoLQD0ZDBZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaoLQD0ZDBZ24
Xingyuan Mao, Yuwen Liu, Lianyong Qi, Li Duan, Xiaolong Xu, Xuyun Zhang, Wanchun Dou, Amin Beheshti, Xiaokang Zhou:
Cluster-driven Personalized Federated Recommendation with Interest-aware Graph Convolution Network for Multimedia. 5614-5622
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0016LLRDP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0016LLRDP24
Yuan Sun, Kaiming Liu, Yongxiang Li, Zhenwen Ren, Jian Dai, Dezhong Peng:
Distribution Consistency Guided Hashing for Cross-Modal Retrieval. 5623-5632
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Dai0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Dai0024
Luanyuan Dai, Xiaoyu Du, Jinhui Tang:
TrGa: Reconsidering the Application of Graph Neural Networks in Two-View Correspondence Pruning. 5633-5642
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangTYZXHZN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangTYZXHZN24
Han Jiang, Haoyu Tang, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Jihua Zhu, Liqiang Nie:
Revisiting Unsupervised Temporal Action Localization: The Primacy of High-Quality Actionness and Pseudolabels. 5643-5652
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiaoZYTLHWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiaoZYTLHWZ24
Yu Liao, Xinfeng Zhang, Rui Yang, Jianwei Tao, Bai Liu, Zhipeng Hu, Shuang Wang, Zeng Zhao:
Selection and Reconstruction of Key Locals: A Novel Specific Domain Image-Text Retrieval Method. 5653-5662
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangY24
Wei Yang, Qingchen Yang:
Multimodal-aware Multi-intention Learning for Recommendation. 5663-5672
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZLXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZLXL24
Liupeng Li, Yuhua Zheng, Shupeng Liu, Xiaoyin Xu, Taihao Li:
Domain Knowledge Enhanced Vision-Language Pretrained Model for Dynamic Facial Expression Recognition. 5673-5682
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZWSZYHLGAX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZWSZYHLGAX24
Yuting Zhang, Zhao Zhang, Yiqing Wu, Ying Sun, Fuzhen Zhuang, Wenhui Yu, Lantao Hu, Han Li, Kun Gai, Zhulin An, Yongjun Xu:
Tag Tree-Guided Multi-grained Alignment for Multi-Domain Short Video Recommendation. 5683-5691
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShaoWHHCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShaoWHHCJ24
Kai Shao, Rui Wang, Yixue Hao, Long Hu, Min Chen, Hans-Arno Jacobsen:
Multimodal Physiological Signals Representation Learning via Multiscale Contrasting for Depression Recognition. 5692-5701
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYZ024
Xinyu Li, Wenqing Ye, Yueyi Zhang, Xiaoyan Sun:
GRACE: GRadient-based Active Learning with Curriculum Enhancement for Multimodal Sentiment Analysis. 5702-5711
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PanJJL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PanJJL24
Yuchen Pan, Junjun Jiang, Kui Jiang, Xianming Liu:
Disentangled-Multimodal Privileged Knowledge Distillation for Depression Recognition with Incomplete Multimodal Data. 5712-5721
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuHLZCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuHLZCC24
Yuanyuan Liu, Yuxuan Huang, Shuyang Liu, Yibing Zhan, Zijing Chen, Zhe Chen:
Open-Set Video-based Facial Expression Recognition with Human Expression-sensitive Prompting. 5722-5731
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuHWY0R24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuHWY0R24
Aoqiang Zhu, Min Hu, Xiaohua Wang, Jiaoyun Yang, Yiming Tang, Fuji Ren:
KEBR: Knowledge Enhanced Self-Supervised Balanced Representation for Multimodal Sentiment Analysis. 5732-5741
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangGGYLHL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangGGYLHL024
Zining Wang, Jinyang Guo, Ruihao Gong, Yang Yong, Aishan Liu, Yushi Huang, Jiaheng Liu, Xianglong Liu:
PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models. 5742-5751
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangQ0P0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangQ0P0024
Longan Wang, Yang Qin, Yuan Sun, Dezhong Peng, Xi Peng, Peng Hu:
Robust Contrastive Cross-modal Hashing with Noisy Labels. 5752-5760
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengZ0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengZ0W24
Xiying Zheng, Yukang Zhang, Yang Lu, Hanzi Wang:
Semi-supervised Visible-Infrared Person Re-identification via Modality Unification and Confidence Guidance. 5761-5770
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouWLZB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouWLZB24
Ziyang Zhou, Pinghui Wang, Zi Liang, Ruofei Zhang, Haitao Bai:
PAIR: Pre-denosing Augmented Image Retrieval Model for Defending Adversarial Patches. 5771-5779
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuYZM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuYZM24
Daiqing Wu, Dongbao Yang, Yu Zhou, Can Ma:
Robust Multimodal Sentiment Analysis of Image-Text Pairs by Distribution-Based Feature Recovery and Fusion. 5780-5789
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuZLPZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuZLPZ24
Kunlun Xu, Haozhuo Zhang, Yu Li, Yuxin Peng, Jiahuan Zhou:
Mitigate Catastrophic Remembering via Continual Knowledge Purification for Noisy Lifelong Person Re-Identification. 5790-5799
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenYH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenYH24
Wei Shen, Mang Ye, Wenke Huang:
Resisting Over-Smoothing in Graph Neural Networks via Dual-Dimensional Decoupling. 5800-5809
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FangWLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FangWLL24
Junlin Fang, Wenya Wang, Guosheng Lin, Fengmao Lv:
Sentiment-oriented Sarcasm Integration for Video Sentiment Analysis Enhancement with Sarcasm Assistance. 5810-5819
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangMSYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangMSYX24
Fanfan Wang, Heqing Ma, Xiangqing Shen, Jianfei Yu, Rui Xia:
Observe before Generate: Emotion-Cause aware Video Caption for Multimodal Emotion Cause Generation in Conversations. 5820-5828
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0121CSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0121CSZ24
Yang Yang, Liyuan Cao, Haoyu Shi, Huaiwen Zhang:
Multi-Instance Multi-Label Learning for Text-motion Retrieval. 5829-5837
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Su0L0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Su0L0024
Hongzu Su, Jingjing Li, Fengling Li, Ke Lu, Lei Zhu:
SOIL: Contrastive Second-Order Interest Learning for Multimodal Recommendation. 5838-5846
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiHZZTTMGC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiHZZTTMGC24
Jiansong Qi, Yaping Huang, Ying Zhang, Sihui Zhang, Mei Tian, Yi Tian, Fanchao Meng, Lin Guan, Tianyi Chang:
Visual Question Answering Driven Eye Tracking Paradigm for Identifying Children with Autism Spectrum Disorder. 5847-5855
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeZWG0WM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeZWG0WM24
Dongxiao He, Jinghan Zhang, Xiaobao Wang, Meng Ge, Zhiyong Feng, Longbiao Wang, Xiaoke Ma:
TUT4CRS: Time-aware User-preference Tracking for Conversational Recommendation System. 5856-5864
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangLGLYHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangLGLYHL24
Guoqing Yang, Zhiming Luo, Jianzhe Gao, Yingxin Lai, Kun Yang, Yifan He, Shaozi Li:
A Multilevel Guidance-Exploration Network and Behavior-Scene Matching Method for Human Behavior Anomaly Detection. 5865-5873
- view
  authority control:
- export record
  dblp key:
  - conf/mm/AiLQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/AiLQ024
Zekun Ai, Xiaotong Luo, Yanyun Qu, Yuan Xie:
SkipVSR: Adaptive Patch Routing for Video Super-Resolution with Inter-Frame Mask. 5874-5882
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangP0YP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangP0YP24
Qianxin Huang, Siyao Peng, Xiaobo Shen, Yunhao Yuan, Shirui Pan:
Similarity Preserving Transformer Cross-Modal Hashing for Video-Text Retrieval. 5883-5891
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0Y024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0Y024
Wenxiao Zhang, Hossein Rahmani, Xun Yang, Jun Liu:
Reverse2Complete: Unpaired Multimodal Point Cloud Completion via Guided Diffusion. 5892-5901
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunHW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunHW24
Yitong Sun, Yao Huang, Xingxing Wei:
Embodied Laser Attack: Leveraging Scene Priors to Achieve Agent-based Robust Non-contact Attacks. 5902-5910
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangS0YDCLLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangS0YDCLLS24
Yipo Huang, Xiangfei Sheng, Zhichao Yang, Quan Yuan, Zhichao Duan, Pengfei Chen, Leida Li, Weisi Lin, Guangming Shi:
AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception. 5911-5920
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiuLPGZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiuLPGZD24
Ji Qiu, Peng Lu, Xujun Peng, Wenhao Guo, Zhaoran Zhao, Xiangtao Dong:
Learning Realistic Sketching: A Dual-agent Reinforcement Learning Approach. 5921-5929
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001YCYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001YCYZ24
Xiaobo Shen, Gaoyao Yu, Yinfan Chen, Xichen Yang, Yuhui Zheng:
Graph Convolutional Semi-Supervised Cross-Modal Hashing. 5930-5938
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002G0NK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002G0NK24
Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan S. Kankanhalli:
Diffusion Facial Forgery Detection. 5939-5948
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLG24
Hengxing Liu, Mingjia Li, Xiaojie Guo:
Regional Attention For Shadow Removal. 5949-5957
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FangZSZWC0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FangZSZWC0L24
Hao Fang, Haoyuan Zhao, Jianxin Shi, Miao Zhang, Guanzhen Wu, Yi Ching Chou, Feng Wang, Jiangchuan Liu:
Robust Live Streaming over LEO Satellite Constellations: Measurement, Analysis, and Handover-Aware Adaptation. 5958-5966
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZangWZHQLSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZangWZHQLSZ24
Qi Zang, Shuang Wang, Dong Zhao, Yang Hu, Dou Quan, Jinlong Li, Nicu Sebe, Zhun Zhong:
Generalized Source-Free Domain-adaptive Segmentation via Reliable Knowledge Propagation. 5967-5976
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PeiTTZXWLX00S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PeiTTZXWLX00S24
Yunqiang Pei, Jialei Tang, Qihang Tang, Mingfeng Zha, Dongyu Xie, Guoqing Wang, Zhitao Liu, Ning Xie, Peng Wang, Yang Yang, Hengtao Shen:
Emotion Recognition in HMDs: A Multi-task Approach Using Physiological Signals and Occluded Faces. 5977-5986
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PanYKWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PanYKWX24
Xiaochao Pan, Jiawei Yao, Hongrui Kou, Tong Wu, Canran Xiao:
HarmonicNeRF: Geometry-Informed Synthetic View Augmentation for 3D Scene Reconstruction in Driving Scenarios. 5987-5996
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiD024
Guangyao Li, Henghui Du, Di Hu:
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues. 5997-6005
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Qin0CXX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Qin0CXX24
Jiongming Qin, Fei Luo, Tuo Cao, Wenju Xu, Chunxia Xiao:
HS-Surf: A Novel High-Frequency Surface Shell Radiance Field to Improve Large-Scale Scene Rendering. 6006-6014
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0010JJL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0010JJL24
Gang Wu, Junjun Jiang, Kui Jiang, Xianming Liu:
Harmony in Diversity: Improving All-in-One Image Restoration via Multi-Task Collaboration. 6015-6023
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuHLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuHLW24
Meichen Liu, Shuting He, Songnan Lin, Bihan Wen:
Dual-head Genre-instance Transformer Network for Arbitrary Style Transfer. 6024-6032
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouZ00MZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouZ00MZ24
Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai:
Subjective and Objective Quality-of-Experience Assessment for 3D Talking Heads. 6033-6042
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouZH24
Zhi Zhou, Junke Zhu, Zhangjin Huang:
Gaussian Splatting with Neural Basis Extension. 6043-6052
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangCZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangCZ0024
Zhenyu Zhang, Guangyao Chen, Yixiong Zou, Yuhua Li, Ruixuan Li:
Learning Unknowns from Unknowns: Diversified Negative Prototypes Generator for Few-shot Open-Set Recognition. 6053-6062
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangDZCZZF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangDZCZZF24
Jinxiao Zhang, Runmin Dong, Juepeng Zheng, Mengxuan Chen, Lixian Zhang, Yi Zhao, Haohuan Fu:
Spatial-Temporal Context Model for Remote Sensing Imagery Compression. 6063-6072
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieYML24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieYML24
Weiying Xie, Mei Yuan, Jitao Ma, Yunsong Li:
Adaptive Pruning of Channel Spatial Dependability in Convolutional Neural Networks. 6073-6082
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Fang0THL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Fang0THL24
Heng Fang, Sheng Huang, Wenhao Tang, Luwen Huangfu, Bo Liu:
SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification. 6083-6092
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenY0WC0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenY0WC0L24
Wenhao Shen, Wanqi Yin, Hao Wang, Chen Wei, Zhongang Cai, Lei Yang, Guosheng Lin:
HMR-Adapter: A Lightweight Adapter with Dual-Path Cross Augmentation for Expressive Human Mesh Recovery. 6093-6102
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SirejidingBLYAL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SirejidingBLYAL24
Shalayiding Sirejiding, Bayram Bayramli, Yuxiang Lu, Yuwen Yang, Tamam Alsarhan, Hongtao Lu, Yue Ding:
Task-Interaction-Free Multi-Task Learning with Efficient Hierarchical Feature Representation. 6103-6112
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoSZYCWG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoSZYCWG24
Yiyong Xiao, Kai Shu, Haoyi Zhang, Baohua Yin, Wai Seng Cheang, Haoyang Wang, Jiechao Gao:
EGGesture: Entropy-Guided Vector Quantized Variational AutoEncoder for Co-Speech Gesture Generation. 6113-6122
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunLT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunLT024
Yuqi Sun, Qing Lin, Weimin Tan, Bo Yan:
Audio-Driven Identity Manipulation for Face Inpainting. 6123-6132
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaXWFS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaXWFS024
Leilei Ma, Hongxing Xie, Lei Wang, Yanping Fu, Dengdi Sun, Haifeng Zhao:
Text-Region Matching for Multi-Label Image Recognition with Missing Labels. 6133-6142
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinLHZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinLHZZ24
Zhengwei Yin, Guixu Lin, Mengshun Hu, Hao Zhang, Yinqiang Zheng:
FlexIR: Towards Flexible and Manipulable Image Restoration. 6143-6152
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Alimohammadzadeh24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Alimohammadzadeh24
Hamed Alimohammadzadeh, Shahram Ghandeharizadeh:
Swarical: An Integrated Hierarchical Approach to Localizing Flying Light Specks. 6153-6161
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaiTL0QDT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaiTL0QDT024
Xiaowen Cai, Yunbo Tao, Daizong Liu, Pan Zhou, Xiaoye Qu, Jianfeng Dong, Keke Tang, Lichao Sun:
Frequency-Aware GAN for Imperceptible Transfer Attack on 3D Point Clouds. 6162-6171
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLO0TL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLO0TL24
Mingjin Zhang, Shilong Liu, Yuanjun Ouyang, Jie Guo, Zhihong Tang, Yunsong Li:
Explore Hybrid Modeling for Moving Infrared Small Target Detection. 6172-6181
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QuanT00J24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QuanT00J24
Yuhui Quan, Xiaoheng Tan, Yan Huang, Yong Xu, Hui Ji:
Enhancing Underwater Images via Asymmetric Multi-Scale Invertible Networks. 6182-6191
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhanYGG0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhanYGG0Q24
Lishuang Zhan, Enting Ying, Jiabao Gan, Shihui Guo, Boyu Gao, Yipeng Qin:
SATPose: Improving Monocular 3D Pose Estimation with Spatial-aware Ground Tactility. 6192-6201
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhanLX0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhanLX0024
Hongjian Zhan, Yangfu Li, Yu-Jie Xiong, Umapada Pal, Yue Lu:
Free Lunch: Frame-level Contrastive Learning with Text Perceiver for Robust Scene Text Recognition in Lightweight Models. 6202-6211
- view
  authority control:
- export record
  dblp key:
  - conf/mm/01190MCCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/01190MCCJ24
Xin Wang, Kai Chen, Xingjun Ma, Zhineng Chen, Jingjing Chen, Yu-Gang Jiang:
AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning. 6212-6221
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LvHY0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LvHY0024
Xudong Lv, Zhiwei He, Yuxiang Yang, Jiahao Nie, Jing Zhang:
SAR-SLAM: Self-Attentive Rendering-based SLAM with Neural Point Cloud Encoding. 6222-6231
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangHYZLLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangHYZLLZ24
Shao-Kui Zhang, Junkai Huang, Liang Yue, Jia-Tong Zhang, Jia-Hong Liu, Yu-Kun Lai, Song-Hai Zhang:
SceneExpander: Real-Time Scene Synthesis for Interactive Floor Plan Editing. 6232-6240
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TianZLWWWHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TianZLWWWHL24
Long Tian, Hongyi Zhao, Ruiying Lu, Rongrong Wang, Yujie Wu, Liming Wang, Xiongpeng He, Xiyang Liu:
FOCT: Few-shot Industrial Anomaly Detection with Foreground-aware Online Conditional Transport. 6241-6249
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuCSZ24
Chuang Liu, Yichao Cao, Xiu Su, Haogang Zhu:
Universal Frequency Domain Perturbation for Single-Source Domain Generalization. 6250-6259
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangCJZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangCJZH24
Yushun Tang, Shuoshuo Chen, Jiyuan Jia, Yi Zhang, Zhihai He:
Domain-Conditioned Transformer for Fully Test-time Adaptation. 6260-6269
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXPW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXPW24
Zhiru Wang, Shiyun Xie, Chengwei Pan, Guoping Wang:
SpecGaussian with Latent Features: A High-quality Modeling of the View-dependent Appearance for 3D Gaussian Splatting. 6270-6278
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanZZL00S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanZZL00S24
Wencheng Han, Chen Zhang, Yang Zhou, Wentao Liu, Chen Qian, Chengzhong Xu, Jianbing Shen:
Prior Metadata-Driven RAW Reconstruction: Eliminating the Need for Per-Image Metadata. 6279-6287
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoLGNG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoLGNG24
Fulin Luo, Yi Liu, Xiuwen Gong, Zhixiong Nan, Tan Guo:
EMVCC: Enhanced Multi-View Contrastive Clustering for Hyperspectral Images. 6288-6296
- view
  authority control:
- export record
  dblp key:
  - conf/mm/NieNZZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/NieNZZZ24
Fan Nie, Jiangqun Ni, Jian Zhang, Bin Zhang, Weizhe Zhang:
FRADE: Forgery-aware Audio-distilled Multimodal Learning for Deepfake Detection. 6297-6306
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhongHYZSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhongHYZSL24
Siru Zhong, Xixuan Hao, Yibo Yan, Ying Zhang, Yangqiu Song, Yuxuan Liang:
UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation. 6307-6315
- view
  authority control:
- export record
  dblp key:
  - conf/mm/NiuYXLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/NiuYXLC24
Yuzhen Niu, Lifen Yang, Rui Xu, Yuezhou Li, Yuzhong Chen:
MiNet: Weakly-Supervised Camouflaged Object Detection through Mutual Interaction between Region and Edge Cues. 6316-6325
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangP0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangP0W024
Delong Zhang, Yi-Xing Peng, Xiao-Ming Wu, Ancong Wu, Weishi Zheng:
PixelFade: Privacy-preserving Person Re-identification with Noise-guided Progressive Replacement. 6326-6334
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeLXCSKL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeLXCSKL24
Wei He, Xiang Li, Shengtian Xu, Yuzheng Chen, Chan-In Sio, Ge Lin Kan, Lik-Hang Lee:
MetaDragonBoat: Exploring Paddling Techniques of Virtual Dragon Boating in a Metaverse Campus. 6335-6344
- view
  authority control:
- export record
  dblp key:
  - conf/mm/000700GL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/000700GL24
Yuxuan Lu, Jiahao Nie, Zhiwei He, Hongjie Gu, Xudong Lv:
VoxelTrack: Exploring Multi-level Voxel Representation for 3D Point Cloud Object Tracking. 6345-6354
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuFJLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuFJLC24
Yu Liu, Longhan Feng, Qi Jia, Zezheng Liu, Zi-Huang Cao:
Two Teachers Are Better Than One: Semi-supervised Elliptical Object Detection by Dual-Teacher Collaborative Guidance. 6355-6363
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoY024
Yao Luo, Ming Yang, Jinhui Tang:
Dual-view Pyramid Network for Video Frame Interpolation. 6364-6373
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinTTMWWW0YLYGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinTTMWWW0YLYGZ24
Junxiong Lin, Zen Tao, Xuan Tong, Xinji Mai, Haoran Wang, Boyang Wang, Yan Wang, Qing Zhao, Jiawen Yu, Yuxuan Lin, Shaoqi Yan, Shuyong Gao, Wenqiang Zhang:
Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution. 6374-6383
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangWXY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangWXY024
Wenxiao Zhang, Ziqi Wang, Li Xu, Xun Yang, Jun Liu:
Informative Point cloud Dataset Extraction for Classification via Gradient-based Points Moving. 6384-6393
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuZZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuZZZ24
Jia-Hong Liu, Shao-Kui Zhang, Chuyue Zhang, Song-Hai Zhang:
Controllable Procedural Generation of Landscapes. 6394-6403
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiaoZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiaoZW24
Fangjian Liao, Xingxing Zou, Waikeung Wong:
Uni-DlLoRA: Style Fine-Tuning for Fashion Image Translation. 6404-6413
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZZX24
Yusen Wang, Kaixuan Zhou, Wenxiao Zhang, Chunxia Xiao:
MegaSurf: Scalable Large Scene Neural Surface Reconstruction. 6414-6423
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiuRSZYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiuRSZYZ24
Zherui Qiu, Chenqu Ren, Kaiwen Song, Xiaoyi Zeng, Leyuan Yang, Juyong Zhang:
Deformable NeRF using Recursively Subdivided Tetrahedra. 6424-6432
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MamtaSKE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MamtaSKE24
Mamta, Gopendra Vikram Singh, Deepak Raju Kori, Asif Ekbal:
Aspect-Based Multimodal Mining: Unveiling Sentiments, Complaints, and Beyond in User-Generated Content. 6433-6442
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuPZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuPZ24
Zichen Liu, Yuxin Peng, Jiahuan Zhou:
InsVP: Efficient Instance Visual Prompting from Image Itself. 6443-6452
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0YZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0YZ024
Zidu Wang, Xiangyu Zhu, Jiang Yu, Tianshuo Zhang, Zhen Lei:
S2TD-Face: Reconstruct a Detailed 3D Face with Controllable Texture from a Single Sketch. 6453-6462
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Kosugi24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Kosugi24
Satoshi Kosugi:
Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement. 6463-6471
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001WL0SS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001WL0SS24
Xun Jiang, Zhuoyuan Wei, Shenshen Li, Xing Xu, Jingkuan Song, Heng Tao Shen:
Counterfactually Augmented Event Matching for De-biased Temporal Sentence Grounding. 6472-6481
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLLFPLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLLFPLZ24
Bingzhi Chen, Ruihan Liu, Yishu Liu, Xiaozhao Fang, Jiahui Pan, Guangming Lu, Zheng Zhang:
Stay Focused is All You Need for Adversarial Robustness. 6482-6491
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengLKLGYMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengLKLGYMZ24
Zhi Zeng, Minnan Luo, Xiangzheng Kong, Huan Liu, Hao Guo, Hao Yang, Zihan Ma, Xiang Zhao:
Mitigating World Biases: A Multimodal Multi-View Debiasing Framework for Fake News Video Detection. 6492-6500
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuGSLYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuGSLYY24
Zibin Liu, Banglei Guan, Yang Shang, Shunkun Liang, Zhenbao Yu, Qifeng Yu:
Optical Flow-Guided 6DoF Object Pose Tracking with an Event Camera. 6501-6509
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuCL24
Junran Wu, Xueyuan Chen, Shangzhe Li:
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning. 6510-6519
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiCTZQXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiCTZQXL24
Zheng Wei, Yuzheng Chen, Wai Tong, Xuan Zong, Huamin Qu, Xian Xu, Lik-Hang Lee:
Hearing the Moment with MetaEcho! From Physical to Virtual in Synchronized Sound Recording. 6520-6529
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangYMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangYMW24
Cong Wang, Chengjin Yu, Jie Mu, Wei Wang:
PercepLIE: A New Path to Perceptual Low-Light Image Enhancement. 6530-6539
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Cheng0WL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Cheng0WL024
Xin Cheng, Hao Wang, Jinwei Wang, Xiangyang Luo, Bin Ma:
Advancing Quantization Steps Estimation: A Two-Stream Network Approach for Enhancing Robustness. 6540-6548
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLS0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLS0L024
Mingjin Zhang, Longyi Li, Wenxuan Shi, Jie Guo, Yunsong Li, Xinbo Gao:
VmambaSCI: Dynamic Deep Unfolding Network with Mamba for Compressive Spectral Imaging. 6549-6558
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengAL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengAL24
Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling:
Speech Reconstruction from Silent Lip and Tongue Articulation by Diffusion Models and Text-Guided Pseudo Target Generation. 6559-6568
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoTWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoTWW24
Junyuan Guo, Hao Tang, Teng Wang, Chao Wang:
R4D-planes: Remapping Planes For Novel View Synthesis and Self-Supervised Decoupling of Monocular Videos. 6569-6577
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenFJH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenFJH024
Wu Chen, Hehe Fan, Qiuping Jiang, Chao Huang, Yi Yang:
Progressive Point Cloud Denoising with Cross-Stage Cross-Coder Adaptive Edge Graph Convolution Network. 6578-6587
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunYLKYYZLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunYLKYYZLZ24
Mingyang Sun, Qipeng Yan, Zhuoer Liang, Dongliang Kou, Dingkang Yang, Ruisheng Yuan, Xiao Zhao, Mingcheng Li, Lihua Zhang:
IF-Garments: Reconstructing Your Intersection-Free Multi-Layered Garments from Monocular Videos. 6588-6597
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DongWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DongWL024
Bo Dong, Pichao Wang, Hao Luo, Fan Wang:
Adaptive Query Selection for Camouflaged Instance Segmentation. 6598-6606
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaoSZQZXZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaoSZQZXZD24
Yuxin Mao, Xuyang Shen, Jing Zhang, Zhen Qin, Jinxing Zhou, Mochu Xiang, Yiran Zhong, Yuchao Dai:
TAVGBench: Benchmarking Text to Audible-Video Generation. 6607-6616
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangHLYHHC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangHLYHHC24
Yuan Tang, Xu Han, Xianzhi Li, Qiao Yu, Yixue Hao, Long Hu, Min Chen:
MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors. 6617-6626
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoXLFZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoXLFZZ24
Guan Luo, Tian-Xing Xu, Ying-Tian Liu, Xiaoxiong Fan, Fang-Lue Zhang, Song-Hai Zhang:
3D Gaussian Editing with A Single Image. 6627-6636
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Sun0TDMLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Sun0TDMLG24
Zhenhong Sun, Junyan Wang, Zhiyu Tan, Daoyi Dong, Hailan Ma, Hao Li, Dong Gong:
EGGen: Image Generation with Multi-entity Prior Learning through Entity Guidance. 6637-6645
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KuangLHHZZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KuangLHHZZ0024
Zhengzhong Kuang, Jianan Lu, Chenhui Hong, Haobin Huang, Suguo Zhu, Xiaowei Zhao, Jun Yu, Jianping Fan:
Latent Representation Reorganization for Face Privacy Protection. 6646-6655
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieLLL0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieLLL0Z024
Wulin Xie, Xiaohuan Lu, Yadong Liu, Jiang Long, Bob Zhang, Shuping Zhao, Jie Wen:
Uncertainty-Aware Pseudo-Labeling and Dual Graph Driven Network for Incomplete Multi-View Multi-Label Classification. 6656-6665
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangS0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangS0024
Mingzhao Yang, Shangchao Su, Bin Li, Xiangyang Xue:
FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models. 6666-6675
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xia0LYW00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xia0LYW00024
Ruiyang Xia, Dawei Zhou, Decheng Liu, Lin Yuan, Shuodi Wang, Jie Li, Nannan Wang, Xinbo Gao:
Advancing Generalized Deepfake Detector with Forgery Perception Guidance. 6676-6685
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HouGL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HouGL024
Hongye Hou, Xuehao Gao, Zhan Liu, Yang Yang:
Dig into Detailed Structures: Key Context Encoding and Semantic-based Decoding for Point Cloud Completion. 6686-6695
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuC0DC0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuC0DC0024
Tao Liu, Feilong Chen, Shuai Fan, Chenpeng Du, Qi Chen, Xie Chen, Kai Yu:
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding. 6696-6705
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLD24
Qi Chen, Wenjie Liu, Hu Ding:
A Novel Confidence Guided Training Method for Conditional GANs with Auxiliary Classifier. 6706-6714
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinHGX0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinHGX0024
Yukang Lin, Haonan Han, Chaoqun Gong, Zunnan Xu, Yachao Zhang, Xiu Li:
Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors. 6715-6724
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangHSWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangHSWM24
Zhaoyu Zhang, Yang Hua, Guanxiong Sun, Hui Wang, Seán F. McLoone:
Improving the Training of the GANs with Limited Data via Dual Adaptive Noise Injection. 6725-6734
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenYYCHWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenYYCHWL24
Changgu Chen, Libing Yang, Xiaoyan Yang, Lianggangxu Chen, Gaoqi He, Changbo Wang, Yang Li:
FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models. 6735-6744
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Lu0GPXMXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Lu0GPXMXW24
Tianyi Lu, Xing Zhang, Jiaxi Gu, Renjing Pei, Songcen Xu, Xingjun Ma, Hang Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. 6745-6754
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiaoPHLMFFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiaoPHLMFFZ24
Zhichao Liao, Fengyuan Piao, Di Huang, Xinghui Li, Yue Ma, Pingfa Feng, Heming Fang, Long Zeng:
Freehand Sketch Generation from Mechanical Components. 6755-6764
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangWH24
Qishan Zhang, Shuangbing Wen, Tao Hu:
Audio Deepfake Detection with Self-Supervised XLS-R and SLS Classifier. 6765-6773
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0004LDS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0004LDS024
Bohong Chen, Yumeng Li, Yao-Xiang Ding, Tianjia Shao, Kun Zhou:
Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation. 6774-6783
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuZWWWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuZWWWZ024
Xiangcheng Du, Zhao Zhou, Xingjiao Wu, Yanlong Wang, Zhuoyao Wang, Yingbin Zheng, Cheng Jin:
MultiColor: Image Colorization by Learning from Multiple Color Spaces. 6784-6792
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiaLCXWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiaLCXWY24
Haozhe Jia, Yan Li, Hengfei Cui, Di Xu, Yuwang Wang, Tao Yu:
DisControlFace: Adding Disentangled Control to Diffusion Autoencoder for One-shot Explicit Facial Image Editing. 6793-6802
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangLW24
Lutao Jiang, Hangyu Li, Lin Wang:
A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness. 6803-6812
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiT24
Yiluo Wei, Gareth Tyson:
Understanding the Impact of AI-Generated Content on Social Media: The Pixiv Case. 6813-6822
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang024
Ruiqi Zhang, Jie Chen:
Mesh-Centric Gaussian Splatting for Human Avatar Modelling with Real-time Dynamic Mesh Reconstruction. 6823-6832
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiongSLCZCY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiongSLCZCY024
Bo Xiong, Changqing Su, Zihan Lin, Yanqin Chen, You Zhou, Zhen Cheng, Zhaofei Yu, Tiejun Huang:
Real-time Parameter Evaluation of High-speed Microfluidic Droplets using Continuous Spike Streams. 6833-6841
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaoCGFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaoCGFS24
Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, Mike Zheng Shou:
MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance. 6842-6850
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenS24
Guan-Yuan Chen, Von-Wun Soo:
Controllable Music Loops Generation with MIDI and Text via Multi-Stage Cross Attention and Instrument-Aware Reinforcement Learning. 6851-6859
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangY0SY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangY0SY24
Weitian Zhang, Yichao Yan, Yunhui Liu, Xingdong Sheng, Xiaokang Yang:
E³Gen: Efficient, Expressive and Editable Avatars Generation. 6860-6869
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002CPYCN024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002CPYCN024
Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Chong-Wah Ngo, Tao Mei:
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models. 6870-6879
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangSWQXZWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangSWQXZWZ024
Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. 6880-6889
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li24
Xiaodi Li:
Streamable Portrait Video Editing with Probabilistic Pixel Correspondence. 6890-6899
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Hai0TLLNZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Hai0TLLNZZ24
Xuan Hai, Xin Liu, Yuan Tan, Gang Liu, Song Li, Weina Niu, Rui Zhou, Xiaokang Zhou:
What's the Real: A Novel Design Philosophy for Robust AI-Synthesized Voice Detection. 6900-6909
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoZXTYCMY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoZXTYCMY24
Xiangyang Luo, Xin Zhang, Yifan Xie, Xinyi Tong, Weijiang Yu, Heng Chang, Fei Ma, Fei Richard Yu:
CodeSwap: Symmetrically Face Swapping Based on Prior Codebook. 6910-6919
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangMZJYJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangMZJYJ24
Ruofan Wang, Xingjun Ma, Hanxu Zhou, Chuanjun Ji, Guangnan Ye, Yu-Gang Jiang:
White-box Multimodal Jailbreaks Against Large Vision-Language Models. 6920-6928
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuSXYYYL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuSXYYYL00024
Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. 6929-6938
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenGXC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenGXC24
Weifeng Chen, Tao Gu, Yuhao Xu, Arlene Chen:
Magic Clothing: Controllable Garment-Driven Image Synthesis. 6939-6948
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiZ0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiZ0T24
Yiluo Wei, Yiming Zhu, Pan Hui, Gareth Tyson:
Exploring the Use of Abusive Generative AI Models on Civitai. 6949-6958
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuanTFZHCWCG0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuanTFZHCWCG0G24
Xiuliang Duan, Dating Tan, Liangda Fang, Yuyu Zhou, Chaobo He, Ziliang Chen, Lusheng Wu, Guanliang Chen, Zhiguo Gong, Weiqi Luo, Quanlong Guan:
Reason-and-Execute Prompting: Enhancing Multi-Modal Large Language Models for Solving Geometry Questions. 6959-6968
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuWZL24
Weiye Xu, Min Wang, Wengang Zhou, Houqiang Li:
P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task. 6969-6978
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuanX0WL0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuanX0WL0T24
Wenjie Xuan, Yufei Xu, Shanshan Zhao, Chaoyue Wang, Juhua Liu, Bo Du, Dacheng Tao:
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability. 6979-6988
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenXZH0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenXZH0L024
Wenshuo Chen, Hongru Xiao, Erhang Zhang, Lijie Hu, Lei Wang, Mengyuan Liu, Chen Chen:
SATO: Stable Text-to-Motion Framework. 6989-6997
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeJL0CLSPBHXLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeJL0CLSPBHXLG24
Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Wei Xue, Qifeng Liu, Yike Guo:
FlashSpeech: Efficient Zero-Shot Speech Synthesis. 6998-7007
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuHLCWCZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuHLCWCZZ24
Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Efficient and High-Quality Text-to-Audio Generation with Minimal Inference Steps. 7008-7017
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangC0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangC0024
Jiaxu Zhang, Xin Chen, Gang Yu, Zhigang Tu:
Generative Motion Stylization of Cross-structure Characters within Canonical Motion Space. 7018-7026
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWGY0LLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWGY0LLM24
Fengqi Liu, Hexiang Wang, Jingyu Gong, Ran Yi, Qianyu Zhou, Xuequan Lu, Jiangbo Lu, Lizhuang Ma:
Emphasizing Semantic Consistency of Salient Posture for Speech-Driven Gesture Generation. 7027-7035
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengGJWZC0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengGJWZC0024
Tianyi Zheng, Cong Geng, Peng-Tao Jiang, Ben Wan, Hao Zhang, Jinwei Chen, Jia Wang, Bo Li:
Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training. 7036-7045
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeZ0TH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeZ0TH24
Miaoxin Ye, Saixing Zhou, Weiqi Luo, Shunquan Tan, Jiwu Huang:
GAN-based Symmetric Embedding Costs Adjustment for Enhancing Image Steganographic Security. 7046-7054
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiFFMBZZHCHSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiFFMBZZHCHSZ24
Yaqi Li, Han Fang, Zerun Feng, Kaijing Ma, Chao Ban, Xianghao Zang, Lanxiang Zhou, Zhongjiang He, Jingyan Chen, Jiani Hu, Hao Sun, Huayu Zhang:
GOAL: Grounded text-to-image Synthesis with Joint Layout Alignment Tuning. 7055-7064
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiZ24
Jinfeng Wei, Xiaofeng Zhang:
DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer. 7065-7074
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoZQYCJM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoZQYCJM24
Yang Luo, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Zhineng Chen, Yu-Gang Jiang, Tao Mei:
FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process. 7075-7084
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuX0WT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuX0WT24
Wenquan Lu, Yufei Xu, Jing Zhang, Chaoyue Wang, Dacheng Tao:
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting. 7085-7093
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWQ024
Miao Liu, Jing Wang, Xinyuan Qian, Haizhou Li:
ListenFormer: Responsive Listening Head Generation with Non-autoregressive Transformers. 7094-7103
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuLMCZZJJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuLMCZZJJ24
Jie Hu, Jie Li, Yue Ma, Liujuan Cao, Songan Zhang, Wei Zhang, Guannan Jiang, Rongrong Ji:
Prompting to Adapt Foundational Segmentation Models. 7104-7112
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0005JQZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0005JQZ24
Zhiyuan Ma, Guoli Jia, Biqing Qi, Bowen Zhou:
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking. 7113-7122
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunSWXS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunSWXS024
Jin Sun, Xiaoshuang Shi, Zhiyuan Wang, Kaidi Xu, Heng Tao Shen, Xiaofeng Zhu:
Caterpillar: A Pure-MLP Architecture with Shifted-Pillars-Concatenation. 7123-7132
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangDCZZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangDCZZ024
Yuanbin Wang, Weilun Dai, Long Chan, Huanyu Zhou, Aixi Zhang, Si Liu:
GPD-VVTO: Preserving Garment Details in Video Virtual Try-On. 7133-7142
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZCC24
Hengfei Wang, Zhongqun Zhang, Yihua Cheng, Hyung Jin Chang:
TextGaze: Gaze-Controllable Face Generation with Natural Language. 7143-7151
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengGYZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengGYZ024
Huiming Zheng, Wei Gao, Zhuozhen Yu, Tiesong Zhao, Ge Li:
ViewPCGC: View-Guided Learned Point Cloud Geometry Compression. 7152-7161
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeHL0W0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeHL0W0C24
Liyang He, Zhenya Huang, Chenglong Liu, Rui Li, Runze Wu, Qi Liu, Enhong Chen:
One-bit Deep Hashing: Towards Resource-Efficient Hashing Model with Binary Neural Network. 7162-7171
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wu00W0ZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wu00W0ZS24
Xinghao Wu, Xuefeng Liu, Jianwei Niu, Haolin Wang, Shaojie Tang, Guogang Zhu, Hao Su:
Decoupling General and Personalized Knowledge in Federated Learning via Additive and Low-rank Decomposition. 7172-7181
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXMLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXMLL24
Hengyi Wang, Weiying Xie, Jitao Ma, Daixun Li, Yunsong Li:
FedSLS: Exploring Federated Aggregation in Saliency Latent Space. 7182-7190
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangSZ24
Zhongchi Wang, Hailong Sun, Zhengyang Zhao:
FedEvalFair: A Privacy-Preserving and Statistically Grounded Federated Fairness Evaluation Framework. 7191-7199
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangLDHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangLDHL24
Weitao Tang, Jianqiang Li, Meijie Du, Die Hu, Qingyun Liu:
Zenith: Real-time Identification of DASH Encrypted Video Traffic with Distortion. 7200-7209
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoBC0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoBC0H24
Beizhang Guo, Juntao Bao, Baili Chai, Di Wu, Miao Hu:
Lumos: Optimizing Live 360-degree Video Upstreaming via Spatial-Temporal Integrated Neural Enhancement. 7210-7219
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWYSX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWYSX24
Zhongnian Li, Meng Wei, Peng Ying, Tongfeng Sun, Xinzheng Xu:
Learning from Concealed Labels. 7220-7228
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DaiZYX0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DaiZYX0L24
Xiangxiang Dai, Zeyu Zhang, Peng Yang, Yuedong Xu, Xutong Liu, John C. S. Lui:
AxiomVision: Accuracy-Guaranteed Adaptive Visual Model Selection for Perspective-Aware Video Analytics. 7229-7238
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWXGLHB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWXGLHB024
Shuo Wang, Yongcai Wang, Zhimin Xu, Yongyu Guo, Wanting Li, Zhe Huang, Xuewei Bai, Deying Li:
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System. 7239-7248
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangZZWC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangZZWC024
Yiyang Jiang, Wengyu Zhang, Xulu Zhang, Xiaoyong Wei, Chang Wen Chen, Qing Li:
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval. 7249-7258

Oral Session 18: Fairness, Trust, Explainability & Inperpretability in Multimedia

- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunZ024
Peiwen Sun, Honggang Zhang, Di Hu:
Unveiling and Mitigating Bias in Audio Visual Segmentation. 7259-7268
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLXSG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLXSG024
Ying Liu, Lihong Liu, Cai Xu, Xiangyu Song, Ziyu Guan, Wei Zhao:
Dynamic Evidence Decoupling for Trusted Multi-view Learning. 7269-7277
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu0Y24
Wei Liu, Yufei Chen, Xiaodong Yue:
Building Trust in Decision with Conformalized Multi-view Deep Classification. 7278-7287
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZongDC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZongDC24
Daoming Zong, Chaoyue Ding, Kaitao Chen:
Toward Explainable Physical Audiovisual Commonsense Reasoning. 7288-7297
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengYYYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengYYYL24
Jingjie Zeng, Zhihao Yang, Qi Yang, Liang Yang, Hongfei Lin:
Peeling Back the Layers: Interpreting the Storytelling of ViT. 7298-7306
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Matsuhira0KHI24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Matsuhira0KHI24
Chihaya Matsuhira, Marc A. Kastner, Takahiro Komamizu, Takatsugu Hirayama, Ichiro Ide:
Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation. 7307-7315

Oral Session 19: Multimodal Applications

- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuZSD0AHGMYW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuZSD0AHGMYW24
Minghui Wu, Chenxu Zhao, Anyang Su, Donglin Di, Tianyu Fu, Da An, Min He, Ya Gao, Meng Ma, Kun Yan, Ping Wang:
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding. 7316-7325
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DengXCWK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DengXCWK24
Yanglin Deng, Tianyang Xu, Chunyang Cheng, Xiao-Jun Wu, Josef Kittler:
MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion. 7326-7335
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYYWYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYYWYX24
Ziyan Li, Jianfei Yu, Jia Yang, Wenya Wang, Li Yang, Rui Xia:
Generative Multimodal Data Augmentation for Low-Resource Multimodal Named Entity Recognition. 7336-7345
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GeHZ0WTZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GeHZ0WTZ24
Zhiqi Ge, Hongzhe Huang, Mingze Zhou, Juncheng Li, Guoming Wang, Siliang Tang, Yueting Zhuang:
WorldGPT: Empowering LLM as Multimodal World Model. 7346-7355
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiGW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiGW024
Yiming Li, Zhifang Guo, Xiangdong Wang, Hong Liu:
Advancing Multi-grained Alignment for Contrastive Language-Audio Pre-training. 7356-7365
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHAM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHAM24
Yingxuan Li, Ryota Hinami, Kiyoharu Aizawa, Yusuke Matsui:
Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion. 7366-7374

Oral Session 20: Datasets & Algorithms for Multimedia Analysis

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0HZKC00LZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0HZKC00LZ24
Chunyi Li, Haoning Wu, Hongkun Hao, Zicheng Zhang, Tengchuan Kou, Chaofeng Chen, Lei Bai, Xiaohong Liu, Weisi Lin, Guangtao Zhai:
G-Refine: A General Quality Refiner for Text-to-Image Generation. 7375-7384
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuDZLZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuDZLZX24
Wenqiang Xu, Wenrui Dai, Ziyang Zheng, Chenglin Li, Junni Zou, Hongkai Xiong:
Point Cloud Upsampling with Geometric Algebra Driven Inverse Heat Dissipation. 7385-7394
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLLY0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLLY0C24
Junyan Wu, Wei Lu, Xiangyang Luo, Rui Yang, Qian Wang, Xiaochun Cao:
Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization. 7395-7403
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanYD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanYD024
Fujun Han, Peng Ye, Shukai Duan, Lidan Wang:
Ada-iD: Active Domain Adaptation for Intrusion Detection. 7404-7413
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Cai0AHDGS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Cai0AHDGS24
Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Tom Gedeon, Kalin Stefanov:
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset. 7414-7423
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YanagiT0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YanagiT0H24
Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama:
DQG: Database Question Generation for Exact Text-based Image Retrieval. 7424-7433

Oral Session 21: Image Enhancement and Super-Resolution

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLZL24
Tongshun Zhang, Pingping Liu, Ming Zhao, Haotian Lv:
DMFourLLIE: Dual-Stage and Multi-Branch Fourier Network for Low-Light Image Enhancement. 7434-7443
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0006LSQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0006LSQ024
Fei Gao, Yuhao Lin, Jiaqi Shi, Maoying Qiao, Nannan Wang:
AesMamba: Universal Image Aesthetic Assessment with State Space Models. 7444-7453
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DongWFOL0RXH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DongWFOL0RXH24
Yi Dong, Yuxi Wang, Zheng Fang, Wenqi Ouyang, Xianhui Lin, Zhiqi Shen, Peiran Ren, Xuansong Xie, Qingming Huang:
MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement. 7454-7463
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiGZ024
Ruibin Li, Jingcai Guo, Qihua Zhou, Song Guo:
FreePIH: Training-Free Painterly Image Harmonization with Diffusion Model. 7464-7473
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangXLWLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangXLWLH24
Qian Huang, Cheng Xu, Guiqing Li, Ziheng Wu, Shengxin Liu, Shengfeng He:
Portrait Shadow Removal via Self-Exemplar Illumination Equalization. 7474-7482
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuWCCZY0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuWCCZY0Z24
Qiwen Zhu, Yanjie Wang, Shilv Cai, Liqun Chen, Jiahuan Zhou, Luxin Yan, Sheng Zhong, Xu Zou:
Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem. 7483-7492

Oral Session 22: Audio-visual Datasets and Applications

- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangYNL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangYNL24
Han Wang, Tan Rui Yang, Usman Naseem, Roy Ka-Wei Lee:
MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili. 7493-7502
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuZT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuZT024
Jiale Yu, Baopeng Zhang, Zhu Teng, Jianping Fan:
OpenAVE: Moving towards Open Set Audio-Visual Event Localization. 7503-7512
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuTWHX00Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuTWHX00Z024
Xinfa Zhu, Wenjie Tian, Xinsheng Wang, Lei He, Yujia Xiao, Xi Wang, Xu Tan, Sheng Zhao, Lei Xie:
UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis. 7513-7522
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0CYG0HQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0CYG0HQ24
Zhedong Zhang, Liang Li, Gaoxiang Cong, Haibing Yin, Yuhan Gao, Chenggang Yan, Anton van den Hengel, Yuankai Qi:
From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning. 7523-7532
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoQNQYSXY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoQNQYSXY24
Ruohao Guo, Liao Qu, Dantong Niu, Yanyu Qi, Wenzhen Yue, Ji Shi, Bowei Xing, Xianghua Ying:
Open-Vocabulary Audio-Visual Semantic Segmentation. 7533-7541

Oral Session 23: Multimodal Learning and Recommendation Systems

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZGLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZGLW24
Hongcheng Li, Yucan Zhou, Xiaoyan Gu, Bo Li, Weiping Wang:
Diversified Semantic Distribution Matching for Dataset Distillation. 7542-7550
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLLWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLLWW24
Jinghao Zhang, Guofan Liu, Qiang Liu, Shu Wu, Liang Wang:
Modality-Balanced Learning for Multimedia Recommendation. 7551-7560
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeZA0RLR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeZA0RLR24
Ziyi Ye, Jingtao Zhan, Qingyao Ai, Yiqun Liu, Maarten de Rijke, Christina Lioma, Tuukka Ruotsalo:
Query Augmentation with Brain Signals. 7561-7570
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiYLYK0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiYLYK0X24
Lei Shi, Jiapeng Yang, Pengtao Lv, Lu Yuan, Feifei Kou, Jia Luo, Mingying Xu:
Self-derived Knowledge Graph Contrastive Learning for Recommendation. 7571-7580
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Lin0XGJXZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Lin0XGJXZZ24
Jiaye Lin, Qing Li, Guorui Xie, Zhongxu Guan, Yong Jiang, Ting Xu, Zhong Zhang, Peilin Zhao:
Mitigating Sample Selection Bias with Robust Domain Adaption in Multimedia Recommendation. 7581-7590
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangX0LLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangX0LLH24
Yangqin Jiang, Lianghao Xia, Wei Wei, Da Luo, Kangyi Lin, Chao Huang:
DiffMM: Multi-Modal Diffusion Model for Recommendation. 7591-7599

Oral Session 24: Novel Multimedia Applications 2

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Feng0HZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Feng0HZ024
Tongtong Feng, Xin Wang, Feilin Han, Leping Zhang, Wenwu Zhu:
U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight. 7600-7608
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Niu0ZWLLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Niu0ZWLLL024
Chaoqun Niu, Dongdong Chen, Jizhe Zhou, Jian Wang, Xiang Luo, Quan-Hui Liu, Yuan Li, Jiancheng Lv:
Neural Boneprint: Person Identification from Bones Using Generative Contrastive Deep Learning. 7609-7618
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuLYFLZZGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuLYFLZZGZ24
Xueli Hu, Huan Liu, Haocheng Yuan, Zhiyang Fu, Yizhi Luo, Ning Zhang, Hang Zou, Jianwen Gan, Yuan Zhang:
Fine-Grained Prompt Learning for Face Anti-Spoofing. 7619-7628
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanRYSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanRYSM24
Xiao Han, Yiming Ren, Yichen Yao, Yujing Sun, Yuexin Ma:
Towards Practical Human Motion Prediction with LiDAR Point Clouds. 7629-7638
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HongWHWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HongWHWL24
Haodong Hong, Sen Wang, Zi Huang, Qi Wu, Jiajun Liu:
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments. 7639-7648
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Gao00P0WLZTZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Gao00P0WLZTZ24
Minghe Gao, Juncheng Li, Hao Fei, Liang Pang, Wei Ji, Guoming Wang, Zheqi Lv, Wenqiao Zhang, Siliang Tang, Yueting Zhuang:
De-fine: Decomposing and Refining Visual Programs with Auto-Feedback. 7649-7657

Oral Session 25: Media and Communication Technologies

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuZ024
Jingjing Liu, Youyi Zheng, Kun Zhou:
Virtual Agent Positioning Driven by Personal Characteristics. 7658-7666
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Luo0LWLPCLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Luo0LWLPCLH24
Meng Luo, Hao Fei, Bobo Li, Shengqiong Wu, Qian Liu, Soujanya Poria, Erik Cambria, Mong-Li Lee, Wynne Hsu:
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis. 7667-7676
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoSSHYP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoSSHYP024
Yawen Luo, Min Shi, Liao Shen, Yachuan Huang, Zixuan Ye, Juewen Peng, Zhiguo Cao:
Video Bokeh Rendering: Make Casual Videography Cinematic. 7677-7685
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangCZH0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangCZH0L24
Zhenyu Zhang, Guangyao Chen, Yixiong Zou, Zhimeng Huang, Yuhua Li, Ruixuan Li:
MICM: Rethinking Unsupervised Pretraining for Enhanced Few-shot Learning. 7686-7695
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0Z024
Zejun Zhang, Xiao Zhu, Anlan Zhang, Feng Qian:
An In-depth Study of Bandwidth Allocation across Media Sources in Video Conferencing. 7696-7704
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangZWHX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangZWHX024
Zixuan Yang, Yushu Zhang, Tao Wang, Zhongyun Hua, Zhihua Xia, Jian Weng:
Once-for-all: Efficient Visual Face Privacy Protection via Person-specific Veils. 7705-7713

Oral Session 26: Cultural Heritage & Media Analysis

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhu0NZLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhu0NZLF24
Shipeng Zhu, Hui Xue, Na Nie, Chenjie Zhu, Haiyue Liu, Pengfei Fang:
Reproducing the Past: A Dataset for Benchmarking Inscription Restoration. 7714-7723
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Pan0YHTBBT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Pan0YHTBBT24
Jiao Pan, Liang Li, Hiroshi Yamaguchi, Kyoko Hasegawa, Fadjar Ibnu Thufail, Brahmantara, Xiaojuan Ban, Satoshi Tanaka:
Reconstructing, Understanding, and Analyzing Relief Type Cultural Heritage from a Single Old Photo. 7724-7733
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BinSDH00NS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BinSDH00NS24
Yi Bin, Wenhao Shi, Yujuan Ding, Zhiqiang Hu, Zheng Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen:
GalleryGPT: Analyzing Paintings with Large Multimodal Models. 7734-7743
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaR24
Jun Ma, Tuukka Ruotsalo:
Cognition-Supervised Saliency Detection: Contrasting EEG Signals and Visual Stimuli. 7744-7753
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuZLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuZLZ24
Yizhang Liu, Weiwei Zhou, Yanping Li, Shengjie Zhao:
RoSe: Rotation-Invariant Sequence-Aware Consensus for Robust Correspondence Pruning. 7754-7763
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZD24
Yujia Wang, Fang-Lue Zhang, Neil A. Dodgson:
ScanTD: 360° Scanpath Prediction based on Time-Series Diffusion. 7764-7773

Oral Session 27: Security & Quality in Multimedia Systems

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLWC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLWC24
Dunyun Chen, Xin Liao, Xiaoshuai Wu, Shiwei Chen:
SafePaint: Anti-forensic Image Inpainting with Domain Adaptation. 7774-7782
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0ZL0CM0LZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0ZL0CM0LZ24
Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Wei Sun, Chaofeng Chen, Xiongkuo Min, Xiaohong Liu, Weisi Lin, Guangtao Zhai:
LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM. 7783-7792
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KouLZL0MZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KouLZL0MZL24
Tengchuan Kou, Xiaohong Liu, Zicheng Zhang, Chunyi Li, Haoning Wu, Xiongkuo Min, Guangtao Zhai, Ning Liu:
Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment. 7793-7802
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0ZJJZMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0ZJJZMZ24
Puyi Wang, Wei Sun, Zicheng Zhang, Jun Jia, Yanwei Jiang, Zhichao Zhang, Xiongkuo Min, Guangtao Zhai:
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment. 7803-7812
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhou0CPC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhou0CPC24
Xuemei Zhou, Irene Viola, Yunlu Chen, Jiahuan Pei, Pablo César:
Deciphering Perceptual Quality in Colored Point Cloud: Prioritizing Geometry or Texture Distortion? 7813-7822
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuanW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuanW24
Desen Yuan, Lei Wang:
Dual-Criterion Quality Loss for Blind Image Quality Assessment. 7823-7832

Oral Session 28: Complex Scene Processing

- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangWWL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangWWL0024
Zhe Huang, Shuo Wang, Yongcai Wang, Wanting Li, Deying Li, Lei Wang:
RoCo: Robust Cooperative Perception By Iterative Object Matching and Pose Adjustment. 7833-7842
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZCCPCYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZCCPCYZ24
Shao-Kui Zhang, Hanxi Zhu, Xuebin Chen, Jinghuan Chen, Zhike Peng, Ziyang Chen, Yong-Liang Yang, Song-Hai Zhang:
ScenePhotographer: Object-Oriented Photography for Residential Scenes. 7843-7851
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLJM0LDSJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLJM0LDSJ24
Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. 7852-7861
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanZY24
Xuan Han, Yihao Zhao, Mingyu You:
Scene Diffusion: Text-driven Scene Image Synthesis Conditioning on a Single 3D Model. 7862-7870
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YanPTW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YanPTW24
Jinbo Yan, Rui Peng, Luyang Tang, Ronggang Wang:
4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes. 7871-7880
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuYX0ZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuYX0ZZ24
Hongtao Wu, Yijun Yang, Huihui Xu, Weiming Wang, Jinni Zhou, Lei Zhu:
RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining. 7881-7890

Oral Session 29: Enhancements in Video Streaming and Compression

- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLLYWDX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLLYWDX24
Bo Wu, Tong Li, Cheng Luo, Xu Yan, Fuyu Wang, Xinle Du, Ke Xu:
Toward Timeliness-Enhanced Loss Recovery for Large-Scale Live Streaming. 7891-7899
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouHZWWZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouHZWWZY24
Fangtao Zhou, Xiaofeng Huang, Peng Zhang, Meng Wang, Zhao Wang, Yang Zhou, Haibing Yin:
Enhanced Screen Content Image Compression: A Synergistic Approach for Structural Fidelity and Text Integrity Preservation. 7900-7908
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangLZSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangLZSL24
Miao Zhang, Jiaxing Li, Haoyuan Zhao, Linfeng Shen, Jiangchuan Liu:
StarStream: Live Video Analytics over Space Networking. 7909-7917
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BiZXYLLX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BiZXYLLX24
Pengqiang Bi, Yifei Zou, Mengbai Xiao, Dongxiao Yu, Yijun Li, Zhixiong Liu, Qun Xie:
LiteQUIC: Improving QoE of Video Streams by Reducing CPU Overhead of QUIC. 7918-7927
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JinD0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JinD0024
Yili Jin, Xize Duan, Fangxin Wang, Xue Liu:
HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets. 7928-7936
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengZHZ00W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengZHZ00W24
Zihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang:
HPC: Hierarchical Progressive Coding Framework for Volumetric Video. 7937-7946

Poster Session 3

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuZLH0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuZLH0W24
Lianghui Zhu, Junwei Zhou, Yan Liu, Xin Hao, Wenyu Liu, Xinggang Wang:
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition. 7947-7956
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunLRKAP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunLRKAP24
Xiangyu Sun, Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Usman Ali, Eunbyung Park:
F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting. 7957-7965
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLYD0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLYD0Z24
Sijing Wu, Yunhao Li, Yichao Yan, Huiyu Duan, Ziwei Liu, Guangtao Zhai:
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation. 7966-7975
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWKM24
Chunxiao Li, Shuyang Wang, Xuejing Kang, Anlong Ming:
Thinking Temporal Automatic White Balance: Datasets, Models and Benchmarks. 7976-7984
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoF0ASB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoF0ASB024
Zhe Luo, Weina Fu, Shuai Liu, Saeed Anwar, Muhammad Saqib, Sambit Bakshi, Khan Muhammad:
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action Detection. 7985-7994
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangL024
Wencan Huang, Daizong Liu, Wei Hu:
Advancing 3D Object Grounding Beyond a Single 3D Scene. 7995-8004
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangHWC0F0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangHWC0F0024
Bin Huang, Feng He, Qi Wang, Hong Chen, Guohao Li, Zhifan Feng, Xin Wang, Wenwu Zhu:
Neighbor Does Matter: Curriculum Global Positive-Negative Sampling for Vision-Language Pre-training. 8005-8014
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JinNYCZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JinNYCZQ24
Haoyuan Jin, Xuesong Nie, Yunfeng Yan, Xi Chen, Zhihang Zhu, Donglian Qi:
Object-Level Pseudo-3D Lifting for Distance-Aware Tracking. 8015-8023
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuJXLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuJXLC24
Xinwei Liu, Xiaojun Jia, Yuan Xun, Siyuan Liang, Xiaochun Cao:
Multimodal Unlearnable Examples: Protecting Data against Multimodal Contrastive Learning. 8024-8033
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003MZH0Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003MZH0Q024
Ge Luo, Yuchen Ma, Manman Zhang, Junqiang Huang, Sheng Li, Zhenxing Qian, Xinpeng Zhang:
Engaging Live Video Comments Generation. 8034-8042
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWLY24
Lu Chen, Qiangchang Wang, Zhaohui Li, Yilong Yin:
Hypergraph-guided Intra- and Inter-category Relation Modeling for Fine-grained Visual Recognition. 8043-8052
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xie0YZZSZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xie0YZZSZ024
Yuan Xie, Yichen Zhang, Yifang Yin, Sheng Zhang, Ying Zhang, Rajiv Ratn Shah, Roger Zimmermann, Guoqing Xiao:
Traj2Former: A Local Context-aware Snapshot and Sequential Dual Fusion Transformer for Trajectory Classification. 8053-8061
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZZCWSZW00SJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZZCWSZW00SJ24
Guilin Li, Mengdan Zhang, Xiawu Zheng, Peixian Chen, Zihan Wang, Yunhang Shen, Mingchen Zhuge, Chenglin Wu, Fei Chao, Ke Li, Xing Sun, Rongrong Ji:
Multimodal Inplace Prompt Tuning for Open-set Object Detection. 8062-8071
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChengMP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChengMP24
Shengran Cheng, Chuhang Ma, Ye Pan:
StylizedFacePoint: Facial Landmark Detection for Stylized Characters. 8072-8080
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangY24
Sheng Zhang, Xi Yang:
Information Fusion with Knowledge Distillation for Fine-grained Remote Sensing Object Detection. 8081-8089
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoWT0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoWT0G24
Bowen Zhao, Qianqian Wang, Zhiqiang Tao, Wei Feng, Quanxue Gao:
DFMVC: Deep Fair Multi-view Clustering. 8090-8099
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLZZ0SSLJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLZZ0SSLJ24
Ruyu Liu, Zhengzhe Liu, Haoyu Zhang, Guodao Zhang, Jianhua Zhang, Bo Sun, Weiguo Sheng, Xiufeng Liu, Yaochu Jin:
ColVO: Colonoscopic Visual Odometry Considering Geometric and Photometric Consistency. 8100-8109
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinYYMZLLWT0K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinYYMZLLWT0K24
Xun Lin, Yi Yu, Zitong Yu, Ruohan Meng, Jiale Zhou, Ajian Liu, Yizhong Liu, Shuai Wang, Wenzhong Tang, Zhen Lei, Alex C. Kot:
HideMIA: Hidden Wavelet Mining for Privacy-Enhancing Medical Image Analysis. 8110-8119
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuCR0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuCR0Y24
Shuyuan Liu, Jiawei Chen, Shouwei Ruan, Hang Su, Zhaoxia Yin:
Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models. 8120-8128
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TianYWCXHC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TianYWCXHC24
Jiahe Tian, Cai Yu, Xi Wang, Peng Chen, Zihao Xiao, Jizhong Han, Yesheng Chai:
Dynamic Mixed-Prototype Model for Incremental Deepfake Detection. 8129-8138
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu0B24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu0B24
Tianshan Liu, Kin-Man Lam, Bing-Kun Bao:
Label Text-aided Hierarchical Semantics Mining for Panoramic Activity Recognition. 8139-8148
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangCF0ZJZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangCF0ZJZ024
Xiaoda Yang, Xize Cheng, Dongjie Fu, Minghui Fang, Jialong Zuo, Shengpeng Ji, Zhou Zhao, Tao Jin:
SyncTalklip: Highly Synchronized Lip-Readable Speaker Generation with Multi-Task Learning. 8149-8158
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YiBZZJHL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YiBZZJHL024
Jingjun Yi, Qi Bi, Hao Zheng, Haolan Zhan, Wei Ji, Yawen Huang, Yuexiang Li, Yefeng Zheng:
Learning Spectral-Decomposited Tokens for Domain Generalized Semantic Segmentation. 8159-8168
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinZSGS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinZSGS24
Peng Yin, Xiaosu Zhu, Jingkuan Song, Lianli Gao, Heng Tao Shen:
SI-BiViT: Binarizing Vision Transformers with Spatial Interaction. 8169-8178
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiLSCG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiLSCG24
Ao Li, Huijun Liu, Jinrong Sheng, Zhongming Chen, Yongxin Ge:
Efficient Dual-Confounding Eliminating for Weakly-supervised Temporal Action Localization. 8179-8188
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GeFCASJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GeFCASJ24
Xuri Ge, Junchen Fu, Fuhai Chen, Shan An, Nicu Sebe, Joemon M. Jose:
Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning. 8189-8198
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WooRJCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WooRJCC24
Jongbhin Woo, Hyeonggon Ryu, Youngjoon Jang, Jae-Won Cho, Joon Son Chung:
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding. 8199-8208
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenHXWX0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenHXWX0024
Jiali Chen, Xusen Hei, Yuqi Xue, Yuancheng Wei, Jiayuan Xie, Yi Cai, Qing Li:
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor. 8209-8218
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SongL0HYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SongL0HYL24
Yu-Pei Song, Yuantong Liu, Xiao Wu, Qi He, Zhaoquan Yuan, Ao Luo:
MagicCartoon: 3D Pose and Shape Estimation for Bipedal Cartoon Characters. 8219-8227
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuMZYYLE0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuMZYYLE0024
Ajian Liu, Hui Ma, Junze Zheng, Haocheng Yuan, Xiaoyuan Yu, Yanyan Liang, Sergio Escalera, Jun Wan, Zhen Lei:
FM-CLIP: Flexible Modal CLIP for Face Anti-Spoofing. 8228-8237
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoGZZLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoGZZLS24
Jiaqi Guo, Lianli Gao, Junchen Zhu, Jiaxin Zhang, Siyang Li, Jingkuan Song:
MagicVFX: Visual Effects Synthesis in Just Minutes. 8238-8246
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liu00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liu00024
Kangzheng Liu, Feng Zhao, Yu Yang, Guandong Xu:
DySarl: Dynamic Structure-Aware Representation Learning for Multimodal Knowledge Graph Reasoning. 8247-8256
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YanWLGZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YanWLGZJ24
Weicai Yan, Ye Wang, Wang Lin, Zirun Guo, Zhou Zhao, Tao Jin:
Low-rank Prompt Interaction for Continual Vision-Language Retrieval. 8257-8266
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouYBFHLX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouYBFHLX24
Jing Zhou, Ziqi Yu, Zhongyun Bao, Gang Fu, Weilei He, Chao Liang, Chunxia Xiao:
Foreground Harmonization and Shadow Generation for Composite Image. 8267-8276
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaCZ0Z0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaCZ0Z0X24
Zhen-Xiang Ma, Zhen-Duo Chen, Li-Jun Zhao, Zi-Chao Zhang, Tai Zheng, Xin Luo, Xin-Shun Xu:
Bi-directional Task-Guided Network for Few-Shot Fine-Grained Image Classification. 8277-8286
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeTLLAL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeTLLAL24
Xiao He, Chang Tang, Xinwang Liu, Chuankun Li, Shan An, Zhenglai Li:
Heterogeneous Graph Guided Contrastive Learning for Spatially Resolved Transcriptomics Data. 8287-8295
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWZWL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWZWL0024
Yabing Wang, Le Wang, Qiang Zhou, Zhibin Wang, Hao Li, Gang Hua, Wei Tang:
Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval. 8296-8305
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yang0ZWS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yang0ZWS024
Zhiwen Yang, Liang Li, Jiehua Zhang, Tingyu Wang, Yaoqi Sun, Chenggang Yan:
Domain Shared and Specific Prompt Learning for Incremental Monocular Depth Estimation. 8306-8315
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeD24
Shuting He, Henghui Ding:
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation. 8316-8325
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BaiCTZ0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BaiCTZ0C24
Yunwei Bai, Bill Yang Cai, Ying Kiat Tan, Zangwei Zheng, Shiming Chen, Tsuhan Chen:
FSL-QuickBoost: Minimal-Cost Ensemble for Few-Shot Learning. 8326-8335
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PangLH0WZHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PangLH0WZHS24
Jinhui Pang, Changqing Lin, Xiaoshuai Hao, Rong Yin, Zixuan Wang, Zhihui Zhang, Jinglin He, Huang Tai Sheng:
FTF-ER: Feature-Topology Fusion-Based Experience Replay Method for Continual Graph Learning. 8336-8344
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LvNZYLW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LvNZYLW024
Fengmao Lv, Changru Nie, Jianyang Zhang, Guowu Yang, Guosheng Lin, Xiao Wu, Tianrui Li:
Rethinking the Effect of Uninformative Class Name in Prompt Learning. 8345-8354
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangS24
Yuhan Wang, Mofei Song:
UniL: Point Cloud Novelty Detection through Multimodal Pre-training. 8355-8364
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoLW24
Zeyu Xiao, Zhihe Lu, Xinchao Wang:
P-BiC: Ultra-High-Definition Image Moiré Patterns Removal via Patch Bilateral Compensation. 8365-8373
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangYGYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangYGYY24
Jing Yang, Shundong Yang, Yuan Gao, Jieming Yang, Laurence T. Yang:
Multimodal Contextual Interactions of Entities: A Modality Circular Fusion Approach for Link Prediction. 8374-8382
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TanLPQPQWS0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TanLPQPQWS0H24
Chaolei Tan, Zihang Lin, Junfu Pu, Zhongang Qi, Wei-Yi Pei, Zhi Qu, Yexin Wang, Ying Shan, Wei-Shi Zheng, Jian-Fang Hu:
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses. 8383-8392
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWLB0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWLB0024
Buyu Liu, Kai Wang, Yansong Liu, Jun Bao, Tingting Han, Jun Yu:
MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability. 8393-8401
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWAYTSCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWAYTSCC24
Junzhang Liu, Zhecan Wang, Hammad A. Ayyubi, Haoxuan You, Chris Thomas, Rui Sun, Shih-Fu Chang, Kai-Wei Chang:
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions. 8402-8411
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangG0L0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangG0L0Z24
Yingchun Wang, Jingcai Guo, Song Guo, Yi Liu, Jie Zhang, Weizhan Zhang:
SFP: Spurious Feature-Targeted Pruning for Out-of-Distribution Generalization. 8412-8420
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiDXWCJZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiDXWCJZ24
Yao Li, Jiajun Deng, Yuxuan Xiao, Yingjie Wang, Xiaomeng Chu, Jianmin Ji, Yanyong Zhang:
FARFusion V2: A Geometry-based Radar-Camera Fusion Method on the Ground for Roadside Far-Range 3D Object Detection. 8421-8430
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangJDYF0Z0LZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangJDYF0Z0LZ24
Fangdi Wang, Jiaqi Jin, Zhibin Dong, Xihong Yang, Yu Feng, Xinwang Liu, Xinzhong Zhu, Siwei Wang, Tianrui Liu, En Zhu:
View Gap Matters: Cross-view Topology and Information Decoupling for Multi-view Clustering. 8431-8440
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiLBXCRWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiLBXCRWZ024
Wenjie Wei, Yu Liang, Ammar Belatreche, Yichen Xiao, Honglin Cao, Zhenbang Ren, Guoqing Wang, Malu Zhang, Yang Yang:
Q-SNNs: Quantized Spiking Neural Networks. 8441-8450
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang024a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang024a
Shihua Zhang, Jiayi Ma:
DiffGlue: Diffusion-Aided Image Feature Matching. 8451-8460
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiSLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiSLZ24
Xueyang Li, Yu Song, Yunzhong Lou, Xiangdong Zhou:
CAD Translator: An Effective Drive for Text to 3D Parametric Computer-Aided Design Generative Modeling. 8461-8470
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuCFRHCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuCFRHCZ24
Weichen Xu, Jian Cao, Tianhao Fu, Ruilong Ren, Zicong Hu, Xixin Cao, Xing Zhang:
Point Cloud Reconstruction Is Insufficient to Learn 3D Representations. 8471-8479
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuCZFYSQWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuCZFYSQWY24
Xiao Yu, Kejiang Chen, Kai Zeng, Han Fang, Zijin Yang, Xiuwei Shang, Yuang Qi, Weiming Zhang, Nenghai Yu:
SemGIR: Semantic-Guided Image Regeneration Based Method for AI-generated Image Detection and Attribution. 8480-8488
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaoLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaoLZ024
Jiahua Xiao, Yang Liu, Shizhou Zhang, Xing Wei:
Bridging Fourier and Spatial-Spectral Domains for Hyperspectral Image Denoising. 8489-8497
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiaXZCWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiaXZCWY24
Heng Jia, Yunqiu Xu, Linchao Zhu, Guang Chen, Yufei Wang, Yi Yang:
MoS²: Mixture of Scale and Shift Experts for Text-Only Video Captioning. 8498-8507
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangHZLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangHZLF24
Qi Zhang, Chi Huang, Qian Zhang, Nan Li, Wei Feng:
Learning Geometry Consistent Neural Radiance Fields from Sparse and Unposed Views. 8508-8517
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FangDCW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FangDCW24
Zihan Fang, Shide Du, Yuhong Chen, Shiping Wang:
Beyond the Known: Ambiguity-Aware Multi-view Learning. 8518-8526
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangDL0LL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangDL0LL24
Jingchao Wang, Zhengnan Deng, Tongxu Lin, Wenyuan Li, Shaobin Ling, Junyu Lin:
Beyond Direct Relationships: Exploring Multi-Order Label Pair Dependencies for Knowledge Distillation. 8527-8535
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiJYD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiJYD024
Yuhang Li, Jincen Jiang, Xiaosong Yang, Youdong Ding, Jian Jun Zhang:
Harmony Everything! Masked Autoencoders for Video Harmonization. 8536-8545
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangDYYY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangDYYY024
Linfeng Tang, Yuxin Deng, Xunpeng Yi, Qinglong Yan, Yixuan Yuan, Jiayi Ma:
DRMF: Degradation-Robust Multi-Modal Image Fusion via Composable Diffusion Prior. 8546-8555
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWPTCZXY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWPTCZXY24
Jintao Chen, Fan Wang, Shengye Pang, Siwei Tan, Mingshuai Chen, Tiancheng Zhao, Meng Xi, Jianwei Yin:
UniGM: Unifying Multiple Pre-trained Graph Models via Adaptive Knowledge Aggregation. 8556-8565
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wu0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wu0X24
Ziyue Wu, Junyu Gao, Changsheng Xu:
Open-Vocabulary Video Scene Graph Generation via Union-aware Semantic Alignment. 8566-8575
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengC0LWLJT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengC0LWLJT24
Li Zheng, Boyu Chen, Hao Fei, Fei Li, Shengqiong Wu, Lizi Liao, Donghong Ji, Chong Teng:
Self-Adaptive Fine-grained Multi-modal Data Augmentation for Semi-supervised Muti-modal Coreference Resolution. 8576-8585
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuoFN024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuoFN024
Daqin Luo, Chengjian Feng, Yuxuan Nong, Yiqing Shen:
AutoM³L: An Automated Multimodal Machine Learning Framework with Large Language Models. 8586-8594
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangXY00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangXY00024
Xu Zhang, Zhipeng Xie, Haiyang Yu, Qitong Wang, Peng Wang, Wei Wang:
Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion. 8595-8603
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZ0024
Ran Wang, Hua Zuo, Zhen Fang, Jie Lu:
Towards Robustness Prompt Tuning with Fully Test-Time Adaptation for CLIP's Zero-Shot Generalization. 8604-8612
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangSWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangSWZ24
Lijun Zhang, Wei Suo, Peng Wang, Yanning Zhang:
A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap. 8613-8622
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiY0DC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiY0DC24
Haojie Wei, Jun Yuan, Rui Zhang, Quanyu Dai, Yueguo Chen:
MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation. 8623-8632
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuYZ24
Binbin Xu, Jun Yin, Nan Zhang:
Graph based Consistency Learning for Contrastive Multi-View Clustering. 8633-8641
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoL24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoL24a
Jiaxin Gao, Yaohua Liu:
Enhancing Images with Coupled Low-Resolution and Ultra-Dark Degradations: A Tri-level Learning Framework. 8642-8651
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QuWL0FXLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QuWL0FXLZ24
Qian Qu, Xinhang Wan, Weixuan Liang, Jiyuan Liu, Yu Feng, Huiying Xu, Xinwang Liu, En Zhu:
A Lightweight Anchor-Based Incremental Framework for Multi-view Clustering. 8652-8661
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuX00Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuX00Q24
Yao Wu, Mingwei Xing, Yachao Zhang, Yuan Xie, Yanyun Qu:
CLIP2UDA: Making Frozen CLIP Reward Unsupervised Domain Adaptation in 3D Semantic Segmentation. 8662-8671
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLZH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLZH024
Zongqian Wu, Yujing Liu, Mengmeng Zhan, Ping Hu, Xiaofeng Zhu:
Adaptive Multi-Modality Prompt Learning. 8672-8680
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0LHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0LHZ24
Shiwei Zhang, Wei Ke, Shuai Liu, Xiaopeng Hong, Tong Zhang:
Boosting Semi-supervised Crowd Counting with Scale-based Active Learning. 8681-8690
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoZHLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoZHLH24
Yingjie Gao, Yanan Zhang, Ziyue Huang, Nanqing Liu, Di Huang:
PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection. 8691-8700
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yuan0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yuan0H24
Li Yuan, Yi Cai, Junsheng Huang:
Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model. 8701-8710
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXJDH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXJDH24
Yijia Wang, Qianqian Xu, Yangbangyan Jiang, Siran Dai, Qingming Huang:
Regularized Contrastive Partial Multi-view Outlier Detection. 8711-8720
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLZCC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLZCC024
Rui Liu, Mingjie Li, Shen Zhao, Ling Chen, Xiaojun Chang, Lina Yao:
In-Context Learning for Zero-shot Medical Report Generation. 8721-8730
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZouYCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZouYCH24
Guoliang Zou, Yangdong Ye, Tongji Chen, Shizhe Hu:
Learning Dual Enhanced Representation for Contrastive Multi-view Clustering. 8731-8739
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoXW24
Yang Zhao, Gangwei Xu, Gang Wu:
Hybrid Cost Volume for Memory-Efficient Optical Flow. 8740-8749
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLC0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLC0X24
Xiao-Qian Liu, Minghui Liu, Zhen-Duo Chen, Xin Luo, Xin-Shun Xu:
Hierarchical Multi-label Learning for Incremental Multilingual Text Recognition. 8750-8758
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0024
Yuzhuo Wang, Junwei He, Hongzhi Wang:
RHKH: Relational Hypergraph Neural Network for Link Prediction on N-ary Knowledge Hypergraph. 8759-8767
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LanC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LanC24
Fengbo Lan, Chang Wen Chen:
Understanding and Tackling Scattering and Reflective Flare for Mobile Camera Systems. 8768-8776
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoCZLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoCZLW24
Ziyu Zhao, Pingping Cai, Canyu Zhang, Xiaoguang Li, Song Wang:
Crossmodal Few-shot 3D Point Cloud Semantic Segmentation via View Synthesis. 8777-8785
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengLZ0ZL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengLZ0ZL024
Jinkai Zheng, Xinchen Liu, Boyue Zhang, Chenggang Yan, Jiyong Zhang, Wu Liu, Yongdong Zhang:
It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment. 8786-8794
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangZWSH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangZWSH024
Kenan Huang, Junbao Zhuo, Shuhui Wang, Chi Su, Qingming Huang, Huimin Ma:
Unsupervised Image-to-Video Adaptation via Category-aware Flow Memory Bank and Realistic Video Generation. 8795-8804
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangJSZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangJSZC024
Lv Tang, Peng-Tao Jiang, Zhihao Shen, Hao Zhang, Jin-Wei Chen, Bo Li:
Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection. 8805-8814
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Liao0CF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Liao0CF24
Xinyao Liao, Wei Wei, Dangyang Chen, Yuanyuan Fu:
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation. 8815-8824
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZHZ24
Siyang Wang, Jinghao Zhang, Jie Huang, Feng Zhao:
Image-free Pre-training for Low-Level Vision. 8825-8834
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RuanGXXYLFQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RuanGXXYLFQ24
Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Suncheng Xiang, Zefang Yu, Ting Liu, Yuzhuo Fu, Xiaoye Qu:
GIST: Improving Parameter Efficient Fine-Tuning via Knowledge Interaction. 8835-8844
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoCLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoCLW24
Xuechen Guo, Wenhao Chai, Shiyan Li, Gaoang Wang:
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound. 8845-8854
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanZWZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanZWZW24
Xiao Han, Zhenduo Zhang, Yiling Wu, Xinfeng Zhang, Zhe Wu:
Event Traffic Forecasting with Sparse Multimodal Data. 8855-8864
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuMTCWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuMTCWM24
Wanru Xu, Zhenjiang Miao, Yi Tian, Yigang Cen, Lili Wan, Xiaole Ma:
Probabilistic Distillation Transformer: Modelling Uncertainties for Visual Abductive Reasoning. 8865-8873
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLTGYW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLTGYW24
Shiye Wang, Changsheng Li, Jialin Tang, Xing Gong, Ye Yuan, Guoren Wang:
Importance-aware Shared Parameter Subspace Learning for Domain Incremental Learning. 8874-8883
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZ24
Chengshun Wang, Na Zhao:
GS²-GNeSF: Geometry-Semantics Synergy for Generalizable Neural Semantic Fields. 8884-8892
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuSCZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuSCZQ24
Liang Du, Yukai Shi, Yan Chen, Peng Zhou, Yuhua Qian:
Fast and Scalable Incomplete Multi-View Clustering with Duality Optimal Graph Filtering. 8893-8902
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeZMGG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeZMGG24
Zhilin He, Yawei Zhang, Jingchang Mu, Xiaoyue Gu, Tianhao Gu:
LiteGfm: A Lightweight Self-supervised Monocular Depth Estimation Framework for Artifacts Reduction via Guided Image Filtering. 8903-8912
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangLCQZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangLCQZ24
Chengyi Yang, Wentao Liu, Shisong Chen, Jiayin Qi, Aimin Zhou:
Generating Prompts in Latent Space for Rehearsal-free Continual Learning. 8913-8922
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingP24
Choubo Ding, Guansong Pang:
Improving Out-of-Distribution Detection with Disentangled Foreground and Background Features. 8923-8931
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuRSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuRSC24
Yi Lu, Shenghao Ren, Qiu Shen, Xun Cao:
Leveraging RGB-Pressure for Whole-body Human-to-Humanoid Motion Imitation. 8932-8941
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangHZYWWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangHZYWWW24
Li Zhang, Zean Han, Yan Zhong, Qiaojun Yu, Xingyu Wu, Xue Wang, Rujing Wang:
VoCAPTER: Voting-based Pose Tracking for Category-level Articulated Object via Inter-frame Priors. 8942-8951
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002HZLTG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002HZLTG24
Jinpeng Yu, Binbin Huang, Yuxuan Zhang, Huaxia Li, Xu Tang, Shenghua Gao:
GeoFormer: Learning Point Cloud Completion with Tri-Plane Integrated Transformer. 8952-8961
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wu0YHFJYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wu0YHFJYL24
Sifan Wu, Haipeng Chen, Yifang Yin, Sihao Hu, Runyang Feng, Yingying Jiao, Ziqi Yang, Zhenguang Liu:
Joint-Motion Mutual Learning for Pose Estimation in Video. 8962-8971
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWFLGJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWFLGJ24
Jiaqi Wang, Pichao Wang, Yi Feng, Huafeng Liu, Chang Gao, Liping Jing:
Align2Concept: Language Guided Interpretable Image Recognition by Visual Prototype and Textual Concept Alignment. 8972-8981
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xiao0HL0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xiao0HL0Z24
Siying Xiao, Mao Ye, Qichen He, Shuaifeng Li, Song Tang, Xiatian Zhu:
Adversarial Experts Model for Black-box Domain Adaptation. 8982-8991
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiCLD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiCLD24
Yayun Wei, Lei Cao, Hao Li, Yilin Dong:
MB2C: Multimodal Bidirectional Cycle Consistency for Learning Robust Visual Neural Representations. 8992-9000
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangYD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangYD24
Qiang Wang, Ke Yan, Shouhong Ding:
Bilateral Adaptive Cross-Modal Fusion Prompt Learning for CLIP. 9001-9009
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoWLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoWLS24
Yifei Gao, Jiaqi Wang, Zhiyu Lin, Jitao Sang:
AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models. 9010-9018
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuZLC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuZLC024
Haizhuang Liu, Junbao Zhuo, Chen Liang, Jiansheng Chen, Huimin Ma:
Affinity3D: Propagating Instance-Level Semantic Affinity for Zero-Shot Point Cloud Semantic Segmentation. 9019-9028
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZ024
Zhaojian Li, Bin Zhao, Yuan Yuan:
TAS: Personalized Text-guided Audio Spatialization. 9029-9037
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaoZYLMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaoZYLMZ24
Congqi Cao, Yueran Zhang, Yating Yu, Qinyi Lv, Lingtong Min, Yanning Zhang:
Task-Adapter: Task-specific Adaptation of Image Models for Few-shot Action Recognition. 9038-9047
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiLJLJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiLJLJ24
Quanjiang Li, Tingjin Luo, Mingdie Jiang, Jiahui Liao, Zhangqi Jiang:
Deep Incomplete Multi-View Network Semi-Supervised Multi-Label Learning with Unbiased Loss. 9048-9056
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWZ024
Xinyue Liu, Jiahui Wan, Linlin Zong, Bo Xu:
Conditional Diffusion Model for Open-ended Video Question Answering. 9057-9066
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeW0XT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeW0XT24
Yulin He, Siqi Wang, Wei Chen, Tianci Xun, Yusong Tan:
Sniffing Threatening Open-World Objects in Autonomous Driving by Open-Vocabulary Models. 9067-9076
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunLLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunLLM24
Haosen Sun, Yiming Li, Xixiang Lyu, Jing Ma:
Learning from Distinction: Mitigating Backdoors Using a Low-Capacity Model. 9077-9086
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0006ZS0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0006ZS0024
Shen Lin, Xiaoyu Zhang, Willy Susilo, Xiaofeng Chen, Jun Liu:
GDR-GMA: Machine Unlearning via Direction-Rectified and Magnitude-Adjusted Gradients. 9087-9095
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoCZFSZZZSCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoCZFSZZZSCJ24
Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji:
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM. 9096-9105
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiTXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiTXL24
Shijie Li, Yunbin Tu, Qingyuan Xiang, Zheng Li:
MAGIC: Rethinking Dynamic Convolution Design for Medical Image Segmentation. 9106-9115
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZHLCD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZHLCD24
Chao Wang, Yang Zhou, Liangtian He, Fenglai Lin, Hongming Chen, Liang-Jian Deng:
Illumination Distribution Prior for Low-light Image Enhancement. 9116-9125
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FuLQGWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FuLQGWL24
Pinhan Fu, Xinyan Liang, Yuhua Qian, Qian Guo, Zhifang Wei, Wen Li:
CoMO-NAS: Core-Structures-Guided Multi-Objective Neural Architecture Search for Multi-Modal Classification. 9126-9135
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLMXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLMXL24
Yi Liu, Jiachen Li, Yanchun Ma, Qing Xie, Yongjian Liu:
HcaNet: Haze-concentration-aware Network for Real-scene Dehazing with Codebook Priors. 9136-9144
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiaoQLCWLYHP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiaoQLCWLYHP24
Wenlong Liao, Sunyuan Qiang, Xianfei Li, Xiaolei Chen, Haoyu Wang, Yanyan Liang, Junchi Yan, Tao He, Pai Peng:
CalibRBEV: Multi-Camera Calibration via Reversed Bird's-eye-view Representations for Autonomous Driving. 9145-9154
- view
  authority control:
- export record
  dblp key:
  - conf/mm/IslamRASB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/IslamRASB024
Md Tanvir Islam, Nasir Rahim, Saeed Anwar, Muhammad Saqib, Sambit Bakshi, Khan Muhammad:
HazeSpace2M: A Dataset for Haze Aware Single Image Dehazing. 9155-9164
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLHW0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLHW0Y24
Xiaojun Chen, Jimeng Lou, Wenxi Huang, Ting Wan, Qin Zhang, Min Yang:
ReCoS: A Novel Benchmark for Cross-Modal Image-Text Retrieval in Complex Real-Life Scenarios. 9165-9174
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangLCMX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangLCMX24
Shicheng Yang, Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Jing-Hao Xue:
Channel-Spatial Support-Query Cross-Attention for Fine-Grained Few-Shot Image Classification. 9175-9183
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangMFLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangMFLZ24
Xiaorui Jiang, Zhongyi Ma, Yulin Fu, Yong Liao, Pengyuan Zhou:
Heterogeneity-Aware Federated Deep Multi-View Clustering towards Diverse Feature Representations. 9184-9193
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangCCZ0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangCCZ0Y24
Jiyuan Zhang, Kang Chen, Shiyan Chen, Yajing Zheng, Tiejun Huang, Zhaofei Yu:
SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion. 9194-9203
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangCZ0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangCZ0Y24
Jiangyi Wang, Zhongyao Cheng, Na Zhao, Jun Cheng, Xulei Yang:
On-the-fly Point Feature Representation for Point Clouds Analysis. 9204-9213
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLJLHN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLJLHN24
Kun Wang, Hao Liu, Lirong Jie, Zixu Li, Yupeng Hu, Liqiang Nie:
Explicit Granularity and Implicit Scale Correspondence Learning for Point-Supervised Video Moment Localization. 9214-9223
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuJLLSY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuJLLSY024
Shaoqing Xu, Shengyin Jiang, Fang Li, Li Liu, Ziying Song, Bo Yang, Zhixin Yang:
SparseInteraction: Sparse Semantic Guidance for Radar and Camera 3D Object Detection. 9224-9233
- view
  authority control:
- export record
  dblp key:
  - conf/mm/UkaiK0UI24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/UkaiK0UI24
Mahiro Ukai, Shuhei Kurita, Atsushi Hashimoto, Yoshitaka Ushiku, Nakamasa Inoue:
AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering. 9234-9243
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoXLD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoXLD24
Shengwei Zhao, Linhai Xu, Yuying Liu, Shaoyi Du:
Multi-grained Correspondence Learning of Audio-language Models for Few-shot Audio Recognition. 9244-9252
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuWC00P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuWC00P24
Song Wu, Xiaoyu Wei, Xinyue Chen, Yazhou Ren, Jing He, Xiaorong Pu:
Cross-View Mutual Learning for Semi-Supervised Medical Image Segmentation. 9253-9261
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiZ0BL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiZ0BL24
Yunshan Qi, Lin Zhu, Yifan Zhao, Nan Bao, Jia Li:
Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment. 9262-9270
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xiu00CZ0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xiu00CZ0Z24
Jingqiao Xiu, Mengze Li, Wei Ji, Jingyuan Chen, Hanbin Zhao, Shin'ichi Satoh, Roger Zimmermann:
Hierarchical Debiasing and Noisy Correction for Cross-domain Video Tube Retrieval. 9271-9280
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinL0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinL0W24
Wenyu Yin, Shuyuan Lin, Yang Lu, Hanzi Wang:
Diverse Consensuses Paired with Motion Estimation-Based Multi-Model Fitting. 9281-9290
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuZ0X024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuZ0X024
Andong Lu, Jiacong Zhao, Chenglong Li, Yun Xiao, Bin Luo:
Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation. 9291-9300
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuZPYYWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuZPYYWZ24
Peng Wu, Xuerong Zhou, Guansong Pang, Zhiwei Yang, Qingsen Yan, Peng Wang, Yanning Zhang:
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts. 9301-9310
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Luo0LZXLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Luo0LZXLC24
Pengfei Luo, Tong Xu, Che Liu, Suojuan Zhang, Linli Xu, Minglei Li, Enhong Chen:
Bridging Gaps in Content and Knowledge for Multimodal Entity Linking. 9311-9320
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangLWWLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangLWWLSL24
Shiyu Tang, Zhaofan Luo, Yifan Wang, Lijun Wang, Huchuan Lu, Weibo Su, Libo Liu:
LOVD: Large-and-Open Vocabulary Object Detection. 9321-9329
- view
  authority control:
- export record
  dblp key:
  - conf/mm/NguyenLML24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/NguyenLML24
Cam-Van Thi Nguyen, The-Son Le, Anh-Tuan Mai, Duc-Trong Le:
Ada2I: Enhancing Modality Balance for Multimodal Conversational Emotion Recognition. 9330-9339
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWZMWZP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWZMWZP024
Xinpeng Li, Teng Wang, Jian Zhao, Shuyi Mao, Jinbao Wang, Feng Zheng, Xiaojiang Peng, Xuelong Li:
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer. 9340-9349
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangTMWDTD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangTMWDTD024
Jingjia Huang, Jingyan Tu, Ge Meng, Yingying Wang, Yuhang Dong, Xiaotong Tu, Xinghao Ding, Yue Huang:
Efficient Perceiving Local Details via Adaptive Spatial-Frequency Information Integration for Multi-focus Image Fusion. 9350-9359
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChoKCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChoKCC24
Wonwoo Cho, Kangyeol Kim, Saemee Choi, Jaegul Choo:
Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning. 9360-9368
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003GZTL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003GZTL24
Ning Xu, Yifei Gao, Ting-Ting Zhang, Hongshuo Tian, An-An Liu:
Cross-Modal Coherence-Enhanced Feedback Prompting for News Captioning. 9369-9377
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiDCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiDCL24
Yuzhen Li, Zehang Deng, Yuxin Cao, Lihua Liu:
GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution. 9378-9386
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PuLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PuLC24
Muxin Pu, Mei Kuan Lim, Chun Yong Chong:
Siformer: Feature-isolated Transformer for Efficient Skeleton-based Sign Language Recognition. 9387-9396
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuanGY0MS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuanGY0MS24
Yue Duan, Zhangxuan Gu, Zhenzhe Ying, Lei Qi, Changhua Meng, Yinghuan Shi:
PC²: Pseudo-Classification Based Pseudo-Captioning for Noisy Correspondence Learning in Cross-Modal Retrieval. 9397-9406
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0010W0DG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0010W0DG24
Wei Feng, Zhenwei Wu, Qianqian Wang, Bo Dong, Quanxue Gao:
Federated Fuzzy C-means with Schatten-p Norm Minimization. 9407-9416
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WanXLGFDW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WanXLGFDW24
Tianjiao Wan, Kele Xu, Long Lan, Zijian Gao, Dawei Feng, Bo Ding, Huaimin Wang:
Tracing Training Progress: Dynamic Influence Based Selection for Active Learning. 9417-9425
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoNQQSYXCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoNQQSYXCY24
Ruohao Guo, Dantong Niu, Liao Qu, Yanyu Qi, Ji Shi, Wenzhen Yue, Bowei Xing, Taiyan Chen, Xianghua Ying:
Instance-Level Panoramic Audio-Visual Saliency Detection and Ranking. 9426-9434
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinYXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinYXL24
Shenglin Yin, Kelu Yao, Zhen Xiao, Jieyi Long:
Embracing Adaptation: An Effective Dynamic Defense Strategy Against Adversarial Examples. 9435-9444
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangCLDZ0GFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangCLDZ0GFZ24
Zitong Huang, Ze Chen, Yuanze Li, Bowen Dong, Erjin Zhou, Yong Liu, Rick Siow Mong Goh, Chun-Mei Feng, Wangmeng Zuo:
Class Balance Matters to Active Class-Incremental Learning. 9445-9454
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangKF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangKF24
Hao Zhang, Ee Yeo Keat, Basura Fernando:
RCA: Region Conditioned Adaptation for Visual Abductive Reasoning. 9455-9464
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Jiang-LinHLHLWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Jiang-LinHLHLWS24
Jian-Yu Jiang-Lin, Kang-Yang Huang, Ling Lo, Yi-Ning Huang, Terence Lin, Jhih-Ciang Wu, Hong-Han Shuai, Wen-Huang Cheng:
ReCorD: Reasoning and Correcting Diffusion for HOI Generation. 9465-9474
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangDJC0F24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangDJC0F24
Xiaze Zhang, Ziheng Ding, Qi Jing, Ying Cheng, Wenchao Ding, Rui Feng:
DeepPointMap2: Accurate and Robust LiDAR-Visual SLAM with Neural Descriptors. 9475-9484
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHDZ0WH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHDZ0WH024
Hongyu Li, Tianrui Hui, Zihan Ding, Jing Zhang, Bin Ma, Xiaoming Wei, Jizhong Han, Si Liu:
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding. 9485-9494
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuK0HSLGS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuK0HSLGS24
Hengde Zhu, Xiangyu Kong, Weicheng Xie, Xin Huang, Linlin Shen, Lu Liu, Hatice Gunes, Siyang Song:
PerFRDiff: Personalised Weight Editing for Multiple Appropriate Facial Reaction Generation. 9495-9504
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLZZXB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLZZXB24
Shiqin Liu, Chaozhuo Li, Xi Zhang, Minjun Zhao, Yuanbo Xu, Jiajun Bu:
Deeply Fusing Semantics and Interactions for Item Representation Learning via Topology-driven Pre-training. 9505-9514
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengWLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengWLL24
Yongsen Zheng, Guohua Wang, Yang Liu, Liang Lin:
Diversity Matters: User-Centric Multi-Interest Learning for Conversational Movie Recommendation. 9515-9524
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Shi024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Shi024
Yuanchen Shi, Fang Kong:
Integrating Stickers into Multimodal Dialogue Summarization: A Novel Dataset and Approach for Enhancing Social Media Interaction. 9525-9534
- view
  authority control:
- export record
  dblp key:
  - conf/mm/OncescuHK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/OncescuHK24
Andreea-Maria Oncescu, João F. Henriques, A. Sophia Koepke:
Dissecting Temporal Understanding in Text-to-Audio Retrieval. 9535-9543
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SuHZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SuHZX24
Yuhang Su, Wei Hu, Fan Zhang, Qiming Xu:
AMG-Embedding: A Self-Supervised Embedding Approach for Audio Identification. 9544-9553
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYLLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYLLY24
Xue Li, Jiong Yu, Ziyang Li, Hongchun Lu, Ruifeng Yuan:
Dr. CLIP: CLIP-Driven Universal Framework for Zero-Shot Sketch Image Retrieval. 9554-9562
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuangCZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuangCZ024
Yan Zhuang, Yanlu Cai, Weizhong Zhang, Cheng Jin:
Future Motion Dynamic Modeling via Hybrid Supervision for Multi-Person Motion Prediction Uncertainty Reduction. 9563-9572
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZHFHSFW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZHFHSFW24
Yupeng Zhang, Shuqi Zheng, Ruize Han, Yuzhong Feng, Junhui Hou, Linqi Song, Wei Feng, Liang Wan:
Rethinking the One-shot Object Detection: Cross-Domain Object Search. 9573-9581
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuMHZZDL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuMHZZDL24
Yuhan Wu, Xiyu Meng, Yang He, Junru Zhang, Haowen Zhang, Yabo Dong, Dongming Lu:
Multi-view Self-Supervised Contrastive Learning for Multivariate Time Series. 9582-9590
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinWLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinWLL24
Dongding Lin, Jian Wang, Chak Tou Leong, Wenjie Li:
SCREEN: A Benchmark for Situated Conversational Recommendation. 9591-9600
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuCLWWCLJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuCLWWCLJ24
Xiaowan Hu, Yiyi Chen, Yan Li, Minquan Wang, Haoqian Wang, Quan Chen, Han Li, Peng Jiang:
Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval. 9601-9610
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LvHZZZCZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LvHZZZCZ024
Zheqi Lv, Shaoxuan He, Tianyu Zhan, Shengyu Zhang, Wenqiao Zhang, Jingyuan Chen, Zhou Zhao, Fei Wu:
Semantic Codebook Learning for Dynamic Recommendation Models. 9611-9620
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TuXLWZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TuXLWZX24
Geng Tu, Feng Xiong, Bin Liang, Hui Wang, Xi Zeng, Ruifeng Xu:
Multimodal Emotion Recognition Calibration in Conversations. 9621-9630
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaLRJPY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaLRJPY24
Wuyou Xia, Shengzhe Liu, Rong Qin, Guoli Jia, Eunil Park, Jufeng Yang:
Perceive before Respond: Improving Sticker Response Selection by Emotion Distillation and Hard Mining. 9631-9640
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaHZWZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaHZWZC24
Yunshan Ma, Yingzhi He, Wenjun Zhong, Xiang Wang, Roger Zimmermann, Tat-Seng Chua:
CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling. 9641-9649
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoH00SX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoH00SX24
Zixian Gao, Disen Hu, Xun Jiang, Huimin Lu, Heng Tao Shen, Xing Xu:
Enhanced Experts with Uncertainty-Aware Routing for Multimodal Sentiment Analysis. 9650-9659
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiLW0NK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiLW0NK24
Zhenyang Li, Fan Liu, Yinwei Wei, Zhiyong Cheng, Liqiang Nie, Mohan S. Kankanhalli:
Attribute-driven Disentangled Representation Learning for Multimodal Recommendation. 9660-9669
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FuZZ0C0YX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FuZZ0C0YX24
Ting Fu, Yu-Wei Zhan, Chong-Yu Zhang, Xin Luo, Zhen-Duo Chen, Yongxin Wang, Xun Yang, Xin-Shun Xu:
FedCAFE: Federated Cross-Modal Hashing with Adaptive Feature Enhancement. 9670-9679
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0011YLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0011YLZ24
Feng Zhu, Xinxing Yang, Longfei Li, Jun Zhou:
An Active Masked Attention Framework for Many-to-Many Cross-Domain Recommendations. 9680-9689
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiZHLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiZHLW24
Zehao Qi, Ruixu Zhang, Xinyi Hu, Wenxuan Liu, Zheng Wang:
Predicting the Unseen: A Novel Dataset for Hidden Intention Localization in Pre-abnormal Analysis. 9690-9698
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZH24
Ding Wang, Wei Zhou, Songlin Hu:
Information Diffusion Prediction with Graph Neural Ordinary Differential Equation Network. 9699-9708
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWHCLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWHCLH24
Jian Chen, Wei Wang, Yuzhu Hu, Junxin Chen, Han Liu, Xiping Hu:
TGCA-PVT: Topic-Guided Context-Aware Pyramid Vision Transformer for Sticker Emotion Recognition. 9709-9718
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Yang0THLGHJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Yang0THLGHJ24
Rui Yang, Shuang Wang, Jianwei Tao, Yingping Han, Qiaoling Lin, Yanhe Guo, Biao Hou, Licheng Jiao:
Accurate and Lightweight Learning for Specific Domain Image-Text Retrieval. 9719-9728
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoQF0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoQF0T24
Xianbing Zhao, Lizhen Qu, Tao Feng, Jianfei Cai, Buzhou Tang:
Learning in Order! A Sequential Strategy to Learn Invariant Features for Multimodal Sentiment Analysis. 9729-9738
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZXL24
Yutong Wang, Sidan Zhu, Hongteng Xu, Dixin Luo:
An Inverse Partial Optimal Transport Framework for Music-guided Trailer Generation. 9739-9748
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zheng0DL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zheng0DL24
Haonan Zheng, Wen Jiang, Xinyang Deng, Wenrui Li:
Sample-agnostic Adversarial Perturbation for Vision-Language Pre-training Models. 9749-9758
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenWSLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenWSLY24
Jiade Chen, Jin Wang, Yunhui Shi, Nam Ling, Baocai Yin:
MVP-Net: Multi-View Depth Image Guided Cross-Modal Distillation Network for Point Cloud Upsampling. 9759-9768
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhao0FZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhao0FZ24
Zuoyan Zhao, Hui Xue, Pengfei Fang, Shipeng Zhu:
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution. 9769-9778
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangLL0XLHDTY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangLL0XLHDTY24
Yuzhi Huang, Chenxin Li, Zixu Lin, Hengyu Liu, Haote Xu, Yifan Liu, Yue Huang, Xinghao Ding, Xiaotong Tu, Yixuan Yuan:
P²SAM: Probabilistically Prompted SAMs Are Efficient Segmentator for Ambiguous Medical Images. 9779-9788
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YiZHLR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YiZHLR24
Ran Yi, Haokun Zhu, Teng Hu, Yu-Kun Lai, Paul L. Rosin:
AesStyler: Aesthetic Guided Universal Style Transfer. 9789-9798
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWQYQWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWQYQWZ24
Wenxuan Wang, Chenglei Wang, Huihui Qi, Menghao Ye, Xuelin Qian, Peng Wang, Yanning Zhang:
Sustainable Self-evolution Adversarial Training. 9799-9808
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiaoD0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiaoD0024
Jian-Jun Qiao, Meng-Yu Duan, Xiao Wu, Wei Li:
CAPNet: Cartoon Animal Parsing with Spatial Learning and Structural Modeling. 9809-9817
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangXLYLXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangXLYLXZ24
Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang:
V²A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection. 9818-9827
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhongHLHDY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhongHLHDY024
Xian Zhong, Shengwang Hu, Wenxuan Liu, Wenxin Huang, Jianhao Ding, Zhaofei Yu, Tiejun Huang:
Towards Low-latency Event-based Visual Recognition with Hybrid Step-wise Distillation Spiking Neural Networks. 9828-9836
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiJL0CM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiJL0CM24
Junqi Shi, Mingyi Jiang, Ming Lu, Tong Chen, Xun Cao, Zhan Ma:
HINER: Neural Representation for Hyperspectral Image. 9837-9846
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuXDWZLHJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuXDWZLHJ24
Yaqiang Wu, Zhen Xu, Yong Duan, Yanlai Wu, Qinghua Zheng, Hui Li, Xiaochen Hu, Lianwen Jin:
RDLNet: A Novel and Accurate Real-world Document Localization Method. 9847-9855
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TengSXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TengSXL24
Xiao Teng, Xingyu Shen, Kele Xu, Long Lan:
Enhancing Unsupervised Visible-Infrared Person Re-Identification with Bidirectional-Consistency Gradual Matching. 9856-9865
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0LW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0LW24
Zhen Zhang, Jing Xiao, Liang Liao, Mi Wang:
RefScale: Multi-temporal Assisted Image Rescaling in Repetitive Observation Scenarios. 9866-9874
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeB0ZHF0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeB0ZHF0024
Chaoxiang He, Xiaofan Bai, Xiaojing Ma, Bin B. Zhu, Pingyi Hu, Jiayun Fu, Hai Jin, Dongmei Zhang:
Towards Stricter Black-box Integrity Verification of Deep Neural Network Models. 9875-9884
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenZD24
Peibin Chen, Xijin Zhang, Daniel Kang Du:
SimpliGuard: Robust Mesh Simplification In the Wild. 9885-9893
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoZYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoZYL24
Shixuan Gao, Pingping Zhang, Tianyu Yan, Huchuan Lu:
Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection. 9894-9903
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuanZCJZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuanZCJZW24
Panjun Duan, Yang Zhao, Yuan Chen, Wei Jia, Zhao Zhang, Ronggang Wang:
Blind Video Bit-Depth Expansion. 9904-9912
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TanZQLWB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TanZQLWB24
Xiaoheng Tan, Jiabin Zhang, Yuhui Quan, Jing Li, Yajing Wu, Zilin Bian:
Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy. 9913-9922
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWH24
Yujia Wang, Zhongxu Wang, Hua Huang:
AutoSFX: Automatic Sound Effect Generation for Videos. 9923-9932
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0H0GG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0H0GG24
Weiguang Zhang, Qiufeng Wang, Kaizhu Huang, Xiaowei Huang, Fengjun Guo, Xiaomeng Gu:
Document Registration: Towards Automated Labeling of Pixel-Level Alignment Between Warped-Flat Documents. 9933-9942
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangWYZL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangWYZL024
Hao Yang, Min Wang, Zhengfei Yu, Zhi Zeng, Mingrui Lao, Yun Zhou:
Maximizing Feature Distribution Variance for Robust Neural Networks. 9943-9951
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanWSLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanWSLY24
Kai Han, Jin Wang, Yunhui Shi, Nam Ling, Baocai Yin:
D³U-Net: Dual-Domain Collaborative Optimization Deep Unfolding Network for Image Compressive Sensing. 9952-9960
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuYSFX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuYSFX24
Jiangtong Zhu, Zhao Yang, Yinan Shi, Jianwu Fang, Jianru Xue:
IC-Mapper: Instance-Centric Spatio-Temporal Modeling for Online Vectorized Map Construction. 9961-9969
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiangDCLHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiangDCLHG24
Jianjun Xiang, Yuanjie Dang, Peng Chen, Ronghua Liang, Ruohong Huan, Nan Gao:
Semantic-Aware and Quality-Aware Interaction Network for Blind Video Quality Assessment. 9970-9979
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangYCLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangYCLL24
Zerui Zhang, Jun Yu, Liangxian Cui, Qiang Ling, Tianyu Liu:
Part-level Reconstruction for Self-Supervised Category-level 6D Object Pose Estimation with Coarse-to-Fine Correspondence Optimization. 9980-9988
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MiSLHZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MiSLHZL24
Yachun Mi, Yan Shu, Yu Li, Chen Hui, Puchao Zhou, Shaohui Liu:
CLiF-VQA: Enhancing Video Quality Assessment by Incorporating High-Level Semantic Information related to Human Feelings. 9989-9998
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuYWYQZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuYWYQZ024
Xuntao Liu, Yuzhou Yang, Haoyue Wang, Qichao Ying, Zhenxing Qian, Xinpeng Zhang, Sheng Li:
Multi-view Feature Extraction via Tunable Prompts is Enough for Image Manipulation Localization. 9999-10007
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangFZLLZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangFZLLZC24
Junfeng Yang, Jing Fu, Zhen Zhang, Limei Liu, Qin Li, Wei Zhang, Wenzhi Cao:
Align-IQA: Aligning Image Quality Assessment Models with Diverse Human Preferences via Customizable Guidance. 10008-10017
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinXYY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinXYY024
Zehang Lin, Jiayuan Xie, Zhenguo Yang, Yi Yu, Qing Li:
Generalized News Event Discovery via Dynamic Augmentation and Entropy Optimization. 10018-10026
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CuiJPP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CuiJPP024
Jiahao Cui, Wei Jiang, Zhan Peng, Zhiyu Pan, Zhiguo Cao:
Exposure Completing for Temporally Consistent Neural High Dynamic Range Video Rendering. 10027-10035
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanZ24
Lei Han, Xuesong Zhang:
Scalable Super-Resolution Neural Operator. 10036-10045
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangMJHBFXX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangMJHBFXX24
Ling Zhang, Yidong Ma, Zhi Jiang, Weilei He, Zhongyun Bao, Gang Fu, Wenju Xu, Chunxia Xiao:
HighlightRemover: Spatially Valid Pixel Learning for Image Specular Highlight Removal. 10046-10054
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhou0ZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhou0ZH24
Yuhang Zhou, Yushu Zhang, Leo Yu Zhang, Zhongyun Hua:
DERD: Data-free Adversarial Robustness Distillation through Self-adversarial Teacher Group. 10055-10064
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuangH0C0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuangH0C0L24
Shuman Zhuang, Sujia Huang, Wei Huang, Yuhong Chen, Zhihao Wu, Ximeng Liu:
Enhancing Multi-view Graph Neural Network with Cross-view Confluent Message Passing. 10065-10074
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RongPLZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RongPLZZ24
Fu Rong, Wenjin Peng, Meng Lan, Qian Zhang, Lefei Zhang:
Driving Scene Understanding with Traffic Scene-Assisted Topology Graph Transformer. 10075-10084
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YiCZXZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YiCZXZC24
Chang'an Yi, Haotian Chen, Yifan Zhang, Yonghui Xu, Yan Zhou, Lizhen Cui:
From Question to Exploration: Can Classic Test-Time Adaptation Strategies Be Effectively Applied in Semantic Segmentation? 10085-10094
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenLMT0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenLMT0Z024
Zehao Chen, Zhan Lu, De Ma, Huajin Tang, Xudong Jiang, Qian Zheng, Gang Pan:
Event-ID: Intrinsic Decomposition Using an Event Camera. 10095-10104
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangNDZWNL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangNDZWNL24
Xu Zhang, Fan Ni, Guannan Dong, Aichun Zhu, Jianhui Wu, Mingcheng Ni, Hui Liu:
TVPR: Text-to-Video Person Retrieval and a New Benchmark. 10105-10113
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiZ24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiZ24a
Haoyu Shi, Huaiwen Zhang:
Modal-Enhanced Semantic Modeling for Fine-Grained 3D Human Motion Retrieval. 10114-10123
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0004LH0JW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0004LH0JW24
Hongyu Zhu, Sichu Liang, Wentao Hu, Fangqi Li, Ju Jia, Shi-Lin Wang:
Reliable Model Watermarking: Defending against Theft without Compromising on Evasion. 10124-10133
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QiaoXGWHFCW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QiaoXGWHFCW024
Qian Qiao, Yu Xie, Jun Gao, Tianxiang Wu, Shaoyao Huang, Jiaqing Fan, Ziqiang Cao, Zili Wang, Yue Zhang:
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training. 10134-10143
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLS24
Yi Liu, Xinyi Li, Wenjing Shuai:
3D Scene De-occlusion in Neural Radiance Fields: A Framework for Obstacle Removal and Realistic Inpainting. 10144-10153
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLH0CLQDH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLH0CLQDH24
Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He:
FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs. 10154-10163
- view
  authority control:
- export record
  dblp key:
  - conf/mm/QinQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/QinQ24
Yalan Qin, Li Qian:
Fast Elastic-Net Multi-view Clustering: A Geometric Interpretation Perspective. 10164-10172
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Guo0LWP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Guo0LWP24
Xiaojiao Guo, Xuhang Chen, Shenghong Luo, Shuqiang Wang, Chi-Man Pun:
Dual-Hybrid Attention Network for Specular Highlight Removal. 10173-10181
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Luo0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Luo0G24
Yiyang Luo, Ke Lin, Chao Gu:
Context-Aware Indoor Point Cloud Object Generation through User Instructions. 10182-10190
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuCZLKN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuCZLKN24
Zhangli Hu, Ye Chen, Zhongyin Zhao, Jinfan Liu, Bilian Ke, Bingbing Ni:
Towards Artist-Like Painting Agents with Multi-Granularity Semantic Alignment. 10191-10199
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangLQSZ0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangLQSZ0L24
Zixuan Wang, Jiayi Li, Xiaoyu Qin, Shikun Sun, Songtao Zhou, Jia Jia, Jiebo Luo:
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. 10200-10209
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KimHPCS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KimHPCS24
Sooho Kim, Soyeon Hong, Kyungsoo Park, Hyunsouk Cho, Kyung-Ah Sohn:
OmniStitch: Depth-Aware Stitching Framework for Omnidirectional Vision with Multiple Cameras. 10210-10219
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiLLWGJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiLLWGJ24
Kaijiang Li, Hao Li, Haining Li, Peisen Wang, Chunyi Guo, Wenfeng Jiang:
SIRLUT: Simulated Infrared Fusion Guided Image-adaptive 3D Lookup Tables for Lightweight Image Enhancement. 10220-10228
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JiangXLLCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JiangXLLCX24
Bolin Jiang, Yuqiu Xie, Jiawei Li, Naiqi Li, Bin Chen, Shu-Tao Xia:
IGSPAD: Inverting 3D Gaussian Splatting for Pose-agnostic Anomaly Detection. 10229-10237
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Li0Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Li0Q024
Guobiao Li, Sheng Li, Zhenxing Qian, Xinpeng Zhang:
Cover-separable Fixed Neural Network Steganography via Deep Generative Models. 10238-10247
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaLZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaLZH24
Baorui Ma, Yu-Shen Liu, Matthias Zwicker, Zhizhong Han:
Inferring 3D Occupancy Fields through Implicit Reasoning on Silhouette Images. 10248-10257
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiLLLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiLLLL24
Rui Li, Yishu Liu, Huafeng Li, Jinxing Li, Guangming Lu:
Prototype-Guided Dual-Transformer Reasoning for Video Individual Counting. 10258-10267
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZXYX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZXYX024
Tao Wang, Yushu Zhang, Xiangli Xiao, Lin Yuan, Zhihua Xia, Jian Weng:
Make Privacy Renewable! Generating Privacy-Preserving Faces Supporting Cancelable Biometric Recognition. 10268-10276
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SPBM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SPBM24
Green Rosh K. S, B. H. Pawan Prasad, Lokesh R. Boregowda, Kaushik Mitra:
R²SFD: Improving Single Image Reflection Removal using Semantic Feature Dictionary. 10277-10286
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenH0C024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenH0C024
Jiaming Shen, Kun Hu, Wei Bao, Chang Wen Chen, Zhiyong Wang:
Bridging the Gap: Sketch-Aware Interpolation Network for High-Quality Animation Sketch Inbetweening. 10287-10295
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SuZXZ0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SuZXZ0Y24
Yanghao Su, Jie Zhang, Ting Xu, Tianwei Zhang, Weiming Zhang, Nenghai Yu:
Model X-ray: Detecting Backdoored Models via Decision Boundary. 10296-10305
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouWXL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouWXL024
Lize Zhou, Xiaoqi Wang, Jian Xiong, Xianzhong Long, Hao Gao:
Towards Distortion-Debiased Blind Image Quality Assessment. 10306-10315
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang0Y24
Benhui Zhang, Junyu Gao, Yuan Yuan:
A Descriptive Basketball Highlight Dataset for Automatic Commentary Generation. 10316-10325
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWMYW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWMYW24
Cong Wang, Liyan Wang, Jie Mu, Chengjin Yu, Wei Wang:
Progressive Local and Non-Local Interactive Networks with Deeply Discriminative Training for Image Deraining. 10326-10335
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangZG24
Kaifang Yang, Xinrong Zhao, Yanchao Gong:
Semantic Aware Just Noticeable Differences for VVC Compressed Text Screen Content Images. 10336-10344
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuWXWP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuWXWP24
Jiaxuan Wu, Zhengxian Wu, Yiming Xue, Juan Wen, Wanli Peng:
Generative Text Steganography with Large Language Model. 10345-10353
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0YZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0YZW24
Yuchen Wang, Xingyu Zhu, Guanhui Ye, Shiyao Zhang, Xuetao Wei:
Achieving Resolution-Agnostic DNN-based Image Watermarking: A Novel Perspective of Implicit Neural Representation. 10354-10362
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuZSGX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuZSGX024
Renshu Gu, Jiajun Zhu, Yixuan Si, Fei Gao, Jiamin Xu, Gang Xu:
3D Human Pose Estimation from Multiple Dynamic Views via Single-view Pretraining with Procrustes Alignment. 10363-10372
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingDWFCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingDWFCZ24
Yang Ding, Yi Dai, Xin Wang, Ling Feng, Lei Cao, Huijun Zhang:
Integrating Content-Semantics-World Knowledge to Detect Stress from Videos. 10373-10381
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaoWXLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaoWXLW24
Xintian Mao, Jiansheng Wang, Xingran Xie, Qingli Li, Yan Wang:
LoFormer: Local Frequency Transformer for Image Deblurring. 10382-10391
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZZL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZZL0024
Mingjin Zhang, Chi Zhang, Qiming Zhang, Yunsong Li, Xinbo Gao, Jing Zhang:
Unleashing the Power of Generic Segmentation Model: A Simple Baseline for Infrared Small Target Detection. 10392-10401
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuanL0D0R24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuanL0D0R24
Honglin Yuan, Shiyun Lai, Xingfeng Li, Jian Dai, Yuan Sun, Zhenwen Ren:
Robust Prototype Completion for Incomplete Multi-view Clustering. 10402-10411
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PengG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PengG24
Changhao Peng, Wei Gao:
Laplacian Matrix Learning for Point Cloud Attribute Compression with Ternary Search-Based Adaptive Block Partition. 10412-10420
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuanZWYWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuanZWYWL24
Zhongwei Xuan, Zunjie Zhu, Shuai Wang, Haibing Yin, Hongkui Wang, Ming Lu:
Superpixel-based Efficient Sampling for Learning Neural Fields from Large Input. 10421-10430
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WanY0FZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WanY0FZZ24
Zhaolin Wan, Qiushuang Yang, Zhiyang Li, Xiaopeng Fan, Wangmeng Zuo, Debin Zhao:
Dual-stream Perception-driven Blind Quality Assessment for Stereoscopic Omnidirectional Images. 10431-10439
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangYRZP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangYRZP24
Weixuan Tang, Haoyu Yang, Yuan Rao, Zhili Zhou, Fei Peng:
Dig a Hole and Fill in Sand: Adversary and Hiding Decoupled Steganography. 10440-10448
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZ0ZLW0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZ0ZLW0Z24
Bin Wang, Meishan Zhang, Hao Fei, Yu Zhao, Bobo Li, Shengqiong Wu, Wei Ji, Min Zhang:
SpeechEE: A Novel Benchmark for Speech Event Extraction. 10449-10458
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenHYLZLNSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenHYLZLNSZ24
Shouyu Chen, Liang Hu, Tangwei Ye, Zhongyuan Lai, Qi Zhang, Ke Liu, Usman Naseem, Ke Sun, Nengjun Zhu:
VR-DiagNet: Medical Volumetric and Radiomic Diagnosis Networks with Interpretable Clinician-like Optimizing Visual Inspection. 10459-10467
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuPKSLSYWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuPKSLSYWL24
Minjing Yu, Delong Pang, Ziwen Kang, Zhiyao Sun, Tian Lv, Jenny Sheng, Ran Yi, Yu-Hui Wen, Yong-Jin Liu:
ECAvatar: 3D Avatar Facial Animation with Controllable Identity and Emotion. 10468-10476
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BaoLZLLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BaoLZLLQ24
Zhenyu Bao, Guibiao Liao, Zhongyuan Zhao, Kanglin Liu, Qing Li, Guoping Qiu:
3D Reconstruction and Novel View Synthesis of Indoor Environments Based on a Dual Neural Radiance Field. 10477-10486
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLGZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLGZW24
Zimo Liu, Kangjun Liu, Mingyue Guo, Shiliang Zhang, Yaowei Wang:
CoTuning: A Large-Small Model Collaborating Distillation Framework for Better Model Generalization. 10487-10496
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DengLXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DengLXZ24
Yanbin Deng, Zheng Li, Ning Xie, Wei Zhang:
PIMT: Physics-Based Interactive Motion Transition for Hybrid Character Animation. 10497-10505
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShenXGGXD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShenXGGXD24
Kang Shen, Haifeng Xia, Guangxing Geng, Guangyue Geng, Siyu Xia, Zhengming Ding:
DEITalk: Speech-Driven 3D Facial Animation with Dynamic Emotional Intensity Modeling. 10506-10514
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangH0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangH0Z024
Tianyi Wang, Mengxiao Huang, Harry Cheng, Xiao Zhang, Zhiqi Shen:
LampMark: Proactive Deepfake Detection via Training-Free Landmark Perceptual Watermarks. 10515-10524
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DongZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DongZZ24
Lintao Dong, Wei Zhai, Zheng-Jun Zha:
UniDense: Unleashing Diffusion Models with Meta-Routers for Universal Few-Shot Dense Prediction. 10525-10534
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LvX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LvX024
Henglei Lv, Jiayu Xiao, Liang Li:
Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization. 10535-10543
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuP0TY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuP0TY024
Guoqing Zhu, Honghu Pan, Qiang Wang, Chao Tian, Chao Yang, Zhenyu He:
Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model. 10544-10553
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiFWLGDH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiFWLGDH24
Qiao Li, Xiaomeng Fu, Xi Wang, Jin Liu, Xingyu Gao, Jiao Dai, Jizhong Han:
Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models. 10554-10562
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeZLP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeZLP24
Zhaoda Ye, Xinhan Zheng, Yang Liu, Yuxin Peng:
RelScene: A Benchmark and baseline for Spatial Relations in text-driven 3D Scene Generation. 10563-10571
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Tian0LLG0LYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Tian0LLG0LYX24
Shilong Tian, Hong Chen, Chengtao Lv, Yu Liu, Jinyang Guo, Xianglong Liu, Shengxi Li, Hao Yang, Tao Xie:
QVD: Post-training Quantization for Video Diffusion Models. 10572-10581
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xie0LCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xie0LCJ24
Jingjing Xie, Yuxin Zhang, Mingbao Lin, Liujuan Cao, Rongrong Ji:
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation. 10582-10591
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouFLLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouFLLW24
Pengfei Zhou, Fangxiang Feng, Guang Liu, Ruifan Li, Xiaojie Wang:
DiffHarmony++: Enhancing Image Harmonization with Harmony-VAE and Inverse Harmonization Model. 10592-10601
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuFLSMX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuFLSMX024
Qi Xu, Xuanye Fang, Yaxin Li, Jiangrong Shen, De Ma, Yi Xu, Gang Pan:
RSNN: Recurrent Spiking Neural Networks for Dynamic Spatial-Temporal Information Processing. 10602-10610
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangHL24
Wei Yang, Tengfei Huo, Zhiqiang Liu:
Enhancing Transformer-based Semantic Matching for Few-shot Learning through Weakly Contrastive Pre-training. 10611-10620
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FrolovMP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FrolovMP024
Stanislav Frolov, Brian B. Moser, Sebastian Palacio, Andreas Dengel:
ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation. 10621-10629
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangWHXHYC00YL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangWHXHYC00YL24
Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. 10630-10639
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWQW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWQW24
Yuran Wang, Zhijing Wan, Yansheng Qiu, Zheng Wang:
Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation. 10640-10648
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWZZHP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWZZHP24
Minghui Li, Jiangxiong Wang, Hao Zhang, Ziqi Zhou, Shengshan Hu, Xiaobing Pei:
Transferable Adversarial Facial Images for Privacy Protection. 10649-10658
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoBTWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoBTWX24
Ming Tao, Bing-Kun Bao, Hao Tang, Yaowei Wang, Changsheng Xu:
CoIn: A Lightweight and Effective Framework for Story Visualization and Continuation. 10659-10668
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZWWZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZWWZ0024
Xulu Zhang, Wengyu Zhang, Xiaoyong Wei, Jinlin Wu, Zhaoxiang Zhang, Zhen Lei, Qing Li:
Generative Active Learning for Image Synthesis Personalization. 10669-10677
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaiWLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaiWLZ024
Zhijun Zhai, Zengmao Wang, Xiaoxiao Long, Kaixuan Zhou, Bo Du:
SAT3D: Image-driven Semantic Attribute Transfer in 3D. 10678-10687
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangSHBDY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangSHBDY024
Zihan Huang, Xinyu Shi, Zecheng Hao, Tong Bu, Jianhao Ding, Zhaofei Yu, Tiejun Huang:
Towards High-performance Spiking Transformers from ANN to SNN Conversion. 10688-10697
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiW0Q0V24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiW0Q0V24
Jialiang Li, Haoyue Wang, Sheng Li, Zhenxing Qian, Xinpeng Zhang, Athanasios V. Vasilakos:
Are handcrafted filters helpful for attributing AI-generated images? 10698-10706
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingWKMCCCCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingWKMCCCCH24
Peng Ding, Jingyu Wu, Jun Kuang, Dan Ma, Xuezhi Cao, Xunliang Cai, Shi Chen, Jiajun Chen, Shujian Huang:
Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs. 10707-10715
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangGCZWC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangGCZWC024
Shaodong Wang, Yunyang Ge, Liuhan Chen, Haiyang Zhou, Qian Wang, Xinhua Cheng, Li Yuan:
Prompt2Poster: Automatically Artistic Chinese Poster Creation from Prompt Only. 10716-10724
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZLLXSSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZLLXSSL24
Weijie Wang, Jichao Zhang, Chang Liu, Xia Li, Xingqian Xu, Humphrey Shi, Nicu Sebe, Bruno Lepri:
UVMap-ID: A Controllable and Personalized UV Map Generative Model. 10725-10734
- view
  authority control:
- export record
  dblp key:
  - conf/mm/PengLZ0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/PengLZ0W024
Tianshuo Peng, Zuchao Li, Lefei Zhang, Hai Zhao, Ping Wang, Bo Du:
Multi-modal Auto-regressive Modeling via Visual Tokens. 10735-10744
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0007LZWSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0007LZWSF24
Haining Wang, Na Li, Huijie Zhao, Yan Wen, Yi Su, Yuqiang Fang:
MappingFormer: Learning Cross-modal Feature Mapping for Visible-to-infrared Image Translation. 10745-10754
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhengHWBZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhengHWBZ0024
Xiangping Zheng, Xiuxin Hao, Bo Wu, Xigang Bao, Xuan Zhang, Wei Li, Xun Liang:
A Sample-driven Selection Framework: Towards Graph Contrastive Networks with Reinforcement Learning. 10755-10764
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXHG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXHG024
Peiyong Wang, Bohan Xiao, Qisheng He, Carri Glide-Hurst, Ming Dong:
Score-Based Image-to-Image Brownian Bridge. 10765-10773
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaoKZYDZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaoKZYDZZ24
Tingfeng Cao, Junsheng Kong, Xue Zhao, Wenqing Yao, Junwei Ding, Jinhui Zhu, Jiandong Zhang:
Product2IMG: Prompt-Free E-commerce Product Background Generation with Diffusion Model and Self-Improved LMM. 10774-10783
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieDGML24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieDGML24
Zhenyu Xie, Haoye Dong, Yufei Gao, Zehua Ma, Xiaodan Liang:
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models. 10784-10793
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FuWZJMWCWG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FuWZJMWCWG024
Chencan Fu, Yabiao Wang, Jiangning Zhang, Zhengkai Jiang, Xiaofeng Mao, Jiafu Wu, Weijian Cao, Chengjie Wang, Yanhao Ge, Yong Liu:
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion. 10794-10803
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LouLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LouLWL24
Wei Lou, Guanbin Li, Xiang Wan, Haofeng Li:
Multi-modal Denoising Diffusion Pre-training for Whole-Slide Image Classification. 10804-10813
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWCPWXWCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWCPWXWCL24
Xingyi Li, Yizheng Wu, Jun Cen, Juewen Peng, Kewei Wang, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin:
iControl3D: An Interactive System for Controllable 3D Scene Generation. 10814-10823
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangZ0024a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangZ0024a
Yibin Wang, Weizhong Zhang, Jianwei Zheng, Cheng Jin:
PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering. 10824-10832
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangYCHC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangYCHC24
Jiancheng Huang, Mingfu Yan, Songyan Chen, Yi Huang, Shifeng Chen:
MagicFight: Personalized Martial Arts Combat Video Generation. 10833-10842
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuG0ZHWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuG0ZHWX24
Longfei Lu, Huachen Gao, Tao Dai, Yaohua Zha, Zhi Hou, Junta Wu, Shu-Tao Xia:
Large Point-to-Gaussian Model for Image-to-3D Generation. 10843-10852
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunWQSQGZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunWQSQGZL24
Mingzhen Sun, Weining Wang, Yanyuan Qiao, Jiahui Sun, Zihan Qin, Longteng Guo, Xinxin Zhu, Jing Liu:
MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation. 10853-10861
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangL0MXZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangL0MXZZ24
Ruowei Wang, Jiaqi Li, Dan Zeng, Xueqi Ma, Zixiang Xu, Jianwei Zhang, Qijun Zhao:
GenUDC: High Quality 3D Mesh Generation With Unsigned Dual Contouring Representation. 10862-10871
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuXZD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuXZD024
Xiaopei Zhu, Peiyang Xu, Guanning Zeng, Yinpeng Dong, Xiaolin Hu:
Natural Language Induced Adversarial Images. 10872-10881
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuZLW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuZLW024
Xin Lu, Chuanqing Zhuang, Zhengda Lu, Yiqun Wang, Jun Xiao:
FC-4DFS: Frequency-controlled Flexible 4D Facial Expression Synthesizing. 10882-10890
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiZWL24
Jiaxing Li, Hongbo Zhao, Yijun Wang, Jianxin Lin:
Towards Photorealistic Video Colorization via Gated Color-Guided Image Diffusion Models. 10891-10900
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GeJILWM0WLTSB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GeJILWM0WLTSB24
Mengmeng Ge, Xu Jia, Takashi Isobe, Xiaomin Li, Qinghe Wang, Jing Mu, Dong Zhou, Li Wang, Huchuan Lu, Lu Tian, Ashish Sirasao, Emad Barsoum:
Customizing Text-to-Image Generation with Inverted Interaction. 10901-10909
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XuZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XuZ024
Yunqiu Xu, Linchao Zhu, Yi Yang:
GG-Editor: Locally Editing 3D Avatars with Multimodal Large Language Model Guidance. 10910-10919
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Lyu0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Lyu0H24
Xianqiang Lyu, Hui Liu, Junhui Hou:
RainyScape: Unsupervised Rainy Scene Reconstruction using Decoupled Neural Rendering. 10920-10929
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinZXWWDDC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinZXWWDDC24
Jingyu Lin, Guiqin Zhao, Jing Xu, Guoli Wang, Zejin Wang, Antitza Dantcheva, Lan Du, Cunjian Chen:
DiffTV: Identity-Preserved Thermal-to-Visible Face Translation via Feature Alignment and Dual-Stage Conditions. 10930-10938
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiB0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiB0024
Yifan Li, Yuhang Bai, Shuai Yang, Jiaying Liu:
COCO-LC: Colorfulness Controllable Language-based Colorization. 10939-10947
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BaoZPXSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BaoZPXSC24
Yiying Bao, Hao Zhou, Chao Peng, Chenyang Xu, Shuo Shi, Kecheng Cai:
Boundary-Aware Periodicity-based Sparsification Strategy for Ultra-Long Time Series Forecasting. 10948-10956
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DongXWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DongXWL24
Ziyi Dong, Yao Xiao, Pengxu Wei, Liang Lin:
Decoder-Only LLMs are Better Controllers for Diffusion Models. 10957-10965
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DaiLZWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DaiLZWZ24
Zhenqi Dai, Ting Liu, Xingxing Zhang, Yunchao Wei, Yanning Zhang:
One-shot In-context Part Segmentation. 10966-10975
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuanCWQYS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuanCWQYS24
Ziyang Yuan, Mingdeng Cao, Xintao Wang, Zhongang Qi, Chun Yuan, Ying Shan:
CustomNet: Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models. 10976-10984
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChoLYHKAK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChoLYHKAK24
Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn, Seungryong Kim:
GaussianTalker: Real-Time Talking Head Synthesis with 3D Gaussian Splatting. 10985-10994
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Chu0ZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Chu0ZY24
Huanpeng Chu, Wei Wu, Chengjie Zang, Kun Yuan:
QNCD: Quantization Noise Correction for Diffusion Models. 10995-11003
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0011C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0011C24
Dan Wang, Xinrui Cui:
InNeRF: Learning Interpretable Radiance Fields for Generalizable 3D Scene Representation and Rendering. 11004-11012
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FanYLZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FanYLZZ24
Zhongyi Fan, Zixin Yin, Gang Li, Yibing Zhan, Heliang Zheng:
DreamBooth++: Boosting Subject-Driven Generation via Region-Level References Packing. 11013-11021
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenZH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenZH024
Zhenghao Chen, Luping Zhou, Zhihao Hu, Dong Xu:
Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression. 11022-11031
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RenHWXLWZHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RenHWXLWZHH24
Lingfei Ren, Ruimin Hu, Zheng Wang, Yilin Xiao, Dengshi Li, Junhang Wu, Yilong Zang, Jinzhang Hu, Zijun Huang:
Heterophilic Graph Invariant Learning for Out-of-Distribution of Fraud Detection. 11032-11040
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiaoSSWTTL0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiaoSSWTTL0L24
Haicheng Liao, Haoyu Sun, Huanming Shen, Chengyue Wang, Chunlin Tian, KaHou Tam, Li Li, Chengzhong Xu, Zhenning Li:
CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions. 11041-11050
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinKS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinKS024
Lehao Lin, Hong Kang, Xinyao Sun, Wei Cai:
SemNFT: A Semantically Enhanced Decentralized Middleware for Digital Asset Immortality. 11051-11059
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhu000WZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhu000WZ24
Guogang Zhu, Xuefeng Liu, Jianwei Niu, Shaojie Tang, Xinghao Wu, Jiayuan Zhang:
DualFed: Enjoying both Generalization and Personalization in Federated Learning via Hierachical Representations. 11060-11069
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZengXZW0CN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZengXZW0CN24
Hui Zeng, Minrui Xu, Tongqing Zhou, Xinyi Wu, Jiawen Kang, Zhiping Cai, Dusit Niyato:
One-shot-but-not-degraded Federated Learning. 11070-11079
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaoWWWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaoWWWY24
Miao Cao, Lishun Wang, Huan Wang, Guoqing Wang, Xin Yuan:
Towards Real-time Video Compressive Sensing on Mobile Devices. 11080-11088
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YinSZHL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YinSZHL024
Daheng Yin, Jianxin Shi, Miao Zhang, Zhaowu Huang, Jiangchuan Liu, Fang Dong:
FSVFG: Towards Immersive Full-Scene Volumetric Video Streaming with Adaptive Feature Grid. 11089-11098
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangzLZWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangzLZWM24
Huanhuan Zhang, Liu zhuo, Haotian Li, Anfu Zhou, Chuanming Wang, Huadong Ma:
AraLive: Automatic Reward Adaption for Learning-based Live Video Streaming. 11099-11108
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Dan0LXDMTX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Dan0LXDMTX24
Jun Dan, Weiming Liu, Mushui Liu, Chunfeng Xie, Shunjie Dong, Guofang Ma, Yanchao Tan, Jiazheng Xing:
HOGDA: Boosting Semi-supervised Graph Domain Adaptation via High-Order Structure-Guided Adaptive Feature Alignment. 11109-11118

Reproducibility

- view
  authority control:
- export record
  dblp key:
  - conf/mm/JinJZLGD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JinJZLGD24
Xin Jin, Longteng Jiang, Yihao Zhang, Lihua Lu, Xiaobo Gao, Boyan Dong:
Reproducibility Companion Paper: Aesthetics-Driven Virtual Time-Lapse Photography Generation. 11119-11122

Panel

- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangCY24
Zi Helen Huang, Phoebe Chen, Shuicheng Yan:
Generative AI in Multimedia: Challenges and Opportunities for Academic and Industrial Impact. 11123-11124

Industry Session

- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuAY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuAY024
Jianquan Liu, Balu Adsumilli, Yukiko Yanagawa, Haiwei Dong:
An Innovative Industry Program in A New Era of Multimedia with Generative AI. 11125-11126

Doctoral Symposium

- view
  authority control:
- export record
  dblp key:
  - conf/mm/Hu24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Hu24
Wenmiao Hu:
Utilizing Very High-resolution Optical RGB Satellite Imagery in Geo-information Extraction for Fine-scale Map-making. 11127-11131
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhang24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhang24
Cheng Zhang:
Practical Deep Learning Models for QIM-based VoIP Steganalysis. 11132-11136

Brave New Ideas

- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002YLWL0WL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002YLWL0WL24
Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo:
OpenLEAF: A Novel Benchmark for Open-Domain Interleaved Image-Text Generation. 11137-11145
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Torre-OrtizR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Torre-OrtizR24
Carlos de la Torre-Ortiz, Tuukka Ruotsalo:
Perceptual Visual Similarity from EEG: Prediction and Image Generation. 11146-11155
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoSMWJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoSMWJ24
Yifeng Gao, Yuhua Sun, Xingjun Ma, Zuxuan Wu, Yu-Gang Jiang:
ModelLock: Locking Your Model With a Spell. 11156-11165
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangFC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangFC24
Jiyi Zhang, Han Fang, Ee-Chien Chang:
Finding Input Data Domains of Image Classification Models with Hard-Label Black-Box Access. 11166-11174
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangXCS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangXCS024
Yudong Zhang, Ruobing Xie, Jiansheng Chen, Xingwu Sun, Yu Wang:
PIP: Detecting Adversarial Examples in Large Vision-Language Models via Attention Patterns of Irrelevant Probe Questions. 11175-11183
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhou0ZJXHXY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhou0ZJXHXY24
Taotao Zhou, Teng Xu, Dong Zhang, Yuyang Jiao, Peijun Xu, Yaoyu He, Lan Xu, Jingyi Yu:
Sophia-in-Audition: Virtual Production with a Robot Performer. 11184-11193

Open-Source

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenHLLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenHLLZ024
Xiaodong Chen, Kunlang He, Wu Liu, Xinchen Liu, Zheng-Jun Zha, Tao Mei:
CLaM: An Open-Source Library for Performance Evaluation of Text-driven Human Motion Generation. 11194-11197
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuanYQFCLDZZWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuanYQFCLDZZWL024
Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models. 11198-11201
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoZZZYLYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoZZZYLYZ24
Wei Gao, Huiming Zheng, Chenhao Zhang, Kaiyu Zheng, Zhuozhen Yu, Yuan Li, Hua Ye, Yongchi Zhang:
OpenDIC: An Open-Source Library and Performance Evaluation for Deep-learning-based Image Compression. 11202-11205
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoKK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoKK024
Hung-Jui Guo, Hiranya Garbha Kumar, Minhas Kamal, Balakrishnan Prabhakaran:
Room2XR: Virtual Interactive Collaboration in Real-world Scenes. 11206-11209
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001R00C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001R00C24
Jack Jansen, Thomas Röggla, Silvia Rossi, Irene Viola, Pablo César:
Open-Sourcing VR2Gather: A Collaborative Social VR System for Adaptive Multi-Party Real Time Communication. 11210-11213
- view
  authority control:
- export record
  dblp key:
  - conf/mm/RasanenTMV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/RasanenTMV24
Joni Räsänen, Heikki Tampio, Alexandre Mercat, Jarno Vanne:
uvgComm: Open Software for Low-Latency Multi-party Video Communication. 11214-11217
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SoucekL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SoucekL24
Tomás Soucek, Jakub Lokoc:
TransNet V2: An Effective Deep Network Architecture for Fast Shot Transition Detection. 11218-11221
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangCGS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangCGS24
Jingyuan Tang, Yangang Cai, Xuesong Gao, Songlin Sun:
Generalized Sampling of Non-Local Textural Clues Multi-View Stereo Framework. 11222-11225
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TongHW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TongHW24
Yuan Tong, Mengshun Hu, Zheng Wang:
NNVISR: Bring Neural Network Video Interpolation and Super Resolution into Video Processing Framework. 11226-11229
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ViitanenSSMV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ViitanenSSMV24
Marko Viitanen, Joose Sainio, Kari Siivonen, Alexandre Mercat, Jarno Vanne:
uvg266: Open-Source VVC Intra Encoder. 11230-11233
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xie024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xie024
Liang Xie, Wei Gao:
LearningPCC: A PyTorch Library for Learning-Based Point Cloud Compression. 11234-11238
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xie024a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xie024a
Liang Xie, Wei Gao:
PCHMVision: An Open-Source Library of Point Cloud Compression for Human and Machine Vision. 11239-11243
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YeZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YeZJ24
Feng Ye, Li Zhang, Chuanmin Jia:
Deep Video Compression with Scaled Hierarchical Bi-directional Motion Model. 11244-11247
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuanGG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuanGG24
Hang Yuan, Wei Gao, Wenxu Gao:
OpenSEP: An Open Source Subjective Experiment Platform. 11248-11251

Technical Demonstrations

- view
  authority control:
- export record
  dblp key:
  - conf/mm/BlumeNWCSJKZLHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BlumeNWCSJKZLHS24
Ansel Blume, Khanh Duy Nguyen, Zhenhailong Wang, Yangyi Chen, Michal Shlapentokh-Rothman, Xiaomeng Jin, Jeonghwan Kim, Zhen Zhu, Jiateng Liu, Kuan-Hao Huang, Mankeerat Sidhu, Xuanming Zhang, Vivian Liu, Raunak Sinha, Te-Lin Wu, Abhay Zala, Elias Stengel-Eskin, Da Yin, Yao Xiao, Utkarsh Mall, Zhou Yu, Kai-Wei Chang, Camille Cobb, Karrie Karahalios, Lydia B. Chilton, Mohit Bansal, Nanyun Peng, Carl Vondrick, Derek Hoiem, Heng Ji:
MIRACLE: An Online, Explainable Multimodal Interactive Concept Learning System. 11252-11254
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoHBLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoHBLS24
Difei Gao, Siyuan Hu, Zechen Bai, Qinghong Lin, Mike Zheng Shou:
AssistEditor: Multi-Agent Collaboration for GUI Workflow Automation in Video Creation. 11255-11257
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanZ0ZZSF024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanZ0ZZSF024
Feilin Han, Leping Zhang, Xin Wang, Ke-Ao Zhao, Ying Zhong, Ziyi Su, Tongtong Feng, Wenwu Zhu:
U2USim - A UAV Telepresence Simulation Platform with Multi-agent Sensing and Dynamic Environment. 11258-11260
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuHPZFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuHPZFZ24
Zhanbin Hu, Xiaodong He, Renzhou Pan, Xianzhou Zeng, Chenming Fan, Qiang Zhu:
MAF-ID: Multi-Agent Framework for Interactive Dubbing through Deep Video Understanding. 11261-11263
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JinZJL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JinZJL24
Xin Jin, Liaoruxing Zhang, Longteng Jiang, Dandan Li:
Unlimited Vision: Professional Composition by Yourself. 11264-11266
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KimHPKL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KimHPKL24
Seongjean Kim, Jungwoo Huh, Yeseung Park, Jungsu Kim, Sanghoon Lee:
DanceMimic: Awaken Your Dancing Instinct through a Real-time Dance Imitation Capture System. 11267-11269
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaYWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaYWZL24
Ying Ma, Xinyan Yang, Aiqi Wang, Jianglin Zeng, Shaofei Liu:
Video Editing Chatbot: Language-Driven Video Compositing System. 11270-11272
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangYMA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangYMA24
Liangyu Wang, Yoko Yamakata, Ryoma Maeda, Kiyoharu Aizawa:
Measure and Improve Your Food: Ingredient Estimation Based Nutrition Calculator. 11273-11275
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuJZLT0ZCZSN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuJZLT0ZCZSN24
Mingyuan Wu, Ruifan Ji, Haozhen Zheng, Jiaxi Li, Beitong Tian, Bo Chen, Ruixiao Zhang, Jacob Chakareski, Michael Zink, Ramesh K. Sitaraman, Klara Nahrstedt:
Scene Graph Driven Hybrid Interactive VR Teleconferencing. 11276-11278
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuSYTQLHB0J24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuSYTQLHB0J24
Yuning Wu, Jiatong Shi, Yifeng Yu, Yuxun Tang, Tao Qian, Yueqian Lin, Jionghao Han, Xinyi Bai, Shinji Watanabe, Qin Jin:
Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm. 11279-11281
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YiMYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YiMYY24
Shengzhou Yi, Junichiro Matsugami, Takuya Yamamoto, Toshihiko Yamasaki:
Enhancing Speaking and Slide Design Skills with Deep Learning: An Online Presentation Assessment System. 11282-11284

Tutorial Presentations

- view
  authority control:
- export record
  dblp key:
  - conf/mm/ArnoldBG0KS0V24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ArnoldBG0KS0V24
Rahel Arnold, Werner Bailer, Ralph Gasser, Björn Þór Jónsson, Omar Shahbaz Khan, Heiko Schuldt, Florian Spiess, Lucia Vadicamo:
Multimedia Information Retrieval in XR. 11285-11286
- view
  authority control:
- export record
  dblp key:
  - conf/mm/BiondiRPB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/BiondiRPB24
Niccolò Biondi, Simone Ricci, Federico Pernici, Alberto Del Bimbo:
Learning Backward Compatible Representations. 11287-11288
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001LLLZZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001LLLZZY24
Hao Fei, Xiangtai Li, Haotian Liu, Fuxiao Liu, Zhuosheng Zhang, Hanwang Zhang, Shuicheng Yan:
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond. 11289-11291
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoL24b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoL24b
Wei Gao, Ge Li:
Point Cloud Compression, Enhancement and Applications: From 3D Perception to Large Models. 11292-11293
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HanCPN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HanCPN24
Soyeon Caren Han, Feiqi Cao, Josiah Poon, Roberto Navigli:
Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond. 11294-11295
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0019ZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0019ZC024
Xin Wang, Yuwei Zhou, Hong Chen, Wenwu Zhu:
Curriculum Learning for Multimedia in the Era of Large Language Models. 11296-11297
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuSQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuSQL24
Kaicheng Yu, Zhuang Shao, Siyuan Qi, Dongfang Liu:
Tutorial: Large Language-Vision Model in Society. 11298-11299
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhaoJHZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhaoJHZ024
Sicheng Zhao, Guoli Jia, Xiaopeng Hong, Yanyan Zhao, Jianhua Tao:
Label-Efficient Emotion and Sentiment Analysis. 11300-11301

Grand Challenges

- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001XLW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001XLW024
Yicheng Wu, Yutong Xie, Xiangde Luo, Qi Wu, Jianfei Cai:
Dataset, Challenge, and Evaluation for Tumor Segmentation Variability. 11302-11303
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuoLLCHZYW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuoLLCHZYW24
Dan Guo, Xiaobai Li, Kun Li, Haoyu Chen, Jingjing Hu, Guoying Zhao, Yi Yang, Meng Wang:
MAC 2024: Micro-Action Analysis Grand Challenge. 11304-11305
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuJZLWZSLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuJZLWZSLL24
Jun Yu, Mohan Jing, Guopeng Zhao, Keda Lu, Yifan Wang, Feng Zhao, Jiaqing Sun, Qingsong Liu, Jiaen Liang:
End-to-end Spatio-Temporal Information Aggregation For Micro-Action Detection. 11306-11312
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHCHCW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHCHCW24
Qiankun Li, Xiaolong Huang, Huabao Chen, Feng He, Qiupu Chen, Zengfu Wang:
Advancing Micro-Action Recognition with Multi-Auxiliary Heads and Hybrid Loss Optimization. 11313-11319
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangMZ24
Chen Wang, Xun Mei, Feng Zhang:
Instance-aware Fine-grained Micro-action Recognition. 11320-11326
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GongCZB0GX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GongCZB0GX24
Fan Gong, Jialiang Chen, Jiajun Zhu, Qijian Bao, Fei Gao, Renshu Gu, Gang Xu:
Micro-Action Recognition via Hierarchical Fusion and Inference. 11327-11332
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SaeedNMDTZLKNYS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SaeedNMDTZLKNYS24
Muhammad Saad Saeed, Shah Nawaz, Marta Moscati, Rohan Kumar Das, Muhammad Salman Tahir, Muhammad Zaigham Zaheer, Muhammad Irzam Liaqat, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf, Markus Schedl:
A Synopsis of FAME 2024 Challenge: Associating Faces with Voices in Multilingual Environments. 11333-11334
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TangWXLLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TangWXLLH24
Jiehui Tang, Xiaofei Wang, Zhen Xiao, Jiayi Liu, Xueliang Liu, Richang Hong:
Exploring Robust Face-Voice Matching in Multilingual Environments. 11335-11341
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoSJTCA024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoSJTCA024
Ruijie Tao, Zhan Shi, Yidi Jiang, Duc-Tuan Truong, Eng Siong Chng, Massimo Alioto, Haizhou Li:
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization. 11342-11347
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenSXD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenSXD24
Wuyang Chen, Yanjie Sun, Kele Xu, Yong Dou:
Contrastive Learning-based Chaining-Cluster for Multilingual Voice-Face Association. 11348-11354
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CaiD0HKST24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CaiD0HKST24
Zhixi Cai, Abhinav Dhall, Shreya Ghosh, Munawar Hayat, Dimitrios Kollias, Kalin Stefanov, Usman Tariq:
1M-Deepfakes Detection Challenge. 11355-11359
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Perez-VieitesMA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Perez-VieitesMA24
Diego Pérez-Vieites, Juan José Moreira-Pérez, Ángel Aragón-Kifute, Raquel Román-Sarmiento, Rubén Castro-González:
Vigo: Audiovisual Fake Detection and Segment Localization. 11360-11364
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangMLLDYLHFG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangMLLDYLHFG024
Yi Zhang, Changtao Miao, Man Luo, Jianshu Li, Wenzhong Deng, Weibin Yao, Zhe Li, Bingyu Hu, Weiwei Feng, Tao Gong, Qi Chu:
MFMS: Learning Modality-Fused and Modality-Specific Features for Deepfake Detection and Localization Tasks. 11365-11369
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWZJLYSGLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWZJLYSGLSL24
Yifan Wang, Xuecheng Wu, Jia Zhang, Mohan Jing, Keda Lu, Jun Yu, Wen Su, Fang Gao, Qingsong Liu, Jianqing Sun, Jiaen Liang:
Building Robust Video-Level Deepfake Detection via Audio-Visual Local-Global Interactions. 11370-11376
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001B0DHPSBAAB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001B0DHPSBAAB24
Philipp Müller, Michal Balazia, Tobias Baur, Michael Dietz, Alexander Heimerl, Anna Penzkofer, Dominik Schiller, François Brémond, Jan Alexandersson, Elisabeth André, Andreas Bulling:
MultiMediate'24: Multi-Domain Engagement Estimation. 11377-11382
- view
  authority control:
- export record
  dblp key:
  - conf/mm/KumarMSDR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/KumarMSDR24
Deepak Kumar, Surbhi Madan, Pradeep Singh, Abhinav Dhall, Balasubramanian Raman:
Towards Engagement Prediction: A Cross-Modality Dual-Pipeline Approach using Visual and Audio Features. 11383-11389
- view
  authority control:
- export record
  dblp key:
  - conf/mm/MaH0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/MaH0L24
Fuyan Ma, Yiran He, Bin Sun, Shutao Li:
Less is More: Adaptive Feature Selection and Fusion for Eye Contact Detection. 11390-11396
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiYCZJXLWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiYCZJXLWH24
Jia Li, Yangchen Yu, Yin Chen, Yu Zhang, Peng Jia, Yunbo Xu, Ziqiang Li, Meng Wang, Richang Hong:
DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimation. 11397-11403
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Zhao0LZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Zhao0LZZ24
Yu Zhao, Hao Fei, Bobo Li, Meishan Zhang, Min Zhang:
The ACM Multimedia 2024 Viual Spatial Description Grand Challenge. 11404-11406
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuZZYZSZLSLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuZZYZSZLSLZ24
Jun Yu, Yunxiang Zhang, Zerui Zhang, Zhao Yang, Gongpeng Zhao, Fengzhao Sun, Fanrui Zhang, Qingsong Liu, Jianqing Sun, Jiaen Liang, Yaohui Zhang:
RAG-Guided Large Language Models for Visual Spatial Description with Adaptive Hallucination Corrector. 11407-11413
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0TLZMS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0TLZMS024
Jiabao Wang, Fang Gao, Jingfeng Tang, Shaodong Li, Hanbo Zheng, Shengheng Ma, Feng Shuang, Jun Yu:
A Method for Visual Spatial Description Based on Large Language Model Fine-tuning. 11414-11419
- view
  authority control:
- export record
  dblp key:
  - conf/mm/JinLZHGTLWWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/JinLZHGTLWWM24
Yizhang Jin, Jian Li, Jiangning Zhang, Jianlong Hu, Zhenye Gan, Xin Tan, Yong Liu, Yabiao Wang, Chengjie Wang, Lizhuang Ma:
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description. 11420-11425
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Ge0YZTZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Ge0YZTZ24
Zhiqi Ge, Juncheng Li, Qifan Yu, Wei Zhou, Siliang Tang, Yueting Zhuang:
DEMON24: ACM MM24 Demonstrative Instruction Following Challenge. 11426-11428
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Fu24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Fu24
Xian Fu:
Enhancing Multimodal Large Language Models on Demonstrative Multi-Image Instructions. 11429-11434
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WeiSXZLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WeiSXZLW24
Jingyu Wei, Yi Su, Kele Xu, Lingbin Zeng, Bo Liu, Huaimin Wang:
Demonstrative Instruction Following in Multimodal LLMs via Integrating Low-Rank Adaptation with Ensemble Learning. 11435-11441
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WuLHZW0LC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WuLHZW0LC24
Bo Wu, Peiye Liu, Qiushi Huang, Zhaoyang Zeng, Jia Wang, Bei Liu, Jiebo Luo, Wen-Huang Cheng:
SMP Challenge Summary: Social Media Prediction Challenge. 11442-11444
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinL24
Yu-Shi Lin, Anthony J. T. Lee:
MMF: Winning Solution to Social Media Popularity Prediction Challenge 2024. 11445-11449
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuCYWCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuCYWCZ24
Wenhao Hu, Weilong Chen, Weimin Yuan, Yan Wang, Shimin Cai, Yanru Zhang:
Dual-Stream Pre-Training Transformer to Enhance Multimodal Learning for Social Media Prediction. 11450-11456
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TuWXJXY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TuWXJXY24
Mingsheng Tu, Tianjiao Wan, Qisheng Xu, Xinhao Jiang, Kele Xu, Cheng Yang:
Higher-Order Vision-Language Alignment for Social Media Prediction. 11457-11463
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HsuLLCJT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HsuLLCJT24
Chih-Chung Hsu, Chia-Ming Lee, Yu-Fan Lin, Yi-Shiuan Chou, Chih-Yu Jian, Chi-Han Tsai:
Revisiting Vision-Language Features Adaptation and Inconsistency for Social Media Popularity Prediction. 11464-11469
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SongYCQXLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SongYCQXLY24
Shien Song, Jie Yang, Jin Chen, Han Qi, Yifei Xue, Yizhen Lao, Yi Yu:
ACM Multimedia 2024 Grand Challenge Report for Artificial Intelligence Generated Image Detection. 11470-11471
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Fu24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Fu24a
Huihui Fu:
Optimizing AIGC Image Detection: Strategies in Data Augmentation and Model Architecture. 11472-11474
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiWW24
ShiHang Li, Haishan Wu, Biao Wang:
A Solution to ACMMM 2024 on Artificial Intelligence Generated Image Detection. 11475-11477
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Chen24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Chen24
Jin Chen:
Optimizing the Baseline Approach for the 2024 ACM Multimedia Grand Challenge in Artificial Intelligence Generated Image Detection. 11478-11481
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SeeLDLYCLHW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SeeLDLYCLHW24
John See, Jingting Li, Adrian K. Davison, Gen-Bing Liong, Moi Hoon Yap, Wen-Huang Cheng, Xiaobai Li, Xiaopeng Hong, Su-Jing Wang:
MEGC2024: ACM Multimedia 2024 Facial Micro-Expression Grand Challenge. 11482-11483
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuZZHZYLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuZZHZYLSL24
Jun Yu, Gongpeng Zhao, Yaohui Zhang, Peng He, Zerui Zhang, Zhao Yang, Qingsong Liu, Jianqing Sun, Jiaen Liang:
Temporal-Informative Adapters in VideoMAE V2 and Multi-Scale Feature Fusion for Micro-Expression Spotting-then-Recognize. 11484-11489
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001ZZHZCLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001ZZHZCLSL24
Jun Yu, Yaohui Zhang, Gongpeng Zhao, Peng He, Zerui Zhang, Zhongpeng Cai, Qingsong Liu, Jianqing Sun, Jiaen Liang:
Micro-Expression Spotting Based on Optical Flow Feature with Boundary Calibration. 11490-11496
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZMLWXC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZMLWXC24
Zhengye Zhang, Sirui Zhao, Xinglong Mao, Shifeng Liu, Hao Wang, Tong Xu, Enhong Chen:
A Multi-scale Feature Learning Network with Optical Flow Correction for Micro- and Macro-expression Spotting. 11497-11502
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HeLW0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HeLW0024
Yuhong He, Wenchao Liu, Guangyu Wang, Lin Ma, Haifeng Li:
Enhancing Micro-Expression Analysis Performance by Effectively Addressing Data Imbalance. 11503-11507

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.