


default search action
MMM 2024, Amsterdam, The Netherlands - Part I
- Stevan Rudinac

, Alan Hanjalic
, Cynthia C. S. Liem
, Marcel Worring
, Björn Þór Jónsson
, Bei Liu
, Yoko Yamakata
:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part I. Lecture Notes in Computer Science 14554, Springer 2024, ISBN 978-3-031-53304-4 - Chi-Yu Chen, Pu Ching, Pei-Hsin Huang, Min-Chun Tien:

Where Are Biases? Adversarial Debiasing with Spurious Feature Visualization. 1-14 - Mengying Xu, Hanjiang Lai

, Jian Yin:
Cross-Modal Hash Retrieval with Category Semantics. 15-27 - Min Li, Fengfa Li, Bo Meng, Ruwen Bai, Junxing Ren, Zihao Huang, Chenghua Gao:

Spatiotemporal Representation Enhanced ViT for Video Recognition. 28-40 - Kedi Qiu, Shoudong Shi, Tianxiang Zhao, Yongfang Ye:

SCFormer: A Vision Transformer with Split Channel in Sitting Posture Recognition. 41-52 - Zebin Li

, Jianping Luo
:
Dive into Coarse-to-Fine Strategy in Single Image Deblurring. 53-65 - Yuhang Yang, Xiao Yan, Sanyuan Zhang:

TICondition: Expanding Control Capabilities for Text-to-Image Generation with Multi-Modal Conditions. 66-79 - Zhe Kong, Neng Gao, Yifei Zhang

, Yuhan Liu:
Enhancing Generative Generalized Zero Shot Learning via Multi-Space Constraints and Adaptive Integration. 80-93 - Chen-Hsiu Huang, Ja-Ling Wu:

Joint Image Data Hiding and Rate-Distortion Optimization in Neural Compressed Latent Representations. 94-108 - Jixuan Hong, Jingjing Xie, Xueqin He, Chenhui Yang:

GSUNet: A Brain Tumor Segmentation Method Based on 3D Ghost Shuffle U-Net. 109-120 - Youkai Wang, Yue Hu, Wansen Wu, Ting Liu, Yong Peng:

ACT: Action-assoCiated and Target-Related Representations for Object Navigation. 121-133 - Die Yu, Zhaoyan Fang, Yong Jiang:

Foreground Feature Enhancement and Peak & Background Suppression for Fine-Grained Visual Classification. 134-146 - Jinyu Shi, Wenjie Wu:

YOLOv5-SRR: Enhancing YOLOv5 for Effective Underwater Target Detection. 147-158 - Yongqi Liu

, Jiashuang Zhou
, Xiaoqin Du
:
Image Clustering and Generation with HDGMVAE-I. 159-171 - Anqi Zhang

, Guangyu Gao
, Zhuocheng Lv
, Yukun An
:
"Car or Bus?" CLearSeg: CLIP-Enhanced Discrimination Among Resembling Classes for Few-Shot Semantic Segmentation. 172-186 - Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin:

PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation. 187-200 - Wenjun Gan, Jiawei Liu, Yangchun Zhu

, Yong Wu, Guozhi Zhao, Zheng-Jun Zha:
Cross-Modal Semantic Alignment Learning for Text-Based Person Search. 201-215 - Lisa Liu, William Y. Wang, Pingping Cai

:
Point Cloud Classification via Learnable Memory Bank. 216-229 - William Y. Wang, Lisa Liu, Pingping Cai

:
Adversarially Regularized Low-Light Image Enhancement. 230-243 - Yuan Zhou

, Xin Chen
, Yanrong Guo
, Jun Yu
, Richang Hong
, Qi Tian
:
Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation. 244-257 - Zhengye Shen, Guangtong Lu, Qian Qiao, Fanzhang Li:

PMGCN:Preserving Measuring Mapping Prototype Graph Calibration Network for Few-Shot Learning. 258-272 - Zituo Li, Jianbin Sun, Yuqi Qin, Lunhao Ju, Ke-Wei Yang:

ARE-CAM: An Interpretable Approach to Quantitatively Evaluating the Adversarial Robustness of Deep Models Based on CAM. 273-285 - Bei Liu, Jian Zhang, Tianwen Yuan, Peng Huang, Chengwei Feng, Minghe Li:

SSK-Yolo: Global Feature-Driven Small Object Detection Network for Images. 286-299 - Zixuan Hong, Weipeng Cao

, Zhiwu Xu, Zhenru Chen, Xi Tao, Zhong Ming, Chuqing Cao, Liang Zheng:
MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification. 300-313 - Yehong Pan, Jian Wang, Guihong Liu, Qiushuo Wu, Yazi Zheng, Xin Lan, Weibo Liang

, Jiancheng Lv, Yuan Li:
From Skulls to Faces: A Deep Generative Framework for Realistic 3D Craniofacial Reconstruction. 314-326 - Wei Liu, Jiahuan Wang, Chao Wang, Yan Peng, Shaorong Xie:

Structure-Aware Adaptive Hybrid Interaction Modeling for Image-Text Matching. 327-341 - Vaibhav Mudgal

, Qingyang Wang, Lorin Sweeney, Alan F. Smeaton
:
Using Saliency and Cropping to Improve Video Memorability. 342-355 - Shuaiwei Wang, Zhao Liu, Jie Lei, Zunlei Feng, Juan Xu, Xuan Li, Ronghua Liang:

Contextual Augmentation with Bias Adaptive for Few-Shot Video Object Segmentation. 356-369 - Feng Chen, Xin Song, Liang Zhu:

A Lightweight Local Attention Network for Image Super-Resolution. 370-384 - Qiulin Li, Junhao Qiang, Qun Yang

:
Domain Adaptation for Speaker Verification Based on Self-supervised Learning with Adversarial Training. 385-395 - Qian Cao, Dongdong Zhang, Chengyu Sun:

Quality Scalable Video Coding Based on Neural Representation. 396-409 - Zijian Lin

, Jianping Luo
:
Hierarchical Bi-directional Temporal Context Mining for Improved Video Compression. 410-421 - Yongyu Liu, Guoliang Lin, Hanjiang Lai

, Yan Pan:
MAMixer: Multivariate Time Series Forecasting via Multi-axis Mixing. 422-435 - Kun Zhang, Chunling Gao, Shuangyuan Yang:

A Custom GAN-Based Robust Algorithm for Medical Image Watermarking. 436-447 - Xiaoting Li, Shouhong Wan, Hantao Zhang, Peiquan Jin:

A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection. 448-461 - Qiuxian Li

, Quanxing Zhou
, Hongfa Ding
:
A Secure and Fair Federated Learning Protocol Under the Universal Composability Framework. 462-474 - Kang Yi, Haoran Tang, Hongyu Bai, Yinjie Wang, Jing Xu, Ping Li:

Bi-directional Interaction and Dense Aggregation Network for RGB-D Salient Object Detection. 475-489 - Sizheng Guo, Haozhe Yang, Xianming Lin:

Face Forgery Detection via Texture and Saliency Enhancement. 490-502

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














