


default search action
18th ECCV 2024: Milan, Italy - Part XV
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XV. Lecture Notes in Computer Science 15073, Springer 2025, ISBN 978-3-031-72632-3 - Yinghao Xu, Zifan Shi, Yifan Wang, Hansheng Chen, Ceyuan Yang, Sida Peng, Yujun Shen, Gordon Wetzstein:

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation. 1-20 - Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie

, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Jingdong Wang, Baining Guo:
IRGen: Generative Modeling for Image Retrieval. 21-41 - Kyu Ri Park

, Hong Joo Lee
, Jung Uk Kim
:
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality. 42-59 - Florian Langer, Jihong Ju, Georgi Dikov, Gerhard Reitmayr, Mohsen Ghafoorian:

FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos. 60-77 - Wouter Van Gansbeke, Bert De Brabandere:

A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting. 78-97 - Cilin Yan, Haochen Wang, Shilin Yan, Xiaolong Jiang, Yao Hu, Guoliang Kang, Weidi Xie, Efstratios Gavves:

VISA: Reasoning Video Object Segmentation via Large Language Models. 98-115 - Saman Motamed

, Danda Pani Paudel
, Luc Van Gool
:
Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models. 116-133 - Yuanhao Zhai

, Kevin Lin
, Linjie Li
, Chung-Ching Lin
, Jianfeng Wang
, Zhengyuan Yang
, David S. Doermann
, Junsong Yuan
, Zicheng Liu
, Lijuan Wang
:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. 134-152 - Ryo Nakamura

, Ryu Tadokoro
, Ryosuke Yamada
, Yuki M. Asano
, Iro Laina
, Christian Rupprecht
, Nakamasa Inoue
, Rio Yokota
, Hirokatsu Kataoka
:
Scaling Backwards: Minimal Synthetic Pre-Training? 153-171 - Ekkasit Pinyoanuntapong

, Muhammad Usama Saleem
, Pu Wang
, Minwoo Lee
, Srijan Das
, Chen Chen
:
BAMM: Bidirectional Autoregressive Motion Model. 172-190 - Jiahui Yuan, Hebei Li, Yansong Peng, Jin Wang, Yuheng Jiang, Yueyi Zhang, Xiaoyan Sun:

Event-Based Head Pose Estimation: Benchmark and Method. 191-208 - Ekta Prashnani, Koki Nagano, Shalini De Mello, David Luebke, Orazio Gallo:

Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos. 209-228 - Guangyu Sun

, Matías Mendieta
, Aritra Dutta
, Xin Li
, Chen Chen
:
Towards Multi-modal Transformers in Federated Learning. 229-246 - Wenke Huang

, Mang Ye
, Zekun Shi
, Bo Du
, Dacheng Tao
:
Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning. 247-265 - Pengbo Guo

, Chengxu Liu
, Xingsong Hou
, Xueming Qian
:
QueryCDR: Query-Based Controllable Distortion Rectification Network for Fisheye Images. 266-284 - Shishira R. Maiya

, Anubhav Gupta
, Matthew Gwilliam
, Max Ehrlich
, Abhinav Shrivastava
:
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics. 285-302 - Shrey Singh

, Prateek Keserwani
, Masakazu Iwamura
, Partha Pratim Roy
:
DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution. 303-320 - Jeongmin Bae

, Seoha Kim
, Youngsik Yun
, Hahyun Lee
, Gun Bang
, Youngjung Uh
:
Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting. 321-335 - Liao Shen

, Tianqi Liu
, Huiqiang Sun
, Xinyi Ye
, Baopu Li
, Jianming Zhang
, Zhiguo Cao
:
DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion. 336-353 - Shuang Hao

, Chunlin Zhong
, He Tang
:
CoLA: Conditional Dropout and Language-Driven Robust Dual-Modal Salient Object Detection. 354-371 - Zhiyu Wu, Jinshi Cui:

Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-supervised Learning. 372-388 - Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng:

RPBG: Towards Robust Neural Point-Based Graphics in the Wild. 389-406 - Jiahao Chang, Yinglin Xu, Yihao Li, Yuantao Chen, Wensen Feng, Xiaoguang Han:

GaussReg: Fast 3D Registration with Gaussian Splatting. 407-423 - Yifan Pu

, Zhuofan Xia, Jiayi Guo
, Dongchen Han
, Qixiu Li
, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang, Xiu Li:
Efficient Diffusion Transformer with Step-Wise Dynamic Attention Mediators. 424-441 - Pengfei Wang

, Yuxi Wang
, Shuai Li
, Zhaoxiang Zhang, Zhen Lei, Lei Zhang
:
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation. 442-460 - Kihwan Yoon

, Yong Han Kim
, Sungjei Kim
, Jinwoo Jeong
:
IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with Motion Complexity Map. 461-477 - Siyi Du

, Shaoming Zheng
, Yinsong Wang
, Wenjia Bai
, Declan P. O'Regan
, Chen Qin
:
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data. 478-496

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














