


default search action
18th ECCV 2024: Milan, Italy - Part LVI
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LVI. Lecture Notes in Computer Science 15114, Springer 2025, ISBN 978-3-031-72991-1 - Nina Shvetsova, Anna Kukleva, Xudong Hong, Christian Rupprecht, Bernt Schiele, Hilde Kuehne:
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale. 1-18 - Sanmin Kim
, Youngseok Kim
, Sihwan Hwang
, Hyeonjun Jeong
, Dongsuk Kum
:
LabelDistill: Label-Guided Cross-Modal Knowledge Distillation for Camera-Based 3D Object Detection. 19-37 - Hyeong-Seok Jeon
, Sanmin Kim
, Abi Rahman Syamil
, Junsoo Kim, Dongsuk Kum
:
Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction. 38-53 - Hasan Abed Al Kader Hammoud
, Tuhin Das, Fabio Pizzati, Philip H. S. Torr, Adel Bibi, Bernard Ghanem
:
On Pretraining Data Diversity for Self-Supervised Learning. 54-71 - Gianluca Scarpellini
, Stefano Rosa
, Pietro Morerio
, Lorenzo Natale
, Alessio Del Bue
:
Look Around and Learn: Self-training Object Detection by Exploration. 72-88 - Ozan Unal
, Christos Sakaridis, Luc Van Gool:
Bayesian Self-training for Semi-supervised 3D Segmentation. 89-107 - Zhongyang Ren
, Bangyan Liao
, Delei Kong
, Jinghang Li
, Peidong Liu
, Laurent Kneip
, Guillermo Gallego
, Yi Zhou
:
Motion and Structure from Event-Based Normal Flow. 108-125 - Qiran Zou, Shangyuan Yuan
, Shian Du, Yu Wang, Chang Liu, Yi Xu, Jie Chen, Xiangyang Ji:
ParCo: Part-Coordinating Text-to-Motion Synthesis. 126-143 - Zheng Zhang
, Wenjie Ai, Kevin Wells
, David Rosewarne, Thanh-Toan Do
, Gustavo Carneiro
:
Learning to Complement and to Defer to Multiple Users. 144-162 - Qingyuan Wang
, Barry Cardiff
, Antoine Frappé
, Benoit Larras
, Deepu John
:
Tiny Models are the Computational Saver for Large Models. 163-182 - Yufan Deng
, Ruida Wang
, Yuhao Zhang
, Yu-Wing Tai
, Chi-Keung Tang
:
DragVideo: Interactive Drag-Style Video Editing. 183-199 - Zeqian Li
, Qirui Chen
, Tengda Han
, Ya Zhang
, Yanfeng Wang
, Weidi Xie
:
Multi-sentence Grounding for Long-Term Instructional Video. 200-216 - Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song:
Do Generalised Classifiers Really Work on Human Drawn Sketches? 217-235 - Zhihao Xu, Shengjie Gong, Jiapeng Tang, Lingyu Liang, Yining Huang, Haojie Li, Shuangping Huang:
KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding. 236-253 - Yuxiao He, Yiyu Zhuang, Yanwen Wang, Yao Yao, Siyu Zhu, Xiaoyu Li, Qi Zhang, Xun Cao, Hao Zhu:
Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360$^\circ $. 254-272 - Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jia-Wei Liu, Weijia Wu, Jussi Keppo, Mike Zheng Shou:
MotionDirector: Motion Customization of Text-to-Video Diffusion Models. 273-290 - Yang Wu, Kaihua Zhang, Jianjun Qian, Jin Xie, Jian Yang:
Text2LiDAR: Text-Guided LiDAR Point Cloud Generation via Equirectangular Transformer. 291-310 - Sungjune Kim
, Hadam Baek
, Seunggwan Lee
, Hyung-Gun Chi
, Hyerin Lim, Jinkyu Kim
, Sangpil Kim
:
Enhanced Motion Forecasting with Visual Relation Reasoning. 311-328 - Jinming Liu, Ruoyu Feng, Yunpeng Qi, Qiuyu Chen, Zhibo Chen, Wenjun Zeng, Xin Jin:
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression. 329-348 - Zixuan Fu
, Lanqing Guo
, Chong Wang
, Yufei Wang
, Zhihao Li
, Bihan Wen
:
Temporal As a Plugin: Unsupervised Video Denoising with Pre-trained Image Denoisers. 349-367 - Yujeong Chae
, Hyeonseong Kim, Changgyoon Oh
, Minseok Kim
, Kuk-Jin Yoon
:
LiDAR-Based All-Weather 3D Object Detection via Prompting and Distilling 4D Radar. 368-385 - Xin Liu, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao:
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models. 386-403 - Siao Tang, Xin Wang
, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu
:
Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models. 404-420 - Eric Brachmann, Jamie Wynn, Shuai Chen, Tommaso Cavallari, Áron Monszpart, Daniyar Turmukhambetov, Victor Adrian Prisacariu:
Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer. 421-440 - Ruicheng Wang
, Jianfeng Xiang
, Jiaolong Yang
, Xin Tong
:
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-trained Diffusion Priors. 441-458 - Xinyu Yang
, Hossein Rahmani
, Sue Black
, Bryan M. Williams
:
Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation. 459-478 - Ming Tao
, Bing-Kun Bao
, Hao Tang
, Yaowei Wang
, Changsheng Xu
:
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion. 479-495

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.