


default search action
18th ECCV 2024: Milan, Italy - Part V
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part V. Lecture Notes in Computer Science 15063, Springer 2025, ISBN 978-3-031-72651-4 - Zhengdi Yu

, Shaoli Huang
, Yongkang Cheng, Tolga Birdal
:
SignAvatars: A Large-Scale 3D Sign Language Holistic Motion Dataset and Benchmark. 1-19 - Lujun Li

, Zimian Wei, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo
:
AttnZero: Efficient Attention Discovery for Vision Transformers. 20-37 - Lujun Li, Haosen Sun

, Shiwen Li, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo
:
Auto-GAS: Automated Proxy Discovery for Training-Free Generative Architecture Search. 38-55 - Haosen Sun

, Lujun Li, Peijie Dong, Zimian Wei, Shitong Shao:
Auto-DAS: Automated Proxy Discovery for Training-Free Distillation-Aware Architecture Search. 56-73 - Zexiang Liu, Yangguang Li

, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang
, Ding Liang, Wanli Ouyang:
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation. 74-91 - Huabin Liu

, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin
:
TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning. 92-107 - Haejoon Lee

, Aswin C. Sankaranarayanan
:
Spectral Subsurface Scattering for Material Classification. 108-124 - Benjin Zhu

, Zhe Wang, Hongsheng Li
:
nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding. 125-141 - Xianrui Luo

, Huiqiang Sun
, Juewen Peng
, Zhiguo Cao
:
Dynamic Neural Radiance Field from Defocused Monocular Video. 142-159 - Yang Liu

, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang:
PiTe: Pixel-Temporal Alignment for Large Video-Language Model. 160-176 - Shadi Hamdan

, Fatma Güney
:
CarFormer: Self-driving with Learned Object-Centric Representations. 177-193 - Wei Wu

, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni B. Chan
:
FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models. 194-209 - Cheng Shi, Yuchen Zhu

, Sibei Yang:
Plain-Det: A Plain Multi-dataset Object Detector. 210-226 - Zhen Zhao

, Zicheng Wang
, Longyue Wang
, Dian Yu
, Yixuan Yuan
, Luping Zhou
:
Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation. 227-243 - Wei Cong

, Yang Cong
, Yuyang Liu
, Gan Sun
:
Cs2K: Class-Specific and Class-Shared Knowledge Guidance for Incremental Semantic Segmentation. 244-261 - Dongliang Cao

, Zorah Lähner
, Florian Bernard:
Synchronous Diffusion for Unsupervised Smooth Non-rigid 3D Shape Matching. 262-281 - David Fan

, Jue Wang
, Shuai Liao, Zhikang Zhang
, Vimal Bhat, Xinyu Li
:
Text-Guided Video Masked Autoencoder. 282-298 - Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht:

Diffusion Models for Open-Vocabulary Segmentation. 299-317 - Peixi Xiong

, Michael Kozuch, Nilesh Jain
:
Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation. 318-334 - Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming Li, Dong Wang, Huchuan Lu, Xu Jia:

EvSign: Sign Language Recognition and Translation with Streaming Events. 335-351 - Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang:

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots. 352-367 - Huilin Zhu

, Jingling Yuan, Zhengwei Yang
, Yu Guo, Zheng Wang, Xian Zhong, Shengfeng He
:
Zero-Shot Object Counting with Good Exemplars. 368-385 - Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei:

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering. 386-402 - Yanbo Wang

, Wentao Zhao
, Chuan Cao, Tianchen Deng
, Jingchuan Wang
, Weidong Chen:
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds. 403-421 - Hyunjin Kim

, Minhyuk Sung
:
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation. 422-439 - Rajeev Yasarla

, Manish Kumar Singh
, Hong Cai
, Yunxiao Shi, Jisoo Jeong
, Yinhao Zhu, Shizhong Han
, Risheek Garrepalli, Fatih Porikli
:
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation. 440-458 - Yanyuan Qiao

, Qianyi Liu
, Jiajun Liu
, Jing Liu
, Qi Wu
:
LLM as Copilot for Coarse-Grained Vision-and-Language Navigation. 459-476

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














