


default search action
18th ECCV 2024: Milan, Italy - Part LXXVI
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXVI. Lecture Notes in Computer Science 15134, Springer 2025, ISBN 978-3-031-73115-0 - Quan Kong, Yuki Kawana, Rajat Saini, Ashutosh Kumar, Jingjing Pan, Ta Gu, Yohei Ozao, Balazs Opra

, Yoichi Sato, Norimasa Kobori:
WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-Grained Spatial-Temporal Understanding. 1-18 - Yuetong Fang

, Ziqing Wang
, Lingfeng Zhang, Jiahang Cao, Honglei Chen, Renjing Xu
:
Spiking Wavelet Transformer. 19-37 - Yutang Feng, Sicheng Gao, Yuxiang Bao, Xiaodi Wang, Shumin Han, Juan Zhang

, Baochang Zhang, Angela Yao:
WAVE: Warping DDIM Inversion Features for Zero-Shot Text-to-Video Editing. 38-55 - Mingle Zhou

, Rui Xing
, Delong Han
, Zhiyong Qi, Gang Li
:
PDT: Uav Target Detection Dataset for Pests and Diseases Tree. 56-72 - Fazilet Gokbudak

, Alejandro Sztrajman
, Chenliang Zhou
, Fangcheng Zhong
, Rafal Mantiuk
, Cengiz Öztireli
:
Hypernetworks for Generalizable BRDF Representation. 73-89 - Lucas J. Koerner

, Shantanu Gupta
, Atul Ingle
, Mohit Gupta
:
Photon Inhibition for Energy-Efficient Single-Photon Imaging. 90-107 - Haoran Yang, Chuan-Xian Ren

, You-Wei Luo
:
COD: Learning Conditional Invariant Representation for Domain Adaptation Regression. 108-125 - Benno Buschmann, Andreea Dogaru, Elmar Eisemann, Michael Weinmann

, Bernhard Egger:
RANRAC: Robust Neural Scene Representations via Random Ray Consensus. 126-143 - Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:

LayerDiff: Exploring Text-Guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. 144-160 - Adrián Rodríguez-Muñoz

, Tongzhou Wang, Antonio Torralba:
Characterizing Model Robustness via Natural Input Gradients. 161-178 - Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov, Shubham Tulsiani:

UpFusion: Novel View Diffusion from Unposed Sparse View Observations. 179-195 - Ozan Unal

, Christos Sakaridis, Suman Saha, Luc Van Gool:
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding. 196-213 - Abhishek Singh, Vivek Sharma, Rohan Sukumaran, John Mose, Jeffrey Chiu, Justin Yu, Ramesh Raskar:

SIMBA: Split Inference - Mechanisms, Benchmarks and Attacks. 214-232 - Pengzhi Li

, Qiang Nie, Ying Chen, Xi Jiang, Kai Wu, Yuhuan Lin, Yong Liu, Jinlong Peng, Chengjie Wang
, Feng Zheng:
Tuning-Free Image Customization with Image and Text Guidance. 233-250 - Yu Tian

, Congcong Wen
, Min Shi
, Muhammad Muneeb Afzal, Hao Huang
, Muhammad Osama Khan, Yan Luo
, Yi Fang
, Mengyu Wang
:
FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification. 251-271 - Hyesong Choi, Hunsang Lee, Seyoung Joung, Hyejin Park

, Jiyeong Kim, Dongbo Min:
Emerging Property of Masked Token for Effective Pre-training. 272-289 - Yi-Xin Huang, Hou-I Liu, Hong-Han Shuai, Wen-Huang Cheng:

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection. 290-305 - Homanga Bharadhwaj, Roozbeh Mottaghi, Abhinav Gupta, Shubham Tulsiani:

Track2Act: Predicting Point Tracks from Internet Videos Enables Generalizable Robot Manipulation. 306-324 - Hiba Dahmani

, Moussâb Bennehar
, Nathan Piasco
, Luis Roldão, Dzmitry Tsishkou:
SWAG: Splatting in the Wild Images with Appearance-Conditioned Gaussians. 325-340 - Dongbin Zhang, Chuming Wang, Weitao Wang, Peihao Li, Minghan Qin

, Haoqian Wang:
Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections. 341-359 - Qingfeng Shi

, Jing Wei
, Fei Shen
, Zhengtao Zhang
:
Few-Shot Defect Image Generation Based on Consistency Modeling. 360-376 - Ada-Astrid Balauca

, Danda Pani Paudel
, Kristina Toutanova, Luc Van Gool
:
Taming CLIP for Fine-Grained and Structured Visual Understanding of Museum Exhibits. 377-394 - Yassine Ouali, Adrian Bulat, Brais Martínez, Georgios Tzimiropoulos:

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs. 395-413 - Yuehui Han, Can Xu, Rui Xu, Jianjun Qian, Jin Xie:

Masked Motion Prediction with Semantic Contrast for Point Cloud Sequence Learning. 414-431 - Zixuan Chen

, Zewei He
, Ziqian Lu
, Xuecheng Sun
, Zhe-Ming Lu
:
Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline. 432-449 - Uriel Singer, Amit Zohar, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman:

Video Editing via Factorized Diffusion Distillation. 450-466 - Benjamin Gallusser

, Martin Weigert
:
TRACKASTRA: Transformer-Based Cell Tracking for Live-Cell Microscopy. 467-484

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














