


default search action
18th ECCV 2024: Milan, Italy - Part L
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part L. Lecture Notes in Computer Science 15108, Springer 2025, ISBN 978-3-031-72972-0 - Xinpeng Liu

, Haowen Hou
, Yanchao Yang
, Yong-Lu Li
, Cewu Lu
:
Revisit Human-Scene Interaction via Space Occupancy. 1-19 - Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang

, Chengjie Wang, Yong Liu:
Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control. 20-36 - Haisheng Fu, Jie Liang, Zhenman Fang, Jingning Han, Feng Liang, Guohe Zhang:

WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model. 37-53 - Pengyu Li, Tianchu Guo, Biao Wang, Xian-Sheng Hua:

Grid-Attention: Enhancing Computational Efficiency of Large Vision Models Without Fine-Tuning. 54-70 - Gilhan Park, WonJun Moon

, SuBeen Lee
, Tae-Young Kim
, Jae-Pil Heo
:
Mitigating Background Shift in Class-Incremental Semantic Segmentation. 71-88 - Xiuquan Hou

, Meiqin Liu
, Senlin Zhang
, Ping Wei, Badong Chen
, Xuguang Lan
:
Relation DETR: Exploring Explicit Position Relation Prior for Object Detection. 89-105 - Zekai Xu, Kang You, Qinghai Guo, Xiang Wang, Zhezhi He

:
BKDSNN: Enhancing the Performance of Learning-Based Spiking Neural Networks Training with Blurred Knowledge Distillation. 106-123 - Dongchen Han

, Tianzhu Ye, Yizeng Han
, Zhuofan Xia, Siyuan Pan
, Pengfei Wan
, Shiji Song
, Gao Huang
:
Agent Attention: On the Integration of Softmax and Linear Attention. 124-140 - Quoc-Huy Tran

, Muhammad Ahmed
, Murad Popattia
, M. Hassan Ahmed
, Andrey Konin
, M. Zeeshan Zia
:
Learning by Aligning 2D Skeleton Sequences and Multi-modality Fusion. 141-161 - Kohei Ashida

, Hiroaki Santo
, Fumio Okura
, Yasuyuki Matsushita
:
Resolving Scale Ambiguity in Multi-view 3D Reconstruction Using Dual-Pixel Sensors. 162-178 - Shibin Mei, Bingbing Ni, Hang Wang, Chenglong Zhao, Fengfa Hu, Zhiming Pi, Bilian Ke:

Object-Oriented Anchoring and Modal Alignment in Multimodal Learning. 179-196 - Jiabao Wang, Qiang Meng, Guochao Liu, Liujiang Yan, Ke Wang, Ming-Ming Cheng, Qibin Hou:

Towards Stable 3D Object Detection. 197-213 - Byunggwan Son

, Youngmin Oh
, Donghyeon Baek
, Bumsub Ham
:
FYI: Flip Your Images for Dataset Distillation. 214-230 - Hyeonseong Kim, Sung-Hoon Yoon

, Minseok Kim
, Kuk-Jin Yoon
:
On-the-Fly Category Discovery for LiDAR Semantic Segmentation. 231-249 - Renlong Wu, Zhilu Zhang, Yu Yang, Wangmeng Zuo:

Dual-Camera Smooth Zoom on Mobile Phones. 250-269 - Xumin Yu, Yanbo Wang, Jie Zhou, Jiwen Lu:

ProtoComp: Diverse Point Cloud Completion with Controllable Prototype. 270-286 - Long Li

, Nian Liu
, Dingwen Zhang
, Zhongyu Li, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal
, Junwei Han
, Fahad Shahbaz Khan
:
CONDA: Condensed Deep Association Learning for Co-salient Object Detection. 287-303 - Ge Wu

, Xin Zhang, Zheng Li
, Zhaowei Chen
, Jiajun Liang
, Jian Yang
, Xiang Li
:
Cascade Prompt Learning for Vision-Language Model Adaptation. 304-321 - Yuzhou Liu

, Lingjie Zhu, Xiaodong Ma, Hanqiao Ye
, Xiang Gao
, Xianwei Zheng
, Shuhan Shen
:
PolyRoom: Room-Aware Transformer for Floorplan Reconstruction. 322-339 - Rizhao Cai

, Zirui Song
, Dayan Guan
, Zhenhao Chen
, Yaohang Li
, Xing Luo
, Chenyu Yi
, Alex C. Kot
:
BenchLMM: Benchmarking Cross-Style Visual Capability of Large Multimodal Models. 340-358 - Mingjun Zheng, Long Sun, Jiangxin Dong, Jinshan Pan:

SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution. 359-375 - Zhongyu Xia, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:

HENet: Hybrid Encoding for End-to-End Multi-task 3D Perception from Multi-view Cameras. 376-392 - Bowei Xing, Xianghua Ying, Ruibin Wang, Ruohao Guo, Ji Shi, Wenzhen Yue:

Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation. 393-409 - Jian Jin, Yang Shen

, Zhenyong Fu, Jian Yang:
Customized Generation Reimagined: Fidelity and Editability Harmonized. 410-426 - Kaishen Yuan

, Zitong Yu
, Xin Liu
, Weicheng Xie, Huanjing Yue
, Jingyu Yang
:
AUFormer: Vision Transformers Are Parameter-Efficient Facial Action Unit Detectors. 427-445 - Yikang Zhou

, Tao Zhang
, Shunping Ji
, Shuicheng Yan
, Xiangtai Li
:
Improving Video Segmentation via Dynamic Anchor Queries. 446-463 - Shunqi Mao

, Chaoyi Zhang
, Hang Su
, Hwanjun Song
, Igor Shalyminov
, Weidong Cai
:
Controllable Contextualized Image Captioning: Directing the Visual Narrative Through User-Defined Highlights. 464-481

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














