


default search action
CVPR 2023: Vancouver, BC, Canada
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. IEEE 2023, ISBN 979-8-3503-0129-8
- Shikhar Bahl, Russell Mendonca, Lili Chen, Unnat Jain, Deepak Pathak:
Affordances from Human Videos as a Versatile Representation for Robotics. 1-13 - Lei Jin, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension. 1-10 - Adithya Pediredla, Srinivasa G. Narasimhan
, Maysamreza Chamanzar, Ioannis Gkioulekas
:
Megahertz Light Steering Without Moving Parts. 1-12 - Yu-Lun Liu, Chen Gao, Andreas Meuleman, Hung-Yu Tseng, Ayush Saraf, Changil Kim, Yung-Yu Chuang, Johannes Kopf, Jia-Bin Huang:
Robust Dynamic Radiance Fields. 13-23 - Yu Chen, Gim Hee Lee:
DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields. 24-34 - Bingfan Zhu, Yanchao Yang, Xulong Wang
, Youyi Zheng, Leonidas J. Guibas:
VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization. 35-45 - Yifan Jiang, Peter Hedman, Ben Mildenhall, Dejia Xu, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue:
AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training. 46-55 - Deborah Levy
, Amit Peleg, Naama Pearl, Dan Rosenbaum, Derya Akkaynak
, Simon Korman, Tali Treibitz:
SeaThru-NeRF: Neural Radiance Fields in Scattering Media. 56-65 - Brian K. S. Isaac-Medina, Chris G. Willcocks
, Toby P. Breckon:
Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields. 66-75 - Liao Wang, Qiang Hu, Qihan He, Ziyu Wang, Jingyi Yu, Tinne Tuytelaars
, Lan Xu, Minye Wu:
Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos. 76-87 - Han Yan, Celong Liu, Chao Ma, Xing Mei:
Plen-VDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering. 88-96 - Xin Huang, Qi Zhang, Ying Feng
, Xiaoyu Li, Xuan Wang, Qing Wang:
Local Implicit Ray Function for Generalizable Radiance Field Representation. 97-107 - Yiming Gao, Yan-Pei Cao, Ying Shan:
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes. 108-118 - Yi Zhang, Xiaoyang Huang, Bingbing Ni, Wenjun Zhang, Teng Li:
Frequency-Modulated Point Cloud Rendering with Easy Editing. 119-129 - Ang Cao, Justin Johnson:
HexPlane: A Fast Representation for Dynamic Scenes. 130-141 - Markus Worchel, Marc Alexa:
Differentiable Shadow Mapping for Efficient Inverse Graphics. 142-153 - Peng Dai, Yinda Zhang, Xin Yu, Xiaoyang Lyu, Xiaojuan Qi:
Hybrid Neural Rendering for Large-Scale Scenes with Motion Blur. 154-164 - Haian Jin, Isabella Liu, Peijia Xu, Xiaoshuai Zhang, Songfang Han, Sai Bi, Xiaowei Zhou, Zexiang Xu, Hao Su:
TensoIR: Tensorial Inverse Rendering. 165-174 - Jingwang Ling, Zhibo Wang, Feng Xu:
ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision. 175-185 - S. Mahdi H. Miangoleh, Zoya Bylinskii, Eric Kee, Eli Shechtman, Yagiz Aksoy
:
Realistic Saliency Guided Image Enhancement. 186-194 - Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, Hyunjoon Jung, Vishal M. Patel:
LightPainter: Interactive Portrait Relighting with Freehand Scribble. 195-205 - Xianmin Xu, Yuxin Lin, Haoyang Zhou, Chong Zeng
, Yaxin Yu, Kun Zhou, Hongzhi Wu:
A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance. 206-215 - Ruichen Zheng, Peng Li, Haoqian Wang, Tao Yu:
Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting. 216-226 - Junbong Jang, Kwonmoo Lee
, Tae-Kyun Kim
:
Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency Losses. 227-236 - Yu-Tao Liu, Li Wang, Jie Yang, Weikai Chen, Xiaoxu Meng, Bo Yang, Lin Gao:
NeUDF: Leaning Neural Unsigned Distance Fields with Volume Rendering. 237-247 - Xiaoxu Meng, Weikai Chen, Bo Yang:
NeAT: Learning Neural Implicit Surfaces with Arbitrary Topologies from Multi-View Images. 248-258 - Zhen Wang, Shijie Zhou, Jeong Joon Park, Despoina Paschalidou
, Suya You, Gordon Wetzstein, Leonidas J. Guibas, Achuta Kadambi:
ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction. 259-270 - Zhaoyang Lyu, Jinyi Wang, Yuwei An, Ya Zhang
, Dahua Lin, Bo Dai:
Controllable Mesh Generation Through Sparse Latent Point Diffusion Models. 271-280 - Ke Li, Kaiyue Pang, Yi-Zhe Song:
Photo Pre-Training, But for Sketch. 275-285 - Simon Weber, Nikolaus Demmel, Tin Chon Chan, Daniel Cremers:
Power Bundle Adjustment for Large-Scale 3D Reconstruction. 281-289 - Aayush Bansal, Michael Zollhöfer:
Neural Pixel Composition for 3D-4D View Synthesis from Multi-Views. 290-299 - Chen-Hsuan Lin, Jun Gao, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, Karsten Kreis, Sanja Fidler, Ming-Yu Liu, Tsung-Yi Lin:
Magic3D: High-Resolution Text-to-3D Content Creation. 300-309 - Li Ma
, Xiaoyu Li, Jing Liao
, Pedro V. Sander:
3D Video Loops from Asynchronous Input. 310-320 - Jiaxin Xie, Hao Ouyang, Jingtan Piao, Chenyang Lei, Qifeng Chen:
High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization. 321-331 - Leheng Li, Qing Lian, Luozhou Wang, Ningning Ma, Yingcong Chen:
Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field. 332-341 - Fei Yin, Yong Zhang, Xuan Wang, Tengfei Wang
, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Öztireli, Yujiu Yang
:
3D GAN Inversion with Facial Symmetry Prior. 342-351 - Diqiong Jiang, Dan Song, Ruofeng Tong, Min Tang:
StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping. 352-361 - Haoran Bai, Di Kang, Haoxian Zhang, Jinshan Pan, Linchao Bao
:
FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction. 362-371 - Chunlu Li
, Andreas Morel-Forster, Thomas Vetter, Bernhard Egger
, Adam Kortylewski:
Robust Model-based Face Reconstruction through Weakly-Supervised Outlier Segmentation. 372-381 - Zhenyu Zhang, Renwang Chen, Weijian Cao, Ying Tai, Chengjie Wang:
Learning Neural Proto-Face Field for Disentangled 3D Face Modeling in the Wild. 382-393 - Biwen Lei, Jianqiang Ren, Mengyang Feng, Miaomiao Cui, Xuansong Xie:
A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images. 394-403 - Kacper Kania, Stephan J. Garbin, Andrea Tagliasacchi, Virginia Estellers, Kwang Moo Yi, Julien Valentin, Tomasz Trzcinski, Marek Kowalski:
BlendFields: Few-Shot Example-Driven Facial Modeling. 404-415 - Chuhan Chen, Matthew O'Toole, Gaurav Bharaj, Pablo Garrido:
Implicit Neural Head Synthesis via Controllable Local Deformation Fields. 416-426 - Youxin Pang, Yong Zhang, Weize Quan, Yanbo Fan, Xiaodong Cun, Ying Shan, Dong-Ming Yan:
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing. 427-436 - Sijing Wu
, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai:
GANHead: Towards Generative Animatable Neural Head Avatars. 437-447 - Jonathan Tseng, Rodrigo Castellon, C. Karen Liu
:
EDGE: Editable Dance Generation From Music. 448-458 - Aliaksandr Siarohin, Willi Menapace, Ivan Skorokhodov, Kyle Olszewski, Jian Ren, Hsin-Ying Lee, Menglei Chai, Sergey Tulyakov:
Unsupervised Volumetric Animation. 458-469 - Hugo Bertiche, Niloy J. Mitra, Kuldeep Kulkarni, Chun-Hao Paul Huang, Tuanfeng Y. Wang, Meysam Madadi, Sergio Escalera
, Duygu Ceylan:
Blowing in the Wind: CycleNet for Human Cinemagraphs from Still Images. 459-468 - Hongwei Yi, Hualin Liang, Yifei Liu, Qiong Cao, Yandong Wen, Timo Bolkart, Dacheng Tao, Michael J. Black:
Generating Holistic 3D Human Motion from Speech. 469-480 - Yuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali K. Thabet, Artsiom Sanakoyeu:
Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model. 481-490 - Fang Zhao, Zekun Li, Shaoli Huang, Junwu Weng, Tianfei Zhou, Guo-Sen Xie, Jue Wang, Ying Shan:
Learning Anchor Transformations for 3D Garment Animation. 491-500 - Hongwen Zhang
, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu:
CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition. 501-511 - Yuliang Xiu, Jinlong Yang, Xu Cao, Dimitrios Tzionas
, Michael J. Black:
ECON: Explicit Clothed humans Optimized via Normal integration. 512-523 - Chung-Yi Weng, Pratul P. Srinivasan, Brian Curless, Ira Kemelmacher-Shlizerman:
PersonNeRF : Personalized Reconstruction from Photo Collections. 524-533 - Xiaoxuan Ma, Jiajun Su, Chunyu Wang, Wentao Zhu, Yizhou Wang:
3D Human Mesh Estimation from Virtual Markers. 534-543 - Ziwei Yu, Chen Li
, Linlin Yang, Xiaoxu Zheng, Michael Bi Mi, Gim Hee Lee, Angela Yao:
Overcoming the TradeOff between Accuracy and Plausibility in 3D Hand Shape Reconstruction. 544-553 - Yeonguk Oh, JoonKyu Park, Jaeha Kim, Gyeongsik Moon, Kyoung Mu Lee:
Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal Unfolding. 554-563 - Congyi Wang, Feida Zhu, Shilei Wen:
MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand Reconstruction. 564-573 - Karthik Shetty, Annette Birkhold, Srikrishna Jaganathan, Norbert Strobel, Markus Kowarschik, Andreas K. Maier, Bernhard Egger
:
PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation. 574-584 - Juntian Zheng, Qingyuan Zheng, Lixing Fang, Yun Liu, Li Yi:
CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis. 585-594 - Yuheng Jiang, Kaixin Yao, Zhuo Su, Zhehao Shen, Haimin Luo, Lan Xu:
Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream. 595-605 - Bowen Wen, Jonathan Tremblay, Valts Blukis, Stephen Tyree, Thomas Müller, Alex Evans, Dieter Fox, Jan Kautz, Stan Birchfield:
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects. 606-617 - Xuan Ju, Ailing Zeng, Jianan Wang, Qiang Xu, Lei Zhang:
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes. 618-629 - Mohammed Suhail, Erika Lu, Zhengqi Li, Noah Snavely, Leonid Sigal, Forrester Cole:
Omnimatte3D: Associating Objects and Their Effects in Unconstrained Monocular Video. 630-639 - Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Christoph Feichtenhofer, Jitendra Malik:
On the Benefits of 3D Pose and Tracking for Human Action Recognition. 640-649 - Li'an Zhuo, Jian Cao, Qi Wang
, Bang Zhang, Liefeng Bo:
Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization. 650-659 - Zigang Geng, Chunyu Wang, Yixuan Wei, Ze Liu, Houqiang Li, Han Hu:
Human Pose as Compositional Tokens. 660-671 - Qihao Liu, Adam Kortylewski, Alan L. Yuille:
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation. 672-681 - Yudi Dai
, Yitai Lin, Xiping Lin, Chenglu Wen, Lan Xu, Hongwei Yi, Siqi Shen, Yuexin Ma, Cheng Wang:
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments. 682-692 - Linzhi Huang, Yulong Li, Hongbo Tian, Yue Yang, Xiangang Li, Weihong Deng, Jieping Ye:
Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module. 693-703 - Sohyun Lee, Jaesung Rim, Boseung Jeong, Geonu Kim, Byungju Woo, Haechan Lee, Sunghyun Cho, Suha Kwak:
Human Pose Estimation in Extremely Low-Light Conditions. 704-714 - Riqiang Gao, Bin Lou
, Zhoubing Xu, Dorin Comaniciu, Ali Kamen:
Flexible-Cm GAN: Towards Precise 3D Dose Prediction in Radiotherapy. 715-725 - Antyanta Bangunharcana, Ahmed Magd, Kyung-Soo Kim:
DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium. 726-738 - Yijia He, Bo Xu, Zhanpeng Ouyang, Hongdong Li:
A Rotation-Translation-Decoupled Solution for Robust and Efficient Visual-Inertial Initialization. 739-748 - Linus Härenstam-Nielsen, Niclas Zeller, Daniel Cremers:
Semidefinite Relaxations for Robust Multiview Triangulation. 749-757 - Zheheng Jiang, Hossein Rahmani
, Sue Black, Bryan M. Williams:
A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image. 758-768 - Timo Bolkart, Tianye Li, Michael J. Black:
Instant Multi-View Head Capture through Learnable Registration. 768-779 - HyunJun Jung, Patrick Ruhkamp, Guangyao Zhai, Nikolas Brasch, Yitong Li, Yannick Verdie, Jifei Song, Yiren Zhou, Anil Armagan, Slobodan Ilic, Ales Leonardis, Nassir Navab, Benjamin Busam:
On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks. 780-791 - Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner:
Learning 3D Scene Priors with 2D Supervision. 792-802 - Tong Wu, Jiarui Zhang, Xiao Fu
, Yuxin Wang
, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu:
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation. 803-814 - Songyou Peng, Kyle Genova, Chiyu Max Jiang, Andrea Tagliasacchi, Marc Pollefeys, Thomas A. Funkhouser:
OpenScene: 3D Scene Understanding with Open Vocabularies. 815-824 - Xu Cao, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita
:
Multi-View Azimuth Stereo via Tangent Space Consistency. 825-834 - Yi-Ting Shen, Hyungtae Lee, Heesung Kwon, Shuvra S. Bhattacharyya:
Progressive Transformation Learning for Leveraging Virtual Images in Training. 835-844 - Yuanwen Yue
, Theodora Kontogianni
, Konrad Schindler, Francis Engelmann
:
Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries. 845-854 - Fabio Tosi, Alessio Tonioni, Daniele De Gregorio, Matteo Poggi
:
NeRF-Supervised Deep Stereo. 855-866 - Fengyun Wang, Dong Zhang, Hanwang Zhang, Jinhui Tang, Qianru Sun:
Semantic Scene Completion with Cleaner Self. 867-877 - Haozheng Yu, Lu He, Bing Jian, Weiwei Feng, Shan Liu:
PanelNet: Understanding 360 Indoor Environment via Panel Representation. 878-887 - Avinash Paliwal, Andrii Tsarov, Nima Khademi Kalantari:
Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform Coordinates. 888-898 - Wenjie Chang, Yueyi Zhang, Zhiwei Xiong:
Depth Estimation from Indoor Panoramas with Neural Scene Representation. 899-908 - Zehan Zheng, Danni Wu, Ruisi Lu, Fan Lu, Guang Chen, Changjun Jiang:
NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear Interpolation. 909-918 - Changjiang Cai
, Pan Ji, Qingan Yan, Yi Xu:
RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo. 919-928 - Shitao Tang, Sicong Tang, Andrea Tagliasacchi, Ping Tan, Yasutaka Furukawa:
NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization. 929-939 - Antoine Guédon, Tom Monnier, Pascal Monasse, Vincent Lepetit:
MACARONS: Mapping and Coverage Anticipation with RGB Online Self-Supervision. 940-951 - Xin Kong, Shikun Liu, Marwan Taher, Andrew J. Davison:
vMAP: Vectorised Object Mapping for Neural Field SLAM. 952-961 - Yunzhi Zhang, Shangzhe Wu, Noah Snavely, Jiajun Wu:
Seeing a Rose in Five Thousand Ways. 962-971 - Yihao Wang, Zhigang Wang, Bin Zhao, Dong Wang, Mulin Chen, Xuelong Li:
Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking. 972-981 - Praneeth Chakravarthula, Jim Aldon D'Souza, Ethan Tseng, Joe Bartusek, Felix Heide:
Seeing With Sound: Long-Range Acoustic Beamforming for Multimodal Scene Understanding. 982-991 - Jia Zeng, Li Chen, Hanming Deng, Lewei Lu, Junchi Yan, Yu Qiao, Hongyang Li:
Distilling Focal Knowledge from Imperfect Expert for 3D Object Detection. 992-1001 - Zechuan Li, Hongshan Yu, Zhengeng Yang, Tom Tongjia Chen, Naveed Akhtar:
AShapeFormer : Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers. 1012-1021 - Yinpeng Dong, Caixin Kang, Jinlai Zhang, Zijian Zhu, Yikai Wang, Xiao Yang, Hang Su, Xingxing Wei, Jun Zhu:
Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving. 1022-1032 - Hang Xu, Xinyuan Liu, Qiang Zhao
, Yike Ma, Chenggang Yan, Feng Dai:
Gaussian Label Distribution Learning for Spherical Image Object Detection. 1033-1042 - Ukcheol Shin
, Jinsun Park, In So Kweon:
Deep Depth Estimation from Thermal Image. 1043-1053 - Chuanfu Shen, Fan Chao, Wei Wu, Rui Wang, George Q. Huang, Shiqi Yu:
LidarGait: Benchmarking 3D Gait Recognition with Point Clouds. 1054-1063 - Kunyu Wang, Xueyang Fu, Yukun Huang, Chengzhi Cao, Gege Shi
, Zheng-Jun Zha:
Generalized UAV Object Detection via Frequency Domain Disentanglement. 1064-1073 - Yuwen Xiong, Wei-Chiu Ma, Jingkang Wang, Raquel Urtasun:
Learning Compact Representations for LiDAR Completion and Generation. 1074-1083 - Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang:
CXTrack: Improving 3D Point Cloud Tracking with Contextual Information. 1084-1093 - Wei Ji, Jingjing Li, Cheng Bian, Zongwei Zhou, Jiaying Zhao, Alan L. Yuille, Li Cheng:
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline. 1094-1104 - Tao Lu, Xiang Ding, Haisong Liu, Gangshan Wu, Limin Wang:
LinK: Linear Kernel for LiDAR-based 3D Perception. 1105-1115 - Tarasha Khurana, Peiyun Hu, David Held, Deva Ramanan:
Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting. 1116-1124 - Ziyue Zhu, Qiang Meng, Xiao Wang, Ke Wang, Liujiang Yan, Jian Yang:
Curricular Object Manipulation in LiDAR-based Object Detection. 1125-1135 - Jiaming Zhang, Ruiping Liu, Hao Shi
, Kailun Yang, Simon Reiß
, Kunyu Peng, Haodong Fu, Kaiwei Wang, Rainer Stiefelhagen:
Delivering Arbitrary-Modal Semantic Segmentation. 1136-1147 - Haobo Jiang, Zheng Dang, Zhen Wei
, Jin Xie, Jian Yang, Mathieu Salzmann:
Robust Outlier Rejection for 3D Registration with Variational Bayes. 1148-1157 - Zhenzhen Weng
, Alexander S. Gorban, Jingwei Ji, Mahyar Najibi, Yin Zhou, Dragomir Anguelov:
3D Human Keypoints Estimation from Point Clouds in the Wild without Human Labels. 1158-1167 - Li Jiang
, Zetong Yang, Shaoshuai Shi, Vladislav Golyanik, Dengxin Dai, Bernt Schiele:
Self-Supervised Pre-Training with Masked Shape Prediction for 3D Scene Understanding. 1168-1178 - Le Xue
, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding. 1179-1189 - Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang:
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation. 1190-1199 - Zhijian Liu, Xinyu Yang, Haotian Tang, Shang Yang, Song Han:
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer. 1200-1211 - Zhiqiang Shen, Xiaoxiao Sheng, Longguang Wang, Yulan Guo, Qiong Liu, Xi Zhou:
PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos. 1212-1222 - Minghan Zhu, Maani Ghaffari, William A. Clark, Huei Peng:
E2PN: Efficient SE(3)-Equivariant Point Network. 1223-1232 - Tao Xie, Shiguang Wang, Ke Wang, Linqi Yang, Zhiqiang Jiang, Xingcheng Zhang, Kun Dai, Ruifeng Li, Jian Cheng:
Poly-PC: A Polyhedral Network for Multiple Point Cloud Tasks at Once. 1233-1243 - Nan Zhang, Zhiyi Pan, Thomas H. Li, Wei Gao
, Ge Li:
Improving Graph Representation for Point Cloud Segmentation via Attentive Filtering. 1244-1254 - Sheng Ao, Qingyong Hu, Hanyun Wang
, Kai Xu, Yulan Guo:
BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration. 1255-1264 - Bingnan Yang, Mi Zhang, Zhan Zhang, Zhili Zhang, Xiangyun Hu:
TopDiG: Class-agnostic Topological Directional Graph Extraction from Remote Sensing Images. 1265-1274 - Daniel Widdowson, Vitaliy Kurlin:
Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants with no False Negatives and no False Positives. 1275-1284 - Xu Zheng, Jinjing Zhu, Yexin Liu, Zidong Cao, Chong Fu, Lin Wang
:
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation. 1285-1295 - Harshil Bhatia, Edith Tretschk, Zorah Lähner
, Marcel Seelbach Benkner, Michael Moeller, Christian Theobalt
, Vladislav Golyanik:
CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes. 1296-1305 - Guilherme A. Potje, Felipe Cadar, André Araújo, Renato Martins, Erickson R. Nascimento:
Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints. 1306-1315 - Souhaib Attaiki, Maks Ovsjanikov:
Understanding and Improving Features Learned in Deep Functional Maps. 1316-1326 - Haoliang Zhao, Huizhou Zhou, Yongjun Zhang, Jie Chen, Yitong Yang, Yong Zhao:
High-Frequency Stereo Matching Network. 1327-1336 - Qiaole Dong
, Chenjie Cao, Yanwei Fu
:
Rethinking Optical Flow from Geometric Matching Consistent Perspective. 1337-1347 - Shun Fang, Zhengqin Xu, Shiqian Wu, Shoulie Xie:
Efficient Robust Principal Component Analysis via Block Krylov Iteration and CUR Decomposition. 1348-1357 - Bingchen Yang, Haiyong Jiang, Hao Pan, Jun Xiao:
VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation. 1358-1367 - Shaoheng Fang, Zi Wang, Yiqi Zhong, Junhao Ge, Siheng Chen:
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving. 1368-1378 - Ben Agro, Quinlan Sykora, Sergio Casas, Raquel Urtasun:
Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving. 1379-1388 - Ze Yang, Yun Chen, Jingkang Wang, Sivabalan Manivasagam, Wei-Chiu Ma, Anqi Joyce Yang, Raquel Urtasun:
UniSim: A Neural Closed-Loop Sensor Simulator. 1389-1399 - Yuning Wang, Pu Zhang, Lei Bai
, Jianru Xue:
FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-Tail Trajectory Prediction. 1400-1409 - Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang:
EqMotion: Equivariant Multi-Agent Motion Prediction with Invariant Interaction Reasoning. 1410-1420 - Guoqiang Zhang, Kenta Niwa, W. Bastiaan Kleijn
:
Lookahead Diffusion Probabilistic Models for Refining Mean Estimation. 1421-1429 - Ruihan Yang, Ge Yang, Xiaolong Wang:
Neural Volumetric Memory for Visual Locomotion Control. 1430-1440 - Sounak Mondal, Zhibo Yang, Seoyoung Ahn, Dimitris Samaras, Gregory J. Zelinsky, Minh Hoai:
Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention. 1441-1450 - Luca De Luigi, Ren Li
, Benoît Guillard, Mathieu Salzmann, Pascal Fua:
DrapeNet: Garment Generation and Self-Supervised Draping. 1451-1460 - Mingzhen Huang, Xiaoxing Li, Jun Hu, Honghong Peng, Siwei Lyu:
Tracking Multiple Deformable Objects in Egocentric Videos. 1461-1471 - Zhengwei Yang
, Meng Lin, Xian Zhong, Yu Wu, Zheng Wang:
Good is Bad: Causality Inspired Cloth-debiasing for Cloth-changing Person Re-identification. 1472-1481 - Xuan-Bac Nguyen, Chi Nhan Duong, Xin Li
, Susan Gauch, Han-Seok Seo, Khoa Luu:
Micron-BERT: BERT-Based Facial Micro-Expression Recognition. 1482-1492 - Zhixi Cai
, Shreya Ghosh
, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat:
MARLIN: Masked Autoencoder for facial video Representation LearnINg. 1493-1504 - Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator. 1505-1515 - Samuel Clarke, Ruohan Gao, Mason L. Wang, Mark Rau, Julia Xu, Jui-Hsien Wang, Doug L. James
, Jiajun Wu:
REALIMPACT: A Dataset of Impact Sound Fields for Real Objects. 1516-1525 - Xiaoyu Zhu, Po-Yao Huang, Junwei Liang
, Celso M. de Melo, Alexander G. Hauptmann:
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition. 1526-1536 - Xueyan Huang, Yueyi Zhang, Zhiwei Xiong:
Progressive Spatio-temporal Alignment for Efficient Event-based Motion Estimation. 1537-1546 - Manasi Muglikar, Leonard Bauersfeld, Diederik Paul Moeys, Davide Scaramuzza
:
Event-Based Shape from Polarization. 1547-1556 - Yunfan Lu, Zipeng Wang, Minjie Liu, Hongjian Wang, Lin Wang
:
Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution. 1557-1567 - Junheum Park, Jintae Kim
, Chang-Su Kim:
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation. 1568-1577 - Xin Jin, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-Hee Hahm:
A Unified Pyramid Recurrent Network for Video Frame Interpolation. 1578-1587 - Wenming Weng, Yueyi Zhang, Zhiwei Xiong:
Event-based Blurry Frame Interpolation under Blind Exposure. 1588-1598 - Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
:
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation. 1599-1610 - Ce Zheng, Xianpeng Liu, Guo-Jun Qi, Chen Chen:
POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery. 1611-1620 - Yuesong Wang, Zhaojie Zeng, Tao Guan, Wei Yang, Zhuo Chen, Wenkai Liu, Luoyuan Xu, Yawei Luo:
Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo. 1621-1630 - Zhenjie Yu, Shuang Li, Yirui Shen
, Chi Harold Liu
, Shuigen Wang:
On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer. 1631-1640 - Aniket Dashpute
, Vishwanath Saragadam, Emma Alexander, Florian Willomitzer, Aggelos K. Katsaggelos, Ashok Veeraraghavan, Oliver Cossairt:
Thermal Spread Functions (TSF): Physics-Guided Material Classification. 1641-1650 - Xuhai Chen, Jiangning Zhang, Chao Xu, Yabiao Wang, Chengjie Wang, Yong Liu:
Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution. 1651-1661 - Yuhui Wu, Chen Pan, Guoqing Wang, Yang Yang, Jiwei Wei, Chongyi Li
, Heng Tao Shen:
Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement. 1662-1671 - Zeyu Xiao, Yutong Liu, Ruisheng Gao, Zhiwei Xiong:
CutMIB: Boosting Light Field Super-Resolution via Multi-View Image Blending. 1672-1682 - Zixuan Fu, Lanqing Guo, Bihan Wen
:
sRGB Real Noise Synthesizing with Neighboring Correlation-Aware Noise Model. 1683-1691 - Haoyu Chen, Jinjin Gu, Yihao Liu
, Salma Abdel Magid, Chao Dong, Qiong Wang, Hanspeter Pfister, Lei Zhu:
Masked Image Training for Generalizable Deep Image Denoising. 1692-1703 - Zhixin Wang, Ziying Zhang, Xiaoyun Zhang, Huangjie Zheng, Mingyuan Zhou, Ya Zhang, Yanfeng Wang:
DR2: Diffusion-Based Robust Degradation Remover for Blind Face Restoration. 1704-1713 - Xin Li
, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen:
Learning Distortion Invariant Representation for Image Restoration from a Causality Perspective. 1714-1724 - Seung Ho Park
, Young-Su Moon, Nam Ik Cho:
Perception-Oriented Single Image Super-Resolution using Optimal Objective Estimation. 1725-1735 - Xinmiao Lin, Yikang Li, Jenhao Hsiao, Chiuman Ho, Yu Kong:
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder. 1736-1745 - Zicheng Zhang, Wei Wu
, Wei Sun, Danyang Tu, Wei Lu, Xiongkuo Min, Ying Chen, Guangtao Zhai:
MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos. 1746-1755 - Senmao Tian, Ming Lu, Jiaming Liu, Yandong Guo, Yurong Chen, Shunli Zhang:
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input. 1756-1765 - Ann-Christin Woerl, Jan Disselhoff
, Michael Wand:
Initialization Noise in Image Gradients and Saliency Maps. 1766-1775 - Jie-En Yao, Li-Yuan Tsao, Yi-Chen Lo, Roy Tseng, Chia-Che Chang, Chun-Yi Lee:
Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution. 1776-1785 - Xiaohang Wang, Xuanhong Chen, Bingbing Ni, Hang Wang, Zhengyan Tong, Yutian Liu:
Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance Pursuit. 1786-1795 - Jiezhang Cao
, Qin Wang, Yongqin Xian, Yawei Li
, Bingbing Ni, Zhiming Pi, Kai Zhang, Yulun Zhang, Radu Timofte
, Luc Van Gool:
CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution. 1796-1807 - Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Bingbing Ni:
Multiplicative Fourier Level of Detail. 1808-1817 - Ling Zhang, Yinghao He, Qing Zhang, Zheng Liu, Xiaolong Zhang, Chunxia Xiao:
Document Image Shadow Removal Guided by Color-Aware Background. 1818-1827 - Hamza Pehlivan, Yusuf Dalva, Aysegul Dundar:
StyleRes: Transforming the Residuals for Real Image Editing with StyleGAN. 1828-1837 - Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen:
TopNet: Transformer-Based Object Placement Network for Image Compositing. 1838-1847 - Zeqing Xia, Bojun Xiong, Zhouhui Lian:
VecFontSDF: Learning to Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance Functions. 1848-1857 - Chi Wang
, Min Zhou, Tiezheng Ge, Yuning Jiang, Hujun Bao, Weiwei Xu:
CF-Font: Content Fusion for Few-Shot Font Generation. 1858-1867 - Wuyang Luo, Su Yang, Xinjian Zhang, Weishan Zhang
:
SIEDOB: Semantic Image Editing by Disentangling Object and Background. 1868-1878 - Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa:
MaskSketch: Unpaired Structure-guided Masked Image Generation. 1879-1889 - Inwoo Hwang, Hyeonwoo Kim, Young Min Kim:
Text2Scene: Text-driven Indoor Scene Stylization with Part-Aware Details. 1890-1899 - Qiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui, Tong Yu, Zhe Lin, Yang Zhang, Shiyu Chang:
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models. 1900-1910 - Ajay Jain, Amber Xie, Pieter Abbeel:
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models. 1911-1920 - Narek Tumanyan, Michal Geyer, Shai Bagon, Tali Dekel:
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation. 1921-1930 - Nupur Kumari, Bingliang Zhang
, Richard Zhang, Eli Shechtman, Jun-Yan Zhu:
Multi-Concept Customization of Text-to-Image Diffusion. 1931-1941 - Mude Hui, Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu:
Unifying Layout Generation with a Decoupled Diffusion Model. 1942-1951 - Bo Li, Kaitao Xue
, Bin Liu, Yu-Kun Lai:
BBDM: Image-to-Image Translation with Brownian Bridge Diffusion Models. 1952-1961 - Hyojun Go, Yunsung Lee, Jin Young Kim, Seunghyun Lee, Myeongho Jeong, Hyun Seung Lee, Seungtaek Choi
:
Towards Practical Plug-and-Play Diffusion Models. 1962-1971 - Yuzhang Shang, Zhihang Yuan
, Bin Xie, Bingzhe Wu
, Yan Yan:
Post-Training Quantization on Diffusion Models. 1972-1981 - Shuai Shen
, Wenliang Zhao, Zibin Meng, Wanhua Li, Zheng Zhu, Jie Zhou, Jiwen Lu:
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation. 1982-1991 - Kwanyong Park, Sanghyun Woo, Seoung Wug Oh, In So Kweon, Joon-Young Lee:
Mask-Guided Matting in the Wild. 1992-2001 - Mengqi Huang, Zhendong Mao, Quan Wang, Yongdong Zhang:
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation. 2002-2011 - Yingwei Wang, Takashi Isobe, Xu Jia, Xin Tao, Huchuan Lu, Yu-Wing Tai:
Compression-Aware Video Super-Resolution. 2012-2021 - Nilesh A. Ahuja, Parual Datta, Bhavya Kanzariya, V. Srinivasa Somayazulu, Omesh Tickoo:
Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN models. 2022-2030 - Qi Zhao, M. Salman Asif, Zhan Ma:
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos. 2031-2040 - Rajhans Singh, Ankita Shukla
, Pavan K. Turaga:
Polynomial Implicit Neural Representations For Large Diverse Datasets. 2041-2051 - Yutaro Shigeto, Masashi Shimbo, Yuya Yoshikawa, Akikazu Takeuchi:
Learning Decorrelated Representations Efficiently Using Fast Fourier Transform. 2052-2060 - Xuanyao Chen, Zhijian Liu, Haotian Tang, Li Yi, Hang Zhao, Song Han:
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer. 2061-2070 - Haram Choi, Jeongmin Lee, Jihoon Yang:
N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution. 2071-2081 - Xuran Pan, Tianzhu Ye, Zhuofan Xia
, Shiji Song, Gao Huang:
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention. 2082-2091 - Siyuan Wei, Tianzhu Ye, Shen Zhang, Yao Tang, Jiajun Liang:
Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers. 2092-2101 - Baifeng Shi, Trevor Darrell, Xin Wang:
Top-Down Visual Attention from Analysis by Synthesis. 2102-2112 - Markus Frey, Christian F. Doeller, Caswell Barry:
Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural Networks. 2113-2121 - Haoqing Wang, Yehui Tang, Yunhe Wang, Jianyuan Guo
, Zhi-Hong Deng, Kai Han:
Masked Image Modeling with Local Multi-Scale Reconstruction. 2122-2131 - Chenxin Tao, Xizhou Zhu, Weijie Su
, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. 2132-2141 - Tianhong Li, Huiwen Chang, Shlok Kumar Mishra, Han Zhang, Dina Katabi, Dilip Krishnan:
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis. 2142-2152 - Yukang Zhang, Hanzi Wang:
Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification. 2153-2162 - Suhang Ye, Yingyi Zhang, Jie Hu, Liujuan Cao, Shengchuan Zhang, Lei Shen, Jun Wang, Shouhong Ding, Rongrong Ji:
DistilPose: Tokenized Pose Regression with Heatmap Distillation. 2163-2172 - Hao Tang, Zhenyu Zhang, Humphrey Shi
, Bo Li, Ling Shao, Nicu Sebe
, Radu Timofte
, Luc Van Gool:
Graph Transformer GANs for Graph-Constrained House Generation. 2173-2182 - Mang Tik Chiu, Xuaner Zhang, Zijun Wei, Yuqian Zhou, Eli Shechtman, Connelly Barnes, Zhe Lin, Florian Kainz, Sohrab Amirghodsi, Humphrey Shi
:
Automatic High Resolution Wire Segmentation and Removal. 2183-2192 - Adnan Firoze
, Cameron Wingren, Raymond A. Yeh, Bedrich Benes, Daniel G. Aliaga:
Tree Instance Segmentation with Temporal Contour Graph. 2193-2202 - Jungin Park, Jiyoung Lee
, Kwanghoon Sohn:
Dual-Path Adaptation from Image to Video Transformers. 2203-2213 - A. J. Piergiovanni, Weicheng Kuo, Anelia Angelova:
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning. 2214-2224 - Heng Zhang, Daqing Liu, Qi Zheng, Bing Su:
Modeling Video as Stochastic Processes for Fine-Grained Video Representation Learning. 2225-2234 - Xinyu Sun, Peihao Chen, Liangwei Chen, Changhao Li, Thomas H. Li, Mingkui Tan, Chuang Gan:
Masked Motion Encoding for Self-Supervised Video Representation Learning. 2235-2245 - Yurong Zhang, Liulei Li, Wenguan Wang
, Rong Xie, Li Song, Wenjun Zhang:
Boosting Video Object Segmentation via Space-Time Correspondence Learning. 2246-2256 - Kun Yan, Xiao Li, Fangyun Wei, Jinglu Wang
, Chenbin Zhang, Ping Wang, Yan Lu:
Two-shot Video Object Segmentation. 2257-2267 - Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. 2268-2278 - Rui Li, Dong Liu
:
Spatial-then-Temporal Self-Supervised Learning for Video Correspondence. 2279-2288 - Yogesh Kumar
, Anand Mishra:
Few-Shot Referring Relationships in Videos. 2289-2298 - Yan-Bo Lin, Yi-Lin Sung, Jie Lei, Mohit Bansal, Gedas Bertasius:
Vision Transformers are Parameter-Efficient Audio-Visual Learners. 2299-2309 - Zihui Xue, Yale Song, Kristen Grauman, Lorenzo Torresani:
Egocentric Video Task Translation. 2310-2320 - Sicheng Yang, Zhiyong Wu, Minglei Li, Zhensong Zhang, Lei Hao, Weihong Bao, Haolin Zhuang:
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation. 2321-2330 - Mingyang Sun, Mengchen Zhao, Yaqing Hou, Minglei Li, Huang Xu, Songcen Xu, Jianye Hao:
Co-speech Gesture Synthesis by Reinforcement Learning with Contrastive Pretrained Rewards. 2331-2340 - Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Chen Chen, Mubarak Shah:
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition. 2341-2352 - Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid:
How can objects help action recognition? 2353-2362 - Lilang Lin, Jiahang Zhang, Jiaying Liu:
Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition. 2363-2372 - Pilhyeon Lee, Taeoh Kim, Minho Shim, Dongyoon Wee, Hyeran Byun:
Decomposed Cross-Modal Distillation for RGB-based Temporal Action Detection. 2373-2383 - Beatrice van Amsterdam, Abdolrahim Kadkhodamohammadi, Imanol Luengo, Danail Stoyanov:
ASPnet: Action Segmentation with Shared-Private Representation of Multiple Data Sources. 2384-2393 - Huan Ren
, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang:
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization. 2394-2404 - Shiyi Zhang, Wenxun Dai, Sujia Wang, Xiangwei Shen, Jiwen Lu, Jie Zhou, Yansong Tang:
LOGO: A Long-Form Video Dataset for Group Action Quality Assessment. 2405-2414 - Toby Perrett, Saptarshi Sinha, Tilo Burghardt, Majid Mirmehdi, Dima Damen:
Use Your Head: Improving Long-Tail Video Recognition. 2415-2425 - Yuexi Du, Ziyang Chen, Justin Salamon, Bryan C. Russell, Andrew Owens:
Conditional Generation of Audio from Video via Foley Analogies. 2426-2436 - Sixun Dong, Huazhang Hu, Dongze Lian, Weixin Luo, Yicheng Qian, Shenghua Gao:
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos. 2437-2447 - Xiang Fang, Daizong Liu, Pan Zhou, Guoshun Nan:
You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos. 2448-2460 - Paul Voigtlaender, Soravit Changpinyo, Jordi Pont-Tuset, Radu Soricut, Vittorio Ferrari:
Connecting Vision and Language with Video Localized Narratives. 2461-2471 - Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. 2472-2482 - Jiahao Zhang
, Anoop Cherian, Yanbin Liu, Yizhak Ben-Shabat, Cristian Rodriguez Opazo, Stephen Gould:
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations. 2483-2492 - Tanzila Rahman, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Shweta Mahajan
, Leonid Sigal:
Make-A-Story: Visual Memory Conditioned Consistent Story Generation. 2493-2502 - Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek:
Test of Time: Instilling Video-Language Models with a Sense of Time. 2503-2516 - Dhruv Srivastava
, Aditya Kumar Singh, Makarand Tapaswi:
How You Feelin'? Learning Emotions and Mental States in Movie Scenes. 2517-2528 - Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng:
Continuous Sign Language Recognition with Correlation Network. 2529-2539 - Changsong Wen, Guoli Jia, Jufeng Yang:
DIP: Dual Incongruity Perceiving Network for Sarcasm Detection. 2540-2550 - Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao:
Gloss Attention for Gloss-free Sign Language Translation. 2551-2562 - Heming Du, Lincheng Li, Zi Huang
, Xin Yu
:
Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States. 2563-2573 - Zijiao Yang
, Arjun Majumdar, Stefan Lee:
Behavioral Analysis of Vision-and-Language Navigation Agents. 2574-2582 - Xiangyang Li, Zihan Wang, Jiahao Yang, Yaowei Wang, Shuqiang Jiang:
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation. 2583-2592 - Mengmeng Xu, Yanghao Li, Cheng-Yang Fu, Bernard Ghanem
, Tao Xiang, Juan-Manuel Pérez-Rúa:
Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization. 2593-2603 - Yaowei Li, Ruijie Quan, Linchao Zhu, Yi Yang:
Efficient Multimodal Fusion via Interactive Prompting. 2604-2613 - Joy Hsu, Jiayuan Mao, Jiajun Wu:
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations. 2614-2623 - Burak Uzkent, Amanmeet Garg, Wentao Zhu, Keval Doshi, Jingru Yi, Xiaolong Wang, Mohamed Omar:
Dynamic Inference with Grounding Based Vision and Language Models. 2624-2633 - Shuquan Ye
, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao
:
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles. 2634-2645 - Wei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu:
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning. 2646-2656 - Sivan Doveh, Assaf Arbelle, Sivan Harary, Eli Schwartz, Roei Herzig, Raja Giryes, Rogério Feris, Rameswar Panda, Shimon Ullman, Leonid Karlinsky:
Teaching Structured Vision & Language Concepts to Vision & Language Models. 2657-2668 - Xiao Han, Xiatian Zhu
, Licheng Yu, Li Zhang, Yi-Zhe Song
, Tao Xiang:
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks. 2669-2680 - Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li
, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. 2691-2700 - Shi Chen
, Nachiappan Valliappan, Shaolei Shen, Xinyu Ye, Kai Kohlhoff, Junfeng He:
Learning from Unique Perspectives: User-aware Saliency Modeling. 2701-2710 - Thomas Fel
, Agustin Martin Picard, Louis Béthune, Thibaut Boissin, David Vigouroux, Julien Colin
, Rémi Cadène, Thomas Serre:
CRAFT: Concept Recursive Activation FacTorization for Explainability. 2711-2721 - Chengzhi Mao, Revant Teotia, Amrutha Sundar, Sachit Menon, Junfeng Yang, Xin Wang, Carl Vondrick:
Doubly Right Object Recognition: A Why Prompt for Visual Rationales. 2722-2732 - Ayan Kumar Bhunia, Subhadeep Koley, Amandeep Kumar, Aneeshan Sain
, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
:
Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings. 2733-2743 - Meike Nauta, Jörg Schlötterer, Maurice van Keulen, Christin Seifert:
PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image Classification. 2744-2753 - Aneeshan Sain
, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, Yi-Zhe Song
:
CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not. 2765-2775 - Yixuan Wei, Yue Cao, Zheng Zhang, Houwen Peng, Zhuliang Yao, Zhenda Xie, Han Hu, Baining Guo:
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual Recognition. 2776-2786 - Ding Jiang, Mang Ye
:
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval. 2787-2797 - Jaeyoo Park, Bohyung Han:
Multi-Modal Representation Learning with Text-Driven Soft Masks. 2798-2807 - Zixian Guo, Bowen Dong, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo:
Texts as Images in Prompt Tuning for Multi-Label Image Recognition. 2808-2817 - Mehdi Cherti, Romain Beaumont, Ross Wightman, Mitchell Wortsman, Gabriel Ilharco, Cade Gordon, Christoph Schuhmann, Ludwig Schmidt, Jenia Jitsev:
Reproducible Scaling Laws for Contrastive Language-Image Learning. 2818-2829 - Zheng Wang, Zhenwei Gao, Kangshuai Guo, Yang Yang, Xiaoming Wang, Heng Tao Shen:
Multilateral Semantic Relations Modeling for Image Text Retrieval. 2830-2839 - Rita Ramos, Bruno Martins, Desmond Elliott
, Yova Kementchedjhieva:
Smallcap: Lightweight Image Captioning Prompted with Retrieval Augmentation. 2840-2849 - Tinglei Feng, Jiaxuan Liu, Jufeng Yang:
Probing Sentiment-Oriented PreTraining Inspired by Human Sentiment Perception Mechanism. 2850-2860 - Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister:
Prefix Conditioning Unifies Language and Label Supervision. 2861-2870 - Yuchen Ren, Zhendong Mao, Shancheng Fang, Yan Lu, Tong He, Hao Du, Yongdong Zhang, Wanli Ouyang:
Crossing the Gap: Domain Generalization for Image Captioning. 2871-2880 - Weijie Tu, Weijian Deng, Tom Gedeon, Liang Zheng:
A Bag-of-Prototypes Representation for Dataset-Level Applications. 2881-2892 - Dingkang Liang
, Jiahao Xie, Zhikang Zou, Xiaoqing Ye, Wei Xu, Xiang Bai:
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model. 2893-2903 - Jianfeng He, Yuan Gao
, Tianzhu Zhang, Zhe Zhang, Feng Wu:
D2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based Transformers. 2904-2914 - Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang Wen Chen:
Learning to Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic Space. 2915-2924 - Sanghyun Kim, Deunsol Jung, Minsu Cho:
Relational Context Learning for Human-Object Interaction Detection. 2925-2934 - Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie:
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision. 2935-2944 - Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai:
Side Adapter Network for Open-Vocabulary Semantic Segmentation. 2945-2954 - Jiarui Xu, Sifei Liu, Arash Vahdat, Wonmin Byeon, Xiaolong Wang, Shalini De Mello:
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models. 2955-2966 - Sukmin Yun, Seong Hyeon Park
, Paul Hongsuck Seo, Jinwoo Shin:
IFSeg: Image-free Semantic Segmentation via Vision-Language Model. 2967-2977 - Haoran Geng, Ziming Li, Yiran Geng, Jiayi Chen, Hao Dong, He Wang:
PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations. 2978-2988 - Jitesh Jain, Jiachen Li, MangTik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi
:
OneFormer: One Transformer to Rule Universal Image Segmentation. 2989-2998 - Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou:
Delving into Shape-aware Zero-shot Semantic Segmentation. 2999-3009 - Fabio Cermelli, Matthieu Cord, Arthur Douillard:
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation. 3010-3020 - Mengxue Qu, Yu Wu, Yunchao Wei, Wu Liu, Xiaodan Liang, Yao Zhao:
Learning to Segment Every Referring Object Point by Point. 3021-3030 - Zhizheng Liu, Francesco Milano
, Jonas Frey
, Roland Siegwart, Hermann Blum, Cesar Cadena:
Unsupervised Continual Semantic Adaptation Through Neural Rendering. 3031-3040 - Feng Li, Hao Zhang, Huaizhe Xu, Shilong Liu, Lei Zhang, Lionel M. Ni, Heung-Yeung Shum:
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation. 3041-3050 - Hengcan Shi, Munawar Hayat, Jianfei Cai:
Transformer Scale Gate for Semantic Segmentation. 3051-3060 - Wei Huang, Chang Chen, Yong Li, Jiacheng Li, Cheng Li, Fenglong Song, Youliang Yan, Zhiwei Xiong:
Style Projected Clustering for Domain Generalized Semantic Segmentation. 3061-3071 - Shiqi Huang, Tingfa Xu, Ning Shen, Feng Mu, Jianan Li:
Rethinking Few-Shot Medical Segmentation: A Vector Quantization View. 3072-3081 - Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu:
Continual Semantic Segmentation with Automatic Memory Sample Selection. 3082-3092 - Lixiang Ru, Heliang Zheng, Yibing Zhan, Bo Du:
Token Contrast for Weakly-Supervised Semantic Segmentation. 3093-3102 - Rixin Zhou, Jiafu Wei, Qian Zhang, Ruihua Qi, Xi Yang, Chuntao Li:
Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph. 3103-3113 - Xiaoyang Wang
, Bingfeng Zhang
, Limin Yu, Jimin Xiao:
Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation. 3114-3123 - Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra:
Cut and Learn for Unsupervised Object Detection and Instance Segmentation. 3124-3134 - Zhaozheng Chen, Qianru Sun:
Extracting Class Activation Maps from Non-Discriminative Features as well. 3135-3144 - Tianheng Cheng, Xinggang Wang
, Shaoyu Chen, Qian Zhang, Wenyu Liu:
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation. 3145-3154 - Xiao Guo, Xiaohong Liu
, Zhiyuan Ren, Steven Grosz, Iacopo Masi, Xiaoming Liu:
Hierarchical Fine-Grained Image Forgery Detection and Localization. 3155-3165 - Pei Wang, Nuno Vasconcelos:
Towards Professional Level Crowd Annotation of Expert Domain Data. 3166-3175 - Oriane Siméoni, Chloé Sekkat, Gilles Puy, Antonín Vobecký, Éloi Zablocki, Patrick Pérez:
Unsupervised Object Localization: Observing the Background to Discover Objects. 3176-3186 - Enrico Fini, Pietro Astolfi, Karteek Alahari, Xavier Alameda-Pineda, Julien Mairal, Moin Nabi, Elisa Ricci:
Semi-supervised learning made simple with self-supervised clustering. 3187-3197 - Henri De Plaen, Pierre-François De Plaen, Johan A. K. Suykens
, Marc Proesmans, Tinne Tuytelaars
, Luc Van Gool:
Unbalanced Optimal Transport: A Unified Framework for Object Detection. 3198-3207 - Jiawei Ma
, Yulei Niu, Jincheng Xu, Shiyuan Huang, Guangxing Han, Shih-Fu Chang:
DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection. 3208-3218 - Vidit Vidit, Martin Engilberge, Mathieu Salzmann:
CLIP the Gap: A Single Domain Generalization Approach for Object Detection. 3219-3229 - Wenteng Liang, Feng Xue, Yihao Liu, Guofeng Zhong, Anlong Ming:
Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown Objects. 3230-3239 - Xinjiang Wang, Xingyi Yang
, Shilong Zhang, Yijiang Li, Litong Feng, Shijie Fang, Chengqi Lyu, Kai Chen, Wayne Zhang
:
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object Detection. 3240-3249 - Xiaolin Song, Binghui Chen, Pengyu Li, Jun-Yan He, Biao Wang, Yifeng Geng, Xuansong Xie, Honggang Zhang:
Optimal Proposal Learning for Deployable End-to-End Pedestrian Detection. 3250-3260 - Yipeng Gao
, Kun-Yu Lin
, Junkai Yan, Yaowei Wang, Wei-Shi Zheng:
AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection. 3261-3271 - Chenxi Zheng, Bangzhen Liu, Huaidong Zhang, Xuemiao Xu, Shengfeng He
:
Where is My Spot? Few-shot Image Generation via Latent Subspace Optimization. 3272-3281 - Fan Lu, Kai Zhu, Wei Zhai, Kecheng Zheng, Yang Cao:
Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection. 3282-3291 - Ronald Xie, Kuan Pang, Gary D. Bader, Bo Wang:
MAESTER: Masked Autoencoder Guided Segmentation at Pixel Resolution for Accurate, Self-Supervised Subcellular Structure Recognition. 3292-3301 - Heng Cai, Shumeng Li, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao:
Orthogonal Annotation Benefits Barely-supervised Medical Image Segmentation. 3302-3311 - Donghao Zhou, Chunbin Gu, Junde Xu, Furui Liu, Qiong Wang, Guangyong Chen, Pheng-Ann Heng:
RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure Prediction. 3312-3322 - Shahira Abousamra, Rajarsi Gupta, Tahsin M. Kurç, Dimitris Samaras, Joel H. Saltz, Chao Chen:
Topology-Guided Multi-Class Cell Context Generation for Digital Pathology. 3323-3333 - Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang
:
Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation. 3334-3343 - Mingu Kang, Heon Song, Seonwook Park
, Donggeun Yoo, Sérgio Pereira
:
Benchmarking Self-Supervised Learning on Diverse Pathology Datasets. 3344-3354 - Kangning Liu, Weicheng Zhu, Yiqiu Shen, Sheng Liu, Narges Razavian, Krzysztof J. Geras, Carlos Fernandez-Granda:
Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning. 3355-3365 - Rajshekhar Das, Yonatan Dukler, Avinash Ravichandran, Ashwin Swaminathan:
Learning Expressive Prompting With Residuals for Vision Transformers. 3366-3377 - Bartlomiej Olber, Krystian Radlak, Adam Popowicz, Michal Szczepankiewicz, Krystian Chachula:
Detection of Out-of-Distribution Samples Using Binary Neuron Activation Patterns. 3378-3387 - Zihan Zhang, Xiang Xiang:
Decoupling MaxLogit for Out-of-Distribution Detection. 3388-3397 - Zixuan Ding, Ao Wang, Hui Chen, Qiang Zhang, Pengzhang Liu, Yongjun Bao, Weipeng Yan, Jungong Han:
Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels. 3398-3407 - Youngwook Kim
, Jae-Myung Kim, Jieun Jeong, Cordelia Schmid, Zeynep Akata, Jungwoo Lee:
Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification. 3408-3417 - Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, Ioannis Patras:
DivClust: Controlling Diversity in Deep Clustering. 3418-3428 - Furen Zhuang, Pierre Moulin:
Deep Semi-Supervised Metric Learning with Mixed Label Propagation. 3429-3438 - Maria Sofia Bucarelli
, Lucas Cassano, Federico Siciliano
, Amin Mantrach, Fabrizio Silvestri:
Leveraging Inter-Rater Agreement for Classification in the Presence of Noisy Labels. 3439-3448 - Wenbin Li
, Zhichen Fan, Jing Huo, Yang Gao:
Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery. 3449-3458 - Muli Yang, Liancheng Wang, Cheng Deng, Hanwang Zhang:
Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery. 3459-3468 - Tong Wei, Kai Gan:
Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency is All You Need. 3469-3478 - Sheng Zhang, Salman H. Khan, Zhiqiang Shen, Muzammal Naseer, Guangyi Chen, Fahad Shahbaz Khan:
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery. 3479-3488 - Jianqing Xu, Shen Li, Ailin Deng, Miao Xiong, Jiaying Wu, Jiaxiang Wu, Shouhong Ding, Bryan Hooi:
Probabilistic Knowledge Distillation of Face Ensembles. 3489-3498 - Zhipeng Zhou
, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Wei Gong:
Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition. 3499-3509 - Yuchen Liu, Yaoming Wang, Yabo Chen, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong:
Promoting Semantic Connectivity: Dual Nearest Neighbors Contrastive Learning for Unsupervised Domain Generalization. 3510-3519 - Vibashan VS, Poojan Oza, Vishal M. Patel:
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection. 3520-3530 - You-Wei Luo
, Chuan-Xian Ren:
MOT: Masked Optimal Transport for Partial Domain Adaptation. 3531-3540 - Hao Yu, Xu Cheng, Wei Peng
:
TOPLight: Lightweight Neural Networks with Task-Oriented Pretraining for Visible-Infrared Recognition. 3541-3550 - Ye Liu, Lingfeng Qiao, Changchong Lu, Di Yin, Chen Lin, Haoyuan Peng, Bo Ren:
OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain Adaptation. 3551-3560 - Jinjing Zhu, Haotian Bai, Lin Wang
:
Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective. 3561-3571 - Yizhi Wang, Zeyu Huang, Ariel Shamir, Hui Huang, Hao Zhang, Ruizhen Hu:
ARO-Net: Learning Implicit Fields from Anchored Radial Observations. 3572-3581 - Dhanajit Brahma, Piyush Rai:
A Probabilistic Framework for Lifelong Test-Time Adaptation. 3582-3591 - Runpeng Yu, Songhua Liu, Xingyi Yang
, Xinchao Wang:
Distribution Shift Inversion for Out-of-Distribution Prediction. 3592-3602 - Jiali Cui, Ying Nian Wu, Tian Han:
Learning Joint Latent Space EBM Prior Model for Multi-layer Generator. 3603-3612 - Saachi Jain, Hadi Salman, Alaa Khaddaj, Eric Wong
, Sung Min Park, Aleksander Madry:
A Data-Based Perspective on Transfer Learning. 3613-3622 - Achin Jain, Gurumurthy Swaminathan, Paolo Favaro, Hao Yang, Avinash Ravichandran, Hrayr Harutyunyan, Alessandro Achille, Onkar Dabeer, Bernt Schiele, Ashwin Swaminathan, Stefano Soatto:
A Meta-Learning Approach to Predicting Performance and Data Requirements. 3623-3632 - Hao Li, Charless C. Fowlkes, Hao Yang, Onkar Dabeer, Zhuowen Tu, Stefano Soatto:
Guided Recommendation for Model Fine-Tuning. 3633-3642 - Peng Liao
, Yaochu Jin, Wenli Du:
EMT-NAS: Transferring architectural knowledge between tasks from different datasets. 3643-3653 - Runqi Wang, Xiaoyue Duan, Guoliang Kang, Jianzhuang Liu, Shaohui Lin, Songcen Xu, Jinhu Lv, Baochang Zhang:
AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning. 3654-3663 - Iordanis Fostiropoulos, Jiaye Zhu, Laurent Itti:
Batch Model Consolidation: A Multi-Task Model Consolidation Framework. 3664-3676 - Yinglong Wang, Chao Ma, Jianzhuang Liu:
SmartAssign: Learning A Smart Knowledge Assignment Strategy for Deraining and Desnowing. 3677-3686 - Sucheng Ren, Fangyun Wei, Zheng Zhang, Han Hu:
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models. 3687-3697 - Ameya Prabhu, Hasan Abed Al Kader Hammoud
, Puneet K. Dokania, Philip H. S. Torr, Ser-Nam Lim, Bernard Ghanem
, Adel Bibi
:
Computationally Budgeted Continual Learning: What Does Matter? 3698-3707 - Kangyang Luo, Xiang Li, Yunshi Lan, Ming Gao:
GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic Forgetting. 3708-3717 - Zhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie, Lizhuang Ma:
Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling. 3718-3727 - Yushun Tang, Ce Zhang
, Heng Xu
, Shuoshuo Chen, Jie Cheng, Luziwei Leng, Qinghai Guo, Zhihai He:
Neuro-Modulated Hebbian Learning for Fully Test-Time Adaptation. 3728-3738 - Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li:
Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation. 3749-3758 - Songhua Liu, Jingwen Ye, Runpeng Yu, Xinchao Wang:
Slimmable Dataset Condensation. 3759-3768 - Pengfei Wang, Zhaoxiang Zhang, Zhen Lei, Lei Zhang
:
Sharpness-Aware Gradient Matching for Domain Generalization. 3769-3778 - Wonhyeok Choi, Sunghoon Im:
Dynamic Neural Network for Multi-Task Learning Searching across Diverse Network Topologies. 3779-3788 - Ahmed Imtiaz Humayun, Randall Balestriero, Guha Balakrishnan, Richard G. Baraniuk:
SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries. 3789-3798 - Jaeill Kim, Suhyun Kang, Duhun Hwang, Jungwook Shin, Wonjong Rhee:
VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution. 3799-3810 - Yuedong Yang, Guihong Li, Radu Marculescu:
Efficient On-Device Training via Gradient Filtering. 3811-3820 - Tang Li, Fengchun Qiao, Mengmeng Ma, Xi Peng:
Are Data-Driven Explanations Robust Against Out-of-Distribution Data? 3821-3831 - Jongin Lim
, Youngdong Kim, Byungjai Kim, Chanho Ahn, Jinwoo Shin, Eunho Yang, Seungju Han
:
BiasAdv: Bias-Adversarial Augmentation for Model Debiasing. 3832-3841 - Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lü, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. 3842-3851 - Juncheol Shin, Junhyuk So, Sein Park, Seungyeop Kang, Sungjoo Yoo, Eunhyeok Park:
NIPQ: Noise proxy-based Integrated Pseudo-Quantization. 3852-3861 - Vinu Sankar Sadasivan, Mahdi Soltanolkotabi
, Soheil Feizi:
CUDA: Convolution-Based Unlearnable Datasets. 3862-3871 - Kaiwen Cui, Yingchen Yu, Fangneng Zhan, Shengcai Liao, Shijian Lu, Eric P. Xing:
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation. 3872-3882 - Siddarth Asokan
, Chandra Sekhar Seelamantula:
Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training. 3883-3893 - Harleen Hanspal, Alessio Lomuscio
:
Efficient Verification of Neural Networks Against LVM-Based Specifications. 3894-3903 - Kexin Sun, Zhineng Chen, Gongwei Wang, Jun Liu, Xiongjun Ye, Yu-Gang Jiang:
Bi-directional Feature Fusion Generative Adversarial Network for Ultra-high Resolution Pathological Image Virtual Re-staining. 3904-3913 - Xuan Zhang, Shiyu Li, Xi Li, Ping Huang, Jiulong Shan, Ting Chen:
DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection. 3914-3923 - Ying Zhao
:
OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization. 3924-3933 - Jiahua Dong, Duzhen Zhang, Yang Cong, Wei Cong, Henghui Ding
, Dengxin Dai:
Federated Incremental Semantic Segmentation. 3934-3943 - Sangmook Kim, Sangmin Bae, Hwanjun Song, Se-Young Yun:
Re-Thinking Federated Active Learning Based on Inter-Class Diversity. 3944-3953 - Ruipeng Zhang, Qinwei Xu, Jiangchao Yao, Ya Zhang
, Qi Tian, Yanfeng Wang:
Federated Domain Generalization with Generalization Adjustment. 3954-3963 - Bo Li, Mikkel N. Schmidt
, Tommy S. Alstrøm
, Sebastian U. Stich:
On the Effectiveness of Partial Variance Reduction in Federated Learning with Heterogeneous Data. 3964-3973 - Joshua C. Zhao, Ahmed Roushdy Elkordy, Atul Sharma, Yahya H. Ezzeldin, Salman Avestimehr, Saurabh Bagchi:
The Resource Problem of Using Linear Layer Leakage Attack in Federated Learning. 3974-3983 - Jiaming Zhang, Xingjun Ma, Qi Yi, Jitao Sang, Yu-Gang Jiang, Yaowei Wang, Changsheng Xu:
Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples. 3984-3993 - Shichao Dong, Jin Wang, Renhe Ji, Jiajun Liang, Haoqiang Fan, Zheng Ge:
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization. 3994-4004 - Kuofeng Gao, Yang Bai
, Jindong Gu, Yong Yang, Shu-Tao Xia:
Backdoor Defense via Adaptively Splitting Poisoned Dataset. 4005-4014 - Sheng-Yen Chou
, Pin-Yu Chen, Tsung-Yi Ho:
How to Backdoor Diffusion Models? 4015-4024 - Mengxin Zheng, Qian Lou, Lei Jiang:
TrojViT: Trojan Insertion in Vision Transformers. 4025-4034 - Weixin Chen, Dawn Song, Bo Li:
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets. 4035-4044 - Zikui Cai, Yaoteng Tan, M. Salman Asif:
Ensemble-based Blackbox Attacks on Dense Prediction. 4045-4055 - Yunrui Yu, Cheng-Zhong Xu:
Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based Attacks. 4056-4066 - Iuri Frosio, Jan Kautz:
The Best Defense is a Good Offense: Adversarial Augmentation Against Adversarial Attacks. 4067-4076 - Minjing Dong
, Chang Xu:
Adversarial Robustness via Random Projection Filters. 4077-4086 - Bilel Tarchoun, Anouar Ben Khalifa, Mohamed Ali Mahjoub, Nael B. Abu-Ghazaleh, Ihsen Alouani
:
Jedi: Entropy-Based Localization and Removal of Adversarial Patches. 4087-4095 - Aishan Liu, Shiyu Tang, Siyuan Liang, Ruihao Gong, Boxi Wu, Xianglong Liu, Dacheng Tao:
Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization. 4096-4107 - Yong Guo, David Stutz, Bernt Schiele:
Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions. 4108-4118 - Xiao Yang, Chang Liu, Longlong Xu
, Yikai Wang, Yinpeng Dong, Ning Chen, Hang Su, Jun Zhu:
Towards Effective Adversarial Textured 3D Meshes on Physical Face Recognition. 4119-4128 - Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Houqiang Li:
AltFreezing for More General Video Face Forgery Detection. 4129-4138 - Alankar Kotwal, Anat Levin, Ioannis Gkioulekas
:
Passive Micron-Scale Time-of-Flight with Sunlight Interferometry. 4139-4149 - Peng Wang, Yuan Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt
, Wenping Wang:
F2-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories. 4150-4159 - Wenjing Bian, Zirui Wang, Kejie Li, Jia-Wang Bian:
NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior. 4160-4169 - Peng Wang
, Lingzhe Zhao
, Ruijie Ma, Peidong Liu:
BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields. 4170-4179 - Jamie Wynn, Daniyar Turmukhambetov:
DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models. 4180-4189 - Prune Truong, Marie-Julie Rakotosaona, Fabian Manhardt, Federico Tombari:
SPARF: Neural Radiance Fields from Sparse and Noisy Poses. 4190-4200 - Rahul Goel, Dhawal Sirikonda, Saurabh Saini, P. J. Narayanan:
Interactive Segmentation of Radiance Fields. 4201-4211 - Sungheon Park, Minjung Son
, Seokhwan Jang, Young Chun Ahn, Ji-Yeon Kim, Nahyup Kang:
Temporal Interpolation is all You Need for Dynamic Neural Radiance Fields. 4212-4221 - Lingzhi Li, Zhen Shen, Zhongshu Wang, Li Shen, Liefeng Bo:
Compressing Volumetric Radiance Fields to 1 MB. 4222-4231 - Kang Han, Wei Xiang:
Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis. 4232-4241 - Yuechen Zhang, Zexin He, Jinbo Xing, Xufeng Yao, Jiaya Jia
:
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization. 4242-4251 - Sida Peng, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou:
Representing Volumetric Videos as Dynamic MLP Maps. 4252-4262 - Wei Dong, Christopher B. Choy, Charles Loop, Or Litany, Yuke Zhu, Anima Anandkumar:
Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids. 4263-4272 - Zhengqi Li, Qianqian Wang, Forrester Cole, Richard Tucker, Noah Snavely:
DynIBaR: Neural Dynamic Image-Based Rendering. 4273-4284 - Michael Fischer, Tobias Ritschel:
Plateau-Reduced Differentiable Path Tracing. 4285-4294 - Haoqian Wu, Zhipeng Hu, Lincheng Li, Yongqiang Zhang
, Changjie Fan, Xin Yu
:
NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. 4295-4304 - Ziang Cheng, Junxuan Li, Hongdong Li:
WildLight: In-the-wild Inverse Rendering with a Flashlight. 4305-4314 - Taotao Zhou, Kai He, Di Wu, Teng Xu
, Qixuan Zhang, Kuixiang Shao, Wenzheng Chen, Lan Xu, Jingyi Yu:
Relightable Neural Human Assets from Multi-view Gradient Illuminations. 4315-4327 - Norman Müller, Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder, Matthias Nießner:
DiffRF: Rendering-Guided 3D Radiance Field Diffusion. 4328-4338 - Tianyuan Zhang, Mark Sheinin, Dorian Chan, Mark Rau, Matthew O'Toole, Srinivasa G. Narasimhan
:
Analyzing Physical Impacts Using Transient Surface Wave Imaging. 4339-4348 - Byeongjoo Ahn, Michael DeZeeuw, Ioannis Gkioulekas
, Aswin C. Sankaranarayanan:
Neural Kaleidoscopic Space Sculpting. 4349-4358 - Yongqiang Zhang
, Zhipeng Hu, Haoqian Wu, Minda Zhao, Lincheng Li, Zhengxia Zou, Changjie Fan:
Towards Unbiased Volume Rendering of Neural Implicit Surfaces with Geometry Priors. 4359-4368 - Jiahui Huang, Zan Gojcic, Matan Atzmon, Or Litany, Sanja Fidler, Francis Williams:
Neural Kernel Surface Reconstruction. 4369-4379 - Mingye Xu, Mutian Xu, Tong He, Wanli Ouyang, Yali Wang, Xiaoguang Han, Yu Qiao:
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency. 4380-4390 - Dario Pavllo, David Joseph Tan, Marie-Julie Rakotosaona, Federico Tombari:
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion. 4391-4401 - Yinghao Xu, Menglei Chai, Zifan Shi, Sida Peng, Ivan Skorokhodov
, Aliaksandr Siarohin, Ceyuan Yang, Yujun Shen, Hsin-Ying Lee, Bolei Zhou, Sergey Tulyakov:
DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis. 4402-4412 - Chi-Chong Wong:
Heat Diffusion Based Multi-Scale and Geometric Structure-Aware Transformer for Mesh Segmentation. 4413-4422 - Yu Deng, Baoyuan Wang, Heung-Yeung Shum:
Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image. 4423-4433 - Kangle Deng
, Gengshan Yang, Deva Ramanan, Jun-Yan Zhu:
3D-aware Conditional Image Synthesis. 4434-4445 - Anna Frühstück, Nikolaos Sarafianos, Yuanlu Xu
, Peter Wonka, Tony Tung:
VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANs. 4446-4455 - Yen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander G. Schwing, Liangyan Gui:
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation. 4456-4465 - Konstantinos Tertikas, Despoina Paschalidou
, Boxiao Pan, Jeong Joon Park, Mikaela Angelina Uy, Ioannis Z. Emiris
, Yannis Avrithis, Leonidas J. Guibas:
Generating Part-Aware Editable 3D Shapes without 3D Supervision. 4466-4478 - Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Yi Wang
, Zhangyang Wang:
NeuralLift-360: Lifting an in-the-Wild 2D Photo to A 3D Object with 360° Views. 4479-4489 - Baojin Huang, Zhongyuan Wang, Jifan Yang
, Jiaxin Ai, Qin Zou, Qian Wang, Dengpan Ye:
Implicit Identity Driven Deepfake Face Swapping Detection. 4490-4499 - Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, K. Madhava Krishna, Srinath Sridhar:
Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields. 4500-4510 - Xingyu Ren
, Jiankang Deng
, Chao Ma, Yichao Yan, Xiaokang Yang:
Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues. 4511-4520 - Menghua Wu, Hao Zhu, Linjia Huang, Yiyu Zhuang, Yuanxun Lu, Xun Cao:
High-fidelity 3D Face Generation from Natural Language Descriptions. 4521-4530 - Heyuan Li
, Bo Wang
, Yu Cheng, Mohan S. Kankanhalli
, Robby T. Tan:
DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment. 4531-4540 - Yunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan:
High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors. 4541-4551 - Rameen Abdal, Hsin-Ying Lee, Peihao Zhu, Menglei Chai, Aliaksandr Siarohin, Peter Wonka, Sergey Tulyakov:
3DAvatarGAN: Bridging Domains for Personalized Editable Avatars. 4552-4562 - Tengfei Wang
, Bo Zhang, Ting Zhang, Shuyang Gu, Jianmin Bao, Tadas Baltrusaitis, Jingjing Shen, Dong Chen, Fang Wen, Qifeng Chen, Baining Guo:
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion. 4563-4573 - Wojciech Zielonka, Timo Bolkart, Justus Thies:
Instant Volumetric Head Avatars. 4574-4584 - Siddarth Ravichandran, Ondrej Texler, Dimitar Dinev, Hyun Jae Kang:
Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement. 4585-4594 - Xingyi Li
, Zhiguo Cao, Huiqiang Sun, Jianming Zhang, Ke Xian, Guosheng Lin:
3D Cinemagraphy from a Single Image. 4595-4605 - Luyang Zhu, Dawei Yang, Tyler Zhu, Fitsum Reda, William Chan, Chitwan Saharia, Mohammad Norouzi, Ira Kemelmacher-Shlizerman:
TryOnDiffusion: A Tale of Two UNets. 4606-4615 - Xingqun Qi, Chen Liu, Muyi Sun, Lincheng Li, Changjie Fan, Xin Yu
:
Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. 4616-4626 - Yasamin Jafarian, Tuanfeng Y. Wang, Duygu Ceylan, Jimei Yang, Nathan Carr, Yi Zhou, Hyun Soo Park:
Normal-guided Garment UV Prediction for Human Re-texturing. 4627-4636 - Lingteng Qiu, Guanying Chen
, Jiapeng Zhou, Mutian Xu, Junle Wang, Xiaoguang Han:
REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos. 4637-4646 - Yukang Cao, Kai Han, Kwan-Yee K. Wong:
SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction. 4647-4657 - Rolandos Alexandros Potamias, Stylianos Ploumpis, Stylianos Moschoglou, Vasileios Triantafyllou, Stefanos Zafeiriou:
Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model. 4670-4680 - Nikolas Lamb, Cameron Palmer, Benjamin Molloy, Sean Banerjee, Natasha Kholgade Banerjee:
Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete Counterparts. 4681-4691 - Jeff Tan, Gengshan Yang, Deva Ramanan:
Distilling Neural Fields for Real-Time Articulated Shape Reconstruction. 4692-4701 - Rui Guo, Jasmine Collins, Oscar de Lima, Andrew Owens:
GANmouflage: 3D Object Nondetection with Texture Fields. 4702-4712 - Shashank Tripathi, Lea Müller, Chun-Hao P. Huang, Omid Taheri, Michael J. Black, Dimitrios Tzionas
:
3D Human Pose Estimation via Intuitive Physics. 4713-4725 - Ilya A. Petrov
, Riccardo Marin, Julian Chibane, Gerard Pons-Moll:
Object pop-up: Can we infer 3D objects and their poses from human interactions alone? 4726-4736 - Yinzhen Xu, Weikang Wan, Jialiang Zhang, Haoran Liu, Zikang Shan, Hao Shen, Ruicheng Wang, Haoran Geng, Yijia Weng, Jiayi Chen, Tengyu Liu, Li Yi, He Wang:
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy. 4737-4746 - Xiongbiao Luo:
Constrained Evolutionary Diffusion Filter for Monocular Endoscope Tracking. 4747-4756 - Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll:
Visibility Aware Human-Object Interaction Tracking from Single RGB Camera. 4757-4768 - Hoseong Cho, Chanwoo Kim, Jihyeon Kim, Seongyeong Lee, Elkhan Ismayilzada
, Seungryul Baek:
Transformer-based Unified Recognition of Two Hands Manipulating Objects. 4769-4778 - Akash Sengupta, Ignas Budvytis, Roberto Cipolla:
HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation. 4779-4789 - Zhenhua Tang, Zhaofan Qiu, Yanbin Hao, Richang Hong, Ting Yao:
3D Human Pose Estimation with Spatio-Temporal Criss-Cross Attention. 4790-4799 - Hai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang:
GFPose: Learning 3D Human Pose Prior with Gradient Fields. 4800-4810 - Edward Vendrow, Duy-Tho Le, Jianfei Cai, Hamid Rezatofighi:
JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking. 4811-4820 - Qiyuan He, Linlin Yang, Kerui Gu, Qiuxia Lin, Angela Yao:
Analyzing and Diagnosing Pose Estimation with Attributions. 4821-4830 - Yang Hai, Rui Song, Jiaojiao Li, Yinlin Hu:
Shape-Constraint Recurrent Flow for 6D Object Pose Estimation. 4831-4840 - Hanzhi Chen, Fabian Manhardt, Nassir Navab, Benjamin Busam:
TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation. 4841-4852 - Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble. 4853-4862 - Bangyan Liao
, Delin Qu, Yifei Xue, Huiqing Zhang, Yizhen Lao:
Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast Solution. 4863-4871 - Yaqing Ding
, Jian Yang, Viktor Larsson, Carl Olsson, Kalle Åström
:
Revisiting the P3P Problem. 4872-4880 - Samarth Sinha, Roman Shapovalov, Jeremy Reizenstein, Ignacio Rocco, Natalia Neverova, Andrea Vedaldi, David Novotný:
Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories. 4881-4891 - Kejie Li, Jia-Wang Bian, Robert Castle, Philip H. S. Torr, Victor Adrian Prisacariu:
MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices. 4892-4901 - Jiahui Lei, Congyue Deng, Karl Schmeckpeper, Leonidas J. Guibas, Kostas Daniilidis:
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision. 4902-4912 - Bokui Shen, Xinchen Yan, Charles R. Qi, Mahyar Najibi, Boyang Deng, Leonidas J. Guibas, Yin Zhou, Dragomir Anguelov:
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild. 4913-4926 - Karmesh Yadav, Ram Ramrakhya, Santhosh Kumar Ramakrishnan, Théophile Gervet, John M. Turner, Aaron Gokaslan, Noah Maestre, Angel Xuan Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot:
Habitat-Matterport 3D Semantics Dataset. 4927-4936 - Tao Chu, Pan Zhang, Qiong Liu, Jiaqi Wang:
BUOL: A Bottom-Up Framework with Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single Image. 4937-4946 - Xinhua Cheng, Yanmin Wu
, Mengxi Jia, Qian Wang, Jian Zhang:
Panoptic Compositional Feature Field for Editable Scene Rendering with Network-Inferred Labels via Metric Learning. 4947-4957 - Yash Bhalgat, João F. Henriques, Andrew Zisserman:
A Light Touch Approach to Teaching Transformers Multi-view Geometry. 4958-4969 - Yilun Du, Cameron Smith, Ayush Tewari, Vincent Sitzmann:
Learning to Render Novel Views from Wide-Baseline Stereo Pairs. 4970-4980 - Lukas Mehl, Jenny Schmalfuss, Azin Jahedi, Yaroslava Nalivayko, Andrés Bruhn
:
Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo. 4981-4991 - Viktor Rudnev, Mohamed Elgharib, Christian Theobalt
, Vladislav Golyanik:
EventNeRF: Neural Radiance Fields from a Single Colour Event Camera. 4992-5002 - Shengjie Zhu
, Xiaoming Liu:
LightedDepth: Video Depth Estimation in Light of Limited Inference View Angles. 5003-5012 - Ruicheng Feng, Chongyi Li
, Huaijin G. Chen
, Shuai Li, Jinwei Gu, Chen Change Loy:
Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera. 5013-5022 - Donggun Kim, Hyeonjoong Jang, Inchul Kim, Min H. Kim:
Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel Image. 5023-5032 - Chao Ning
, Hongping Gan:
Trap Attention: Monocular Depth Estimation with Manual Traps. 5033-5043 - Eric Brachmann, Tommaso Cavallari, Victor Adrian Prisacariu:
Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and Poses. 5044-5053 - Brevin Tilmon, Zhanghao Sun, Sanjeev J. Koppal, Yicheng Wu, Georgios Evangelidis, Ramzi Zahreddine, Gurunandan Krishnan, Sizhuo Ma, Jian Wang:
Energy-Efficient Adaptive 3D Sensing. 5054-5063 - Shun-Cheng Wu, Keisuke Tateno, Nassir Navab, Federico Tombari:
Incremental 3D Semantic Scene Graph Prediction from RGB Sequences. 5064-5074 - Zhanghao Sun, Wei Ye, Jinhui Xiong, Gyeongmin Choe, Jialiang Wang, Shuochen Su, Rakesh Ranjan:
Consistent Direct Time-of-Flight Video Depth Super-Resolution. 5075-5085 - Chittesh Thavamani, Mengtian Li, Francesco Ferroni, Deva Ramanan:
Learning to Zoom and Unzoom. 5086-5095 - Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang:
FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection. 5096-5105 - Jiawei He
, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang:
3D Video Object Detection with Learnable Object-Centric Global Optimization. 5106-5115 - Shengchao Zhou, Weizhou Liu, Chen Hu, Shuchang Zhou, Chao Ma:
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View. 5116-5125 - Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu:
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data. 5126-5135 - Qi Ming, Lingjuan Miao, Zhe Ma, Lin Zhao, Zhiqiang Zhou, Xuhui Huang, Yuanpei Chen, Yufei Guo:
Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU Supervision. 5136-5145 - Han Liu
, Yuhao Wu, Zhiyuan Yu
, Yevgeniy Vorobeychik, Ning Zhang:
SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples. 5146-5155 - Nishant Kumar
, Sinisa Segvic, Abouzar Eslami, Stefan Gumhold:
Normalizing Flow based Feature Synthesis for Outlier-Aware Object Detection. 5156-5165 - Chao Zhou
, Yanan Zhang
, Jiaxin Chen, Di Huang:
OcTr: Octree-Based Transformer for 3D Object Detection. 5166-5175 - Sijie Wang, Qiyu Kang, Rui She
, Wei Wang, Kai Zhao, Yang Song, Wee Peng Tay:
HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic Fusion. 5176-5185 - Song Wang, Wentong Li, Wenyu Liu, Xiaolu Liu, Jianke Zhu:
LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation. 5186-5195 - Chenhang He, Ruihuang Li, Yabin Zhang, Shuai Li, Lei Zhang
:
MSF: Motion-guided Sequential Fusion for Efficient 3D Object Detection from Point Cloud Sequences. 5196-5205 - Fei Xue, Ignas Budvytis, Roberto Cipolla:
SFD2: Semantic-Guided Feature Detection and Description. 5206-5216 - Lucas Nunes, Louis Wiesmann, Rodrigo Marcuzzi, Xieyuanli Chen
, Jens Behley, Cyrill Stachniss:
Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving. 5217-5228 - Bo Pang, Hongchi Xia, Cewu Lu:
Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving. 5229-5239 - Angelika Ando, Spyros Gidaris, Andrei Bursuc, Gilles Puy, Alexandre Boulch, Renaud Marlet:
RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving. 5240-5250 - Yanhao Wu
, Tong Zhang, Wei Ke
, Sabine Süsstrunk, Mathieu Salzmann:
Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild. 5251-5260 - Utkarsh Mall, Bharath Hariharan, Kavita Bala
:
Change-Aware Sampling and Contrastive Learning for Satellite Images. 5261-5270 - Yaqi Shen, Le Hui, Jin Xie, Jian Yang:
Self-Supervised 3D Scene Flow Estimation Guided by Superpoints. 5271-5280 - Itai Lang, Dror Aiger, Forrester Cole, Shai Avidan, Michael Rubinstein:
SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow. 5281-5290 - Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang:
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection. 5291-5301 - Yaomin Huang, Ning Liu, Zhengping Che, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Guixu Zhang, Xinmei Liu, Feifei Feng, Jian Tang:
CP3: Channel Pruning Plug-in for Point-Based Networks. 5302-5312 - Xiuwei Xu, Ziwei Wang, Jie Zhou, Jiwen Lu:
Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis. 5313-5322 - Junming Zhang, Haomeng Zhang, Ram Vasudevan, Matthew Johnson-Roberson:
Hyperspherical Embedding for Point Cloud Completion. 5323-5332 - Chengzhi Wu, Junwei Zheng, Julius Pfrommer, Jürgen Beyerer:
Attention-Based Point Cloud Edge Sampling. 5333-5343 - Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li
, Jianbo Shi:
Starting from Non-Parametric Networks for 3D Point Cloud Analysis. 5344-5353 - Yun He, Danhang Tang, Yinda Zhang, Xiangyang Xue, Yanwei Fu:
Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions. 5354-5363 - Jiacheng Deng, Chuxin Wang, Jiahao Lu, Jianfeng He, Tianzhu Zhang, Jiyang Yu, Zhe Zhang:
SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence. 5364-5373 - Shengwei Qin
, Zhong Li, Ligang Liu:
Robust 3D Shape Classification via Non-local Graph Attention Network. 5374-5383 - Hao Yu, Zheng Qin, Ji Hou, Mahdi Saleh, Dongsheng Li, Benjamin Busam, Slobodan Ilic:
Rotation-Invariant Transformer for Point Cloud Matching. 5384-5393 - Zheng Qin, Hao Yu, Changjian Wang, Yuxing Peng, Kai Xu:
Deep Graph-based Spatial Consistency for Robust Non-rigid Point Cloud Registration. 5394-5403 - Tianlu Zhang, Hongyuan Guo, Qiang Jiao, Qiang Zhang, Jungong Han:
Efficient RGB-T Tracking via Cross-Modality Distillation. 5404-5413 - Daniel Barath, Denys Rozumnyi
, Ivan Eichhardt, Levente Hajder, Jiri Matas:
Finding Geometric Models by Clustering in the Consensus Space. 5414-5424 - Dihe Huang, Ying Chen, Yong Liu, Jianlin Liu, Shang Xu, Wenlong Wu, Yikang Ding, Fan Tang, Chengjie Wang:
Adaptive Assignment for Geometry Aware Local Feature Matching. 5425-5434 - Zhibo Rao
, Bangshu Xiong
, Mingyi He
, Yuchao Dai, Renjie He, Zhelun Shen, Xing Li:
Masked Representation Learning for Domain Generalized Stereo Matching. 5435-5444 - Han Ling, Yinghui Sun, Quansen Sun, Zhenwen Ren
:
Learning Optical Expansion from Scale Matching. 5445-5454 - Hyunyoung Jung
, Zhuo Hui, Lei Luo, Haitao Yang, Feng Liu, Sungjoo Yoo, Rakesh Ranjan, Denis Demandolx:
AnyFlow: Arbitrary Scale Optical Flow with Implicit Neural Representation. 5455-5465 - Mohammad Amin Shabani, Sepidehsadat Hosseini, Yasutaka Furukawa:
HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising. 5466-5475 - Abdul Hannan Khan, Mohammed Shariq Nawaz, Andreas Dengel:
Localized Semantic Feature Mixers for Efficient Pedestrian Detection in Autonomous Driving. 5476-5485 - Haibao Yu, Wenxian Yang, Hongzhi Ruan, Zhenwei Yang, Yingjuan Tang, Xu Gao, Xin Hao, Yifeng Shi, Yifeng Pan, Ning Sun, Juan Song, Jirui Yuan, Ping Luo, Zaiqing Nie:
V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting. 5486-5495 - Junru Gu, Chenxu Hu, Tianyuan Zhang, Xuanyao Chen, Yilun Wang, Yue Wang, Hang Zhao:
ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries. 5496-5506 - Dekai Zhu, Guangyao Zhai, Yan Di, Fabian Manhardt, Hendrik Berkemeyer, Tuan Tran, Nassir Navab, Federico Tombari, Benjamin Busam:
IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction. 5507-5516 - Weibo Mao, Chenxin Xu, Qi Zhu, Siheng Chen, Yanfeng Wang:
Leapfrog Diffusion Model for Stochastic Trajectory Prediction. 5517-5526 - Xiaoning Sun, Huaijiang Sun, Bin Li, Dong Wei, Weiqing Li, Jianfeng Lu:
DeFeeNet: Consecutive 3D Human Motion Prediction with Deviation Feedback. 5527-5536 - Zhehan Kan, Shuoshuo Chen, Ce Zhang
, Yushun Tang, Zhihai He:
Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation. 5537-5546 - Shiwei Jin, Zhen Wang, Lei Wang, Ning Bi, Truong Q. Nguyen:
ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection. 5547-5556 - Zhou Huang, Hang Dai, Tian-Zhu Xiang
, Shuo Wang, Huai-Xin Chen, Jie Qin, Huan Xiong:
Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers. 5557-5566 - Siyuan Li, Tobias Fischer, Lei Ke, Henghui Ding
, Martin Danelljan, Fisher Yu:
OVTrack: Open-Vocabulary Multiple Object Tracking. 5567-5577 - Huanzhang Dou, Pengyi Zhang, Wei Su, Yunlong Yu, Yining Lin, Xi Li:
GaitGCI: Generative Counterfactual Intervention for Gait Recognition. 5578-5588 - Dimitrios Kollias:
Multi-Label Compound Expression Recognition: C-EXPR Database & Network. 5589-5598 - Lianxin Xie, Wen Xue, Zhen Xu, Si Wu, Zhiwen Yu, Hau-San Wong
:
Blemish-aware and Progressive Face Retouching with Limited Paired Data. 5599-5608 - Yue Gao, Yuan Zhou, Jinglu Wang
, Xiao Li, Xiang Ming, Yan Lu:
High-Fidelity and Freely Controllable Talking Head Video Generation. 5609-5619 - Lei Wang
, Piotr Koniusz:
3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition. 5620-5631 - Zixiang Zhou, Baoyuan Wang:
UDE: A Unified Driving Engine for Human Motion Generation. 5632-5641 - Nico Messikommer
, Carter Fang, Mathias Gehrig, Davide Scaramuzza:
Data-Driven Feature Tracking for Event Cameras. 5642-5651 - Xiaoqian Shen, Xiang Li
, Mohamed Elhoseiny:
MoStGAN-V: Video Generation with Temporal Motion Styles. 5652-5661 - Boyang Zhang, Kehua Ma, Suping Wu, Zhixiang Yuan:
Two-stage Co-segmentation Network Based on Discriminative Representation for Recovering Human Mesh from Videos. 5662-5670 - Bin Fan, Yuxin Mao, Yuchao Dai, Zhexiong Wan, Qi Liu
:
Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction. 5671-5681 - Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, Limin Wang:
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation. 5682-5692 - Zhiliang Wu
, Changchang Sun, Hanyu Xuan, Yan Yan:
Deep Stereo Video Inpainting. 5693-5702 - Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burstormer: Burst Image Restoration and Enhancement Transformer. 5703-5712 - Zhihang Zhong, Mingdeng Cao, Xiang Ji
, Yinqiang Zheng, Imari Sato:
Blur Interpolation Transformer for Real-World Motion from Blur. 5713-5723 - Yiheng Chi, Xingguang Zhang, Stanley H. Chan:
HDR Imaging with Spatially Varying Signal-to-Noise Ratios. 5724-5734 - Yusaku Yoshida, Ryo Kawahara, Takahiro Okabe:
Light Source Separation and Intrinsic Image Decomposition under AC Illumination. 5735-5743 - Yue Cao, Ming Liu, Shuai Liu, Xiaotao Wang, Lei Lei, Wangmeng Zuo:
Physics-Guided ISO-Dependent Sensor Noise Modeling for Extreme Low-Light Photography. 5744-5753 - Yuhui Quan, Zicong Wu, Hui Ji:
Neumann Network with Recursive Kernels for Single Image Defocus Deblurring. 5754-5763 - Carlos Rodríguez-Pardo, Henar Dominguez-Elvira, David Pascual-Hernández
, Elena Garces:
UMat: Uncertainty-Aware Single Image High Resolution Material Capture. 5764-5774 - Qingsen Yan, Song Zhang, Weiye Chen, Hao Tang, Yu Zhu, Jinqiu Sun, Luc Van Gool, Yanning Zhang:
SMAE: Few-shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders. 5775-5784 - Yu Zheng, Jiahui Zhan, Shengfeng He
, Junyu Dong, Yong Du:
Curricular Contrastive Regularization for Physics-Aware Single Image Dehazing. 5785-5794 - Gregory Vaksman, Michael Elad:
PatchCraft Self-Supervised Training for Correlated Image Denoising. 5795-5804 - Miaoyu Li, Ji Liu, Ying Fu, Yulun Zhang, Dejing Dou:
Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising. 5805-5814 - Dongwon Park, Byung Hyun Lee, Se Young Chun:
All-in-One Image Restoration for Unknown Degradations Using Adaptive Discriminative Filters for Specific Degradations. 5815-5824 - Jinghao Zhang, Jie Huang, Mingde Yao, Zizheng Yang, Hu Yu, Man Zhou, Feng Zhao:
Ingredient-oriented Multi-Degradation Learning for Image Restoration. 5825-5835 - Fadi Boutros, Meiling Fang, Marcel Klemt, Biying Fu, Naser Damer:
CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability. 5836-5845 - Avinab Saha, Sandeep Mishra, Alan C. Bovik:
Re-IQA: Unsupervised Learning for Image Quality Assessment in the Wild. 5846-5855 - Zhijun Tu, Jie Hu, Hanting Chen, Yunhe Wang:
Toward Accurate Post-Training Quantization for Image Super Resolution. 5856-5865 - Jiacheng Li, Chang Chen, Wei Huang, Zhiqiang Lang, Fenglong Song, Youliang Yan, Zhiwei Xiong:
Learning Steerable Function for Efficient Image Resampling. 5866-5875 - Woo Kyoung Han, Byeonghun Lee, Sang Hyun Park, Kyong Hwan Jin
:
ABCD : Arbitrary Bitwise Coefficient for De-Quantization. 5876-5885 - Lingshun Kong, Jiangxin Dong, Jianjun Ge, Mingqiang Li, Jinshan Pan:
Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring. 5886-5895 - Xiang Chen, Hao Li, Mingqiang Li, Jinshan Pan:
Learning A Sparse Transformer Network for Effective Image Deraining. 5896-5905 - Zixiang Zhao
, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Shuang Xu
, Zudi Lin, Radu Timofte
, Luc Van Gool:
CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion. 5906-5916 - Julian Jorge Andrade Guerreiro
, Mitsuru Nakazawa, Björn Stenger:
PCT-Net: Full Resolution Image Harmonization Using Pixel-Wise Color Transformations. 5917-5926 - Ke Wang, Michaël Gharbi, He Zhang, Zhihao Xia, Eli Shechtman:
Semi-Supervised Parametric Real-World Image Harmonization. 5927-5936 - Chenfan Qu, Chongyu Liu, Yuliang Liu
, Xinhong Chen, Dezhi Peng, Fengjun Guo, Lianwen Jin:
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution. 5937-5946 - Siyu Huang
, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister:
QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity. 5947-5956 - Takehiro Aoshima, Takashi Matsubara:
Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model. 5957-5967 - Ankan Kumar Bhunia, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer
, Jorma Laaksonen
, Mubarak Shah, Fahad Shahbaz Khan:
Person Image Synthesis via Denoising Diffusion Model. 5968-5976 - Gang Dai
, Yifan Zhang, Qingfeng Wang, Qing Du, Zhuliang Yu, Zhuoman Liu, Shuangping Huang:
Disentangling Writer and Character Styles for Handwriting Generation. 5977-5986 - Harsh Rangwani, Lavish Bansal, Kartik Sharma, Tejan Karmali, Varun Jampani, R. Venkatesh Babu:
NoisyTwins: Class-Consistent and Diverse Image Generation Through StyleGANs. 5987-5996 - Jaskirat Singh, Stephen Gould, Liang Zheng:
High-Fidelity Guided Image Synthesis with Latent Diffusion Models. 5997-6006 - Bahjat Kawar, Shiran Zada, Oran Lang, Omer Tov, Huiwen Chang, Tali Dekel, Inbar Mosseri, Michal Irani:
Imagic: Text-Based Real Image Editing with Diffusion Models. 6007-6017 - HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang:
PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout. 6018-6026 - Zhixing Zhang
, Ligong Han, Arnab Ghosh, Dimitris N. Metaxas, Jian Ren:
SINE: SINgle Image Editing with Text-to-Image Diffusion Models. 6027-6037 - Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or:
Null-text Inversion for Editing Real Images using Guided Diffusion Models. 6038-6047 - Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, Tom Goldstein:
Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models. 6048-6058 - Hyungjin Chung, Jeongsol Kim, Sehui Kim, Jong Chul Ye:
Parallel Diffusion Models of Operator and Image for Blind Inverse Problems. 6059-6069 - Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara
, Vishal M. Patel:
Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models. 6070-6079 - Ziqi Huang, Kelvin C. K. Chan, Yuming Jiang, Ziwei Liu:
Collaborative Diffusion for Multi-Modal Face Generation and Editing. 6080-6090 - Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang:
Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding. 6091-6100 - Runsen Feng, Zongyu Guo, Weiping Li, Zhibo Chen:
NVTC: Nonlinear Vector Transform Coding. 6101-6110 - Linfeng Qi
, Jiahao Li, Bin Li, Houqiang Li, Yan Lu:
Motion Information Propagation for Neural Video Compression. 6111-6120 - Xiaotao Hu, Zhewei Huang, Ailin Huang, Jun Xu, Shuchang Zhou:
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction. 6121-6131 - Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren
, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. 6132-6142 - Shaowen Xie, Hao Zhu, Zhen Liu, Qi Zhang, You Zhou, Xun Cao, Zhan Ma:
DINER: Disorder-Invariant Implicit Neural Representation. 6143-6152 - Jiafeng Li, Ying Wen, Lianghua He:
SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy. 6153-6162 - Xuan Shen, Yaohua Wang, Ming Lin, Yilun Huang, Hao Tang, Xiuyu Sun, Yanzhi Wang:
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network. 6163-6173 - Jiechong Song, Chong Mou, Shiqi Wang
, Siwei Ma, Jian Zhang:
Optimization-Inspired Cross-Attention Transformer for Compressive Sensing. 6174-6184 - Ali Hassani, Steven Walton, Jiachen Li, Shen Li, Humphrey Shi:
Neighborhood Attention Transformer. 6185-6194 - Shuning Chang, Pichao Wang, Ming Lin, Fan Wang, David Junhao Zhang, Rong Jin, Mike Zheng Shou:
Making Vision Transformers Efficient from A Token Sparsification View. 6195-6205 - Gongjie Zhang, Zhipeng Luo, Zichen Tian, Jingyi Zhang, Xiaoqin Zhang, Shijian Lu:
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors. 6206-6216 - Steffen Czolbe, Adrian V. Dalca:
Neuralizer: General Neuroimage Analysis without Re-Training. 6217-6230 - Saimunur Rahman, Piotr Koniusz, Lei Wang, Luping Zhou, Peyman Moghadam, Changming Sun:
Learning Partial Correlation based Deep Visual Representation for Image Classification. 6231-6240 - Xiangwen Kong, Xiangyu Zhang:
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature. 6241-6251 - Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li
:
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers. 6252-6261 - Lai Wei, Zhengwei Chen, Jun Yin, Changming Zhu, Rigui Zhou, Jin Liu
:
Adaptive Graph Convolutional Subspace Clustering. 6262-6271 - Runzhong Wang, Ziao Guo, Shaofei Jiang, Xiaokang Yang, Junchi Yan:
Deep Learning of Partial Graph Matching via Differentiable Top-K. 6272-6281 - Zhihao Lin, Yongtao Wang, Jinhe Zhang, Xiaojie Chu:
DynamicDet: A Unified Dynamic Architecture for Object Detection. 6282-6291 - Sanjoy Kundu, Sathyanarayanan N. Aakur:
IS-GGT: Iterative Scene Graph Generation with Generative Transformers. 6292-6301 - Tianlei Jin
, Fangtai Guo, Qiwei Meng
, Shiqiang Zhu, Xiangming Xi, Wen Wang, Zonghao Mu, Wei Song:
Fast Contextual Scene Graph Generation with Unbiased Context Augmentation. 6302-6311 - Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. 6312-6322 - Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam:
MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation. 6323-6333 - Richard E. L. Higgins, David F. Fouhey:
MOVES: Manipulated Objects in Video Enable Segmentation. 6334-6343 - Qihao Liu, Junfeng Wu
, Yi Jiang, Xiang Bai, Alan L. Yuille, Song Bai:
InstMove: Instance Motion for Object-centric Video Segmentation. 6344-6354