


default search action
ICCV 2021: Montreal, QC, Canada
- 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. IEEE 2021, ISBN 978-1-6654-2812-5
- Abdullah Hamdi
, Silvio Giancola, Bernard Ghanem
:
MVTN: Multi-View Transformation Network for 3D Shape Recognition. 1-11 - Boyu Chen, Peixia Li, Chuming Li, Baopu Li, Lei Bai
, Chen Lin, Ming Sun, Junjie Yan, Wanli Ouyang
:
GLiT: Neural Architecture Search for Global and Local Image Transformer. 12-21 - Haiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang:
CvT: Introducing Convolutions to Vision Transformers. 22-31 - Hugo Touvron, Matthieu Cord, Alexandre Sablayrolles, Gabriel Synnaeve, Hervé Jégou:
Going deeper with Image Transformers. 32-42 - Bin Xiao, Haifeng Wu, Xiuli Bi:
DTMNet: A Discrete Tchebichef Moments-based Deep Neural Network for Multi-focus Image Fusion. 43-51 - Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris N. Metaxas:
CrossNorm and SelfNorm for Generalization under Distribution Shifts. 52-61 - Zhi-Fan Wu, Tong Wei, Jianwen Jiang, Chaojie Mao, Mingqian Tang, Yufeng Li:
NGC: A Unified Framework for Learning with Open-World Noisy Data. 62-71 - Xiong Zhou, Xianming Liu, Chenyang Wang
, Deming Zhai, Junjun Jiang, Xiangyang Ji:
Learning with Noisy Labels via Sparse Regularization. 72-81 - Tal Ridnik, Emanuel Ben Baruch, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor:
Asymmetric Loss For Multi-Label Classification. 82-91 - Han-Jia Ye, De-Chuan Zhan, Wei-Lun Chao:
Procrustean Training for Imbalanced Deep Learning. 92-102 - Yunrui Guo, Guglielmo Camporese, Wenjing Yang, Alessandro Sperduti, Lamberto Ballan
:
Conditional Variational Capsule Network for Open Set Recognition. 103-111 - Jiarui Cai, Yizhou Wang, Jenq-Neng Hwang:
ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot. 112-121 - Shiming Chen
, Wenjie Wang, Beihao Xia, Qinmu Peng, Xinge You, Feng Zheng, Ling Shao:
FREE: Feature Refinement for Generalized Zero-Shot Learning. 122-131 - Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin Shen:
Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. 132-141 - Nanyi Fei, Yizhao Gao, Zhiwu Lu, Tao Xiang:
Z-Score Normalization, Hubness, and Few-Shot Learning. 142-151 - Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy-Chowdhury, Ziyan Wu:
Spatio-Temporal Representation Factorization for Video-based Person Re-Identification. 152-162 - Jiawei Zhao, Ke Yan, Yifan Zhao, Xiaowei Guo, Feiyue Huang, Jia Li:
Transformer-based Dual Relation Graph for Multi-label Image Recognition. 163-172 - Didik Purwanto, Yie-Tarng Chen, Wen-Hsien Fang:
Dance with Self-Attention: A New Look of Conditional Random Fields on Anomaly Detection in Videos. 173-183 - Ke Zhu, Jianxin Wu:
Residual Attention: A Simple but Effective Method for Multi-Label Recognition. 184-193 - Ming Li, Xinming Huang, Ziming Zhang:
Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond. 194-204 - Jiajian Zhao
, Yifan Zhao, Jia Li, Ke Yan, Yonghong Tian:
Heterogeneous Relational Complement for Vehicle Re-identification. 205-214 - Yukun Huang
, Xueyang Fu
, Zheng-Jun Zha:
Attack-Guided Perceptual Data Generation for Real-world Re-Identification. 215-224 - Ziyu Wei, Xi Yang, Nannan Wang, Xinbo Gao:
Syncretic Modality Collaborative Learning for Visible Infrared Person Re-Identification. 225-234 - Yin-Yin He, Jianxin Wu, Xiu-Shen Wei:
Distilling Virtual Examples for Long-tailed Recognition. 235-244 - Florian Strohm, Ekta Sood, Sven Mayer
, Philipp Müller, Mihai Bâce, Andreas Bulling:
Neural Photofit: Gaze-based Mental Image Reconstruction. 245-254 - Philipp Bomatter, Mengmi Zhang
, Dimitar Karev, Spandan Madan, Claire Tseng, Gabriel Kreiman
:
When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes. 255-264 - Juan León Alcázar, Fabian Caba Heilbron, Ali K. Thabet
, Bernard Ghanem
:
MAAS: Multi-modal Assignation for Active Speaker Detection. 265-274 - Sagnik Majumder, Ziad Al-Halah, Kristen Grauman:
Move2Hear: Active Audio-Visual Source Separation. 275-285 - Nikhil Singh, Jeff Mentch, Jerry Ng, Matthew Beveridge, Iddo Drori:
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis. 286-295 - Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro
:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. 296-306 - Boyu Chen, Peixia Li, Baopu Li, Chen Lin, Chuming Li, Ming Sun, Junjie Yan, Wanli Ouyang:
BN-NAS: Neural Architecture Search with Batch Normalization. 307-316 - Kun Yuan, Quanquan Li, Shaopeng Guo, Dapeng Chen, Aojun Zhou, Fengwei Yu, Ziwei Liu:
Differentiable Dynamic Wirings for Neural Networks. 317-326 - Daquan Zhou
, Xiaojie Jin, Xiaochen Lian, Linjie Yang, Yujing Xue, Qibin Hou, Jiashi Feng:
AutoSpace: Neural Architecture Search with Less Human Interference. 327-336 - Ming Lin, Pichao Wang, Zhenhong Sun, Hesen Chen, Xiuyu Sun, Qi Qian, Hao Li, Rong Jin:
Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition. 337-346 - Chun-Fu (Richard) Chen, Quanfu Fan, Rameswar Panda:
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification. 347-356 - Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, Qixiang Ye:
Conformer: Local Features Coupling Global Representations for Visual Recognition. 357-366 - Zizheng Pan, Bohan Zhuang, Jing Liu
, Haoyu He, Jianfei Cai:
Scalable Vision Transformers with Hierarchical Pooling. 367-376 - Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip H. S. Torr, Wayne Zhang
, Dahua Lin:
Vision Transformer with Progressive Sampling. 377-386 - Hila Chefer, Shir Gur, Lior Wolf:
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers. 387-396 - Xin Wei, Yifei Gong, Fudong Wang, Xing Sun
, Jian Sun:
Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views. 397-406 - Cheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu
, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao:
MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection. 407-417 - Xiaoshi Wu, Hadar Averbuch-Elor, Jin Sun, Noah Snavely:
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. 418-427 - Bo Xu
, Han Huang, Cheng Lu, Ziwen Li, Yandong Guo:
Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction. 428-437 - Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert E. Mahony:
An Asynchronous Kalman Filter for Hybrid Event Cameras. 438-447 - Guangyao Chen, Peixi Peng, Li Ma
, Jia Li, Lin Du, Yonghong Tian:
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain. 448-457 - Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Lei Zhang, Nuno Vasconcelos
:
MicroNet: Improving Image Recognition with Extremely Low FLOPs. 458-467 - Haozhe Liu, Haoqian Wu, Weicheng Xie, Feng Liu
, Linlin Shen:
Group-wise Inhibition based Feature Regularization for Robust Classification. 468-476 - Yanfu Zhang, Shangqian Gao, Heng Huang:
Exploration and Estimation for Model Compression. 477-486 - Hossein Talebi, Peyman Milanfar:
Learning to Resize Images for Computer Vision Tasks. 487-496 - Zhonghua Wu, Xiangxi Shi, Guosheng Lin, Jianfei Cai:
Learning Meta-class Memory for Few-Shot Semantic Segmentation. 497-506 - Shuyang Sun, Xiaoyu Yue, Xiaojuan Qi, Wanli Ouyang
, Victor Prisacariu, Philip H. S. Torr:
Aggregation with Feature Detection. 507-516 - Chris Dongjoo Kim, Jinseo Jeong, Sangwoo Moon, Gunhee Kim:
Continual Learning on Noisy Data Streams via Self-Purified Replay. 517-527 - Sihyeon Kim, Sanghyeok Lee, Dasol Hwang, Jaewon Lee, Seong Jae Hwang, Hyunwoo J. Kim:
Point Cloud Augmentation with Weighted Local Transformations. 528-537 - Li Yuan, Yunpeng Chen
, Tao Wang
, Weihao Yu
, Yujun Shi, Zihang Jiang, Francis E. H. Tay, Jiashi Feng, Shuicheng Yan:
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet. 538-547 - Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan
, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. 548-558 - Kun Yuan, Shaopeng Guo, Ziwei Liu, Aojun Zhou, Fengwei Yu, Wei Wu:
Incorporating Convolution Designs into Visual Transformers. 559-568 - Zhengsu Chen, Lingxi Xie, Jianwei Niu
, Xuefeng Liu, Longhui Wei, Qi Tian:
Visformer: The Vision-friendly Transformer. 569-578 - Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez
, Kurt Keutzer, Peter Vajda:
Visual Transformers: Where Do Transformers Really Belong in Vision Models? 579-589 - Xuhui Jia, Kai Han, Yukun Zhu, Bradley Green:
Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal Data. 590-599 - Shaoli Huang, Xinchao Wang
, Dacheng Tao:
Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-grained Recognition. 600-609 - Tianhao Li, Limin Wang, Gangshan Wu:
Self Supervision to Distillation for Long-Tailed Visual Recognition. 610-619 - Avi Ben-Cohen, Nadav Zamir, Emanuel Ben Baruch, Itamar Friedman, Lihi Zelnik-Manor:
Semantic Diversity Learning for Zero-Shot Multi-label Classification. 620-630 - Xueting Zhang, Debin Meng, Henry Gouk, Timothy M. Hospedales:
Shallow Bayesian Meta Learning for Real-World Few-Shot Recognition. 631-640 - Chengzhi Mao, Mia Chiquier, Hao Wang, Junfeng Yang, Carl Vondrick:
Adversarial Attacks are Reversible with Natural Supervision. 641-651 - Jie Hu, Liujuan Cao, Tong Tong, Qixiang Ye, Shengchuan Zhang, Ke Li, Feiyue Huang, Ling Shao, Rongrong Ji:
Architecture Disentanglement for Deep Neural Networks. 652-661 - Xuejun Zhao, Wencan Zhang, Xiaokui Xiao, Brian Y. Lim:
Exploiting Explanations for Model Inversion Attacks. 662-672 - Oran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan, Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal Irani, Inbar Mosseri:
Explaining in Style: Training a GAN to explain a classifier in StyleSpace. 673-682 - Stephan J. Lemmer, Jason J. Corso:
Ground-truth or DAER: Selective Re-query of Secondary Information. 683-694 - Jiequan Cui, Zhisheng Zhong, Shu Liu, Bei Yu, Jiaya Jia
:
Parametric Contrastive Learning. 695-704 - Zizhao Zhang, Tomas Pfister:
Learning Fast Sample Re-weighting Without Reward Data. 705-714 - Seulki Park, Jongin Lim
, Younghan Jeon, Jin Young Choi:
Influence-Balanced Loss for Imbalanced Visual Classification. 715-724 - Shunyan Luo, Emre Barut, Fang Jin
:
Statistically Consistent Saliency Estimation. 725-733 - Yunze Liu, Qingnan Fan, Shanghang Zhang, Hao Dong, Thomas A. Funkhouser, Li Yi:
Contrastive Multimodal Fusion with TupleInfoNCE. 734-743 - Xiaofeng Liu, Site Li
, Yubin Ge, Pengyi Ye, Jane You, Jun Lu:
Recursively Conditional Gaussian for Ordinal Unsupervised Domain Adaptation. 744-753 - Samuel G. Müller, Frank Hutter:
TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation. 754-762 - Zequn Qin, Pengyi Zhang, Fei Wu, Xi Li:
FcaNet: Frequency Channel Attention Networks. 763-772 - Md. Amirul Islam, Matthew Kowal, Sen Jia
, Konstantinos G. Derpanis, Neil D. B. Bruce:
Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs. 773-781 - Longwen Zhang, Qixuan Zhang, Minye Wu, Jingyi Yu, Lan Xu
:
Neural Video Portrait Relighting in Real-time via Consistency Modeling. 782-792 - Shu Kong, Deva Ramanan
:
OpenGAN: Open-Set Recognition via Open Data Generation. 793-802 - Alexandre Ramé, Rémy Sun, Matthieu Cord:
MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks. 803-813 - Zijian Wang
, Yadan Luo
, Ruihong Qiu
, Zi Huang
, Mahsa Baktashmotlagh
:
Learning to Diversify for Single Domain Generalization. 814-823 - Hongjoon Ahn, Jihwan Kwak, Subin Lim, Hyeonsu Bang, Hyojun Kim, Taesup Moon:
SS-IL: Separated Softmax for Incremental Learning. 824-833 - Zihui Xue, Sucheng Ren, Zhengqi Gao, Hang Zhao:
Multimodal Knowledge Expansion. 834-843 - Shihua Huang, Zhichao Lu
, Ran Cheng
, Cheng He
:
FaPN: Feature-aligned Pyramid Network for Dense Image Prediction. 844-853 - Hugo Touvron, Alexandre Sablayrolles, Matthijs Douze, Matthieu Cord, Hervé Jégou:
Grafit: Learning fine-grained image representations with coarse labels. 854-864 - Guohao Peng, Jun Zhang, Heshan Li, Danwei Wang:
Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition. 865-874 - Jiaqi Wang, Huafeng Liu, Xinyue Wang
, Liping Jing:
Interpretable Image Recognition by Constructing Transparent Embedding Space. 875-884 - Adria Ruiz, Antonio Agudo
, Francesc Moreno-Noguer:
Generating Attribution Maps with Disentangled Masked Backpropagation. 885-894 - Tiange Xiang
, Chaoyi Zhang, Yang Song, Jianhui Yu, Weidong Cai
:
Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis. 895-904 - Byeong-Ju Han, Kuhyeun Ko, Jae-Young Sim:
End-to-End Trainable Trident Person Search Network Using Adaptive Gradient Propagation. 905-913 - Yijin Li, Han Zhou, Bangbang Yang, Ye Zhang, Zhaopeng Cui, Hujun Bao, Guofeng Zhang:
Graph-based Asynchronous Event Processing for Rapid Object Recognition. 914-923 - Rujiao Long, Wen Wang, Nan Xue, Feiyu Gao, Zhibo Yang, Yongpan Wang, Gui-Song Xia
:
Parsing Table Structures in the Wild. 924-932 - Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song
:
SketchLattice: Latticed Representation for Sketch Manipulation. 933-941 - Jian Jia, Xiaotang Chen, Kaiqi Huang:
Spatial and Semantic Consistency Regularizations for Pedestrian Attribute Recognition. 942-951 - Meiqi Guo, Rebecca Hwa
, Adriana Kovashka:
Detecting Persuasive Atypicality by Modeling Contextual Compatibility. 952-962 - Ayan Kumar Bhunia, Aneeshan Sain
, Pinaki Nath Chowdhury, Yi-Zhe Song
:
Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation. 963-972 - Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha:
DocFormer: End-to-End Transformer for Document Understanding. 973-983 - Kamal Gupta, Justin Lazarow, Alessandro Achille, Larry Davis, Vijay Mahadevan, Abhinav Shrivastava:
LayoutTransformer: Layout Generation and Completion with Self-attention. 984-994 - Samarth Mishra, Zhongping Zhang, Yuan Shen
, Ranjitha Kumar, Venkatesh Saligrama
, Bryan A. Plummer:
Effectively Leveraging Attributes for Visual Similarity. 995-1004 - Yongming Rao, Guangyi Chen
, Jiwen Lu, Jie Zhou:
Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification. 1005-1014 - Sunghun Joung, Seungryong Kim, Minsu Kim, Ig-Jae Kim, Kwanghoon Sohn:
Learning Canonical 3D Object Representation for Fine-Grained Recognition. 1015-1025 - Liangzhi Li
, Bowen Wang
, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara:
SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition. 1026-1035 - Pau Rodríguez
, Massimo Caccia, Alexandre Lacoste, Lee Zamparo, Issam H. Laradji, Laurent Charlin, David Vázquez:
Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations. 1036-1045 - Wei-Lin Hsiao, Kristen Grauman:
From Culture to Clothing: Discovering the World Events Behind A Century of Fashion Images. 1046-1055 - Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi
:
De-rendering Stylized Texts. 1056-1065 - Ankan Kumar Bhunia, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Mubarak Shah
:
Handwriting Transformers. 1066-1074 - Xin Wang, Shuyun Lin, Hao Zhang, Yufei Zhu, Quanshi Zhang:
Interpreting Attributions and Interactions of Adversarial Attacks. 1075-1084 - Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le
, Khoa Luu:
The Right to Talk: An Audio-Visual Transformer Approach. 1085-1094 - Yue Song, Nicu Sebe
, Wei Wang:
Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling? 1095-1103 - Guile Wu, Shaogang Gong, Pan Li:
Striking a Balance between Stability and Plasticity for Class-Incremental Learning. 1104-1113 - Devin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell, Ludwig Schmidt:
Predicting with Confidence on Unseen Distributions. 1114-1124 - Canyi Lu:
Transforms based Tensor Robust PCA: Corrupted Low-Rank Tensors Recovery via Convex Optimization. 1125-1132 - Keke Tang
, Dingruibo Miao, Weilong Peng
, Jianpeng Wu, Yawen Shi, Zhaoquan Gu, Zhihong Tian, Wenping Wang:
CODEs: Chamfer Out-of-Distribution Examples against Overconfidence Issue. 1133-1142 - Song Xue, Runqi Wang, Baochang Zhang, Tian Wang, Guodong Guo, David S. Doermann:
IDARTS: Interactive Differentiable Architecture Search. 1143-1152 - Alexander Richard, Michael Zollhöfer, Yandong Wen, Fernando De la Torre, Yaser Sheikh:
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement. 1153-1162 - Senthil Purushwalkam, Sebastia Vicenc Amengual Gari, Vamsi Krishna Ithapu, Carl Schissler, Philip W. Robinson, Abhinav Gupta, Kristen Grauman:
Audio-Visual Floorplan Reconstruction. 1163-1172 - Okan Köpüklü, Maja Taseska, Gerhard Rigoll:
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild. 1173-1183 - Moitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja, Anoop Cherian:
Visual Scene Graphs for Audio Source Separation. 1184-1193 - Divya Shanmugam, Davis W. Blalock, Guha Balakrishnan
, John V. Guttag:
Better Aggregation in Test-Time Augmentation. 1194-1203 - Samuel Lerman, Charles Venuto, Henry A. Kautz, Chenliang Xu:
Explaining Local, Global, And Higher-Order Interactions In Deep Learning. 1204-1213 - Hana Chockler, Daniel Kroening, Youcheng Sun
:
Explanations for Occluded Images. 1214-1223 - Maxime Kayser, Oana-Maria Camburu, Leonard Salewski, Cornelius Emde, Virginie Do, Zeynep Akata, Thomas Lukasiewicz:
e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks. 1224-1234 - Adrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Patraucean, Florent Altché, Michal Valko, Jean-Bastien Grill, Aäron van den Oord, Andrew Zisserman:
Broaden Your Views for Self-Supervised Video Learning. 1235-1245 - Xiaowei Liao
, Yong Xu, Haibin Ling:
Hypergraph Neural Networks for Hypergraph Matching. 1246-1255 - Pavlo Melnyk
, Michael Felsberg, Mårten Wadenbäck
:
Embed Me If You Can: A Geometric Perceptron. 1256-1264 - Ahyun Seo, Woohyeon Shim, Minsu Cho:
Learning to Discover Reflection Symmetry via Polar Matching Convolution. 1265-1274 - Wenyuan Xue, Baosheng Yu, Wen Wang, Dacheng Tao, Qingyong Li:
TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition. 1275-1284 - Shi-Xue Zhang
, Xiaobin Zhu, Chun Yang, Hongfa Wang, Xu-Cheng Yin:
Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection. 1285-1294 - Maruthi Narayanan, Vickram Rajendran, Benjamin B. Kimia:
Shape-Biased Domain Generalization via Shock Graph Embeddings. 1295-1305 - Chaofei Wang, Jiayu Xiao, Yizeng Han, Qisen Yang, Shiji Song, Gao Huang:
Towards Learning Spatially Discriminative Feature Representations. 1306-1315 - Hyungsik Jung, Youngrock Oh:
Towards Better Explanations of Class Activation Mapping. 1316-1324 - Peter Cho-Ho Lam, Lingyang Chu, Maxim Torgonskiy, Jian Pei
, Yong Zhang
, Lanjun Wang
:
Finding Representative Interpretations on Convolutional Neural Networks. 1325-1334 - Kwang Hee Lee, Chaewon Park, Junghyun Oh, Nojun Kwak:
LFI-CAM: Learning Feature Importance for Better Visual Explanation. 1335-1343 - Cristina González, Nicolás Ayobi
, Isabela Hernández, José Hernández, Jordi Pont-Tuset, Pablo Arbeláez
:
Panoptic Narrative Grounding. 1344-1353 - Claire Yuqing Cui, Apoorv Khandelwal, Yoav Artzi, Noah Snavely, Hadar Averbuch-Elor:
Who's Waldo? Linking People Across Text and Images. 1354-1364 - Yixin Chen, Qing Li, Deqian Kong, Yik Lun Kei, Song-Chun Zhu, Tao Gao, Yixin Zhu, Siyuan Huang:
YouRefIt: Embodied Reference Understanding with Language and Gesture. 1365-1375 - Anindita Ghosh
, Noshaba Cheema, Cennet Oguz, Christian Theobalt
, Philipp Slusallek:
Synthesis of Compositional Animations from Textual Descriptions. 1376-1386 - Kien Nguyen, Subarna Tripathi, Bang Du, Tanaya Guha, Truong Q. Nguyen:
In Defense of Scene Graphs for Image Captioning. 1387-1396 - Damien Teney, Ehsan Abbasnejad, Anton van den Hengel:
Unshuffling Data for Improved Generalization in Visual Question Answering. 1397-1407 - Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lijuan Wang, Yezhou Yang, Zicheng Liu:
Compressing Visual-linguistic Model via Knowledge Distillation. 1408-1418 - Ronghang Hu, Amanpreet Singh:
UniT: Multimodal Multitask Learning with a Unified Transformer. 1419-1429 - Mohammadreza Zolfaghari, Yi Zhu, Peter V. Gehler, Thomas Brox:
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations. 1430-1439 - Mariella Dimiccoli, Lluís Garrido
, Guillem Rodríguez Corominas
, Herwig Wendt:
Graph Constrained Data Representation Learning for Human Motion Segmentation. 1440-1449 - Jinwoo Nam, Daechul Ahn, Dongyeop Kang
, Seong Jong Ha, Jonghyun Choi
:
Zero-shot Natural Language Video Localization. 1450-1459 - Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun:
Learning Temporal Dynamics from Cycles in Narrated Video. 1460-1469 - Tianyu He
, Xin Jin, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua:
Dense Interaction Learning for Video-based Person Re-identification. 1470-1481 - Ali Diba, Vivek Sharma, Reza Safdari, Dariush Lotfi, M. Saquib Sarfraz, Rainer Stiefelhagen, Luc Van Gool:
Vi2CLR: Video and Image for Visual Contrastive Learning of Representation. 1482-1492 - Yuan Zhi, Zhan Tong, Limin Wang, Gangshan Wu:
MGSampler: An Explainable Sampling Strategy for Video Action Recognition. 1493-1502 - Junyu Gao, Changsheng Xu:
Fast Video Moment Retrieval. 1503-1512 - Rui Su, Qian Yu, Dong Xu:
STVGBert: A Visual-linguistic Transformer based Framework for Spatio-temporal Video Grounding. 1513-1522 - Shaoxiang Chen, Yu-Gang Jiang:
Motion Guided Region Message Passing for Video Captioning. 1523-1532 - Miao Zhang, Jie Liu, Yifei Wang, Yongri Piao, Shunyu Yao, Wei Ji
, Jingjing Li, Huchuan Lu, Zhongxuan Luo:
Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection. 1533-1543 - Shu Yang, Lu Zhang, Jinqing Qi, Huchuan Lu, Shuo Wang, Xiaoxing Zhang:
Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation. 1544-1553 - Corentin Dancette, Rémi Cadène, Damien Teney, Matthieu Cord:
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering. 1554-1563 - Xinzhe Han
, Shuhui Wang, Chi Su, Qingming Huang, Qi Tian:
Greedy Gradient Ensemble for Robust Visual Question Answering. 1564-1573 - Yi Zhu, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao:
Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation. 1574-1583 - Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal:
Contrast and Classify: Training Robust VQA Models. 1584-1593 - Qingxing Cao, Wentao Wan, Keze Wang, Xiaodan Liang, Liang Lin:
Linguistically Routing Capsule Network for Out-of-distribution Visual Question Answering. 1594-1603 - Yushuang Wu, Zizheng Yan, Xiaoguang Han, Guanbin Li, Changqing Zou, Shuguang Cui
:
LapsCore: Language-guided Person Search via Color Reasoning. 1604-1613 - Pierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid:
Airbert: In-domain Pretraining for Vision-and-Language Navigation. 1614-1623 - Chong Liu, Fengda Zhu, Xiaojun Chang
, Xiaodan Liang, Zongyuan Ge, Yi-Dong Shen:
Vision-Language Navigation with Random Environmental Mixup. 1624-1634 - Yuankai Qi
, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. 1635-1644 - Yining Hong, Qing Li, Song-Chun Zhu, Siyuan Huang:
VLGrammar: Grounded Grammar Induction of Vision and Language. 1645-1654 - Difei Gao, Ruiping Wang, Ziyi Bai
, Xilin Chen:
Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments. 1655-1665 - Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid:
Just Ask: Learning to Answer Questions from Millions of Narrated Videos. 1666-1677 - Fei Liu, Jing Liu, Weining Wang, Hanqing Lu:
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering. 1678-1687 - Nayoung Kim, Seong Jong Ha, Je-Won Kang:
Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature. 1688-1697 - Yassir Saquil, Da Chen
, Yuan He, Chuan Li, Yong-Liang Yang:
Multiple Pairwise Ranking Networks for Personalized Video Summarization. 1698-1707 - Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman:
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. 1708-1718 - Huaijia Lin, Ruizheng Wu, Shu Liu, Jiangbo Lu
, Jiaya Jia
:
Video Instance Segmentation with a Propose-Reduce Paradigm. 1719-1728 - Kai-En Lin, Lei Xiao, Feng Liu, Guowei Yang, Ravi Ramamoorthi:
Deep 3D Mask Volume for View Synthesis of Dynamic Scenes. 1729-1738 - Dev Yashpal Sheth, Sreyas Mohan, Joshua L. Vincent, Ramon Manzorro
, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli, Carlos Fernandez-Granda:
Unsupervised Deep Video Denoising. 1739-1748 - Jiajun Deng, Zhengyuan Yang, Tianlang Chen, Wengang Zhou, Houqiang Li:
TransVG: End-to-End Visual Grounding with Transformers. 1749-1759 - Aishwarya Kamath, Mannat Singh, Yann LeCun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion:
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding. 1760-1770 - Zhihao Yuan, Xu Yan, Yinghong Liao, Ruimao Zhang
, Sheng Wang, Zhen Li, Shuguang Cui
:
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring. 1771-1780 - Assaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne
, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen, Alex M. Bronstein, Kate Saenko
, Shimon Ullman, Raja Giryes, Rogério Feris, Leonid Karlinsky:
Detector-Free Weakly Supervised Grounding by Separation. 1781-1792 - Yun Wang, Tong Zhang, Xueya Zhang, Zhen Cui, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang:
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval. 1793-1802 - Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li:
Learning to Generate Scene Graph from Natural Language Supervision. 1803-1814 - Guanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun
:
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query. 1815-1824 - Shuang Li, Yilun Du, Antonio Torralba, Josef Sivic, Bryan C. Russell:
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions. 1825-1835 - Zhengyuan Yang, Songyang Zhang
, Liwei Wang, Jiebo Luo
:
SAT: 2D Semantics Assisted Training for 3D Visual Grounding. 1836-1846 - Juncheng Li, Siliang Tang, Linchao Zhu
, Haochen Shi
, Xuanwen Huang, Fei Wu, Yi Yang, Yueting Zhuang:
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference. 1847-1857 - Zhonghao Wang, Kai Wang, Mo Yu, Jinjun Xiong
, Wen-Mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi
:
Interpretable Visual Reasoning via Induced Symbolic Space. 1858-1867 - Kunal Pratap Singh, Suvaansh Bhambri, Byeonghwi Kim, Roozbeh Mottaghi, Jonghyun Choi
:
Factorizing Perception and Policy for Interactive Instruction Following. 1868-1877 - Shoya Matsumori, Kosuke Shingyouchi, Yuki Abe, Yosuke Fukuchi
, Komei Sugiura, Michita Imai:
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue. 1878-1887 - Pratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral:
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering. 1888-1898 - Ben Saunders
, Necati Cihan Camgöz, Richard Bowden
:
Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives. 1899-1909 - Kranthi Kumar Rachavarapu, Aakanksha, Vignesh Sundaresha, A. N. Rajagopalan:
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization. 1910-1919 - Shijie Li, Yanying Zhou, Jinhui Yi, Juergen Gall:
Spatial-Temporal Consistency Network for Low-Latency Trajectory Forecasting. 1920-1929 - Zhen Zhong, Guobao Xiao, Linxin Zheng, Yan Lu, Jiayi Ma:
T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence Learning. 1930-1939 - Guangming Zang
, Ramzi Idoughi, Rui Li
, Peter Wonka, Wolfgang Heidrich:
IntraTomo: Self-supervised Learning-based Tomography via Sinogram Synthesis and Prediction. 1940-1950 - Yue Qiu, Shintaro Yamamoto, Kodai Nakashima, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka, Yutaka Satoh:
Describing and Localizing Multiple Changes with Transformers. 1951-1960 - Mahmoud Afifi, Jonathan T. Barron, Chloe LeGendre, Yun-Ta Tsai, Francois Bleibel:
Cross-Camera Convolutional Color Constancy. 1961-1970 - Ka Leong Cheng, Yueqi Xie
, Qifeng Chen:
IICNet: A Generic Framework for Reversible Image Conversion. 1971-1980 - Tengfei Wang
, Jiaxin Xie, Wenxiu Sun, Qiong Yan, Qifeng Chen:
Dual-Camera Super-Resolution with Aligned Attention Modules. 1981-1990 - Xiaoyu Li, Bo Zhang, Jing Liao
, Pedro V. Sander:
Let's See Clearly: Contaminant Artifact Removal for Moving Cameras. 1991-2000 - Junwen Chen, Yu Kong Golisano:
Explainable Video Entailment with Grounded Visual Evidence. 2001-2010 - Heeseung Yun, Youngjae Yu, Wonsuk Yang, Kangil Lee, Gunhee Kim:
Pano-AVQA: Grounded Audio-Visual Question Answering on 360° Videos. 2011-2021 - Linjie Li, Jie Lei, Zhe Gan, Jingjing Liu:
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models. 2022-2031 - Hareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia:
AESOP: Abstract Encoding of Stories, Objects, and Pictures. 2032-2043 - Deniz Engin, François Schnitzler, Ngoc Q. K. Duong, Yannis Avrithis:
On the hidden treasure of dialog in video question answering. 2044-2053 - Yiyi Zhou, Tianhe Ren, Chaoyang Zhu, Xiaoshuai Sun, Jianzhuang Liu, Xinghao Ding, Mingliang Xu, Rongrong Ji:
TRAR: Routing the Attention Spans in Transformer for Visual Question Answering. 2054-2064 - Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski:
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery. 2065-2074 - Hoeseong Kim, Jongseok Kim, Hyungseok Lee, Hyunsung Park, Gunhee Kim:
Viewpoint-Agnostic Change Captioning with Cycle Consistency. 2075-2084 - Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei
:
Motion-Focused Contrastive Learning of Video Representations*. 2085-2094 - Wentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin, Si Liu:
Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism. 2095-2104 - Zheyuan Liu, Cristian Rodriguez Opazo, Damien Teney, Stephen Gould:
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models. 2105-2114 - Lin Wang
, Yujeong Chae, Kuk-Jin Yoon:
Dual Transfer Learning for Event-based End-task Prediction via Pluggable Event to Image Translation. 2115-2125 - Junho Kim, Jaehyeok Bae, Gangin Park, Dongsu Zhang, Young Min Kim:
N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras. 2126-2136 - Gregory Vaksman, Michael Elad, Peyman Milanfar:
Patch Craft: Video Denoising by Deep Modeling and Patch Matching. 2137-2146 - Zhijian Liu, Simon Stent, Jie Li, John Gideon, Song Han:
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision. 2147-2156 - Chengxiang Yin, Kun Wu, Zhengping Che, Bo Jiang, Zhiyuan Xu, Jian Tang:
Hierarchical Graph Attention Network for Few-shot Visual-Semantic Learning. 2157-2166 - Jiahe Shi, Yali Li, Shengjin Wang:
Partial Off-policy Learning: Balance Accuracy and Diversity for Human-Oriented Image Captioning. 2167-2176 - Xu Yang, Chongyang Gao, Hanwang Zhang, Jianfei Cai:
Auto-Parsing Network for Image Captioning and Visual Question Answering. 2177-2187 - Keyu Wen, Jin Xia, Yuanyuan Huang, Linyang Li, Jiayan Xu, Jie Shao:
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation. 2188-2197 - Chao Li, Shangqian Gao, Cheng Deng, Wei Liu, Heng Huang:
Adversarial Attack on Deep Cross-Modal Hamming Retrieval. 2198-2207 - Shumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas
, Rahul Garg:
Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image. 2208-2218 - Yicheng Wu, Qiurui He, Tianfan Xue, Rahul Garg, Jiawen Chen, Ashok Veeraraghavan, Jonathan T. Barron:
How to Train Neural Networks for Flare Removal. 2219-2227 - Tao Zhang, Ying Fu, Cheng Li:
Hyperspectral Image Denoising with Realistic Data. 2228-2237 - Albert W. Reed, Hyojin Kim, Rushil Anirudh, K. Aditya Mohan, Kyle Champley, Jingu Kang, Suren Jayasuriya:
Dynamic CT Reconstruction from Limited Views with Implicit Neural Representations and Parametric Motion Fields. 2238-2248 - Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan
, Bernard Ghanem
, C.-C. Jay Kuo:
High Quality Disparity Remapping with Two-Stage Warping. 2249-2258 - Zhiyu Zhu
, Hui Liu, Junhui Hou
, Huanqiang Zeng, Qingfu Zhang:
Semantic-embedded Unsupervised Spectral Reconstruction from Single RGB Images in the Wild. 2259-2268 - Abdullah Abuolaim, Mauricio Delbracio, Damien Kelly, Michael S. Brown, Peyman Milanfar:
Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data. 2269-2278 - Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Hybrid Neural Fusion for Full-frame Video Stabilization. 2279-2288 - Kuldeep Purohit, Maitreya Suin, A. N. Rajagopalan, Vishnu Naresh Boddeti:
Spatially-Adaptive Image Restoration using Distortion-Guided Networks. 2289-2299 - Daksh Thapar, Aditya Nigam
, Chetan Arora:
Anonymizing Egocentric Videos. 2300-2309 - Prafull Sharma, Miika Aittala, Yoav Y. Schechner, Antonio Torralba, Gregory W. Wornell
, William T. Freeman, Frédo Durand:
What You Can Learn by Staring at a Blank Wall. 2310-2319 - Aviad Levis, Daeyoung Lee
, Joel A. Tropp, Charles F. Gammie, Katherine L. Bouman:
Inference of Black Hole Fluid-Dynamics from Sparse Interferometric Measurements. 2320-2329 - Geonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee:
C2N: Practical Generative Noise Modeling for Real-World Denoising. 2330-2339 - Dario Fuoli, Luc Van Gool, Radu Timofte
:
Fourier Space Losses for Efficient Perceptual Image Super-Resolution. 2340-2349 - Bruno Lecouat, Jean Ponce, Julien Mairal:
Lucas-Kanade Reloaded: End-to-End Super-Resolution from Raw Image Bursts. 2350-2359 - Myungseo Song, Jinyoung Choi, Bohyung Han:
Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform. 2360-2369 - B. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra
, Sanjoy Chowdhury:
V-DESIRR: Very Fast Deep Embedded Single Image Reflection Removal. 2370-2379 - Lin Zhu
, Jianing Li, Xiao Wang
, Tiejun Huang, Yonghong Tian:
NeuSpike-Net: High Speed Video Reconstruction via Bio-inspired Neuromorphic Cameras. 2380-2389 - Dongyoung Kim
, Jinwoo Kim, Seonghyeon Nam, Dongwoo Lee, Yeonkyung Lee, Nahyup Kang, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han, Seon Joo Kim:
Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination. 2390-2399 - Soumyadip Sengupta, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz:
A Light Stage on Every Desk. 2400-2409 - Zhihao Xia, Jason Lawrence, Supreeth Achar:
A Dark Flash Normal Camera. 2410-2419 - Julio Marco, Adrián Jarabo
, Ji Hyun Nam, Xiaochun Liu, Miguel Ángel Cosculluela, Andreas Velten, Diego Gutierrez:
Virtual light transport matrices for non-line-of-sight imaging. 2420-2429 - Mantang Guo
, Jing Jin
, Hui Liu, Junhui Hou
:
Learning Dynamic Interpolation for Extremely Sparse Light Fields with Wide Baselines. 2430-2439 - Goutam Bhat, Martin Danelljan, Fisher Yu, Luc Van Gool, Radu Timofte
:
Deep Reparametrization of Multi-Frame Super-Resolution and Denoising. 2440-2450 - Tao Wang, Yong Li, Jingyang Peng, Yipeng Ma, Xian Wang, Fenglong Song, Youliang Yan:
Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup Tables. 2451-2460 - Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan:
Distillation-guided Image Inpainting. 2461-2470 - Prasan A. Shedligeri, Florian Schiffers, Sushobhan Ghosh, Oliver Cossairt, Kaushik Mitra
:
SeLFVi: Self-supervised Light-Field Video Reconstruction from Stereo Video. 2471-2481 - Guanying Chen
, Chaofeng Chen, Shi Guo, Zhetong Liang, Kwan-Yee K. Wong
, Lei Zhang:
HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset. 2482-2491 - Bhavya Goyal, Mohit Gupta:
Photon-Starved Scene Inference using Single Photon Cameras. 2492-2501 - Nianyi Li, Simron Thapa, Cameron Whyte, Albert W. Reed, Suren Jayasuriya, Jinwei Ye:
Unsupervised Non-Rigid Image Distortion Removal via Grid Deformation. 2502-2512 - Jing Zhao, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang:
Super Resolve Dynamic Scene from Continuous Spike Streams. 2513-2522 - Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar:
COMISR: Compression-Informed Video Super-Resolution. 2523-2532 - Ziteng Cui, Guo-Jun Qi, Lin Gu, Shaodi You, Zenghui Zhang
, Tatsuya Harada:
Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection. 2533-2542 - Wenming Weng, Yueyi Zhang, Zhiwei Xiong:
Event-based Video Reconstruction Using Transformer. 2543-2552 - Carlos Hinojosa, Juan Carlos Niebles, Henry Arguello:
Learning Privacy-preserving Optics for Human Pose Estimation. 2553-2562 - Fang Xu, Lei Yu, Bishan Wang, Wen Yang, Gui-Song Xia
, Xu Jia, Zhendong Qiao, Jianzhuang Liu:
Motion Deblurring with Real Events. 2563-2572 - Tristan Swedish, Connor Henley
, Ramesh Raskar:
Objects as Cameras: Estimating High-Frequency Illumination from Shadows. 2573-2582 - Yucheng Zheng, Yi Hua, Aswin C. Sankaranarayanan, M. Salman Asif:
A Simple Framework for 3D Lensless Imaging with Programmable Masks. 2583-2592 - Xiu Li, Jinli Suo, Weihang Zhang
, Xin Yuan, Qionghai Dai:
Universal and Flexible Optical Aberration Correction Using Deep-Prior Based Deconvolution. 2593-2601 - Ziyi Meng, Zhenming Yu, Kun Xu, Xin Yuan:
Self-supervised Neural Networks for Spectral Snapshot Compressive Imaging. 2602-2611 - Shiqi Chen, Huajun Feng, Keming Gao, Zhihai Xu, Yueting Chen:
Extreme-Quality Computational Imaging via Degradation Framework. 2612-2621 - Hyeongseok Son
, Junyong Lee
, Sunghyun Cho, Seungyong Lee:
Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions. 2622-2630 - Seung-Hwan Baek
, Hayato Ikoma, Daniel S. Jeon, Yuqi Li, Wolfgang Heidrich, Gordon Wetzstein, Min H. Kim:
Single-shot Hyperspectral-Depth Imaging with Learned Diffractive Optics. 2631-2640 - Wei Fang, Zhaofei Yu, Yanqi Chen, Timothée Masquelier
, Tiejun Huang, Yonghong Tian:
Incorporating Learnable Membrane Time Constant to Enhance Learning of Spiking Neural Networks. 2641-2651 - Yuqi Li, Qiang Fu
, Wolfgang Heidrich
:
Multispectral illumination estimation using deep unrolling network. 2652-2661 - Bintao He
, Fa Zhang, Huanshui Zhang, Renmin Han
:
A Hybrid Frequency-Spatial Domain Model for Sparse Image Reconstruction in Scanning Transmission Electron Microscopy. 2662-2671 - Edwin Vargas, Julien N. P. Martel, Gordon Wetzstein, Henry Arguello:
Time-Multiplexed Coded Aperture Imaging: Learned Coded Aperture and Pixel Exposures for Compressive Imaging Systems. 2672-2682 - Chaoqi Chen, Jiongcheng Li, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu:
Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection. 2683-2692 - Zhikang Zou, Xiaoqing Ye, Liang Du, Xianhui Cheng, Xiao Tan, Li Zhang, Jianfeng Feng, Xiangyang Xue, Errui Ding:
The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection. 2693-2702 - Jiageng Mao
, Minzhe Niu, Haoyue Bai, Xiaodan Liang, Hang Xu, Chunjing Xu:
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection. 2703-2712 - Yoli Shavit, Ron Ferens, Yosi Keller:
Learning Multi-Scene Absolute Pose Regression with Transformers. 2713-2722 - Hualian Sheng
, Sijia Cai, Yuan Liu, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao:
Improving 3D Object Detection with Channel-wise Transformer. 2723-2732 - Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, Qixing Huang:
HPNet: Deep Primitive Segmentation Using Hybrid Representations. 2733-2742 - Gangming Zhao, Weifeng Ge, Yizhou Yu:
GraphFPN: Graph Feature Pyramid Network for Object Detection. 2743-2752 - Kai Chen
, Qi Dou:
SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation. 2753-2762 - Zhihao Liang, Zhihao Li, Songcen Xu, Mingkui Tan, Kui Jia:
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks. 2763-2772 - Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang:
PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation. 2773-2782 - Minsong Ki, Youngjung Uh
, Junsuk Choe
, Hyeran Byun:
Contrastive Attention Maps for Self-supervised Co-localization. 2783-2792 - Andreas Panteli, Jonas Teuwen, Hugo M. Horlings, Efstratios Gavves:
Sparse-shot Learning with Exclusive Cross-Entropy for Extremely Many Localisations. 2793-2803 - David Biertimpel, Sindi Shkodrani, Anil S. Baslamisli, Nóra Baka:
Prior to Segment: Foreground Cues for Weakly Annotated Classes in Partially Supervised Instance Segmentation. 2804-2813 - Xiaoyu Zhu, Jeffrey Chen, Xiangrui Zeng, Junwei Liang
, Chengqi Li, Sinuo Liu, Sima Behpour, Min Xu:
Weakly Supervised 3D Semantic Segmentation Using Cross-Image Consensus and Inter-Voxel Affinity Relations. 2814-2824 - Haosen Liu
, Xuan Liu, Jiangbo Lu
, Shan Tan
:
Self-Supervised Image Prior Learning with GMM from a Single Noisy Image. 2825-2834 - Isinsu Katircioglu
, Helge Rhodin, Jörg Spörri, Mathieu Salzmann, Pascal Fua:
Human Detection and Segmentation via Multi-view Consensus. 2835-2844 - Vignesh Ramanathan, Rui Wang, Dhruv Mahajan:
PreDet: Large-scale weakly supervised pre-training for detection. 2845-2855 - Bowen Dong
, Zitong Huang, Yuelin Guo
, Qilong Wang
, Zhenxing Niu, Wangmeng Zuo:
Boosting Weakly Supervised Object Detection via Learning Bounding Box Adjusters. 2856-2865 - Wei Gao, Fang Wan, Xingjia Pan, Zhiliang Peng, Qi Tian, Zhenjun Han, Bolei Zhou, Qixiang Ye:
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization. 2866-2875 - Jiannan Guo, Haochen Shi
, Yangyang Kang, Kun Kuang, Siliang Tang, Zhuoren Jiang, Changlong Sun, Fei Wu, Yueting Zhuang:
Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels. 2876-2885 - Ishan Misra, Rohit Girdhar, Armand Joulin:
An End-to-End Transformer Model for 3D Object Detection. 2886-2897 - Lue Fan
, Xuan Xiong, Feng Wang, Naiyan Wang, Zhaoxiang Zhang:
RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection. 2898-2907 - Lichen Zhao, Daigang Cai, Lu Sheng
, Dong Xu:
3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds. 2908-2917 - Frank D. Julca-Aguilar, Jason Taylor, Mario Bijelic, Fahim Mannan, Ethan Tseng, Felix Heide:
Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues. 2918-2928 - Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong
:
Group-Free 3D Object Detection via Transformers. 2929-2938 - Junfeng Wan, Jiangfan Deng, Xiaosong Qiu, Feng Zhou:
Body-Face Joint Detection via Embedding and Head Hook. 2939-2948 - Haotian Zhang, Yicheng Luo, Fangbo Qin, Yijia He, Xiao Liu:
ELSD: Efficient Line Segment Detector and Descriptor. 2949-2958 - Fanfan Liu, Haoran Wei, Wenzhe Zhao, Guozhen Li, Jingquan Peng, Zihao Li:
WB-DETR: Transformer-Based Detector without Backbone. 2959-2967 - Xiyang Dai, Yinpeng Chen, Jianwei Yang, Pengchuan Zhang, Lu Yuan, Lei Zhang:
Dynamic DETR: End-to-End Object Detection with Dynamic Attention. 2968-2977 - Pengchuan Zhang, Xiyang Dai, Jianwei Yang, Bin Xiao, Lu Yuan, Lei Zhang, Jianfeng Gao:
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding. 2978-2988 - Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan:
Rank & Sort Loss for Object Detection and Instance Segmentation. 2989-2998 - Boxiao Liu, Guanglu Song, Manyuan Zhang, Haihang You, Yu Liu:
Switchable K-class Hyperplanes for Noise-Robust Representation Learning. 2999-3008 - Kun Yuan, Yiming Chen, Xinmeng Huang, Yingya Zhang, Pan Pan, Yinghui Xu, Wotao Yin:
DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training. 3009-3019 - Zhuoning Yuan, Yan Yan, Milan Sonka, Tianbao Yang:
Large-scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image Classification. 3020-3029 - Jung Uk Kim
, Sungjune Park, Yong Man Ro
:
Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning. 3030-3039 - Mengde Xu, Zheng Zhang, Han Hu, Jianfeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu:
End-to-End Semi-Supervised Object Detection with Soft Teacher. 3040-3049 - Tianyue Cao, Lianyu Du, Xiaoyun Zhang, Siheng Chen, Ya Zhang
, Yanfeng Wang:
CaT: Weakly Supervised Object Detection with Category Transfer. 3050-3059 - Yangyu Huang, Hao Yang, Chong Li, Jongyoo Kim, Fangyun Wei:
ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment. 3060-3070 - Tan Wang, Chang Zhou, Qianru Sun, Hanwang Zhang:
Causal Attention for Unbiased Visual Recognition. 3071-3080 - Zhoutao Wang, Qian Xie, Yu-Kun Lai, Jing Wu, Kun Long, Jun Wang:
MLVSNet: Multi-level Voting Siamese Network for 3D Visual Tracking. 3081-3090 - Yan Lu
, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Junjie Yan, Wanli Ouyang
:
Geometry Uncertainty Projection Network for Monocular 3D Object Detection. 3091-3101 - Rawal Khirodkar, Visesh Chari, Amit Agrawal, Ambrish Tyagi:
Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation. 3102-3111 - Hao Xu, Shuaicheng Liu
, Guangfu Wang, Guanghui Liu, Bing Zeng:
OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud Registration. 3112-3121 - Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon:
Is Pseudo-Lidar needed for Monocular 3D Object detection? 3122-3132 - Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li
:
LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector. 3133-3143 - Jiageng Mao
, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu:
Voxel Transformer for 3D Object Detection. 3144-3153 - Tarasha Khurana, Achal Dave, Deva Ramanan:
Detecting Invisible People. 3154-3164 - Heqian Qiu, Hongliang Li, Qingbo Wu, Jianhua Cui, Zichen Song
, Lanxiao Wang, Minjian Zhang
:
CrossDet: Crossline Representation for Object Detection. 3175-3184 - Zhiheng Ma
, Xiaopeng Hong, Xing Wei, Yunfeng Qiu, Yihong Gong:
Towards A Universal Model for Cross-Dataset Crowd Counting. 3185-3194 - Xinyan Liu
, Guorong Li, Zhenjun Han, Weigang Zhang, Yifan Yang, Qingming Huang, Nicu Sebe
:
Exploiting sample correlation for crowd counting with multi-expert network. 3195-3204 - Andrea Simonelli, Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder, Elisa Ricci
:
Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection? 3205-3213 - Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang
, Ying Tai
, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu:
Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting. 3214-3222 - Dror Aiger, Simon Lynen, Jan Hosang, Bernhard Zeisl:
Efficient Large Scale Inlier Voting for Geometric Vision Problems. 3223-3231 - Shuzhe Wang
, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala:
Continual Learning for Image-Based Camera Localization. 3232-3242 - Guangxing Han, Yicheng He, Shiyuan Huang, Jiawei Ma
, Shih-Fu Chang:
Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks. 3243-3252 - Xingxu Yao, Sicheng Zhao, Pengfei Xu, Jufeng Yang:
Multi-Source Domain Adaptation for Object Detection. 3253-3262 - Yongming Rao, Benlin Liu, Yi Wei, Jiwen Lu, Cho-Jui Hsieh, Jie Zhou:
RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection. 3263-3272 - Jiaming Sun
, Yiming Xie, Siyu Zhang, Linghao Chen, Guofeng Zhang, Hujun Bao, Xiaowei Zhou:
You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking. 3265-3174 - Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei Zhang, Zhenguo Li, Luc Van Gool:
Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection. 3273-3282 - Shun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani:
RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering. 3283-3292 - Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim:
PICCOLO: Point Cloud-Centric Omnidirectional Localization. 3293-3303 - Cheng Chi, Shuran Song
:
GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion. 3304-3313 - Jingyi Cao, Bo Liu
, Yunqian Wen, Rong Xie, Li Song:
Personalized and Invertible Face De-identification by Disentangled Identity Information Manipulation. 3314-3322 - Dominik Rivoir, Micha Pfeiffer, Reuben Docea, Fiona R. Kolbinger
, Carina Riediger, Jürgen Weitz, Stefanie Speidel
:
Long-Term Temporally Consistent Unpaired Video Translation from Simulated Surgical 3D Data. 3323-3333 - Dongyang Zhao, Ziyang Song, Zhenghao Ji, Gangming Zhao, Weifeng Ge, Yizhou Yu:
Multi-scale Matching Networks for Semantic Correspondence. 3334-3344 - Qingyu Song, Changan Wang, Zhengkai Jiang, Yabiao Wang
, Ying Tai
, Chengjie Wang, Jilin Li, Feiyue Huang, Yang Wu:
Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework. 3345-3354 - Yuming Du, Yang Xiao, Vincent Lepetit:
Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos. 3355-3364 - Meng Meng, Tianzhu Zhang, Qi Tian, Yongdong Zhang, Feng Wu:
Foreground Activation Maps for Weakly Supervised Object Localization. 3365-3375 - Thomas Hastings Greer, Roland Kwitt, François-Xavier Vialard, Marc Niethammer:
ICON: Learning Regular Maps Through Inverse Consistency. 3376-3385 - Shiyi Lan, Zhiding Yu, Christopher B. Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar:
DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision. 3386-3396 - Chengjian Feng, Yujie Zhong, Weilin Huang:
Exploring Classification Equilibrium in Long-Tailed Object Detection. 3397-3406 - Jeesoo Kim, Junsuk Choe
, Sangdoo Yun, Nojun Kwak:
Normalization Matters in Weakly Supervised Object Localization. 3407-3416 - Jaeyoung Yoo, Hojun Lee
, Inseop Chung, Geonseok Seo, Nojun Kwak:
Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image. 3417-3426 - Siyu Huang
, Tianyang Wang, Haoyi Xiong
, Jun Huan, Dejing Dou:
Semi-Supervised Active Learning with Temporal Output Discrepancy. 3427-3436 - Yuhang Zang, Chen Huang, Chen Change Loy:
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation. 3437-3446 - Yifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto:
Learning Hierarchical Graph Neural Networks for Image Clustering. 3447-3457 - Shekoofeh Azizi
, Basil Mustafa, Fiona Ryan, Zachary Beaver, Jan Freyberg, Jonathan Deaton, Aaron Loh, Alan Karthikesalingam, Simon Kornblith, Ting Chen, Vivek Natarajan, Mohammad Norouzi:
Big Self-Supervised Models Advance Medical Image Classification. 3458-3468 - Huisi Wu, Guilian Chen, Zhenkun Wen, Jing Qin:
Collaborative and Adversarial Learning of Focused and Dispersive Representations for Semi-supervised Polyp Segmentation. 3469-3478 - Hong-Yu Zhou
, Chixiang Lu, Sibei Yang, Xiaoguang Han, Yizhou Yu:
Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts. 3479-3489 - Chengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott
, Weilin Huang:
TOOD: Task-aligned One-stage Object Detection. 3490-3499 - Xingxing Xie, Gong Cheng
, Jiabao Wang, Xiwen Yao, Junwei Han:
Oriented R-CNN for Object Detection. 3500-3509 - Agastya Kalra, Guy Stoppi, Bradley Brown, Rishav Agarwal, Achuta Kadambi:
Towards Rotation Invariance in Object Detection. 3510-3520 - Denys Rozumnyi
, Jirí Matas
, Filip Sroubek, Marc Pollefeys
, Martin R. Oswald:
FMODetect: Robust Detection of Fast Moving Objects. 3521-3529 - Qi Dong, Zhuowen Tu, Haofu Liao, Yuting Zhang, Vijay Mahadevan, Stefano Soatto:
Visual Relationship Detection Using Part-and-Sum Transformers with Composite Queries. 3530-3539 - Jiehong Lin, Zewei Wei, Zhihao Li, Songcen Xu, Kui Jia, Yuanqing Li:
DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency. 3540-3549 - Rindra Ramamonjison, Amin Banitalebi-Dehkordi, Xinyu Kang, Xiaolong Bai, Yong Zhang
:
SimROD: A Simple Adaptation Method for Robust Object Detection. 3550-3559 - Lv Tang, Bo Li, Yijie Zhong
, Shouhong Ding, Mofei Song:
Disentangled High Quality Salient Object Detection. 3560-3570 - Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang:
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation. 3571-3580 - Fanglei Xue, Qiangchang Wang, Guodong Guo:
TransFER: Learning Relation-aware Facial Expression Representations with Transformers. 3581-3590 - Zhiqing Sun, Shengcao Cao, Yiming Yang, Kris Kitani:
Rethinking Transformer-based Set Prediction for Object Detection. 3591-3600 - Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai
, Hongsheng Li
:
Fast Convergence of DETR with Spatially Modulated Co-Attention. 3601-3610 - Keyang Wang, Lei Zhang:
Reconcile Prediction Consistency for Balanced Object Detection. 3611-3620 - Ziteng Gao, Limin Wang, Gangshan Wu:
Mutual Supervision for Dense Object Detection. 3621-3630 - Depu Meng
, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang:
Conditional DETR for Fast Training Convergence. 3631-3640 - Haoxuanye Ji
, Le Wang, Sanping Zhou, Wei Tang, Nanning Zheng, Gang Hua:
Meta Pairwise Relationship Distillation for Unsupervised Person Re-identification. 3641-3650 - Hardik Uppal, Alireza Sepas-Moghaddam
, Michael A. Greenspan, Ali Etemad:
Teacher-Student Adversarial Depth Hallucination to Improve Face Recognition. 3651-3660 - Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Sebastian Dziadzio, Thomas J. Cashman, Jamie Shotton:
Fake it till you make it: face analysis in the wild using synthetic data alone. 3661-3671 - Xuege Hou, Yali Li, Shengjin Wang:
Disentangled Representation for Age-Invariant Face Recognition: A Mutual Information Minimization Perspective. 3672-3681 - Yunjia Sun
, Jiabei Zeng, Shiguang Shan, Xilin Chen:
Cross-Encoder for Unsupervised Gaze Representation Learning. 3682-3691 - Qian Xie, Yu-Kun Lai, Jing Wu, Zhoutao Wang, Dening Lu, Mingqiang Wei, Jun Wang:
VENet: Voting Enhancement Network for 3D Object Detection. 3692-3701 - Mingtao Feng, Zhen Li, Qi Li, Liang Zhang, Xiangdong Zhang, Guangming Zhu, Hui Zhang, Yaonan Wang, Ajmal Mian
:
Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud. 3702-3711 - Jianping Wu
, Liang Zhang, Ye Liu, Ke Chen:
Real-time Vanishing Point Detector Integrating Under-parameterized RANSAC and Hough Transform. 3712-3721 - Yunhao Li, Wei Shen, Zhongpai Gao
, Yucheng Zhu, Guangtao Zhai, Guodong Guo:
Looking here or there? Gaze Following in 360-Degree Images. 3722-3731 - Yawei Li
, He Chen, Zhaopeng Cui, Radu Timofte
, Marc Pollefeys
, Gregory S. Chirikjian, Luc Van Gool:
Towards Efficient Graph Convolutional Networks for Point Cloud Handling. 3732-3742 - Yunze Man, Xinshuo Weng, Prasanna Kumar Sivakumar, Matthew O'Toole, Kris Kitani:
Multi-Echo LiDAR for 3D Object Detection. 3743-3752 - Lizhe Liu, Xiaohao Chen, Siyu Zhu, Ping Tan:
CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution. 3753-3762 - Huajun Liu, Xiangyu Miao, Christoph Mertz, Chengzhong Xu
, Hui Kong
:
CrackFormer: Transformer Network for Fine-Grained Crack Detection. 3763-3772 - Robin Magnet
, Maks Ovsjanikov:
DWKS : A Local Descriptor of Deformations Between Meshes and Point Clouds. 3773-3782 - Colin L. V. Cooke, Fanjie Kong, Amey Chaware, Kevin C. Zhou
, Kanghyun Kim, Rong Xu, D. Michael Ando, Samuel J. Yang, Pavan Chandra Konda, Roarke Horstmeyer
:
Physics-Enhanced Machine Learning for Virtual Fluorescence Microscopy. 3783-3793 - Jiaheng Liu, Yudong Wu, Yichao Wu, Chuming Li, Xiaolin Hu, Ding Liang, Mengyu Wang:
DAM: Discrepancy Alignment Metric for Face Recognition. 3794-3803 - Tianye Li, Shichen Liu, Timo Bolkart, Jiayi Liu, Hao Li
, Yajie Zhao:
Topologically Consistent Multi-View Face Inference Using Volumetric Sampling. 3804-3814 - Yunfei Liu, Ruicong Liu, Haofei Wang, Feng Lu
:
Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation. 3815-3824 - Junfu Liu, Di Qiu, Pengfei Yan, Xiaolin Wei:
Learn to Cluster Faces via Pairwise Classification. 3825-3833 - Xiangrui Zeng, Gregory Howe, Min Xu:
End-to-end robust joint unsupervised image alignment and clustering. 3834-3846 - Chenxu Zhang, Yifan Zhao, Yifei Huang, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo:
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning. 3847-3856 - Sen He, Wentong Liao, Michael Ying Yang
, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang:
Disentangled Lifespan Face Synthesis. 3857-3866 - Min Jin Chong, Wen-Sheng Chu, Abhishek Kumar, David A. Forsyth:
Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval. 3867-3876 - Xiao Yang, Yinpeng Dong, Tianyu Pang
, Hang Su, Jun Zhu, Yuefeng Chen, Hui Xue:
Towards Face Encryption by Generating Adversarial Identity Masks. 3877-3887 - Farkhod Makhmudkhujaev
, Sungeun Hong, In Kyu Park:
Re-Aging GAN: Toward Personalized Face Age Transformation. 3888-3897 - Hao Tang, Xingwei Liu, Shanlin Sun, Xiangyi Yan, Xiaohui Xie:
Recurrent Mask Refinement for Few-Shot Medical Image Segmentation. 3898-3908 - Neel Dey, Mengwei Ren, Adrian V. Dalca, Guido Gerig:
Generative Adversarial Registration for Improved Conditional Deformable Templates. 3909-3921 - Shih-Cheng Huang, Liyue Shen, Matthew P. Lungren, Serena Yeung
:
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition. 3922-3931 - Alireza Naghizadeh, Hongye Xu, Mohab Mohamed, Dimitris N. Metaxas, Dongfang Liu:
Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images with Artificial Neural Networks. 3932-3941 - Dong Yang, Andriy Myronenko
, Xiaosong Wang
, Ziyue Xu, Holger R. Roth, Daguang Xu:
T-AutoML: Automated Machine Learning for Lesion Segmentation using Transformers in 3D Medical Imaging. 3942-3954 - Yuhang Ding, Xin Yu
, Yi Yang:
RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation. 3955-3964 - Yi Zhou, Lei Huang, Tao Zhou, Huazhu Fu
, Ling Shao:
Visual-Textual Attentive Semantic Consistency for Medical Report Generation. 3965-3974 - John Gideon, Simon Stent:
The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled Video. 3975-3984 - Shahira Abousamra, David Belinsky, John S. Van Arnam, Felicia Allard, Eric Yee, Rajarsi Gupta, Tahsin M. Kurç, Dimitris Samaras, Joel H. Saltz, Chao Chen:
Multi-Class Cell Detection Using Spatial Context Representation. 3985-3994 - Richard J. Chen, Ming Y. Lu, Wei-Hung Weng, Tiffany Y. Chen, Drew F. K. Williamson, Trevor Manz, Maha Shady, Faisal Mahmood:
Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images. 3995-4005 - Hongliang He, Zhongyi Huang, Yao Ding, Guoli Song, Lin Wang, Qian Ren, Pengxu Wei, Zhiqiang Gao, Jie Chen:
CDNet: Centripetal Direction Network for Nuclear Instance Segmentation. 4006-4015 - Zunlei Feng, Zhonghua Wang, Xinchao Wang
, Yining Mao, Thomas Li, Jie Lei, Yuexuan Wang, Mingli Song:
Mutual-Complementing Framework for Nuclei Detection and Segmentation in Pathology Image. 4016-4025 - Michelle Shu, Richard Strong Bowen, Charles Herrmann, Gengmo Qi, Michele Santacatterina, Ramin Zabih:
Deep survival analysis with longitudinal X-rays for COVID-19. 4026-4035 - Zhidong Yang, Fa Zhang, Renmin Han
:
Self-Supervised Cryo-Electron Tomography Volumetric Image Restoration from Single Noisy Volume with Sparsity Constraint. 4036-4045 - Ellen D. Zhong
, Adam Lerer, Joseph H. Davis, Bonnie Berger:
CryoDRGN2: Ab initio neural reconstruction of 3D protein structures from real cryo-EM images. 4046-4055 - Jingyun Liang, Andreas Lugmayr, Kai Zhang
, Martin Danelljan, Luc Van Gool, Radu Timofte
:
Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling. 4056-4065 - Xueyang Fu
, Xi Wang, Aiping Liu, Junwei Han, Zheng-Jun Zha:
Learning Dual Priors for JPEG Compression Artifacts Removal. 4066-4075 - Jingyun Liang, Guolei Sun, Kai Zhang
, Luc Van Gool, Radu Timofte
:
Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution. 4076-4085 - Zhaoyang Zhang, Yitong Jiang, Jun Jiang
, Xiaogang Wang, Ping Luo, Jinwei Gu:
STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement. 4086-4095 - Jichun Li
, Weimin Tan, Bo Yan:
Perceptual Variousness Motion Deblurring with Light Global Context Refinement. 4096-4105 - Yuda Song, Hui Qian, Xin Du:
StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement. 4106-4115 - Yongri Piao, Jian Wang, Miao Zhang, Huchuan Lu:
MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection. 4116-4125 - Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan
:
Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection. 4126-4135 - Avishek Siris, Jianbo Jiao
, Gary K. L. Tam, Xianghua Xie, Rynson W. H. Lau:
Scene Context-Aware Salient Object Detection. 4136-4146 - Ni Zhang, Junwei Han, Nian Liu
, Ling Shao:
Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency Detection. 4147-4156 - Xiaotian Qiao
, Gerhard P. Hancke, Rynson W. H. Lau:
Light Source Guided Single-Image Flare Removal from Unpaired Data. 4157-4165 - Bin Tan
, Nan Xue, Song Bai, Tianfu Wu
, Gui-Song Xia
:
PlaneTR: Structure-Guided Transformers for 3D Plane Recovery. 4166-4175 - Wei-Ting Chen, Hao-Yu Fang, Cheng-Lin Hsieh, Cheng-Che Tsai, I-Hsiang Chen, Jian-Jiun Ding, Sy-Yen Kuo:
ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss. 4176-4185 - Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie
, Ser-Nam Lim:
Exploring Visual Engagement Signals for Representation Learning. 4186-4197 - Zhiyu Pan, Zhiguo Cao, Kewei Wang, Hao Lu, Weicai Zhong:
TransView: Inside, Outside, and Across the Cropping View Boundaries. 4198-4207 - Bin Fan, Yuchao Dai:
Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter Video. 4208-4217 - Qiaosi Yi, Juncheng Li, Qinyan Dai, Faming Fang, Guixu Zhang, Tieyong Zeng:
Structure-Preserving Deraining with Residue Channel Prior Guidance. 4218-4227 - Ke Yu, Zexian Li, Yue Peng, Chen Change Loy, Jinwei Gu:
ReconfigISP: Reconfigurable Camera Image Processing Pipeline. 4228-4237 - S. Mohammad Mostafavi I.
, Kuk-Jin Yoon, Jonghyun Choi
:
Event-Intensity Stereo: Estimating Depth by the Best of Both Worlds. 4238-4247 - Sagnik Das, Kunwar Yashraj Singh, Jon Wu, Erhan Bas, Vijay Mahadevan, Rahul Bhotika, Dimitris Samaras:
End-to-end Piece-wise Unwarping of Document Images. 4248-4257 - Yulun Zhang
, Donglai Wei, Can Qin, Huan Wang, Hanspeter Pfister
, Yun Fu:
Context Reasoning Attention Network for Image Super-Resolution. 4258-4267 - Salma Abdel Magid, Yulun Zhang
, Donglai Wei, Won-Dong Jang, Zudi Lin, Yun Fu, Hanspeter Pfister
:
Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-Resolution. 4268-4277 - Xiaobin Hu, Wenqi Ren, Kaicheng Yu, Kaihao Zhang, Xiaochun Cao, Wei Liu, Bjoern H. Menze
:
Pyramid Architecture Search for Real-Time Image Deblurring. 4278-4287 - Wenbin Xie, Dehua Song, Chang Xu
, Chunjing Xu, Hui Zhang, Yunhe Wang:
Learning Frequency-aware Dynamic Network for Efficient Super-Resolution. 4288-4297 - Wei Wang, Haochen Zhang
, Zehuan Yuan, Changhu Wang:
Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective. 4298-4307 - Chong Mou, Jian Zhang, Zhuoyuan Wu:
Dynamic Attentive Graph Learning for Image Restoration. 4308-4317 - Jing Zhang, Deng-Ping Fan
, Yuchao Dai, Xin Yu
, Yiran Zhong, Nick Barnes, Ling Shao:
RGB-D Saliency Detection via Cascaded Mutual Information Minimization. 4318-4327 - Zhilu Zhang, Haolin Wang, Ming Liu, Ruohao Wang, Jiawei Zhang, Wangmeng Zuo:
Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision. 4328-4338 - Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, Jiaya Jia
:
Deep Structured Instance Graph for Distilling Object Detectors. 4339-4348 - Jhih-Ciang Wu
, Ding-Jie Chen, Chiou-Shann Fuh, Tyng-Luh Liu:
Learning Unsupervised Metaformer for Anomaly Detection. 4349-4358 - Dongdong Chen, Julián Tachella, Mike E. Davies:
Equivariant Imaging: Learning Beyond the Range Space. 4359-4368 - Kang Liao, Chunyu Lin, Lixin Liao, Yao Zhao, Weiyao Lin:
Multi-Level Curriculum for Training A Distortion-Aware Barrel Distortion Rectification Model. 4369-4378 - Attila Lengyel, Sourav Garg
, Michael Milford
, Jan C. van Gemert:
Zero-Shot Day-Night Domain Adaptation with a Physics Prior. 4379-4389 - Yuhang Li, Feng Zhu, Ruihao Gong
, Mingzhu Shen, Xin Dong, Fengwei Yu, Shaoqing Lu, Shi Gu:
MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing. 4390-4399 - Lin Zhang, Yong Luo, Yan Bai, Bo Du, Ling-Yu Duan:
Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective Alignment. 4400-4408 - Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, Jiayi Ma:
Omniscient Video Super-Resolution. 4409-4418 - Chuanjun Zheng, Daming Shi, Wentian Shi:
Adaptive Unfolding Total Variation Network for Low-Light Image Enhancement. 4419-4428 - Zhuoran Zheng, Wenqi Ren, Xiaochun Cao, Tao Wang, Xiuyi Jia:
Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning. 4429-4438 - Hanul Kim, Su-Min Choi, Chang-Su Kim, Yeong Jun Koh:
Representative Color Transform for Image Enhancement. 4439-4448 - Peike Li, Xin Yu
, Yi Yang:
Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar. 4449-4459 - Siqi Li, Yutong Feng, Yipeng Li, Yu Jiang, Changqing Zou, Yue Gao:
Event Stream Super-Resolution via Spatiotemporal Constraint Learning. 4460-4469 - Yuan Tian
, Guo Lu, Xiongkuo Min
, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao:
Self-Conditioned Probabilistic Learning of Video Rescaling. 4470-4479 - Xiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao, Chao Dong:
A New Journey from SDRTV to HDRTV. 4480-4489 - Xiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu, Jungong Han, Yuchen Guo, Guiguang Ding:
ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting. 4490-4500 - Mehrdad Khani Shirkoohi, Vibhaalakshmi Sivaraman, Mohammad Alizadeh:
Efficient Video Compression via Content-Adaptive Super-Resolution. 4501-4510 - Wei Shang, Dongwei Ren
, Dongqing Zou, Jimmy S. Ren, Ping Luo, Wangmeng Zuo:
Bringing Events into Video Deblurring with Non-consecutively Blurry Frames. 4511-4520 - Bin Fan, Yuchao Dai, Mingyi He
:
SUNet: Symmetric Undistortion Network for Rolling Shutter Correction. 4521-4530 - Jérôme Revaud, Martin Humenberger:
Robust Automatic Monocular Vehicle Speed Estimation for Traffic Surveillance. 4531-4541 - Scott Workman, Hunter Blanton:
Augmenting Depth Estimation with Geospatial Context. 4542-4551 - Mehrdad Khani Shirkoohi, Pouya Hamadanian, Arash Nasr-Esfahany, Mohammad Alizadeh:
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming. 4552-4562 - Shitong Luo, Wei Hu:
Score-Based Point Cloud Denoising. 4563-4572 - Yi Zhang
, Hongwei Qin, Xiaogang Wang, Hongsheng Li
:
Rethinking Noise Synthesis and Modeling in Raw Denoising. 4573-4581 - Erik Jenner, Enrique Fita Sanmartín, Fred A. Hamprecht:
Extensions of Karger's Algorithm: Why They Fail in Theory and How They Are Useful in Practice. 4582-4591 - Zhanliang Wang, Junyu Dong, Xinguo Liu, Xueying Zeng:
Low-Rank Tensor Completion by Approximating the Tensor Average Rank. 4592-4600 - Huanyu Wang, Songyuan Li, Shihao Su, Zequn Qin, Xi Li:
RDI-Net: Relational Dynamic Inference Networks. 4601-4610 - Jiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen
, Chuang Zhang, Ming Wu:
Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation. 4611-4620 - Sung-Jin Cho, Seo-Won Ji, Jun-Pyo Hong, Seung-Won Jung
, Sung-Jea Ko:
Rethinking Coarse-to-Fine Approach in Single Image Deblurring. 4621-4630 - Yao Li, Xueyang Fu
, Zheng-Jun Zha:
Cross-Patch Graph Convolutional Network for Image Denoising. 4631-4640 - Tao Wang
, Li Yuan, Yunpeng Chen
, Jiashi Feng, Shuicheng Yan:
PnP-DETR: Towards Efficient Visual Analysis with Transformers. 4641-4650 - Isha Garg, Sayeed Shafayet Chowdhury, Kaushik Roy:
DCT-SNN: Using DCT to Distribute Spatial Information over Time for Low-Latency Spiking Neural Networks. 4651-4660 - Tao Zhou, Huazhu Fu
, Geng Chen
, Yi Zhou, Deng-Ping Fan
, Ling Shao:
Specificity-preserving RGB-D Saliency Detection. 4661-4671 - Ziyu Wan, Jingbo Zhang
, Dongdong Chen, Jing Liao
:
High-Fidelity Pluralistic Image Completion with Transformers. 4672-4681 - Lei Zhu
, Ke Xu, Zhanghan Ke, Rynson W. H. Lau:
Mitigating Intensity Bias in Shadow Detection via Feature Decomposition and Reweighting. 4682-4691 - Nian Liu
, Wangbo Zhao, Dingwen Zhang, Junwei Han, Ling Shao:
Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative Guidance. 4692-4701 - Nian Liu
, Ni Zhang, Kaiyuan Wan, Ling Shao, Junwei Han:
Visual Saliency Transformer. 4702-4712 - Junpeng Jing
, Xin Deng, Mai Xu, Jianyi Wang, Zhenyu Guan:
HiNet: Deep Image Hiding by Invertible Network. 4713-4722 - Zipei Chen, Chengjiang Long, Ling Zhang, Chunxia Xiao:
CANet: A Context-Aware Network for Shadow Removal. 4723-4732 - Yang Liu, Ziyu Yue, Jinshan Pan, Zhixun Su
:
Unpaired Learning for Deep Image Deraining with Rain Direction Regularizer. 4733-4741 - Zirui Liu, Haifeng Jin, Ting-Hsiang Wang, Kaixiong Zhou, Xia Hu:
DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization. 4742-4750 - Xiangyun Zhao, Xu Zou, Ying Wu:
Morphable Detector for Object Detection on Demand. 4751-4760 - Xi Yang, Wangmeng Xiang, Hui Zeng, Lei Zhang
:
Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme. 4761-4770 - Kai Zhang
, Jingyun Liang, Luc Van Gool, Radu Timofte
:
Designing a Practical Degradation Model for Deep Blind Image Super-Resolution. 4771-4780 - Longguang Wang, Yingqian Wang
, Zaiping Lin, Jungang Yang, Wei An, Yulan Guo:
Learning A Single Network for Scale-Arbitrary Super-Resolution. 4781-4790 - Jinshan Pan, Haoran Bai, Jiangxin Dong, Jiawei Zhang, Jinhui Tang:
Deep Blind Video Super-resolution. 4791-4800 - Zheng Zhan, Yifan Gong, Pu Zhao
, Geng Yuan, Wei Niu
, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David R. Kaeli, Bin Ren, Xue Lin, Yanzhi Wang:
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search. 4801-4811 - Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang:
SSH: A Self-Supervised Framework for Image Harmonization. 4812-4821 - Yufei Xu, Jing Zhang, Dacheng Tao:
Out-of-boundary View Synthesis Towards Full-Frame Video Stabilization. 4822-4831 - Jay Shenoy
, James Fong, Jeffrey Tan, Austin Roorda, Ren Ng:
R-SLAM: Optimizing Eye Tracking from Rolling Shutter Video of the Retina. 4832-4841 - Seokju Lee, François Rameau, Fei Pan, In So Kweon:
Attentive and Contrastive Learning for Joint Depth and Motion Field Estimation. 4842-4851 - Vivien Sainte Fare Garnot, Loïc Landrieu:
Panoptic Segmentation of Satellite Image Time Series with Convolutional Temporal Attention Networks. 4852-4861 - Jin Han, Yixin Yang, Chu Zhou
, Chao Xu, Boxin Shi:
EvIntSR-Net: Event Guided Multiple Latent Frames Reconstruction and Super-resolution. 4862-4871 - Zhuoyuan Wu, Jian Zhang, Chong Mou:
Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging. 4872-4881 - Tiantian Wang, Sifei Liu, Yapeng Tian, Kai Li, Ming-Hsuan Yang:
Video Matting via Consistency-Regularized Graph Neural Networks. 4882-4891 - Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang, Shuai Yi:
Collaborative Unsupervised Visual Representation Learning from Decentralized Data. 4892-4901 - Ge-Peng Ji, Keren Fu, Zhe Wu, Deng-Ping Fan
, Jianbing Shen, Ling Shao:
Full-Duplex Strategy for Video Object Segmentation. 4902-4913 - Yuchao Gu, Shang-Hua Gao, Xu-Sheng Cao, Peng Du, Shao-Ping Lu, Ming-Ming Cheng
:
iNAS: Integral NAS for Device-Aware Salient Object Detection. 4914-4924 - Pei Wang, Nuno Vasconcelos
:
A Machine Teaching Framework for Scalable Recognition. 4925-4934 - Ewa Magdalena Nowara, Daniel McDuff, Ashok Veeraraghavan:
The Benefit of Distraction: Denoising Camera-Based Physiological Measurements using Inverse Attention. 4935-4944 - Haoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin, Tong Lu:
Adaptive Graph Convolution for Point Cloud Analysis. 4945-4954 - Yu Tian, Guansong Pang
, Yuanhong Chen, Rajvinder Singh, Johan W. Verjans
, Gustavo Carneiro
:
Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning. 4955-4966 - Jie Xiao, Man Zhou, Xueyang Fu
, Aiping Liu, Zheng-Jun Zha:
Improving De-raining Generalization via Neural Reorganization. 4967-4976 - Jiaxi Jiang
, Kai Zhang
, Radu Timofte
:
Towards Flexible Blind JPEG Artifacts Removal. 4977-4986 - Simron Thapa, Nianyi Li, Jinwei Ye:
Learning to Remove Refractive Distortions from Underwater Images. 4987-4996 - Zheng Dong, Ke Xu
, Yin Yang, Hujun Bao, Weiwei Xu, Rynson W. H. Lau:
Location-aware Single Image Reflection Removal. 4997-5006 - Yeying Jin
, Aashish Sharma, Robby T. Tan:
DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network. 5007-5016 - Yuqi Ding
, Yu Ji, Mingyuan Zhou, Sing Bing Kang, Jinwei Ye:
Polarimetric Helmholtz Stereopsis. 5017-5026 - Ying Chen, Feng Mao, Jie Song, Xinchao Wang
, Huiqiong Wang, Mingli Song:
Self-born Wiring for Neural Trees. 5027-5036 - Yichen Zhu
, Yi Wang:
Student Customized Knowledge Distillation: Bridging the Gap Between Student and Teacher. 5037-5046 - Yajing Kong, Liu Liu, Jun Wang
, Dacheng Tao:
Adaptive Curriculum Learning. 5047-5056 - Linning Xu
, Yuanbo Xiangli, Anyi Rao
, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin:
BlockPlanner: City Block Generation with Vectorized Graph Representation. 5057-5066 - Yeonsik Jo, Se Young Chun, Jonghyun Choi
:
Rethinking Deep Image Prior for Denoising. 5067-5076 - Hang Xu, Ning Kang, Gengwei Zhang
, Chuanlong Xie, Xiaodan Liang, Zhenguo Li:
NASOA: Towards Faster Task-oriented Online Fine-tuning with a Zoo of Models. 5077-5086 - Jae-Han Lee, Chul Lee, Chang-Su Kim:
Learning Multiple Pixelwise Tasks Based on Loss Scale Balancing. 5087-5096 - Zhuo Su, Wenzhe Liu, Zitong Yu, Dewen Hu, Qing Liao, Qi Tian, Matti Pietikäinen, Li Liu:
Pixel Difference Networks for Efficient Edge Detection. 5097-5107 - Robin Chan
, Matthias Rottmann, Hanno Gottschalk:
Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation. 5108-5117 - Nergis Tomen
, Jan C. van Gemert:
Spectral Leakage and Rethinking the Kernel Size in CNNs. 5118-5127 - Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang:
MUSIQ: Multi-scale Image Quality Transformer. 5128-5137 - Thomas Verelst
, Tinne Tuytelaars
:
BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies. 5138-5147 - Yonggan Fu, Yang Zhang, Yue Wang, Zhihan Lu, Vivek Boominathan
, Ashok Veeraraghavan, Yingyan Lin:
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam. 5148-5157 - Pengfei Chen, Leida Li, Jinjian Wu, Weisheng Dong, Guangming Shi:
Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality Assessment. 5158-5167 - Adrian Bulat, Georgios Tzimiropoulos:
Bit-Mixer: Mixed-precision networks with runtime bit-width selection. 5168-5177 - Zihan Xu, Mingbao Lin, Jianzhuang Liu, Jie Chen, Ling Shao, Yue Gao, Yonghong Tian, Rongrong Ji
:
ReCU: Reviving the Dead Weights in Binary Neural Networks. 5178-5188 - Souvik Kundu, Massoud Pedram, Peter A. Beerel:
HIRE-SNN: Harnessing the Inherent Robustness of Energy-Efficient Deep Spiking Neural Networks by Training with Crafted Input Noise. 5189-5198 - Peng Chen, Bohan Zhuang, Chunhua Shen:
FATNN: Fast and Accurate Ternary Neural Networks*. 5199-5208 - Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Mingjie Liu, Zixuan Jiang, Ray T. Chen, David Z. Pan:
Towards Memory-Efficient Neural Networks via Multi-Level in situ Generation. 5209-5218 - Yi Guo, Huan Yuan, Jianchao Tan, Zhangyang Wang, Sen Yang, Ji Liu:
GDP: Stabilized Neural Network Pruning via Gates with Differentiable Polarization. 5219-5230 - Sung-En Chang, Yanyu Li, Mengshu Sun
, Weiwen Jiang, Sijia Liu, Yanzhi Wang, Xue Lin:
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions. 5231-5240 - Tiantian Han, Dong Li, Ji Liu, Lu Tian, Yi Shan:
Improving Low-Precision Network Quantization via Bin Regularization. 5241-5250 - Dohyung Kim, Junghyup Lee
, Bumsub Ham:
Distance-aware Quantization. 5251-5260 - Fangxin Liu, Wenbo Zhao, Zhezhi He, Yanzhi Wang, Zongwu Wang, Changzhi Dai, Xiaoyao Liang, Li Jiang:
Improving Neural Network Efficiency via Post-training Quantization with Adaptive Floating-Point. 5261-5270 - Ziwei Wang
, Han Xiao, Jiwen Lu, Jie Zhou:
Generalizable Mixed-Precision Quantization via Attribution Rank Preservation. 5271-5280 - Yongcheng Jing, Yiding Yang, Xinchao Wang
, Mingli Song, Dacheng Tao:
Meta-Aggregator: Learning to Aggregate for 1-bit Graph Neural Networks. 5281-5290 - Changyong Shu, Yifan Liu
, Jianfei Gao, Zheng Yan, Chunhua Shen:
Channel-wise Knowledge Distillation for Dense Prediction*. 5291-5300 - Yooshin Cho, Hanbyel Cho, Youngsoo Kim, Junmo Kim:
Improving Generalization of Batch Whitening by Convolutional Unit Optimization. 5301-5309 - Fanrong Li, Gang Li, Xiangyu He, Jian Cheng:
Dynamic Dual Gating Neural Networks. 5310-5319 - Mingzhu Shen, Feng Liang, Ruihao Gong
, Yuhang Li, Chuming Li, Chen Lin, Fengwei Yu, Junjie Yan, Wanli Ouyang
:
Once Quantization-Aware Training: High Performance Extremely Low-bit Architecture Search. 5320-5329 - Weihan Chen, Peisong Wang, Jian Cheng:
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization. 5330-5339 - Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao:
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks. 5340-5349 - Jung Hyun Lee
, Jihun Yun, Sung Ju Hwang, Eunho Yang:
Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss. 5350-5359 - Qi Sun
, Chen Bai, Tinghuan Chen
, Hao Geng, Xinyun Zhang, Yang Bai, Bei Yu:
Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning. 5360-5370 - Lvmin Zhang, Jinyue Jiang, Yi Ji, Chunping Liu:
SmartShadow: Artistic Shadow Drawing Tool for Line Drawings. 5371-5380 - Zhirui Dai, Yuepeng Jiang
, Yi Li, Bo Liu, Antoni B. Chan
, Nuno Vasconcelos
:
BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning. 5381-5391 - Boying Wang, Libo Zhang, Longyin Wen, Xianglong Liu, Yanjun Wu:
Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark. 5392-5401 - Zechen Bai, Yuta Nakashima, Noa Garcia:
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation. 5402-5412 - Ayush Chopra, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy:
ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors. 5413-5422 - Je Hyeong Hong
, Seong Jong Yoo, Muhammad Zeeshan Arshad, Young Min Kim, Jinwook Kim:
Structure-from-Sherds: Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections. 5423-5431 - Gabriel Moreira
, Manuel Marques
, João Paulo Costeira
:
Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs. 5432-5440 - Thiemo Alldieck
, Hongyi Xu, Cristian Sminchisescu:
imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose. 5441-5450 - Hugo Bertiche, Meysam Madadi, Emilio Tylson, Sergio Escalera
:
DeePSD: Automatic Deep Skinning And Pose Space Deformation For 3D Garment Animation. 5451-5460 - Kota Yamaguchi
:
CanvasVAE: Learning to Generate Vector Graphic Documents. 5461-5469 - Taewon Min, Chonghyuk Song, Eunseok Kim, Inwook Shim:
Distinctiveness oriented Positional Equilibrium for Point Cloud Registration. 5470-5478 - Peng Xiang, Xin Wen, Yu-Shen Liu
, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han:
SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer. 5479-5489 - Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, Jian Yang:
Superpoint Network for Point Cloud Oversegmentation. 5490-5499 - Roi Ronen, Yoav Y. Schechner, Eshkol Eytan:
4D Cloud Scattering Tomography. 5500-5509 - Bingli Wu, Jie Ma, Gaojie Chen, Pei An:
Feature Interactive Representation for Point Cloud Registration. 5510-5519 - Federica Arrigoni
, Andrea Fusiello, Elisa Ricci
, Tomás Pajdla:
Viewing Graph Solvability via Cycle Consistency. 5520-5529 - Peidong Liu, Xingxing Zuo
, Viktor Larsson, Marc Pollefeys
:
MBA-VO: Motion Blur Aware Visual Odometry. 5530-5539 - Yuxiang Zhang, Zhe Li
, Liang An
, Mengcheng Li, Tao Yu, Yebin Liu:
Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras. 5540-5549 - Viktor Larsson, Marc Pollefeys
, Magnus Oskarsson
:
Orthographic-Perspective Epipolar Geometry. 5550-5558 - Yaqing Ding
, Daniel Barath, Zuzana Kukelova:
Minimal Solutions for Panoramic Stitching Given Gravity Prior. 5559-5568 - Michael Oechsle, Songyou Peng, Andreas Geiger:
UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction. 5569-5579 - Haitian Zeng
, Yuchao Dai, Xin Yu
, Xiaohan Wang, Yi Yang:
PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion. 5580-5589 - Yi Wei, Shaohui Liu, Yongming Rao, Wang Zhao, Jiwen Lu
, Jie Zhou:
NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo. 5590-5599 - Eduard Ramon, Gil Triginer, Janna Escur, Albert Pumarola, Jaime García, Xavier Giró-i-Nieto, Francesc Moreno-Noguer:
H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction. 5600-5609 - Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, Qixing Huang:
Scene Synthesis via Uncertainty-Driven Attribute Synchronization. 5610-5620 - Nikolai Poliarnyi:
Out-of-Core Surface Reconstruction via Global TGV Minimization. 5621-5630 - Benjamin Ummenhofer, Vladlen Koltun:
Adaptive Surface Reconstruction with Multiscale Convolutional Kernels. 5631-5640 - Haoang Li, Kai Chen
, Pyojin Kim, Kuk-Jin Yoon, Zhe Liu, Kyungdon Joo, Yun-Hui Liu:
Learning Icosahedral Spherical Probability Map Based on Bingham Mixture Model for Vanishing Point Estimation. 5641-5650 - Yael Sde-Chen, Yoav Y. Schechner, Vadim Holodovsky, Eshkol Eytan:
3DeepCT: Learning Volumetric Scattering Tomography of Clouds. 5651-5662 - Donghoon Lee, Onur C. Hamsici, Steven Feng, Prachee Sharma, Thorsten Gernoth:
DeepPRO: Deep Partial Point Cloud Registration of Objects. 5663-5672 - Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner:
Pri3D: Can 3D Priors Help 2D Representation Learning? 5673-5682 - Mohammad Amin Shabani, Weilian Song, Makoto Odamaki, Hirochika Fujiki, Yasutaka Furukawa:
Extreme Structure from Motion for Indoor Panoramas without Visual Overlaps. 5683-5691 - Chen Gao, Ayush Saraf, Johannes Kopf, Jia-Bin Huang:
Dynamic View Synthesis from Dynamic Monocular Video. 5692-5701 - Dan Wang
, Xinrui Cui, Xun Chen, Zhengxia Zou, Tianyang Shi, Septimiu E. Salcudean, Z. Jane Wang, Rabab Ward:
Multi-view 3D Reconstruction with Transformers. 5702-5711 - Xinjun Ma, Yue Gong, Qirui Wang, Jingwei Huang, Lei Chen, Fan Yu:
EPP-MVSNet: Epipolar-assembling based Depth Prediction for Multi-view Stereo. 5712-5720 - Chen-Hsuan Lin, Wei-Chiu Ma, Antonio Torralba, Simon Lucey
:
BARF: Bundle-Adjusting Neural Radiance Fields. 5721-5731 - Alex Yu, Ruilong Li, Matthew Tancik, Hao Li, Ren Ng, Angjoo Kanazawa:
PlenOctrees for Real-time Rendering of Neural Radiance Fields. 5732-5741 - Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada:
Neural Articulated Radiance Field. 5742-5752 - Steven Liu, Xiuming Zhang, Zhoutong Zhang, Richard Zhang, Jun-Yan Zhu, Bryan Russell:
Editing Conditional Radiance Fields. 5753-5763 - Yudong Guo, Keyu Chen, Sen Liang, Yong-Jin Liu, Hujun Bao, Juyong Zhang:
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis. 5764-5774 - Xiangyu Xu
, Enrique Dunn
:
GTT-Net: Learned Generalized Trajectory Triangulation. 5775-5784 - Xingkui Wei, Zhengqing Chen, Yanwei Fu
, Zhaopeng Cui, Yinda Zhang:
Deep Hybrid Self-Prior for Full 3D Mesh Generation. 5785-5794 - Qixing Huang, Xiangru Huang, Bo Sun, Zaiwei Zhang, Junfeng Jiang, Chandrajit Bajaj:
ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators. 5795-5805 - Linqi Zhou, Yilun Du, Jiajun Wu:
3D Shape Generation and Completion through Point-Voxel Diffusion. 5806-5815 - Dominic Roberts, Ara Danielyan, Hang Chu, Mani Golparvar Fard, David A. Forsyth:
LSD-StructureNet: Modeling Levels of Structural Detail in 3D Part Hierarchies. 5816-5825 - Yoonwoo Jeong, Seokjun Ahn, Christopher B. Choy, Animashree Anandkumar, Minsu Cho, Jaesik Park:
Self-Calibrating Neural Radiance Fields. 5826-5834 - Jonathan T. Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, Pratul P. Srinivasan:
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields. 5835-5844 - Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B. Goldman, Steven M. Seitz, Ricardo Martin-Brualla:
Nerfies: Deformable Neural Radiance Fields. 5845-5854 - Peter Hedman, Pratul P. Srinivasan, Ben Mildenhall, Jonathan T. Barron, Paul E. Debevec:
Baking Neural Radiance Fields for Real-Time View Synthesis. 5855-5864 - Ajay Jain, Matthew Tancik, Pieter Abbeel:
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis. 5865-5874 - Xinyi Li, Haibin Ling:
PoGO-Net: Pose Graph Optimization with Graph Neural Networks. 5875-5885 - José Pedro Iglesias, Carl Olsson:
Radial Distortion Invariant Factorization for Structure from Motion. 5886-5895 - Duo Chen
, Zixin Tang, Zhenyu Xu, Yunan Zheng, Yiguang Liu:
Gaussian Fusion: Accurate 3D Reconstruction via Geometry-Guided Displacement Interpolation. 5896-5905 - Heng Yang, Chris Doran, Jean-Jacques E. Slotine:
Dynamical Pose Estimation. 5906-5915 - Snehal Bhayani, Torsten Sattler, Daniel Barath, Patrik Beliansky, Janne Heikkilä, Zuzana Kukelova:
Calibrated and Partially Calibrated Semi-Generalized Homographies. 5916-5925 - Mo Shan, Qiaojun Feng
, You-Yi Jau, Nikolay Atanasov:
ELLIPSDF: Joint Object Pose and Shape Optimization with a Bi-level Ellipsoid and Signed Distance Function Description. 5926-5935 - Kaizhang Kang, Cihui Xie, Ruisheng Zhu, Xiaohe Ma, Ping Tan, Hongzhi Wu, Kun Zhou:
Learning Efficient Photometric Feature Transform for Multi-view Stereo. 5936-5945 - Wen-Cheng Chen, Min-Chun Hu, Chu-Song Chen:
STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing. 5946-5955 - Dror Moran, Hodaya Koslowsky, Yoni Kasten, Haggai Maron, Meirav Galun, Ronen Basri:
Deep Permutation Equivariant Structure from Motion. 5956-5966 - Philipp Lindenberger, Paul-Edouard Sarlin, Viktor Larsson, Marc Pollefeys
:
Pixel-Perfect Structure-from-Motion with Featuremetric Refinement. 5967-5977 - Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard A. Newcombe:
ODAM: Object Detection, Association, and Mapping using Posed RGB Video. 5978-5988 - Brevin Tilmon, Sanjeev J. Koppal:
SaccadeCam: Adaptive Visual Attention for Monocular Depth Sensing. 5989-5998 - Yifan Zhu, Jiaxiong Qiu, Bo Ren:
Transfusion: A Novel SLAM Method Focused on Transparent Objects. 5999-6008 - Wenzheng Song, Masanori Suganuma, Xing Liu, Noriyuki Shimobayashi, Daisuke Maruta, Takayuki Okatani:
Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes. 6009-6018 - Mohammad Mahdi Johari, Camilla Carta, François Fleuret:
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation. 6019-6028 - Liangchen Song, Jialian Wu, Ming Yang
, Qian Zhang, Yuan Li, Junsong Yuan:
Stacked Homography Transformations for Multi-View Pedestrian Detection. 6029-6037 - Mihai Dusmanu, Ondrej Miksik, Johannes L. Schönberger, Marc Pollefeys
:
Cross-Descriptor Visual Localization and Mapping. 6038-6047 - Banglei Guan
, Ji Zhao
, Daniel Barath, Friedrich Fraundorfer:
Minimal Cases for Computing the Generalized Relative Pose using Affine Correspondences. 6048-6057 - Hongbin Xu
, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao
:
Digging into Uncertainty in Self-supervised Multi-view Stereo. 6058-6067 - Forrester Cole, Kyle Genova, Avneesh Sud, Daniel Vlasic, Zhoutong Zhang:
Differentiable Surface Rendering via Non-Differentiable Sampling. 6068-6077 - Le Hui, Hang Yang, Mingmei Cheng, Jin Xie, Jian Yang:
Pyramid Point Cloud Transformer for Large-Scale Place Recognition. 6078-6087 - Sérgio Agostinho
, Aljosa Osep, Alessio Del Bue
, Laura Leal-Taixé:
(Just) A Spoonful of Refinements Helps the Registration Error Go Down. 6088-6097 - Runsong Zhu, Yuan Liu, Zhen Dong, Yuan Wang, Tengping Jiang
, Wenping Wang, Bisheng Yang:
AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds. 6098-6107 - Haobo Jiang, Yaqi Shen, Jin Xie, Jun Li, Jianjun Qian, Jian Yang:
Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration. 6108-6117 - Zhi Deng, Yuxin Yao
, Bailin Deng, Juyong Zhang:
A Robust Loss for Point Cloud Registration. 6118-6127 - Jian Gao
, Jin Liu, Shunping Ji:
Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching. 6128-6137 - Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem:
PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility. 6138-6147 - Wang Zhao, Shaohui Liu, Yi Wei, Hengkai Guo, Yong-Jin Liu:
A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo. 6148-6157 - Taekyung Kim
, Jaehoon Choi, Seokeon Choi, Dongki Jung, Changick Kim:
Just a Few Points are All You Need for Multi-view Stereo: A Novel Semi-supervised Learning Method for Multi-view Stereo. 6158-6166 - Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang:
AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network. 6167-6176 - Zhaoshuo Li, Xingtong Liu
, Nathan Drenkow, Andy S. Ding, Francis X. Creighton, Russell H. Taylor, Mathias Unberath:
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers. 6177-6186 - Wei Jiang, Eduard Trulls, Jan Hosang, Andrea Tagliasacchi, Kwang Moo Yi:
COTR: Correspondence Transformer for Matching Across Images. 6187-6197 - Eric Brachmann, Martin Humenberger, Carsten Rother, Torsten Sattler:
On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation. 6198-6208 - Edgar Sucar, Shikun Liu, Joseph Ortiz, Andrew J. Davison:
iMAP: Implicit Mapping and Positioning in Real-Time. 6209-6218