


Остановите войну!
for scientists:


default search action
Hongsheng Li 0001
Person information

- affiliation: Chinese University of Hong Kong, Department of Electrical Engineering, CUHK-SenseTime Joint Laboratory, Hong Kong
- affiliation (former): Lehigh University, Department of Computer Science and Engineering, PA, USA
Other persons with the same name
- Hongsheng Li — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j42]Shaoshuai Shi
, Li Jiang, Jiajun Deng, Zhe Wang, Chaoxu Guo, Jianping Shi, Xiaogang Wang, Hongsheng Li:
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection. Int. J. Comput. Vis. 131(2): 531-551 (2023) - [j41]Jiageng Mao, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li
:
3D Object Detection for Autonomous Driving: A Comprehensive Survey. Int. J. Comput. Vis. 131(8): 1909-1963 (2023) - [j40]Peipei Zhao, Qiguang Miao, Hongsheng Li, Ruyi Liu, Yi-Ning Quan, Jianfeng Song:
Refined probability distribution module for fine-grained visual categorization. Neurocomputing 518: 533-544 (2023) - [j39]Jihan Yang, Shaoshuai Shi
, Zhe Wang
, Hongsheng Li
, Xiaojuan Qi
:
ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6354-6371 (2023) - [j38]Jianbo Liu
, Junjun He, Yuanjie Zheng
, Shuai Yi, Xiaogang Wang, Hongsheng Li
:
A Holistically-Guided Decoder for Deep Representation Learning With Applications to Semantic Segmentation and Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 11390-11406 (2023) - [j37]Kunchang Li
, Yali Wang
, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li
, Yu Qiao
:
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12581-12600 (2023) - [c140]Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li:
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers. CVPR 2023: 6252-6261 - [c139]Hao Shao, Letian Wang, Ruobing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu:
ReasonNet: End-to-End Driving with Temporal and Global Reasoning. CVPR 2023: 13723-13733 - [c138]Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
PATS: Patch Area Transportation with Subdivision for Local Feature Matching. CVPR 2023: 17776-17786 - [i183]Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation. CoRR abs/2303.01237 (2023) - [i182]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation. CoRR abs/2303.01503 (2023) - [i181]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023) - [i180]Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. CoRR abs/2303.05475 (2023) - [i179]Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
PATS: Patch Area Transportation with Subdivision for Local Feature Matching. CoRR abs/2303.07700 (2023) - [i178]Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
BlinkFlow: A Dataset to Push the Limits of Event-based Optical Flow Estimation. CoRR abs/2303.07716 (2023) - [i177]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CoRR abs/2303.08134 (2023) - [i176]Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation. CoRR abs/2303.08340 (2023) - [i175]Jihao Liu, Tai Wang, Boxiao Liu, Qihang Zhang, Yu Liu, Hongsheng Li:
Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding. CoRR abs/2303.11325 (2023) - [i174]Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li:
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching. CoRR abs/2303.13076 (2023) - [i173]Xiaoshi Wu, Keqiang Sun, Feng Zhu, Rui Zhao, Hongsheng Li:
Better Aligning Text-to-Image Models with Human Preference. CoRR abs/2303.14420 (2023) - [i172]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i171]Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, Yu Liu:
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction. CoRR abs/2304.00967 (2023) - [i170]Jingqiu Zhou, Linjiang Huang, Liang Wang, Si Liu, Hongsheng Li:
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels. CoRR abs/2304.07978 (2023) - [i169]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i168]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. CoRR abs/2305.03048 (2023) - [i167]Hao Shao, Letian Wang, Ruobing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu:
ReasonNet: End-to-End Driving with Temporal and Global Reasoning. CoRR abs/2305.10507 (2023) - [i166]Fu-Yun Wang, Wenshuo Chen, Guanglu Song, Han-Jia Ye, Yu Liu, Hongsheng Li:
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising. CoRR abs/2305.18264 (2023) - [i165]Xiaoliang Ju, Zhaoyang Huang, Yijin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li:
DiffRoom: Diffusion-based High-Quality 3D Room Reconstruction and Generation with Occupancy Prior. CoRR abs/2306.00519 (2023) - [i164]Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang:
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling. CoRR abs/2306.01721 (2023) - [i163]Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu:
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process. CoRR abs/2306.05423 (2023) - [i162]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023) - [i161]Fan Lu, Yan Xu, Guang Chen, Hongsheng Li, Kwan-Yee Lin, Changjun Jiang:
Urban Radiance Field Representation with Deformable Neural Mesh Primitives. CoRR abs/2307.10776 (2023) - [i160]Yiyuan Zhang, Kaixiong Gong, Kaipeng Zhang, Hongsheng Li, Yu Qiao, Wanli Ouyang, Xiangyu Yue:
Meta-Transformer: A Unified Framework for Multimodal Learning. CoRR abs/2307.10802 (2023) - [i159]Aojun Zhou, Ke Wang, Zimu Lu, Weikang Shi, Sichun Luo, Zipeng Qin, Shaoqing Lu, Anya Jia, Linqi Song, Mingjie Zhan, Hongsheng Li:
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification. CoRR abs/2308.07921 (2023) - 2022
- [j36]Yiwei Yang, Rui Huang, Guofeng Lv, Zhiqiang Hu, Guoping Shan, Jie Zhang, Xue Bai, Peng Liu, Hongsheng Li, Ming Chen:
Automatic segmentation of the clinical target volume and organs at risk for rectal cancer radiotherapy using structure-contextual representations based on 3D high-resolution network. Biomed. Signal Process. Control. 73: 103362 (2022) - [j35]Dasong Li, Yi Zhang, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network. Int. J. Comput. Vis. 130(8): 2060-2080 (2022) - [j34]Qian Da, Xiaodi Huang, Zhongyu Li, Yanfei Zuo, Chenbin Zhang, Jingxin Liu, Wen Chen, Jiahui Li, Dou Xu, Zhiqiang Hu, Hongmei Yi, Yan Guo, Zhe Wang, Ling Chen, Li Zhang, Xianying He, Xiaofan Zhang, Ke Mei, Chuang Zhu, Weizeng Lu, Linlin Shen, Jun Shi, Jun Li, Sreehari S, Ganapathy Krishnamurthi, Jiangcheng Yang, Tiancheng Lin, Qingyu Song, Xuechen Liu, Simon Graham, Raja Muhammad Saad Bashir, Canqian Yang, Shaofei Qin
, Xinmei Tian, Baocai Yin, Jie Zhao, Dimitris N. Metaxas, Hongsheng Li, Chaofu Wang, Shaoting Zhang:
DigestPath: A benchmark dataset with challenge review for the pathological detection and segmentation of digestive-system. Medical Image Anal. 80: 102485 (2022) - [j33]Yuanjie Zheng
, Xiaodan Sui
, Yanyun Jiang, Tongtong Che
, Shaoting Zhang, Jie Yang
, Hongsheng Li
:
SymReg-GAN: Symmetric Image Registration With Generative Adversarial Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5631-5646 (2022) - [j32]Xinge Zhu
, Hui Zhou
, Tai Wang
, Fangzhou Hong
, Wei Li
, Yuexin Ma, Hongsheng Li
, Ruigang Yang
, Dahua Lin:
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6807-6822 (2022) - [j31]Yan Xu
, Junyi Lin, Jianping Shi
, Guofeng Zhang
, Xiaogang Wang, Hongsheng Li
:
Robust Self-Supervised LiDAR Odometry Via Representative Structure Discovery and 3D Inherent Error Modeling. IEEE Robotics Autom. Lett. 7(2): 1651-1658 (2022) - [j30]Linjiang Huang
, Liang Wang, Hongsheng Li:
Multi-Modality Self-Distillation for Weakly Supervised Temporal Action Localization. IEEE Trans. Image Process. 31: 1504-1519 (2022) - [j29]Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li:
NeuralMarker: A Framework for Learning General Marker Correspondence. ACM Trans. Graph. 41(6): 271:1-271:10 (2022) - [c137]Lin Ma, Weiming Li, Hongsheng Li, Qiang Wang, Ji-Yeon Kim:
Task Generalizable Spatial and Texture Aware Image Downsizing Network. BMVC 2022: 315 - [c136]Teli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li, Baochang Zhang, Peng Gao, Yu Qiao:
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022: 481 - [c135]Hao Shao, Letian Wang, Ruobing Chen, Hongsheng Li, Yu Liu:
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer. CoRL 2022: 726-737 - [c134]Hao Li, Tianwen Fu, Jifeng Dai, Hongsheng Li, Gao Huang, Xizhou Zhu:
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks. CVPR 2022: 999-1008 - [c133]Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, Liwei Wang:
RBGNet: Ray-based Grouping for 3D Object Detection. CVPR 2022: 1100-1109 - [c132]Yi Zhang, Dasong Li, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
IDR: Self-Supervised Image Denoising via Iterative Data Refinement. CVPR 2022: 2088-2097 - [c131]Linjiang Huang, Liang Wang, Hongsheng Li:
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation. CVPR 2022: 3262-3271 - [c130]Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang, Hongsheng Li:
Learning a Structured Latent Space for Unsupervised Point Cloud Completion. CVPR 2022: 5533-5543 - [c129]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c128]Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization. CVPR 2022: 14860-14870 - [c127]Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Hongsheng Li, Xiaohua Wang, Jifeng Dai:
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks. CVPR 2022: 16783-16794 - [c126]Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu:
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP. ECCV (21) 2022: 33-49 - [c125]Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez:
EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers. ECCV (11) 2022: 294-311 - [c124]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c123]Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu:
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers. ECCV (26) 2022: 455-471 - [c122]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c121]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer: A Transformer Architecture for Optical Flow. ECCV (17) 2022: 668-685 - [c120]Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li:
MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection. ECCV (8) 2022: 680-697 - [c119]Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li:
Towards Robust Face Recognition with Comprehensive Search. ECCV (12) 2022: 720-736 - [c118]Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Learning Degradation Representations for Image Deblurring. ECCV (18) 2022: 736-753 - [c117]Kunchang Li
, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning. ICLR 2022 - [c116]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c115]Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li:
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning. NeurIPS 2022 - [c114]Keqiang Sun, Shangzhe Wu, Zhaoyang Huang, Ning Zhang, Quan Wang, Hongsheng Li:
Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields. NeurIPS 2022 - [c113]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. NeurIPS 2022 - [c112]Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai:
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs. NeurIPS 2022 - [i158]Zipeng Qin, Jianbo Liu, Xiaolin Zhang, Maoqing Tian, Aojun Zhou, Shuai Yi, Hongsheng Li:
Pyramid Fusion Transformer for Semantic Segmentation. CoRR abs/2201.04019 (2022) - [i157]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning. CoRR abs/2201.04676 (2022) - [i156]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-attention for Visual Recognition. CoRR abs/2201.09450 (2022) - [i155]Kexue Fu, Peng Gao, Renrui Zhang, Hongsheng Li, Yu Qiao, Manning Wang:
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning. CoRR abs/2202.04241 (2022) - [i154]Jihao Liu, Boxiao Liu, Hongsheng Li, Yu Liu:
Meta Knowledge Distillation. CoRR abs/2202.07940 (2022) - [i153]Yan Xu, Junyi Lin, Jianping Shi, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
Robust Self-Supervised LiDAR Odometry via Representative Structure Discovery and 3D Inherent Error Modeling. CoRR abs/2202.13353 (2022) - [i152]Linjiang Huang, Liang Wang, Hongsheng Li:
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation. CoRR abs/2203.02925 (2022) - [i151]Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu:
LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network. CoRR abs/2203.07186 (2022) - [i150]Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization. CoRR abs/2203.12870 (2022) - [i149]Renrui Zhang, Han Qiu, Tai Wang, Xuanzhuo Xu, Ziyu Guo, Yu Qiao, Peng Gao, Hongsheng Li:
MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection. CoRR abs/2203.13310 (2022) - [i148]Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang, Hongsheng Li:
Learning a Structured Latent Space for Unsupervised Point Cloud Completion. CoRR abs/2203.15580 (2022) - [i147]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer: A Transformer Architecture for Optical Flow. CoRR abs/2203.16194 (2022) - [i146]Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, Liwei Wang:
RBGNet: Ray-based Grouping for 3D Object Detection. CoRR abs/2204.02251 (2022) - [i145]Siming Fan, Jingtan Piao, Chen Qian, Kwan-Yee Lin, Hongsheng Li:
Simulating Fluids in Real-World Still Images. CoRR abs/2204.11335 (2022) - [i144]Wei Cheng, Su Xu, Jingtan Piao, Chen Qian, Wayne Wu, Kwan-Yee Lin, Hongsheng Li:
Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis. CoRR abs/2204.11798 (2022) - [i143]Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez:
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers. CoRR abs/2205.03436 (2022) - [i142]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022) - [i141]Dasong Li, Yi Zhang, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network. CoRR abs/2205.04721 (2022) - [i140]Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li:
MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection. CoRR abs/2205.05979 (2022) - [i139]Jihao Liu, Xin Huang, Yu Liu, Hongsheng Li:
MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning. CoRR abs/2205.13137 (2022) - [i138]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. CoRR abs/2205.14401 (2022) - [i137]Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai:
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs. CoRR abs/2206.04674 (2022) - [i136]Keqiang Sun, Shangzhe Wu, Zhaoyang Huang, Ning Zhang, Quan Wang, Hongsheng Li:
Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields. CoRR abs/2206.08361 (2022) - [i135]Jiageng Mao
, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
3D Object Detection for Autonomous Driving: A Review and New Outlooks. CoRR abs/2206.09474 (2022) - [i134]Dasong Li, Xiaoyu Shi, Yi Zhang, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
No Attention is Needed: Grouped Spatial-temporal Shift for Simple and Efficient Video Restorers. CoRR abs/2206.10810 (2022) - [i133]Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li:
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition. CoRR abs/2206.13559 (2022) - [i132]Jihao Liu, Xin Huang, Guanglu Song, Yu Liu, Hongsheng Li:
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP. CoRR abs/2207.05420 (2022) - [i131]Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu:
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers. CoRR abs/2207.08409 (2022) - [i130]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i129]Hao Shao, Letian Wang, Ruobing Chen, Hongsheng Li, Yu Liu:
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer. CoRR abs/2207.14024 (2022) - [i128]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i127]Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Learning Degradation Representations for Image Deblurring. CoRR abs/2208.05244 (2022) - [i126]Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li:
Towards Robust Face Recognition with Comprehensive Search. CoRR abs/2208.13600 (2022) - [i125]Zhe Wang, Hongsheng Li, Qinwei Zhang, Jing Yuan, Xiaogang Wang:
Magnetic Resonance Fingerprinting with compressed sensing and distance metric learning. CoRR abs/2209.08734 (2022) - [i124]Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li:
NeuralMarker: A Framework for Learning General Marker Correspondence. CoRR abs/2209.08896 (2022) - [i123]Renrui Zhang, Hanqiu Deng, Bohao Li, Wei Zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao:
Collaboration of Pre-trained Models Makes Better Few-shot Learner. CoRR abs/2209.12255 (2022) - [i122]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CoRR abs/2211.05778 (2022) - [i121]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CoRR abs/2211.09808 (2022) - [i120]Linjiang Huang, Kaixin Lu, Guanglu Song, Liang Wang, Si Liu, Yu Liu, Hongsheng Li:
Teach-DETR: Better Training DETR with Teachers. CoRR abs/2211.11953 (2022) - [i119]Keqiang Sun, Shangzhe Wu, Ning Zhang, Zhaoyang Huang, Quan Wang, Hongsheng Li:
CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields. CoRR abs/2211.13251 (2022) - [i118]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders. CoRR abs/2212.06785 (2022) - 2021
- [j28]