default search action
Lu Yuan
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j40]Wenyang Zhou, Lu Yuan, Taijiang Mu:
Multi3D: 3D-aware multimodal image synthesis. Comput. Vis. Media 10(6): 1205-1217 (2024) - [j39]Jiapeng Yang, Lei Shi, Tielin Lu, Lu Yuan, Nanchang Cheng, Xiaohui Yang, Jia Luo, Mingying Xu:
A Positive Sample Enhancement Algorithm with Fuzzy Nearest Neighbor Hybridization for Imbalance Data. Int. J. Fuzzy Syst. 26(8): 2707-2725 (2024) - [j38]Yan-Bing Huang, Li Lin, Xin-Yu Li, Bo-Zhu Chen, Lu Yuan, Hui Zheng:
An indirect treatment comparison meta-analysis of digital versus face-to-face cognitive behavior therapy for headache. npj Digit. Medicine 7(1) (2024) - [j37]Fuhui Zhou, Yihao Li, Ming Xu, Lu Yuan, Qihui Wu, Rose Qingyang Hu, Naofal Al-Dhahir:
Cognitive Semantic Communication Systems Driven by Knowledge Graph: Principle, Implementation, and Performance Evaluation. IEEE Trans. Commun. 72(1): 193-208 (2024) - [j36]Chunyu Liu, Wei Wu, Siyu Wu, Lu Yuan, Rui Ding, Fuhui Zhou, Qihui Wu:
Social-Enhanced Explainable Recommendation With Knowledge Graph. IEEE Trans. Knowl. Data Eng. 36(2): 840-853 (2024) - [j35]Hezhen Hu, Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Lu Yuan, Dong Chen, Houqiang Li:
PersonMAE: Person Re-Identification Pre-Training With Masked AutoEncoders. IEEE Trans. Multim. 26: 10029-10040 (2024) - [j34]Wen-Yang Zhou, Lu Yuan, Shu-Yu Chen, Lin Gao, Shi-Min Hu:
LC-NeRF: Local Controllable Face Generation in Neural Radiance Field. IEEE Trans. Vis. Comput. Graph. 30(8): 5437-5448 (2024) - [c114]James Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian:
Learning Subject-Aware Cropping by Outpainting Professional Photos. AAAI 2024: 2175-2183 - [c113]Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan:
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks. CVPR 2024: 4818-4829 - [c112]Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
OmniViD: A Generative Framework for Universal Video Understanding. CVPR 2024: 18209-18220 - [c111]Xiaopei Hu, Guixin Zhao, Lu Yuan, Xiangjun Dong, Aimei Dong:
Multi-scale Convolutional Attention Fuzzy Broad Network for Few-Shot Hyperspectral Image Classification. ICANN (2) 2024: 46-60 - [c110]Lu Yuan, Jiyan Sun, Shangyuan Zhuang, Yinlong Liu, Liru Geng, Jing Zou, Peizhe Xin, Weiqing Huang, Wei Ma:
Manticore: An Unsupervised Intrusion Detection System Based on Contrastive Learning in 5G Networks. ICASSP 2024: 4705-4709 - [c109]Zhaorui Guo, Jiyan Sun, Jiadong Fu, Lu Yuan, Shangyuan Zhuang, Liru Geng, Yinlong Liu, Wei Ma:
Fast and Accurate Root Cause Analysis Based on Signalling Messages for 5G Networks. ICASSP 2024: 9276-9280 - [c108]Xi Song, Lu Yuan, Zhibo Qu, Fuhui Zhou, Qihui Wu, Tony Q. S. Quek, Rose Qingyang Hu:
Knowledge Graph Driven UAV Cognitive Semantic Communication Systems for Efficient Object Detection. ICC 2024: 1685-1690 - [c107]Lu Yuan, Fuhui Zhou, Qihui Wu, Derrick Wing Kwan Ng:
Channel Prediction-Enhanced Intelligent Resource Allocation for Dynamic Spectrum-Sharing Networks. ICC 2024: 2767-2772 - [c106]Yike Li, Lu Yuan, Fuhui Zhou, Qihui Wu, Naofal Al-Dhahir, Kai-Kit Wong:
KGAMC: A Novel Knowledge Graph Driven Automatic Modulation Classification Scheme. ICC 2024: 4857-4862 - [c105]Lu Yuan, Jiyan Sun, Shangyuan Zhuang, Yinlong Liu, Liru Geng, Wei Ma:
CoSen-IDS: A Novel Cost-Sensitive Intrusion Detection System on Imbalanced Data in 5G Networks. ICIC (8) 2024: 470-481 - [c104]Xu Ma, Xiyang Dai, Jianwei Yang, Bin Xiao, Yinpeng Chen, Yun Fu, Lu Yuan:
Efficient Modulation for Vision Networks. ICLR 2024 - [c103]Lei Shi, Jiapeng Yang, Pengtao Lv, Lu Yuan, Feifei Kou, Jia Luo, Mingying Xu:
Self-derived Knowledge Graph Contrastive Learning for Recommendation. ACM Multimedia 2024: 7571-7580 - [c102]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Xuemei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. NAACL-HLT (Findings) 2024: 1615-1627 - [c101]Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalian:
Block and Detail: Scaffolding Sketch-to-Image Generation. UIST 2024: 33:1-33:13 - [i123]Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalian:
Block and Detail: Scaffolding Sketch-to-Image Generation. CoRR abs/2402.18116 (2024) - [i122]Lingting Zhu, Noel Codella, Dongdong Chen, Zhenchao Jin, Lu Yuan, Lequan Yu:
Generative Enhancement for 3D Medical Images. CoRR abs/2403.12852 (2024) - [i121]Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
OmniVid: A Generative Framework for Universal Video Understanding. CoRR abs/2403.17935 (2024) - [i120]Xu Ma, Xiyang Dai, Jianwei Yang, Bin Xiao, Yinpeng Chen, Yun Fu, Lu Yuan:
Efficient Modulation for Vision Networks. CoRR abs/2403.19963 (2024) - [i119]Yuanze Lin, Yunsheng Li, Dongdong Chen, Weijian Xu, Ronald Clark, Philip Torr, Lu Yuan:
Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge. CoRR abs/2407.04681 (2024) - [i118]Wan-Cyuan Fan, Yen-Chun Chen, Mengchen Liu, Lu Yuan, Leonid Sigal:
On Pre-training of Multimodal Language Models Customized for Chart Understanding. CoRR abs/2407.14506 (2024) - [i117]Xuelu Feng, Yunsheng Li, Dongdong Chen, Chunming Qiao, Junsong Yuan, Lu Yuan, Gang Hua:
Pluralistic Salient Object Detection. CoRR abs/2409.02368 (2024) - 2023
- [j33]Zitong Zhang, Lu Yuan, Fuhui Zhou, Qihui Wu:
Data-and-Knowledge Dual-Driven Radio Frequency Fingerprint Identification. IEEE Internet Things J. 10(13): 11944-11945 (2023) - [j32]Yuwei Fan, Lei Shi, Lu Yuan:
Topic modeling methods for short texts: A survey. J. Intell. Fuzzy Syst. 45(2): 1971-1990 (2023) - [j31]Lu Yuan, Hangshun Jiang, Hao Shen, Lei Shi, Nanchang Cheng:
Sustainable Development of Information Dissemination: A Review of Current Fake News Detection Research and Practice. Syst. 11(9): 458 (2023) - [c100]Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu, Baining Guo:
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers. AAAI 2023: 552-560 - [c99]Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, Yu-Chiang Frank Wang:
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis. AAAI 2023: 579-587 - [c98]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. AAAI 2023: 10880-10890 - [c97]Zhengyu Chen, Dawei Huang, Mingran Wang, Bowen Yang, Jinuk Luke Shin, Changran Hu, Bo Li, Raghu Prabhakar, Gao Deng, Yongning Sheng, Sihua Fu, Lu Yuan, Tian Zhao, Yun Du, Chen Liu, Jun Yang, Viren Shah, Venkat Srinivasan, Sumti Jairath:
AI SoC Design Challenges in the Foundation Model Era. CICC 2023: 1-8 - [c96]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. CVPR 2023: 2268-2278 - [c95]Shuquan Ye, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao:
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles. CVPR 2023: 2634-2645 - [c94]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. CVPR 2023: 6312-6322 - [c93]Xiaoyi Dong, Jianmin Bao, Yinglin Zheng, Ting Zhang, Dongdong Chen, Hao Yang, Ming Zeng, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining. CVPR 2023: 10995-11005 - [c92]Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding. CVPR 2023: 11402-11411 - [c91]Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao:
Generalized Decoding for Pixel, Image, and Language. CVPR 2023: 15116-15127 - [c90]Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang:
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following. EMNLP 2023: 1203-1217 - [c89]Lu Yuan, Yuan Meng, Jiyan Sun, Shangyuan Zhuang, Yinlong Liu, Liru Geng, Weiqing Huang:
ATS: A Fully Automatic Troubleshooting System with Efficient Anomaly Detection and Localization. ICCS (5) 2023: 476-491 - [c88]Qidong Huang, Xiaoyi Dong, Dongdong Chen, Yinpeng Chen, Lu Yuan, Gang Hua, Weiming Zhang, Nenghai Yu:
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting. ICCV 2023: 1600-1610 - [c87]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Stephen Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. ICCV 2023: 21913-21923 - [c86]Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang:
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations. ICLR 2023 - [c85]Hanqing Zhao, Dianmo Sheng, Jianmin Bao, Dongdong Chen, Dong Chen, Fang Wen, Lu Yuan, Ce Liu, Wenbo Zhou, Qi Chu, Weiming Zhang, Nenghai Yu:
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion. ICML 2023: 42098-42109 - [c84]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. NeurIPS 2023 - [c83]Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, Kwan-Yee K. Wong:
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models. NeurIPS 2023 - [i116]Wen-Yang Zhou, Lu Yuan, Shuyu Chen, Lin Gao, Shimin Hu:
LC-NeRF: Local Controllable Face Generation in Neural Randiance Field. CoRR abs/2302.09486 (2023) - [i115]Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang:
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations. CoRR abs/2302.14138 (2023) - [i114]Fuhui Zhou, Yihao Li, Ming Xu, Lu Yuan, Qihui Wu, Rose Qingyang Hu, Naofal Al-Dhahir:
Cognitive Semantic Communication Systems Driven by Knowledge Graph: Principle, Implementation, and Performance Evaluation. CoRR abs/2303.08546 (2023) - [i113]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang:
OmniTracker: Unifying Object Tracking by Tracking-with-Detection. CoRR abs/2303.12079 (2023) - [i112]Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System. CoRR abs/2304.14407 (2023) - [i111]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. CoRR abs/2305.12311 (2023) - [i110]Munan Ning, Yujia Xie, Dongdong Chen, Zeyin Song, Lu Yuan, Yonghong Tian, Qixiang Ye, Li Yuan:
Album Storytelling with Iterative Story-aware Captioning and Large Language Models. CoRR abs/2305.12943 (2023) - [i109]Yuwei Fang, Mahmoud Khademi, Chenguang Zhu, Ziyi Yang, Reid Pryzant, Yichong Xu, Yao Qian, Takuya Yoshioka, Lu Yuan, Michael Zeng, Xuedong Huang:
i-Code Studio: A Configurable and Composable Framework for Integrative AI. CoRR abs/2305.13738 (2023) - [i108]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
Image is First-order Norm+Linear Autoregressive. CoRR abs/2305.16319 (2023) - [i107]Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, Kwan-Yee K. Wong:
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models. CoRR abs/2305.16322 (2023) - [i106]Zixin Zhu, Xuelu Feng, Dongdong Chen, Jianmin Bao, Le Wang, Yinpeng Chen, Lu Yuan, Gang Hua:
Designing a Better Asymmetric VQGAN for StableDiffusion. CoRR abs/2306.04632 (2023) - [i105]Qinhong Yang, Dongdong Chen, Zhentao Tan, Qiankun Liu, Qi Chu, Jianmin Bao, Lu Yuan, Gang Hua, Nenghai Yu:
HQ-50K: A Large-scale, High-quality Dataset for Image Restoration. CoRR abs/2306.05390 (2023) - [i104]Qidong Huang, Xiaoyi Dong, Dongdong Chen, Yinpeng Chen, Lu Yuan, Gang Hua, Weiming Zhang, Nenghai Yu:
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting. CoRR abs/2308.10315 (2023) - [i103]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. CoRR abs/2309.12314 (2023) - [i102]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. CoRR abs/2310.12152 (2023) - [i101]Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang:
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following. CoRR abs/2310.12344 (2023) - [i100]Yinpeng Chen, Dongdong Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
On the Hidden Waves of Image. CoRR abs/2310.12976 (2023) - [i99]Hezhen Hu, Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Lu Yuan, Dong Chen, Houqiang Li:
PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders. CoRR abs/2311.04496 (2023) - [i98]Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan:
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks. CoRR abs/2311.06242 (2023) - [i97]Chongyan Chen, Mengchen Liu, Noel Codella, Yunsheng Li, Lu Yuan, Danna Gurari:
Fully Authentic Visual Question Answering Dataset from Online Communities. CoRR abs/2311.15562 (2023) - [i96]Munan Ning, Bin Zhu, Yujia Xie, Bin Lin, Jiaxi Cui, Lu Yuan, Dongdong Chen, Li Yuan:
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models. CoRR abs/2311.16103 (2023) - [i95]James Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian:
Learning Subject-Aware Cropping by Outpainting Professional Photos. CoRR abs/2312.12080 (2023) - [i94]Chin-Hsuan Wu, Yen-Chun Chen, Bolivar Solarte, Lu Yuan, Min Sun:
iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views. CoRR abs/2312.17250 (2023) - 2022
- [j30]Qiankun Liu, Dongdong Chen, Qi Chu, Lu Yuan, Bin Liu, Lei Zhang, Nenghai Yu:
Online multi-object tracking with unsupervised re-identification learning and occlusion estimation. Neurocomputing 483: 333-347 (2022) - [j29]Lu Yuan, Hao Zhang, Ming Xu, Fuhui Zhou, Qihui Wu:
A Multiscale CNN Framework for Wireless Technique Classification in Internet of Things. IEEE Internet Things J. 9(12): 10366-10367 (2022) - [j28]Zhenbing Liu, Lu Yuan, Long Sun:
Frequency separation-based multi-scale cascading residual block network for image super resolution. Multim. Tools Appl. 81(5): 6827-6848 (2022) - [j27]Zhentao Tan, Dongdong Chen, Qi Chu, Menglei Chai, Jing Liao, Mingming He, Lu Yuan, Gang Hua, Nenghai Yu:
Efficient Semantic Image Synthesis via Class-Adaptive Normalization. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4852-4866 (2022) - [j26]Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Weiming Zhang, Lu Yuan, Gang Hua, Nenghai Yu:
E2Style: Improve the Efficiency and Effectiveness of StyleGAN Inversion. IEEE Trans. Image Process. 31: 3267-3280 (2022) - [c82]Dengpan Fu, Dongdong Chen, Hao Yang, Jianmin Bao, Lu Yuan, Lei Zhang, Houqiang Li, Fang Wen, Dong Chen:
Large-Scale Pre-training for Person Re-identification with Noisy Labels. CVPR 2022: 1-11 - [c81]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Xiaoyi Dong, Lu Yuan, Zicheng Liu:
Mobile-Former: Bridging MobileNet and Transformer. CVPR 2022: 5260-5269 - [c80]Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo:
Vector Quantized Diffusion Model for Text-to-Image Synthesis. CVPR 2022: 10686-10696 - [c79]Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao:
Grounded Language-Image Pre-training. CVPR 2022: 10955-10965 - [c78]Qiankun Liu, Zhentao Tan, Dongdong Chen, Qi Chu, Xiyang Dai, Yinpeng Chen, Mengchen Liu, Lu Yuan, Nenghai Yu:
Reduce Information Loss in Transformers for Pluralistic Image Inpainting. CVPR 2022: 11337-11347 - [c77]Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Weiming Zhang, Nenghai Yu, Lu Yuan, Dong Chen, Baining Guo:
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows. CVPR 2022: 12114-12124 - [c76]Jinnian Zhang, Houwen Peng, Kan Wu, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
MiniViT: Compressing Vision Transformers with Weight Multiplexing. CVPR 2022: 12135-12144 - [c75]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan:
BEVT: BERT Pretraining of Video Transformers. CVPR 2022: 14713-14723 - [c74]Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao:
RegionCLIP: Region-based Language-Image Pretraining. CVPR 2022: 16772-16782 - [c73]Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, Nenghai Yu:
HairCLIP: Design Your Hair by Text and Reference Image. CVPR 2022: 18051-18060 - [c72]Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng:
An Empirical Study of Training End-to-End Vision-and-Language Transformers. CVPR 2022: 18145-18155 - [c71]Yinglin Zheng, Hao Yang, Ting Zhang, Jianmin Bao, Dongdong Chen, Yangyu Huang, Lu Yuan, Dong Chen, Ming Zeng, Fang Wen:
General Facial Representation Learning in a Visual-Linguistic Manner. CVPR 2022: 18676-18688 - [c70]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao, Ce Liu, Lu Yuan, Jianfeng Gao:
Unified Contrastive Learning in Image-Text-Label Space. CVPR 2022: 19141-19151 - [c69]Kan Wu, Jinnian Zhang, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
TinyViT: Fast Pretraining Distillation for Small Vision Transformers. ECCV (21) 2022: 68-85 - [c68]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. ECCV (27) 2022: 69-87 - [c67]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. ECCV (24) 2022: 74-92 - [c66]Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luowei Zhou, Lu Yuan, Ahmed Awadallah, Zhangyang Wang:
DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment. ECCV (20) 2022: 239-256 - [c65]Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
Bootstrapped Masked Autoencoders for Vision BERT Pretraining. ECCV (30) 2022: 247-264 - [c64]Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Ying Jin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos:
Should All Proposals Be Treated Equally in Object Detection? ECCV (25) 2022: 556-572 - [c63]Chunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao:
Efficient Self-supervised Vision Transformers for Representation Learning. ICLR 2022 - [c62]Linsheng Hu, Yihao Li, Hao Zhang, Lu Yuan, Fuhui Zhou, Qihui Wu:
Robust Semantic Communication Driven by Knowledge Graph. IOTSMS 2022: 1-5 - [c61]Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan:
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering. NeurIPS 2022 - [c60]Sheng Shen, Chunyuan Li, Xiaowei Hu, Yujia Xie, Jianwei Yang, Pengchuan Zhang, Zhe Gan, Lijuan Wang, Lu Yuan, Ce Liu, Kurt Keutzer, Trevor Darrell, Anna Rohrbach, Jianfeng Gao:
K-LITE: Learning Transferable Visual Models with External Knowledge. NeurIPS 2022 - [c59]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan:
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks. NeurIPS 2022 - [c58]Yujia Xie, Luowei Zhou, Xiyang Dai, Lu Yuan, Nguyen Bach, Ce Liu, Michael Zeng:
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning. NeurIPS 2022 - [c57]Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao:
GLIPv2: Unifying Localization and Vision-Language Understanding. NeurIPS 2022 - [i93]Qiankun Liu, Dongdong Chen, Qi Chu, Lu Yuan, Bin Liu, Lei Zhang, Nenghai Yu:
Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation. CoRR abs/2201.01297 (2022) - [i92]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan:
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks. CoRR abs/2201.05729 (