


Остановите войну!
for scientists:


default search action
Lu Yuan
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j32]Zitong Zhang, Lu Yuan
, Fuhui Zhou
, Qihui Wu
:
Data-and-Knowledge Dual-Driven Radio Frequency Fingerprint Identification. IEEE Internet Things J. 10(13): 11944-11945 (2023) - [j31]Yuwei Fan, Lei Shi, Lu Yuan:
Topic modeling methods for short texts: A survey. J. Intell. Fuzzy Syst. 45(2): 1971-1990 (2023) - [c95]Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu, Baining Guo:
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers. AAAI 2023: 552-560 - [c94]Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, Yu-Chiang Frank Wang:
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis. AAAI 2023: 579-587 - [c93]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. AAAI 2023: 10880-10890 - [c92]Zhengyu Chen, Dawei Huang, Mingran Wang, Bowen Yang, Jinuk Luke Shin, Changran Hu, Bo Li, Raghu Prabhakar, Gao Deng, Yongning Sheng, Sihua Fu, Lu Yuan, Tian Zhao, Yun Du, Chen Liu, Jun Yang, Viren Shah, Venkat Srinivasan, Sumti Jairath:
AI SoC Design Challenges in the Foundation Model Era. CICC 2023: 1-8 - [c91]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. CVPR 2023: 2268-2278 - [c90]Shuquan Ye, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao:
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles. CVPR 2023: 2634-2645 - [c89]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. CVPR 2023: 6312-6322 - [c88]Xiaoyi Dong, Jianmin Bao, Yinglin Zheng, Ting Zhang, Dongdong Chen, Hao Yang, Ming Zeng, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining. CVPR 2023: 10995-11005 - [c87]Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding. CVPR 2023: 11402-11411 - [c86]Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao:
Generalized Decoding for Pixel, Image, and Language. CVPR 2023: 15116-15127 - [c85]Lu Yuan, Yuan Meng, Jiyan Sun, Shangyuan Zhuang, Yinlong Liu, Liru Geng, Weiqing Huang:
ATS: A Fully Automatic Troubleshooting System with Efficient Anomaly Detection and Localization. ICCS (5) 2023: 476-491 - [c84]Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang:
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations. ICLR 2023 - [c83]Hanqing Zhao, Dianmo Sheng, Jianmin Bao, Dongdong Chen, Dong Chen, Fang Wen, Lu Yuan, Ce Liu, Wenbo Zhou, Qi Chu, Weiming Zhang, Nenghai Yu:
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion. ICML 2023: 42098-42109 - [i114]Wen-Yang Zhou, Lu Yuan, Shuyu Chen, Lin Gao, Shimin Hu:
LC-NeRF: Local Controllable Face Generation in Neural Randiance Field. CoRR abs/2302.09486 (2023) - [i113]Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang:
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations. CoRR abs/2302.14138 (2023) - [i112]Fuhui Zhou, Yihao Li, Ming Xu, Lu Yuan, Qihui Wu, Rose Qingyang Hu, Naofal Al-Dhahir:
Cognitive Semantic Communication Systems Driven by Knowledge Graph: Principle, Implementation, and Performance Evaluation. CoRR abs/2303.08546 (2023) - [i111]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang:
OmniTracker: Unifying Object Tracking by Tracking-with-Detection. CoRR abs/2303.12079 (2023) - [i110]Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System. CoRR abs/2304.14407 (2023) - [i109]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. CoRR abs/2305.12311 (2023) - [i108]Munan Ning, Yujia Xie, Dongdong Chen, Zeyin Song, Lu Yuan, Yonghong Tian, Qixiang Ye, Li Yuan:
Album Storytelling with Iterative Story-aware Captioning and Large Language Models. CoRR abs/2305.12943 (2023) - [i107]Yuwei Fang, Mahmoud Khademi, Chenguang Zhu, Ziyi Yang, Reid Pryzant, Yichong Xu, Yao Qian, Takuya Yoshioka, Lu Yuan, Michael Zeng, Xuedong Huang:
i-Code Studio: A Configurable and Composable Framework for Integrative AI. CoRR abs/2305.13738 (2023) - [i106]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
Image is First-order Norm+Linear Autoregressive. CoRR abs/2305.16319 (2023) - [i105]Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, Kwan-Yee K. Wong:
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models. CoRR abs/2305.16322 (2023) - [i104]Zixin Zhu, Xuelu Feng, Dongdong Chen, Jianmin Bao, Le Wang, Yinpeng Chen, Lu Yuan, Gang Hua:
Designing a Better Asymmetric VQGAN for StableDiffusion. CoRR abs/2306.04632 (2023) - [i103]Qinhong Yang, Dongdong Chen, Zhentao Tan, Qiankun Liu, Qi Chu, Jianmin Bao, Lu Yuan, Gang Hua, Nenghai Yu:
HQ-50K: A Large-scale, High-quality Dataset for Image Restoration. CoRR abs/2306.05390 (2023) - [i102]Qidong Huang, Xiaoyi Dong, Dongdong Chen, Yinpeng Chen, Lu Yuan, Gang Hua, Weiming Zhang, Nenghai Yu:
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting. CoRR abs/2308.10315 (2023) - [i101]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. CoRR abs/2309.12314 (2023) - [i100]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. CoRR abs/2310.12152 (2023) - [i99]Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang:
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following. CoRR abs/2310.12344 (2023) - [i98]Yinpeng Chen, Dongdong Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
On the Hidden Waves of Image. CoRR abs/2310.12976 (2023) - [i97]Hezhen Hu, Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Lu Yuan, Dong Chen, Houqiang Li:
PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders. CoRR abs/2311.04496 (2023) - [i96]Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan:
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks. CoRR abs/2311.06242 (2023) - [i95]Chongyan Chen, Mengchen Liu, Noel Codella, Yunsheng Li, Lu Yuan, Danna Gurari:
Fully Authentic Visual Question Answering Dataset from Online Communities. CoRR abs/2311.15562 (2023) - [i94]Munan Ning, Bin Zhu, Yujia Xie, Bin Lin, Jiaxi Cui, Lu Yuan, Dongdong Chen, Li Yuan:
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models. CoRR abs/2311.16103 (2023) - 2022
- [j30]Qiankun Liu, Dongdong Chen, Qi Chu, Lu Yuan, Bin Liu, Lei Zhang, Nenghai Yu:
Online multi-object tracking with unsupervised re-identification learning and occlusion estimation. Neurocomputing 483: 333-347 (2022) - [j29]Lu Yuan
, Hao Zhang
, Ming Xu
, Fuhui Zhou
, Qihui Wu
:
A Multiscale CNN Framework for Wireless Technique Classification in Internet of Things. IEEE Internet Things J. 9(12): 10366-10367 (2022) - [j28]Zhenbing Liu
, Lu Yuan, Long Sun:
Frequency separation-based multi-scale cascading residual block network for image super resolution. Multim. Tools Appl. 81(5): 6827-6848 (2022) - [j27]Zhentao Tan
, Dongdong Chen
, Qi Chu
, Menglei Chai
, Jing Liao
, Mingming He
, Lu Yuan
, Gang Hua
, Nenghai Yu
:
Efficient Semantic Image Synthesis via Class-Adaptive Normalization. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4852-4866 (2022) - [j26]Tianyi Wei
, Dongdong Chen
, Wenbo Zhou, Jing Liao
, Weiming Zhang
, Lu Yuan
, Gang Hua
, Nenghai Yu
:
E2Style: Improve the Efficiency and Effectiveness of StyleGAN Inversion. IEEE Trans. Image Process. 31: 3267-3280 (2022) - [c82]Dengpan Fu, Dongdong Chen, Hao Yang, Jianmin Bao, Lu Yuan, Lei Zhang, Houqiang Li, Fang Wen, Dong Chen:
Large-Scale Pre-training for Person Re-identification with Noisy Labels. CVPR 2022: 1-11 - [c81]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Xiaoyi Dong, Lu Yuan, Zicheng Liu:
Mobile-Former: Bridging MobileNet and Transformer. CVPR 2022: 5260-5269 - [c80]Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo:
Vector Quantized Diffusion Model for Text-to-Image Synthesis. CVPR 2022: 10686-10696 - [c79]Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao:
Grounded Language-Image Pre-training. CVPR 2022: 10955-10965 - [c78]Qiankun Liu, Zhentao Tan, Dongdong Chen, Qi Chu, Xiyang Dai, Yinpeng Chen, Mengchen Liu, Lu Yuan, Nenghai Yu:
Reduce Information Loss in Transformers for Pluralistic Image Inpainting. CVPR 2022: 11337-11347 - [c77]Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Weiming Zhang, Nenghai Yu, Lu Yuan, Dong Chen, Baining Guo:
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows. CVPR 2022: 12114-12124 - [c76]Jinnian Zhang, Houwen Peng, Kan Wu, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
MiniViT: Compressing Vision Transformers with Weight Multiplexing. CVPR 2022: 12135-12144 - [c75]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan:
BEVT: BERT Pretraining of Video Transformers. CVPR 2022: 14713-14723 - [c74]Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao:
RegionCLIP: Region-based Language-Image Pretraining. CVPR 2022: 16772-16782 - [c73]Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao
, Zhentao Tan, Lu Yuan, Weiming Zhang, Nenghai Yu:
HairCLIP: Design Your Hair by Text and Reference Image. CVPR 2022: 18051-18060 - [c72]Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng:
An Empirical Study of Training End-to-End Vision-and-Language Transformers. CVPR 2022: 18145-18155 - [c71]Yinglin Zheng, Hao Yang, Ting Zhang, Jianmin Bao, Dongdong Chen, Yangyu Huang, Lu Yuan, Dong Chen, Ming Zeng, Fang Wen:
General Facial Representation Learning in a Visual-Linguistic Manner. CVPR 2022: 18676-18688 - [c70]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao, Ce Liu, Lu Yuan, Jianfeng Gao:
Unified Contrastive Learning in Image-Text-Label Space. CVPR 2022: 19141-19151 - [c69]Kan Wu, Jinnian Zhang, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
TinyViT: Fast Pretraining Distillation for Small Vision Transformers. ECCV (21) 2022: 68-85 - [c68]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. ECCV (27) 2022: 69-87 - [c67]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. ECCV (24) 2022: 74-92 - [c66]Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luowei Zhou, Lu Yuan, Ahmed Awadallah, Zhangyang Wang:
DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment. ECCV (20) 2022: 239-256 - [c65]Xiaoyi Dong
, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
Bootstrapped Masked Autoencoders for Vision BERT Pretraining. ECCV (30) 2022: 247-264 - [c64]Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Ying Jin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos:
Should All Proposals Be Treated Equally in Object Detection? ECCV (25) 2022: 556-572 - [c63]Chunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao:
Efficient Self-supervised Vision Transformers for Representation Learning. ICLR 2022 - [c62]Linsheng Hu, Yihao Li, Hao Zhang, Lu Yuan, Fuhui Zhou, Qihui Wu:
Robust Semantic Communication Driven by Knowledge Graph. IOTSMS 2022: 1-5 - [c61]Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan:
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering. NeurIPS 2022 - [c60]Sheng Shen, Chunyuan Li, Xiaowei Hu, Yujia Xie, Jianwei Yang, Pengchuan Zhang, Zhe Gan, Lijuan Wang, Lu Yuan, Ce Liu, Kurt Keutzer, Trevor Darrell, Anna Rohrbach, Jianfeng Gao:
K-LITE: Learning Transferable Visual Models with External Knowledge. NeurIPS 2022 - [c59]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan:
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks. NeurIPS 2022 - [c58]Yujia Xie, Luowei Zhou, Xiyang Dai, Lu Yuan, Nguyen Bach, Ce Liu, Michael Zeng:
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning. NeurIPS 2022 - [c57]Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao:
GLIPv2: Unifying Localization and Vision-Language Understanding. NeurIPS 2022 - [i93]Qiankun Liu, Dongdong Chen, Qi Chu, Lu Yuan, Bin Liu, Lei Zhang, Nenghai Yu:
Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation. CoRR abs/2201.01297 (2022) - [i92]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan:
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks. CoRR abs/2201.05729 (2022) - [i91]Dengpan Fu, Dongdong Chen, Hao Yang, Jianmin Bao, Lu Yuan, Lei Zhang, Houqiang Li, Fang Wen, Dong Chen:
Large-Scale Pre-training for Person Re-identification with Noisy Labels. CoRR abs/2203.16533 (2022) - [i90]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao, Ce Liu, Lu Yuan, Jianfeng Gao:
Unified Contrastive Learning in Image-Text-Label Space. CoRR abs/2204.03610 (2022) - [i89]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. CoRR abs/2204.03645 (2022) - [i88]Jinnian Zhang, Houwen Peng, Kan Wu, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
MiniViT: Compressing Vision Transformers with Weight Multiplexing. CoRR abs/2204.07154 (2022) - [i87]Sheng Shen, Chunyuan Li, Xiaowei Hu, Yujia Xie, Jianwei Yang, Pengchuan Zhang, Anna Rohrbach, Zhe Gan, Lijuan Wang, Lu Yuan, Ce Liu, Kurt Keutzer, Trevor Darrell, Jianfeng Gao:
K-LITE: Learning Transferable Visual Models with External Knowledge. CoRR abs/2204.09222 (2022) - [i86]Lemeng Wu, Mengchen Liu, Yinpeng Chen, Dongdong Chen, Xiyang Dai, Lu Yuan:
Residual Mixture of Experts. CoRR abs/2204.09636 (2022) - [i85]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Xiyang Dai, Bin Xiao, Jianwei Yang, Haoxuan You, Kai-Wei Chang, Shih-Fu Chang, Lu Yuan:
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks. CoRR abs/2204.10496 (2022) - [i84]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. CoRR abs/2205.01818 (2022) - [i83]Qiankun Liu, Zhentao Tan, Dongdong Chen, Qi Chu, Xiyang Dai, Yinpeng Chen, Mengchen Liu, Lu Yuan, Nenghai Yu:
Reduce Information Loss in Transformers for Pluralistic Image Inpainting. CoRR abs/2205.05076 (2022) - [i82]Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan:
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering. CoRR abs/2206.01201 (2022) - [i81]Yujia Xie, Luowei Zhou, Xiyang Dai, Lu Yuan, Nguyen Bach, Ce Liu, Michael Zeng:
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning. CoRR abs/2206.01843 (2022) - [i80]Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding. CoRR abs/2206.03484 (2022) - [i79]Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao:
GLIPv2: Unifying Localization and Vision-Language Understanding. CoRR abs/2206.05836 (2022) - [i78]Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li:
Semantic Image Synthesis via Diffusion Models. CoRR abs/2207.00050 (2022) - [i77]Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Jing Yin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos:
Should All Proposals be Treated Equally in Object Detection? CoRR abs/2207.03520 (2022) - [i76]Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
Bootstrapped Masked Autoencoders for Vision BERT Pretraining. CoRR abs/2207.07116 (2022) - [i75]Kan Wu, Jinnian Zhang
, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
TinyViT: Fast Pretraining Distillation for Small Vision Transformers. CoRR abs/2207.10666 (2022) - [i74]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. CoRR abs/2207.12661 (2022) - [i73]Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang:
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling. CoRR abs/2208.12257 (2022) - [i72]Xiaoyi Dong, Yinglin Zheng, Jianmin Bao, Ting Zhang, Dongdong Chen, Hao Yang, Ming Zeng, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining. CoRR abs/2208.12262 (2022) - [i71]Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan, Yu-Chiang Frank Wang:
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis. CoRR abs/2208.13753 (2022) - [i70]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan:
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks. CoRR abs/2209.07526 (2022) - [i69]Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li:
SinDiffusion: Learning a Diffusion Model from a Single Natural Image. CoRR abs/2211.12445 (2022) - [i68]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
Self-Supervised Learning based on Heat Equation. CoRR abs/2211.13228 (2022) - [i67]Shuquan Ye, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao:
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles. CoRR abs/2211.16504 (2022) - [i66]Hanqing Zhao, Dianmo Sheng, Jianmin Bao, Dongdong Chen, Dong Chen, Fang Wen, Lu Yuan, Ce Liu, Wenbo Zhou, Qi Chu, Weiming Zhang, Nenghai Yu:
X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion. CoRR abs/2212.03863 (2022) - [i65]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. CoRR abs/2212.04500 (2022) - [i64]Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Shuyang Gu, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu:
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet. CoRR abs/2212.06138 (2022) - [i63]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. CoRR abs/2212.06826 (2022) - [i62]Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao:
Generalized Decoding for Pixel, Image, and Language. CoRR abs/2212.11270 (2022) - 2021
- [j25]Zhenbing Liu, Kaijie Wang, Zimin Wang, Haoxiang Lu, Lu Yuan:
PatchNet: a tiny low-light image enhancement net. J. Electronic Imaging 30(3) (2021) - [j24]Chenlin Huang
, Wei Chen, Lu Yuan, Yan Ding, Songlei Jian
, Yusong Tan, Hua Chen, Dan Chen:
Toward security as a service: A trusted cloud service architecture with policy customization. J. Parallel Distributed Comput. 149: 76-88 (2021) - [j23]Qingnan Fan
, Dongdong Chen
, Lu Yuan
, Gang Hua, Nenghai Yu
, Baoquan Chen
:
A General Decoupled Learning Framework for Parameterized Image Operators. IEEE Trans. Pattern Anal. Mach. Intell. 43(1): 33-47 (2021) - [j22]Dongdong Chen
, Lu Yuan
, Jing Liao
, Nenghai Yu
, Gang Hua
:
Explicit Filterbank Learning for Neural Image Style Transfer and Image Processing. IEEE Trans. Pattern Anal. Mach. Intell. 43(7): 2373-2387 (2021) - [j21]