default search action
Zicheng Liu 0001
Person information
- affiliation: Microsoft Research, Redmond, WA, USA
- affiliation (PhD 1996): Princeton University, NJ, USA
- affiliation: Chinese Academy of Sciences, Institute of Applied Mathematics, Beijing, China
Other persons with the same name
- Zicheng Liu — disambiguation page
- Zicheng Liu 0002 — UiT The Arctic University of Norway, Department of Physics and Technology, Tromsø, Norway (and 4 more)
- Zicheng Liu 0003 — Wuhan University, Institute of Artificial Intelligence, State Key Lab of LIESMARS, Wuhan, China
- Zicheng Liu 0004 — Macau University of Science and Technology, Institute of Systems Engineering, Collaborative Laboratory for Intelligent Science and Systems, Taipa, Macau (and 1 more)
- Zicheng Liu 0005 — Nanjing University, State Key Lab for Novel Software Technology, Nanjing, China
- Zicheng Liu 0006 — Westlake University & Institute of Advanced Technology, AI Lab, Hangzhou, China
- Zicheng Liu 0007 — Huazhong University of Science and Technology, China-EU Institute for Clean and Renewable Energy, Wuhan, China (and 1 more)
- Zicheng Liu 0008 — Beihang University, School of Software, Beijing, China
- Zicheng Liu 0009 — Beijing Institute of Technology Chongqing Center for Microelectronics and Microsystems, China
- Zicheng Liu 0010 — Kunming University of Science and Technology, Faculty of Land Resource Engineering, China
- Zicheng Liu 0011 — Chinese University of Hong Kong, Department of Statistics, Hong Kong
- Zicheng Liu 0012 — Nankai University, College of Artificial Intelligence, Tianjin, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c152]Minheng Ni, Chenfei Wu, Xiaodong Wang, Shengming Yin, Lijuan Wang, Zicheng Liu, Nan Duan:
ORES: Open-Vocabulary Responsible Visual Synthesis. AAAI 2024: 21473-21481 - [c151]Tan Wang, Linjie Li, Kevin Lin, Yuanhao Zhai, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
Disco: Disentangled Control for Realistic Human Dance Generation. CVPR 2024: 9326-9336 - [c150]Zichen Miao, Jiang Wang, Ze Wang, Zhengyuan Yang, Lijuan Wang, Qiang Qiu, Zicheng Liu:
Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning. CVPR 2024: 10844-10853 - [c149]Xiaoke Huang, Jianfeng Wang, Yansong Tang, Zheng Zhang, Han Hu, Jiwen Lu, Lijuan Wang, Zicheng Liu:
Segment and Caption Anything. CVPR 2024: 13405-13417 - [c148]Chaoyi Zhang, Kevin Lin, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning. CVPR 2024: 13647-13657 - [c147]Yuanhao Zhai, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David S. Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. ECCV (15) 2024: 134-152 - [c146]Zhengyuan Yang, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
Idea2Img: Iterative Self-refinement with GPT-4V for Automatic Image Design and Generation. ECCV (38) 2024: 167-184 - [c145]Jialian Wu, Jianfeng Wang, Zhengyuan Yang, Zhe Gan, Zicheng Liu, Junsong Yuan, Lijuan Wang:
GRiT: A Generative Region-to-Text Transformer for Object Understanding. ECCV (80) 2024: 207-224 - [c144]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. ICML 2024 - [c143]Zecheng Tang, Chenfei Wu, Zekai Zhang, Minheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu, Juntao Li, Nan Duan:
StrokeNUWA - Tokenizing Strokes for Vector Graphic Synthesis. ICML 2024 - [c142]Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, Lijuan Wang:
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities. ICML 2024 - [c141]Jie An, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo:
Bring Metric Functions into Diffusion Models. IJCAI 2024: 578-586 - [c140]Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo:
OpenLEAF: A Novel Benchmark for Open-Domain Interleaved Image-Text Generation. ACM Multimedia 2024: 11137-11145 - [c139]Kevin Lin, Chung-Ching Lin, Lin Liang, Zicheng Liu, Lijuan Wang:
MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction. WACV 2024: 3403-3413 - [i101]Jie An, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo:
Bring Metric Functions into Diffusion Models. CoRR abs/2401.02414 (2024) - [i100]Zecheng Tang, Chenfei Wu, Zekai Zhang, Mingheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu, Juntao Li, Nan Duan:
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis. CoRR abs/2401.17093 (2024) - [i99]Jiazhao Zhang, Ying Hung, Chung-Ching Lin, Zicheng Liu:
A Unified Gaussian Process for Branching and Nested Hyperparameter Optimization. CoRR abs/2402.04885 (2024) - [i98]Yuanhao Zhai, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David S. Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. CoRR abs/2407.10937 (2024) - [i97]Weihao Yu, Zhengyuan Yang, Linfeng Ren, Linjie Li, Jianfeng Wang, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang, Xinchao Wang:
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities. CoRR abs/2408.00765 (2024) - [i96]Minheng Ni, Chenfei Wu, Huaying Yuan, Zhengyuan Yang, Ming Gong, Lijuan Wang, Zicheng Liu, Wangmeng Zuo, Nan Duan:
AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition. CoRR abs/2408.11564 (2024) - [i95]Zichen Miao, Zhengyuan Yang, Kevin Lin, Ze Wang, Zicheng Liu, Lijuan Wang, Qiang Qiu:
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization. CoRR abs/2410.03190 (2024) - [i94]Taewook Kim, Ze Wang, Zhengyuan Yang, Jiang Wang, Lijuan Wang, Zicheng Liu, Qiang Qiu:
Conditional Text-to-Image Generation with Reference Guidance. CoRR abs/2411.16713 (2024) - [i93]Hao Chen, Ze Wang, Xiang Li, Ximeng Sun, Fangyi Chen, Jiang Liu, Jindong Wang, Bhiksha Raj, Zicheng Liu, Emad Barsoum:
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer. CoRR abs/2412.10958 (2024) - 2023
- [j55]Qiang Zhai, Xin Li, Fan Yang, Zhicheng Jiao, Ping Luo, Hong Cheng, Zicheng Liu:
MGL: Mutual Graph Learning for Camouflaged Object Detection. IEEE Trans. Image Process. 32: 1897-1910 (2023) - [j54]Qiang Zhai, Fan Yang, Xin Li, Guo-Sen Xie, Hong Cheng, Zicheng Liu:
Co-Communication Graph Convolutional Network for Multi-View Crowd Counting. IEEE Trans. Multim. 25: 5813-5825 (2023) - [c138]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Ming Gong, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. ACL (1) 2023: 1309-1320 - [c137]Lin Huang, Chung-Ching Lin, Kevin Lin, Lin Liang, Lijuan Wang, Junsong Yuan, Zicheng Liu:
Neural Voting Field for Camera-Space 3D Hand Pose Estimation. CVPR 2023: 8969-8978 - [c136]Chung-Ching Lin, Jiang Wang, Kun Luo, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu:
Adaptive Human Matting for Dynamic Videos. CVPR 2023: 10229-10238 - [c135]Shiqi Lin, Zhizheng Zhang, Zhipeng Huang, Yan Lu, Cuiling Lan, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Amey Parulkar, Viraj Navkal, Zhibo Chen:
Deep Frequency Filtering for Domain Generalization. CVPR 2023: 11797-11807 - [c134]Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
ReCo: Region-Controlled Text-to-Image Generation. CVPR 2023: 14246-14255 - [c133]Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu:
Binary Latent Diffusion. CVPR 2023: 22576-22585 - [c132]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling. CVPR 2023: 22898-22909 - [c131]Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang:
LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling. CVPR 2023: 23119-23129 - [c130]Tan Wang, Kevin Lin, Linjie Li, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
Equivariant Similarity for Vision-Language Foundation Models. ICCV 2023: 11964-11974 - [c129]Ying Jin, Yinpeng Chen, Jianfeng Wang, Lijuan Wang, Jenq-Neng Hwang, Zicheng Liu:
Zero-Shot Human-Object Interaction (HOI) Classification by Bridging Generative and Contrastive Image-Language Models. ICIP 2023: 1970-1974 - [c128]Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang:
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations. ICLR 2023 - [c127]Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu:
Energy-Inspired Self-Supervised Pretraining for Vision Models. ICLR 2023 - [c126]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. IJCAI 2023: 1506-1514 - [c125]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Rita Singh, Bhiksha Raj:
PaintSeg: Painting Pixels for Training-free Segmentation. NeurIPS 2023 - [c124]Xiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang, Peng Chu, Houdong Hu, Jiang Wang, Zicheng Liu:
MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark. WACV 2023: 4849-4858 - [c123]Peng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu:
TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking. WACV 2023: 4859-4869 - [i92]Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu:
Energy-Inspired Self-Supervised Pretraining for Vision Models. CoRR abs/2302.01384 (2023) - [i91]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. CoRR abs/2302.10781 (2023) - [i90]Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang:
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations. CoRR abs/2302.14138 (2023) - [i89]Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang:
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action. CoRR abs/2303.11381 (2023) - [i88]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Gong Ming, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. CoRR abs/2303.12346 (2023) - [i87]Tan Wang, Kevin Lin, Linjie Li, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
Equivariant Similarity for Vision-Language Foundation Models. CoRR abs/2303.14465 (2023) - [i86]Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu:
Binary Latent Diffusion. CoRR abs/2304.04820 (2023) - [i85]Chung-Ching Lin, Jiang Wang, Kun Luo, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu:
Adaptive Human Matting for Dynamic Videos. CoRR abs/2304.06018 (2023) - [i84]Lin Huang, Chung-Ching Lin, Kevin Lin, Lin Liang, Lijuan Wang, Junsong Yuan, Zicheng Liu:
Neural Voting Field for Camera-Space 3D Hand Pose Estimation. CoRR abs/2305.04328 (2023) - [i83]Yinan Feng, Yinpeng Chen, Peng Jin, Shihang Feng, Zicheng Liu, Youzuo Lin:
Simplifying Full Waveform Inversion via Domain-Independent Self-Supervised Learning. CoRR abs/2305.13314 (2023) - [i82]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
Image is First-order Norm+Linear Autoregressive. CoRR abs/2305.16319 (2023) - [i81]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Bhiksha Raj:
PaintSeg: Training-free Segmentation via Painting. CoRR abs/2305.19406 (2023) - [i80]Andre Abrantes, Jiang Wang, Peng Chu, Quanzeng You, Zicheng Liu:
RefineVIS: Video Instance Segmentation with Temporal Attention Refinement. CoRR abs/2306.04774 (2023) - [i79]Tan Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang:
DisCo: Disentangled Control for Referring Human Dance Generation in Real World. CoRR abs/2307.00040 (2023) - [i78]Xin Yuan, Linjie Li, Jianfeng Wang, Zhengyuan Yang, Kevin Lin, Zicheng Liu, Lijuan Wang:
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models. CoRR abs/2307.14648 (2023) - [i77]Peng Jin, Yinan Feng, Shihang Feng, Hanchen Wang, Yinpeng Chen, Benjamin Consolvo, Zicheng Liu, Youzuo Lin:
Does Full Waveform Inversion Benefit from Big Data? CoRR abs/2307.15388 (2023) - [i76]Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, Lijuan Wang:
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities. CoRR abs/2308.02490 (2023) - [i75]Minheng Ni, Chenfei Wu, Xiaodong Wang, Shengming Yin, Lijuan Wang, Zicheng Liu, Nan Duan:
ORES: Open-vocabulary Responsible Visual Synthesis. CoRR abs/2308.13785 (2023) - [i74]Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision). CoRR abs/2309.17421 (2023) - [i73]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Rita Singh, Bhiksha Raj, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. CoRR abs/2310.00808 (2023) - [i72]Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo:
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation. CoRR abs/2310.07749 (2023) - [i71]Zhengyuan Yang, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation. CoRR abs/2310.08541 (2023) - [i70]Yinpeng Chen, Dongdong Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin:
On the Hidden Waves of Image. CoRR abs/2310.12976 (2023) - [i69]Kevin Lin, Faisal Ahmed, Linjie Li, Chung-Ching Lin, Ehsan Azarnasab, Zhengyuan Yang, Jianfeng Wang, Lin Liang, Zicheng Liu, Yumao Lu, Ce Liu, Lijuan Wang:
MM-VID: Advancing Video Understanding with GPT-4V(ision). CoRR abs/2310.19773 (2023) - [i68]An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian J. McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang:
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation. CoRR abs/2311.07562 (2023) - [i67]Chaoyi Zhang, Kevin Lin, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning. CoRR abs/2311.17435 (2023) - [i66]Xiaoke Huang, Jianfeng Wang, Yansong Tang, Zheng Zhang, Han Hu, Jiwen Lu, Lijuan Wang, Zicheng Liu:
Segment and Caption Anything. CoRR abs/2312.00869 (2023) - 2022
- [j53]Zhe Gan, Linjie Li, Chunyuan Li, Lijuan Wang, Zicheng Liu, Jianfeng Gao:
Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends. Found. Trends Comput. Graph. Vis. 14(3-4): 163-352 (2022) - [j52]Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang:
GIT: A Generative Image-to-text Transformer for Vision and Language. Trans. Mach. Learn. Res. 2022 (2022) - [j51]Xin Li, Fan Yang, Ao Luo, Zhicheng Jiao, Hong Cheng, Zicheng Liu:
EFRNet: Efficient Feature Reconstructing Network for Real-Time Scene Parsing. IEEE Trans. Multim. 24: 2852-2865 (2022) - [c122]Zhe Gan, Yen-Chun Chen, Linjie Li, Tianlong Chen, Yu Cheng, Shuohang Wang, Jingjing Liu, Lijuan Wang, Zicheng Liu:
Playing Lottery Tickets with Vision and Language. AAAI 2022: 652-660 - [c121]Sheng Liu, Kevin Lin, Lijuan Wang, Junsong Yuan, Zicheng Liu:
OVIS: Open-Vocabulary Visual Instance Search via Visual-Semantic Aligned Representation Learning. AAAI 2022: 1773-1781 - [c120]Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Yumao Lu, Zicheng Liu, Lijuan Wang:
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA. AAAI 2022: 3081-3089 - [c119]Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Xiaoyi Dong, Lu Yuan, Zicheng Liu:
Mobile-Former: Bridging MobileNet and Transformer. CVPR 2022: 5260-5269 - [c118]Zhipeng Huang, Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Zheng-Jun Zha:
Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation. CVPR 2022: 14268-14277 - [c117]Kevin Lin, Linjie Li, Chung-Ching Lin, Faisal Ahmed, Zhe Gan, Zicheng Liu, Yumao Lu, Lijuan Wang:
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning. CVPR 2022: 17928-17937 - [c116]Xiaowei Hu, Zhe Gan, Jianfeng Wang, Zhengyuan Yang, Zicheng Liu, Yumao Lu, Lijuan Wang:
Scaling Up Vision-Language Pretraining for Image Captioning. CVPR 2022: 17959-17968 - [c115]Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lin Liang, Zhe Gan, Lijuan Wang, Yezhou Yang, Zicheng Liu:
Injecting Semantic Concepts into End-to-End Image Captioning. CVPR 2022: 17988-17998 - [c114]Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng:
An Empirical Study of Training End-to-End Vision-and-Language Transformers. CVPR 2022: 18145-18155 - [c113]Chung-Ching Lin, Kevin Lin, Lijuan Wang, Zicheng Liu, Linjie Li:
Crossmodal Representation Learning for Zero-shot Action Recognition. CVPR 2022: 19946-19956 - [c112]Yutong Lin, Chen Li, Yue Cao, Zheng Zhang, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Han Hu:
A Simple Approach and Benchmark for 21, 000-Category Object Detection. ECCV (11) 2022: 1-18 - [c111]Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan Wang:
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling. ECCV (36) 2022: 521-539 - [c110]Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Ying Jin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos:
Should All Proposals Be Treated Equally in Object Detection? ECCV (25) 2022: 556-572 - [c109]Peng Jin, Xitong Zhang, Yinpeng Chen, Sharon Xiaolei Huang, Zicheng Liu, Youzuo Lin:
Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop. ICLR 2022 - [c108]Yinan Feng, Yinpeng Chen, Shihang Feng, Peng Jin, Zicheng Liu, Youzuo Lin:
An Intriguing Property of Geophysics Inversion. ICML 2022: 6434-6446 - [c107]Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang:
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone. NeurIPS 2022 - [c106]Chunyuan Li, Haotian Liu, Liunian Harold Li, Pengchuan Zhang, Jyoti Aneja, Jianwei Yang, Ping Jin, Houdong Hu, Zicheng Liu, Yong Jae Lee, Jianfeng Gao:
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models. NeurIPS 2022 - [c105]Jian Liang, Chenfei Wu, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis. NeurIPS 2022 - [i65]Peixi Xiong, Quanzeng You, Pei Yu, Zicheng Liu, Ying Wu:
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering. CoRR abs/2201.10654 (2022) - [i64]Shihang Feng, Peng Jin, Yinpeng Chen, Xitong Zhang, Zicheng Liu, Youzuo Lin:
Exploring Multi-physics with Extremely Weak Supervision. CoRR abs/2202.01770 (2022) - [i63]Ying Jin, Yinpeng Chen, Lijuan Wang, Jianfeng Wang, Pei Yu, Lin Liang, Jenq-Neng Hwang, Zicheng Liu:
The Overlooked Classifier in Human-Object Interaction Recognition. CoRR abs/2203.05676 (2022) - [i62]Shiqi Lin, Zhizheng Zhang, Zhipeng Huang, Yan Lu, Cuiling Lan, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Amey Parulkar, Viraj Navkal, Zhibo Chen:
Deep Frequency Filtering for Domain Generalization. CoRR abs/2203.12198 (2022) - [i61]Xiangjun Gao, Jiaolong Yang, Jongyoo Kim, Sida Peng, Zicheng Liu, Xin Tong:
MPS-NeRF: Generalizable 3D Human Rendering from Multiview Images. CoRR abs/2203.16875 (2022) - [i60]Chunyuan Li, Haotian Liu, Liunian Harold Li, Pengchuan Zhang, Jyoti Aneja, Jianwei Yang, Ping Jin, Yong Jae Lee, Houdong Hu, Zicheng Liu, Jianfeng Gao:
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models. CoRR abs/2204.08790 (2022) - [i59]Yinan Feng, Yinpeng Chen, Shihang Feng, Peng Jin, Zicheng Liu, Youzuo Lin:
An Intriguing Property of Geophysics Inversion. CoRR abs/2204.13731 (2022) - [i58]Chung-Ching Lin, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu:
Cross-modal Representation Learning for Zero-shot Action Recognition. CoRR abs/2205.01657 (2022) - [i57]Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang:
GIT: A Generative Image-to-text Transformer for Vision and Language. CoRR abs/2205.14100 (2022) - [i56]Quanzeng You, Jiang Wang, Peng Chu, Andre Abrantes, Zicheng Liu:
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention. CoRR abs/2206.07011 (2022) - [i55]Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang:
LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling. CoRR abs/2206.07160 (2022) - [i54]Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang:
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone. CoRR abs/2206.07643 (2022) - [i53]Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Jing Yin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos:
Should All Proposals be Treated Equally in Object Detection? CoRR abs/2207.03520 (2022) - [i52]Chenfei Wu, Jian Liang, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis. CoRR abs/2207.09814 (2022) - [i51]Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu:
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling. CoRR abs/2209.01540 (2022) - [i50]Zhe Gan, Linjie Li, Chunyuan Li, Lijuan Wang, Zicheng Liu, Jianfeng Gao:
Vision-Language Pre-training: Basics, Recent Advances, and Future Trends. CoRR abs/2210.09263 (2022) - [i49]Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu:
Exploring Discrete Diffusion Models for Image Captioning. CoRR abs/2211.11694 (2022) - [i48]