default search action
Kai Chen 0026
陈恺
Person information
- unicode name: 陈恺
- affiliation: Shanghai AI Laboratory, Guangzhou, China
- affiliation: SenseTime Research, Hong Kong
- affiliation (PhD 2019): Chinese University of Hong Kong, MMLab, Hong Kong
Other persons with the same name
- Kai Chen — disambiguation page
- Kai Chen 0001 — University of Science and Technology of China, Department of Electronics Science and Technology, Hefei, China (and 1 more)
- Kai Chen 0002 — University of California at Berkeley, EECS Department, CA, USA
- Kai Chen 0003 — Google, Mountain View, CA, USA (and 1 more)
- Kai Chen 0004 — Xi'an Jiaotong University, School of Electronic and Information Engineering
- Kai Chen 0005 — Hong Kong University of Science and Technology (and 1 more)
- Kai Chen 0006 — Shanghai Jiatong University, Institute of Image Communication and Network Engineering, China
- Kai Chen 0007 — Cisco Systems (and 1 more)
- Kai Chen 0008 — University of Science and Technology of China, Department of Modern Physics
- Kai Chen 0009 — University of Science and Technology of China, Department of Computer Science
- Kai Chen 0010 — Google (and 2 more)
- Kai Chen 0011 — University of Fribourg, Switzerland
- Kai Chen 0012 — Chinese Academy of Sciences, Institute of Information Engineering, SKLOIS, Beijing, China
- Kai Chen 0013 — Qualcomm, Wireless R&D Department, Beijing, China (and 2 more)
- Kai Chen 0014 — China University of Geosciences, School of Geophysics and Information Technology, Beijing, China
- Kai Chen 0015 — University of Southern California, Dept. of Industrial and Systems Engineering
- Kai Chen 0016 — Huazhong University of Science and Technology, School of Computer Science and Technology, Services Computing Technology and System Lab / Cluster and Grid Computing Lab, China
- Kai Chen 0017 — Zhejiang University
- Kai Chen 0018 — University of Electronic Science and Technology of China, School of Automation Engineering, Chengdu, China
- Kai Chen 0019 — Xiamen University, China
- Kai Chen 0020 — National University of Defense Technology, National Laboratory for Parallel and Distributed Processing, Changsha, China
- Kai Chen 0021 — Wuhan University of Technology, Laboratory of Intelligent Manufacture and Control, China
- Kai Chen 0022 — University of Arizona, Department of Electrical and Computer Engineering, Tucson, AZ, USA
- Kai Chen 0023 — Huazhong University of Science and Technology, School of Automation, National Key Laboratory of Science and Technology on Multi-spectral Information Processing, Wuhan, China
- Kai Chen 0024 — Wuhan University, School of Remote Sensing and Information Engineering, Wuhan, China
- Kai Chen 0025 — BUPT, School of Information and Communication Engineering, Beijing, China
- Kai Chen 0027 — Fudan University, School of Computer Science, Shanghai, China
- Kai Chen 0028 — Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j3]Yuan Liu, Songyang Zhang, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. Trans. Mach. Learn. Res. 2024 (2024) - [c45]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. ACL (Findings) 2024: 6884-6915 - [c44]Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH: Analytical Annotation of Hallucinations in Large Language Models. ACL (1) 2024: 8135-8158 - [c43]Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. ACL (Findings) 2024: 9354-9366 - [c42]Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step. ACL (1) 2024: 9510-9529 - [c41]Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CVPR 2024: 1491-1500 - [c40]Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma:
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text. CVPR 2024: 6243-6253 - [c39]Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CVPR 2024: 12501-12511 - [c38]Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang:
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI. CVPR 2024: 19757-19767 - [c37]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough for all Segmentation? CVPR 2024: 27948-27959 - [c36]Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CVPR 2024: 28076-28086 - [c35]Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu:
Can AI Assistants Know What They Don't Know? ICML 2024 - [c34]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. NAACL-HLT (Findings) 2024: 3184-3200 - [c33]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. NAACL-HLT 2024: 3712-3724 - [i109]Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy:
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively. CoRR abs/2401.02955 (2024) - [i108]Huanjun Kong, Songyang Zhang, Kai Chen:
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance. CoRR abs/2401.08772 (2024) - [i107]Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CoRR abs/2401.10226 (2024) - [i106]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RAP-SAM: Towards Real-Time All-Purpose Segment Anything. CoRR abs/2401.10228 (2024) - [i105]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough For All Segmentation? CoRR abs/2401.10229 (2024) - [i104]Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu:
Can AI Assistants Know What They Don't Know? CoRR abs/2401.13275 (2024) - [i103]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i102]Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin:
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning. CoRR abs/2402.06332 (2024) - [i101]Tian Lan, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xianling Mao:
CriticBench: Evaluating Large Language Models as Critic. CoRR abs/2402.13764 (2024) - [i100]Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen:
DevBench: A Comprehensive Benchmark for Software Development. CoRR abs/2403.08604 (2024) - [i99]Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. CoRR abs/2403.12881 (2024) - [i98]Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma:
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text. CoRR abs/2403.16897 (2024) - [i97]Lingdong Kong, Xiang Xu, Jun Cen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu:
Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding. CoRR abs/2403.17010 (2024) - [i96]Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, Fukai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, et al.:
InternLM2 Technical Report. CoRR abs/2403.17297 (2024) - [i95]Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CoRR abs/2404.00906 (2024) - [i94]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. CoRR abs/2404.06480 (2024) - [i93]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i92]Jiahao Wang, Wenqi Shao, Mengzhao Chen, Chengyue Wu, Yong Liu, Kaipeng Zhang, Songyang Zhang, Kai Chen, Ping Luo:
Adapting LLaMA Decoder to Vision Transformer. CoRR abs/2404.06773 (2024) - [i91]Lingdong Kong, Xiang Xu, Jiawei Ren, Wenwei Zhang, Liang Pan, Kai Chen, Wei Tsang Ooi, Ziwei Liu:
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving. CoRR abs/2405.05258 (2024) - [i90]Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingwei Wang, Yinpeng Dong, Bo Yang, Shengyin Jiang, Zeliang Ma, Dengyi Ji, Haiwen Li, Xingliang Huang, Yu Tian, Genghua Kou, Fan Jia, Yingfei Liu, Tiancai Wang, Ying Li, Xiaoshuai Hao, Yifan Yang, Hui Zhang, Mengchuan Wei, Yi Zhou, Haimei Zhao, Jing Zhang, Jinke Li, Xiao He, Xiaoqiang Cheng, Bingyang Zhang, Lirong Zhao, Dianlei Ding, Fangsheng Liu, Yixiang Yan, Hongming Wang, Nanfei Ye, Lun Luo, Yubo Tian, Yiwei Zuo, Zhe Cao, Yi Ren, Yunfan Li, Wenjie Liu, Xun Wu, Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Cunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu, Ziyan Wang, Chiwei Li, Shilong Li, Chendong Yuan, Songyue Yang, Wentao Liu, Peng Chen, Bin Zhou, Yubo Wang, Chi Zhang, Jianhang Sun, Hai Chen, Xiao Yang, Lizhong Wang, Dongyi Fu, Yongchun Lin, Huitong Yang, Haoang Li, Yadan Luo, Xianjing Cheng, Yong Xu:
The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition. CoRR abs/2405.08816 (2024) - [i89]Kai Hu, Weichen Yu, Tianjun Yao, Xiang Li, Wenhe Liu, Lijun Yu, Yining Li, Kai Chen, Zhiqiang Shen, Matt Fredrikson:
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization. CoRR abs/2405.09113 (2024) - [i88]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. CoRR abs/2405.12209 (2024) - [i87]Jiahao Sun, Chunmei Qing, Xiang Xu, Lingdong Kong, Youquan Liu, Li Li, Chenming Zhu, Jingwei Zhang, Zeqi Xiao, Runnan Chen, Tai Wang, Wenwei Zhang, Kai Chen:
An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models. CoRR abs/2405.14870 (2024) - [i86]Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving. CoRR abs/2405.17426 (2024) - [i85]Zifan Song, Yudong Wang, Wenwei Zhang, Kuikun Liu, Chengqi Lyu, Demin Song, Qipeng Guo, Hang Yan, Dahua Lin, Kai Chen, Cairong Zhao:
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data. CoRR abs/2405.19265 (2024) - [i84]Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH: Analytical Annotation of Hallucinations in Large Language Models. CoRR abs/2405.20315 (2024) - [i83]Huaiyuan Ying, Zijian Wu, Yihan Geng, Jiayu Wang, Dahua Lin, Kai Chen:
Lean Workbook: A large-scale Lean problem set formalized from natural language math problems. CoRR abs/2406.03847 (2024) - [i82]Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan, Wenhan Luo, Zikun Liu, Mingde Qiao, Junjun Jiang, Kui Jiang, Yao Xiao, Chuyang Sun, Jinhui Hu, Weijian Ruan, Yubo Dong, Kai Chen, Hyejeong Jo, Jiahao Qin, Bingjie Han, Pinle Qin, Rui Chai, Pengyuan Wang:
MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results. CoRR abs/2406.07006 (2024) - [i81]Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen:
Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior. CoRR abs/2406.09389 (2024) - [i80]Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen:
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding. CoRR abs/2406.14515 (2024) - [i79]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. CoRR abs/2406.14544 (2024) - [i78]Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge:
InternLM-Law: An Open Source Chinese Legal Large Language Model. CoRR abs/2406.14887 (2024) - [i77]Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen:
MotionBooth: Motion-Aware Customized Text-to-Video Generation. CoRR abs/2406.17758 (2024) - [i76]Xiangyu Zhao, Xiangtai Li, Haodong Duan, Haian Huang, Yining Li, Kai Chen, Hua Yang:
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning. CoRR abs/2406.17770 (2024) - [i75]Yanan Sun, Yanchen Liu, Yinhao Tang, Wenjie Pei, Kai Chen:
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation. CoRR abs/2406.18958 (2024) - [i74]Yicheng Chen, Xiangtai Li, Yining Li, Yanhong Zeng, Jianzong Wu, Xiangyu Zhao, Kai Chen:
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language. CoRR abs/2406.20085 (2024) - [i73]Junyao Gao, Yanchen Liu, Yanan Sun, Yinhao Tang, Yanhong Zeng, Kai Chen, Cairong Zhao:
StyleShot: A Snapshot on Any Style. CoRR abs/2407.01414 (2024) - [i72]Chenming Zhu, Tai Wang, Wenwei Zhang, Kai Chen, Xihui Liu:
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities. CoRR abs/2407.01525 (2024) - [i71]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i70]Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models. CoRR abs/2407.04693 (2024) - [i69]Xiang Xu, Lingdong Kong, Hui Shuai, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Qingshan Liu:
4D Contrastive Superflows are Dense 3D Representation Learners. CoRR abs/2407.06190 (2024) - [i68]Zhening Xing, Gereon Fox, Yanhong Zeng, Xingang Pan, Mohamed Elgharib, Christian Theobalt, Kai Chen:
Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models. CoRR abs/2407.08701 (2024) - [i67]Songyang Zhang, Chuyu Zhang, Yingfan Hu, Haowen Shen, Kuikun Liu, Zerun Ma, Fengzhe Zhou, Wenwei Zhang, Xuming He, Dahua Lin, Kai Chen:
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin. CoRR abs/2407.10499 (2024) - [i66]Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models. CoRR abs/2407.11691 (2024) - [i65]Zijian Wu, Jiayu Wang, Dahua Lin, Kai Chen:
LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover. CoRR abs/2407.17227 (2024) - [i64]Zhenzhi Wang, Yixuan Li, Yanhong Zeng, Youqing Fang, Yuwei Guo, Wenran Liu, Jing Tan, Kai Chen, Tianfan Xue, Bo Dai, Dahua Lin:
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation. CoRR abs/2407.17438 (2024) - [i63]Zehui Chen, Kuikun Liu, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao:
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher. CoRR abs/2407.20183 (2024) - [i62]Zhi Chen, Qiguang Chen, Libo Qin, Qipeng Guo, Haijun Lv, Yicheng Zou, Wanxiang Che, Hang Yan, Kai Chen, Dahua Lin:
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices. CoRR abs/2409.01893 (2024) - 2023
- [c32]Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr:
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation. AAAI 2023: 3222-3230 - [c31]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen:
Dense Distinct Query for End-to-End Object Detection. CVPR 2023: 7329-7338 - [c30]Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer. CVPR 2023: 14443-14452 - [c29]Yuan Liu, Songyang Zhang, Jiacheng Chen, Zhaohui Yu, Kai Chen, Dahua Lin:
Improving Pixel-based MIM by Reducing Wasted Modeling Capability. ICCV 2023: 5338-5349 - [c28]Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions. ICCV 2023: 19937-19949 - [c27]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. IJCAI 2023: 1044-1052 - [c26]Youquan Liu, Lingdong Kong, Jun Cen, Runnan Chen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu:
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models. NeurIPS 2023 - [i61]Yuan Liu, Songyang Zhang, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. CoRR abs/2303.02416 (2023) - [i60]Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr:
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation. CoRR abs/2303.06345 (2023) - [i59]Tao Jiang, Peng Lu, Li Zhang, Ningsheng Ma, Rui Han, Chengqi Lyu, Yining Li, Kai Chen:
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose. CoRR abs/2303.07399 (2023) - [i58]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen:
Dense Distinct Query for End-to-End Object Detection. CoRR abs/2303.12776 (2023) - [i57]Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions. CoRR abs/2303.17597 (2023) - [i56]Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer. CoRR abs/2304.05659 (2023) - [i55]Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions. CoRR abs/2304.06719 (2023) - [i54]Xiangtai Li, Henghui Ding, Wenwei Zhang, Haobo Yuan, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy:
Transformer-Based Visual Segmentation: A Survey. CoRR abs/2304.09854 (2023) - [i53]Tao Gong, Chengqi Lyu, Shilong Zhang, Yudong Wang, Miao Zheng, Qian Zhao, Kuikun Liu, Wenwei Zhang, Ping Luo, Kai Chen:
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans. CoRR abs/2305.04790 (2023) - [i52]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. CoRR abs/2305.10049 (2023) - [i51]Youquan Liu, Lingdong Kong, Jun Cen, Runnan Chen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu:
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models. CoRR abs/2306.09347 (2023) - [i50]Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Kai Chen, Ping Luo:
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest. CoRR abs/2307.03601 (2023) - [i49]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-around Player? CoRR abs/2307.06281 (2023) - [i48]Yuan Liu, Songyang Zhang, Jiacheng Chen, Zhaohui Yu, Kai Chen, Dahua Lin:
Improving Pixel-based MIM by Reducing Wasted Modeling Capability. CoRR abs/2308.00261 (2023) - [i47]Wangbo Zhao, Kepan Nan, Songyang Zhang, Kai Chen, Dahua Lin, Yang You:
Learning Referring Video Object Segmentation from Weak Annotation. CoRR abs/2308.02162 (2023) - [i46]Chenming Zhu, Wenwei Zhang, Tai Wang, Xihui Liu, Kai Chen:
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection. CoRR abs/2309.09456 (2023) - [i45]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023) - [i44]Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge:
LawBench: Benchmarking Legal Knowledge of Large Language Models. CoRR abs/2309.16289 (2023) - [i43]Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yining Li, Guangliang Cheng, Yunhai Tong, Kai Chen, Chen Change Loy:
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection. CoRR abs/2310.01393 (2023) - [i42]Qinyuan Cheng, Tianxiang Sun, Wenwei Zhang, Siyin Wang, Xiangyang Liu, Mozhi Zhang, Junliang He, Mianqiu Huang, Zhangyue Yin, Kai Chen, Xipeng Qiu:
Evaluating Hallucinations in Chinese Large Language Models. CoRR abs/2310.03368 (2023) - [i41]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. CoRR abs/2310.13650 (2023) - [i40]Junhao Zhuang, Yanhong Zeng, Wenran Liu, Chun Yuan, Kai Chen:
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting. CoRR abs/2312.03594 (2023) - [i39]Zeming Chen, Wenwei Zhang, Xinjiang Wang, Kai Chen, Zhi Wang:
Mixed Pseudo Labels for Semi-Supervised Object Detection. CoRR abs/2312.07006 (2023) - [i38]Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CoRR abs/2312.07526 (2023) - [i37]Yiming Zhang, Zhening Xing, Yanhong Zeng, Youqing Fang, Kai Chen:
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models. CoRR abs/2312.13964 (2023) - [i36]Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability Step by Step. CoRR abs/2312.14033 (2023) - [i35]Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang:
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI. CoRR abs/2312.16170 (2023) - 2022
- [j2]Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE++: Unified Content-Aware ReAssembly of FEatures. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4674-4687 (2022) - [c25]Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan, Wenhan Luo, Zikun Liu, Mingde Qiao, Junjun Jiang, Kui Jiang, Yao Xiao, Chuyang Sun, Jinhui Hu, Weijian Ruan, Yubo Dong, Kai Chen, Hyejeong Jo, Jiahao Qin, Bingjie Han, Pinle Qin, Rui Chai, Pengyuan Wang:
MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results. CVPR Workshops 2022: 1153-1161 - [c24]Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai:
Revisiting Skeleton-based Action Recognition. CVPR 2022: 2959-2968 - [c23]