


default search action
Haodong Duan
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j1]Mo Li, Songyang Zhang, Taolin Zhang, Haodong Duan, Yunxin Liu, Kai Chen:
NeedleBench: Evaluating LLM Retrieval and Reasoning Across Varying Information Densities. Trans. Mach. Learn. Res. 2025 (2025)
[c31]Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. ACL (Findings) 2025: 6547-6563
[c30]Zicheng Zhang, Xiangyu Zhao, Xinyu Fang, Chunyi Li, Xiaohong Liu, Xiongkuo Min, Haodong Duan, Kai Chen, Guangtao Zhai:
Redundancy Principles for MLLMs Benchmarks. ACL (1) 2025: 12492-12504
[c29]Xiangyu Zhao, Shengyuan Ding, Zicheng Zhang, Haian Huang, Maosongcao Maosongcao, Jiaqi Wang, Weiyun Wang, Xinyu Fang, Wenhai Wang, Guangtao Zhai, Hua Yang, Haodong Duan, Kai Chen:
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference. ACL (1) 2025: 18490-18515
[c28]Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang, Yixin Cao, Aixin Sun:
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings. ACL (Findings) 2025: 19568-19580
[c27]Maosongcao Maosongcao, Taolin Zhang, Mo Li, Chuyu Zhang, Yunxin Liu, Conghui He, Haodong Duan, Songyang Zhang, Kai Chen:
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement. ACL (1) 2025: 22392-22412
[c26]Chunyi Li, Yuan Tian, Xiaoyue Ling, Zicheng Zhang, Haodong Duan, Haoning Wu, Ziheng Jia, Xiaohong Liu, Xiongkuo Min, Guo Lu, Weisi Lin, Guangtao Zhai:
Image Quality Assessment: From Human to Machine Preference. CVPR 2025: 7570-7581
[c25]Junbo Niu, Yifei Li, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CVPR 2025: 18902-18913
[c24]Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. ICLR 2025
[c23]Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? ICML 2025
[i63]Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning. CoRR abs/2501.03226 (2025)
[i62]Yifei Li, Junbo Niu, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang
:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CoRR abs/2501.05510 (2025)
[i61]Maosong Cao, Taolin Zhang, Mo Li
, Chuyu Zhang, Yunxin Liu, Haodong Duan, Songyang Zhang, Kai Chen:
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement. CoRR abs/2501.12273 (2025)
[i60]Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang
:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. CoRR abs/2501.12368 (2025)
[i59]Zicheng Zhang, Xiangyu Zhao, Xinyu Fang, Chunyi Li, Xiaohong Liu
, Xiongkuo Min, Haodong Duan, Kai Chen, Guangtao Zhai:
Redundancy Principles for MLLMs Benchmarks. CoRR abs/2501.13953 (2025)
[i58]Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang
, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? CoRR abs/2502.05173 (2025)
[i57]Xiangyu Zhao, Shengyuan Ding, Zicheng Zhang, Haian Huang, Maosong Cao, Weiyun Wang, Jiaqi Wang
, Xinyu Fang, Wenhai Wang, Guangtao Zhai, Haodong Duan, Hua Yang, Kai Chen:
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference. CoRR abs/2502.18411 (2025)
[i56]Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang
:
Visual-RFT: Visual Reinforcement Fine-Tuning. CoRR abs/2503.01785 (2025)
[i55]Chunyi Li, Yuan Tian, Xiaoyue Ling, Zicheng Zhang, Haodong Duan, Haoning Wu, Ziheng Jia, Xiaohong Liu, Xiongkuo Min, Guo Lu, Weisi Lin, Guangtao Zhai:
Image Quality Assessment: From Human to Machine Preference. CoRR abs/2503.10078 (2025)
[i54]Chunyi Li, Xiaozhe Li, Zicheng Zhang, Yuan Tian, Ziheng Jia, Xiaohong Liu, Xiongkuo Min, Jia Wang, Haodong Duan, Kai Chen, Guangtao Zhai:
Information Density Principle for MLLM Benchmarks. CoRR abs/2503.10079 (2025)
[i53]Weiyun Wang, Zhangwei Gao, Lianjie Chen, Zhe Chen, Jinguo Zhu, Xiangyu Zhao, Yangzhou Liu, Yue Cao, Shenglong Ye, Xizhou Zhu, Lewei Lu, Haodong Duan, Yu Qiao, Jifeng Dai, Wenhai Wang:
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning. CoRR abs/2503.10291 (2025)
[i52]Xinyu Fang, Zhijian Chen, Kai Lan, Lixin Ma, Shengyuan Ding, Yingji Liang, Xiangyu Zhao, Farong Wen, Zicheng Zhang, Guofeng Zhang, Haodong Duan, Kai Chen, Dahua Lin:
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM. CoRR abs/2503.14478 (2025)
[i51]Kexian Tang, Junyao Gao, Yanhong Zeng, Haodong Duan, Yanan Sun, Zhening Xing, Wenran Liu, Kaifeng Lyu, Kai Chen:
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? CoRR abs/2503.19990 (2025)
[i50]Xiangyu Zhao, Peiyuan Zhang, Kexian Tang, Hao Li, Zicheng Zhang, Guangtao Zhai, Junchi Yan, Hua Yang, Xue Yang, Haodong Duan:
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing. CoRR abs/2504.02826 (2025)
[i49]Shengyuan Ding, Shenxi Wu, Xiangyu Zhao, Yuhang Zang, Haodong Duan, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
MM-IFEngine: Towards Multimodal Instruction Following. CoRR abs/2504.07957 (2025)
[i48]Siqi Li, Yufan Shen, Xiangnan Chen, Jiayi Chen, Hengwei Ju, Haodong Duan, Song Mao, Hongbin Zhou, Bo Zhang, Bin Fu, Pinlong Cai, Licheng Wen, Botian Shi, Yong Liu, Xinyu Cai, Yu Qiao:
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling. CoRR abs/2505.00063 (2025)
[i47]Ziyu Liu, Yuhang Zang, Yushan Zou, Zijian Liang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang:
Visual Agentic Reinforcement Fine-Tuning. CoRR abs/2505.14246 (2025)
[i46]Sihan Yang, Runsen Xu, Yiman Xie, Sizhe Yang, Mo Li
, Jingli Lin, Chenming Zhu, Xiaochen Chen, Haodong Duan, Xiangyu Yue, Dahua Lin, Tai Wang, Jiangmiao Pang:
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence. CoRR abs/2505.23764 (2025)
[i45]Junying Wang, Wenzhe Li, Yalun Wu, Yingji Liang, Yijin Guo, Chunyi Li, Haodong Duan, Zicheng Zhang, Guangtao Zhai:
Affordance Benchmark for MLLMs. CoRR abs/2506.00893 (2025)
[i44]Xiaorong Zhu, Ziheng Jia, Jiarui Wang, Xiangyu Zhao, Haodong Duan, Xiongkuo Min, Jia Wang, Zicheng Zhang, Guangtao Zhai:
GOBench: Benchmarking Geometric Optics Generation and Understanding of MLLMs. CoRR abs/2506.00991 (2025)
[i43]Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang
, Yixin Cao, Aixin Sun:
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings. CoRR abs/2506.04997 (2025)
[i42]Xiaozhe Li, Jixuan Chen, Xinyu Fang, Shengyuan Ding, Haodong Duan, Qingwen Liu, Kai Chen:
OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems. CoRR abs/2506.10764 (2025)
[i41]Xuehui Wang, Zhenyu Wu, JingJing Xie, Zichen Ding, Bowen Yang, Zehao Li, Zhaoyang Liu, Qingyun Li, Xuan Dong, Zhe Chen, Weiyun Wang, Xiangyu Zhao, Jixuan Chen, Haodong Duan, Tianbao Xie, Chenyu Yang, Shiqian Su, Yue Yu, Yuan Huang, Yiqian Liu, Xiao Zhang, Yanting Zhang, Xiangyu Yue, Weijie Su, Xizhou Zhu, Wei Shen, Jifeng Dai, Wenhai Wang:
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents. CoRR abs/2507.19478 (2025)
[i40]Lei Bai, Zhongrui Cai, Yuhang Cao, Maosong Cao, Weihan Cao, Chiyu Chen, Haojiong Chen, Kai Chen, Pengcheng Chen, Ying Chen, Yongkang Chen, Yu Cheng, Pei Chu, Tao Chu, Erfei Cui, Ganqu Cui, Long Cui, Ziyun Cui, Nianchen Deng, Ning Ding, Nanqing Dong, Peijie Dong, Shihan Dou, Sinan Du, Haodong Duan, Caihua Fan, Ben Gao, Changjiang Gao, Jianfei Gao, Songyang Gao, Yang Gao, Zhangwei Gao, Jiaye Ge, Qiming Ge, Lixin Gu, Yuzhe Gu, Aijia Guo, Qipeng Guo, Xu Guo, Conghui He, Junjun He, Yili Hong, Siyuan Hou, Caiyu Hu, Hanglei Hu, Jucheng Hu, Ming Hu, Zhouqi Hua, Haian Huang, Junhao Huang, Xu Huang, Zixian Huang, Zhe Jiang, Lingkai Kong, Linyang Li, Peiji Li, Pengze Li, Shuaibin Li, Tianbin Li, Wei Li, Yuqiang Li, Dahua Lin, Junyao Lin, Tianyi Lin, Zhishan Lin, Hongwei Liu, Jiangning Liu, Jiyao Liu, Junnan Liu, Kai Liu, Kaiwen Liu, Kuikun Liu, Shichun Liu, Shudong Liu, Wei Liu, Xinyao Liu, Yuhong Liu, Zhan Liu, Yinquan Lu, Haijun Lv, Hongxia Lv, Huijie Lv, Qitan Lv, Ying Lv, Chengqi Lyu, Chenglong Ma, Jianpeng Ma, Ren Ma, Runmin Ma, Runyuan Ma, Xinzhu Ma, Yichuan Ma, Zihan Ma, Sixuan Mi, Junzhi Ning, Wenchang Ning, Xinle Pang, Jiahui Peng, Runyu Peng
, Yu Qiao:
Intern-S1: A Scientific Multimodal Foundation Model. CoRR abs/2508.15763 (2025)
[i39]Weiyun Wang, Zhangwei Gao, Lixin Gu, Hengjun Pu, Long Cui, Xingguang Wei, Zhaoyang Liu, Linglin Jing, Shenglong Ye, Jie Shao, Zhaokai Wang, Zhe Chen, Hongjie Zhang, Ganlin Yang, Haomin Wang
, Qi Wei, Jinhui Yin, Wenhao Li, Erfei Cui, Guanzhou Chen, Zichen Ding, Changyao Tian, Zhenyu Wu, JingJing Xie, Zehao Li, Bowen Yang, Yuchen Duan, Xuehui Wang, Zhi Hou, Haoran Hao, Tianyi Zhang
, Songze Li, Xiangyu Zhao, Haodong Duan, Nianchen Deng, Bin Fu, Yinan He, Yi Wang, Conghui He, Botian Shi, Junjun He, Yingtong Xiong, Han Lv, Lijun Wu, Wenqi Shao, Kaipeng Zhang, Huipeng Deng, Biqing Qi, Jiaye Ge, Qipeng Guo, Wenwei Zhang, Songyang Zhang, Maosong Cao, Junyao Lin, Kexian Tang, Jianfei Gao, Haian Huang, Yuzhe Gu, Chengqi Lyu, Huanze Tang, Rui Wang, Haijun Lv, Wanli Ouyang, Limin Wang, Min Dou, Xizhou Zhu, Tong Lu, Dahua Lin, Jifeng Dai, Weijie Su, Bowen Zhou, Kai Chen, Yu Qiao, Wenhai Wang, Gen Luo:
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency. CoRR abs/2508.18265 (2025)
[i38]Ming Hu, Chenglong Ma, Wei Li, Wanghan Xu, Jiamin Wu, Jucheng Hu, Tianbin Li, Guohang Zhuang, Jiaqi Liu, Yingzhou Lu, Ying Chen, Chaoyang Zhang, Cheng Tan, Jie Ying, Guocheng Wu, Shujian Gao, Pengcheng Chen, Jiashi Lin, Haitao Wu, Lulu Chen, Fengxiang Wang, Yuanyuan Zhang, Xiangyu Zhao, Feilong Tang, Encheng Su, Junzhi Ning, Xinyao Liu, Ye Du, Changkai Ji, Cheng Tang, Huihui Xu, Ziyang Chen, Ziyan Huang, Jiyao Liu, Pengfei Jiang, Yizhou Wang, Chen Tang, Jianyu Wu, Yuchen Ren, Siyuan Yan, Zhonghua Wang, Zhongxing Xu, Shiyan Su, Shangquan Sun, Runkai Zhao, Zhisheng Zhang, Yu Liu, Fudi Wang, Yuanfeng Ji, Yanzhou Su, Hongming Shan, Chun-Mei Feng, Jiahao Xu, Jiangtao Yan, Wenhao Tang, Diping Song, Lihao Liu, Yanyan Huang, Lequan Yu, Bin Fu, Shujun Wang, Xiaomeng Li, Xiaowei Hu, Yun Gu, Ben Fei, Zhongying Deng, Benyou Wang, Yuewen Cao, Minjie Shen, Haodong Duan, Jie Xu, Yirong Chen, Fang Yan, Hongxia Hao, Jielan Li, Jiajun Du, Yanbo Wang, Imran Razzak, Chi Zhang, Lijun Wu, Conghui He, Zhaohui Lu, Jinhai Huang, Yihao Liu, Fenghua Ling, Yuqiang Li, Aoran Wang, Qihao Zheng, Nanqing Dong, Tianfan Fu, Dongzhan Zhou, Yan Lu, Wenlong Zhang, Jin Ye, Jianfei Cai, Wanli Ouyang, Yu Qiao, Zongyuan Ge, Shixiang Tang, Junjun He:
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers. CoRR abs/2508.21148 (2025)
[i37]Ziyu Liu, Yuhang Zang, Shengyuan Ding, Yuhang Cao, Xiaoyi Dong, Haodong Duan, Dahua Lin, Jiaqi Wang:
SPARK: Synergistic Policy And Reward Co-Evolving Framework. CoRR abs/2509.22624 (2025)
[i36]Xiangyu Zhao, Junming Lin, Tianhao Liang, Yifan Zhou, Wenhao Chai, Yuzhe Gu, Weiyun Wang, Kai Chen, Gen Luo, Wenwei Zhang, Junchi Yan, Hua Yang, Haodong Duan, Xue Yang:
MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization. CoRR abs/2510.08540 (2025)
[i35]Xiaozhe Li, Xinyu Fang, Shengyuan Ding, Linyang Li, Haodong Duan, Qingwen Liu, Kai Chen:
NP-Engine: Empowering Optimization Reasoning in Large Language Models with Verifiable Synthetic NP Problems. CoRR abs/2510.16476 (2025)
[i34]Yuhong Liu, Beichen Zhang, Yuhang Zang, Yuhang Cao, Long Xing, Xiaoyi Dong, Haodong Duan, Dahua Lin, Jiaqi Wang:
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning. CoRR abs/2510.27606 (2025)- 2024
[c22]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. ACL (Findings) 2024: 6884-6915
[c21]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang
, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-Around Player? ECCV (6) 2024: 216-233
[c20]Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen:
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs. EMNLP (Findings) 2024: 1950-1976
[c19]Haodong Duan
, Junming Yang
, Yuxuan Qiao
, Xinyu Fang
, Lin Chen
, Yuan Liu
, Xiaoyi Dong
, Yuhang Zang
, Pan Zhang
, Jiaqi Wang
, Dahua Lin
, Kai Chen
:
VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models. ACM Multimedia 2024: 11198-11201
[c18]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. NAACL-HLT (Findings) 2024: 3184-3200
[c17]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. NAACL-HLT 2024: 3712-3724
[c16]Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Lin Bin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. NeurIPS 2024
[c15]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? NeurIPS 2024
[c14]Pengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J. Seibel, Junjun He, Yu Qiao:
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. NeurIPS 2024
[c13]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. NeurIPS 2024
[c12]Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen:
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding. NeurIPS 2024
[c11]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. NeurIPS 2024
[i33]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang
, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024)
[i32]Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao
, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, Fukai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang
, Jiaqi Wang, Jiayu Wang, Rui Wang
, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng
, Fan Wu, Yingtong Xiong, Xiaomeng Zhao
, et al.:
InternLM2 Technical Report. CoRR abs/2403.17297 (2024)
[i31]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang
, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? CoRR abs/2403.20330 (2024)
[i30]Chonghua Wang, Haodong Duan, Songyang Zhang
, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. CoRR abs/2404.06480 (2024)
[i29]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang
, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024)
[i28]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang
, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. CoRR abs/2405.12209 (2024)
[i27]Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang
:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024)
[i26]Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen:
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding. CoRR abs/2406.14515 (2024)
[i25]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang
, Jiaqi Wang
, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. CoRR abs/2406.14544 (2024)
[i24]Xiangyu Zhao, Xiangtai Li, Haodong Duan, Haian Huang, Yining Li, Kai Chen, Hua Yang:
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning. CoRR abs/2406.17770 (2024)
[i23]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang
, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024)
[i22]Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models. CoRR abs/2407.11691 (2024)
[i21]Pengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J. Seibel, Junjun He, Yu Qiao:
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. CoRR abs/2408.03361 (2024)
[i20]Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen:
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs. CoRR abs/2410.12405 (2024)
[i19]Maosong Cao, Alexander Lam, Haodong Duan, Hongwei Liu, Songyang Zhang, Kai Chen:
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution. CoRR abs/2410.16256 (2024)
[i18]Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang
:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. CoRR abs/2410.17637 (2024)
[i17]Chaoyou Fu, Yifan Zhang, Shukang Yin, Bo Li, Xinyu Fang, Sirui Zhao, Haodong Duan, Xing Sun
, Ziwei Liu, Liang Wang, Caifeng Shan, Ran He:
MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs. CoRR abs/2411.15296 (2024)
[i16]Pan Zhang, Xiaoyi Dong, Yuhang Cao, Yuhang Zang, Rui Qian, Xilin Wei, Lin Chen, Yifei Li, Junbo Niu, Shuangrui Ding, Qipeng Guo, Haodong Duan, Xin Chen, Han Lv, Zheng Nie, Min Zhang, Bin Wang, Wenwei Zhang, Xinyue Zhang, Jiaye Ge, Wei Li, Jingwen Li, Zhongying Tu, Conghui He, Xingcheng Zhang, Kai Chen, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions. CoRR abs/2412.09596 (2024)- 2023
[c10]Yujie Zhou, Haodong Duan, Anyi Rao
, Bing Su, Jiaqi Wang
:
Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences. AAAI 2023: 3825-3833
[c9]Haodong Duan, Mingze Xu, Bing Shuai, Davide Modolo, Zhuowen Tu, Joseph Tighe, Alessandro Bergamo:
SkeleTR: Towards Skeleton-based Action Recognition in the Wild. ICCV 2023: 13588-13598
[c8]Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. NeurIPS 2023
[i15]Yujie Zhou, Haodong Duan, Anyi Rao, Bing Su, Jiaqi Wang
:
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences. CoRR abs/2302.09018 (2023)
[i14]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023)
[i13]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang
, Wangbo Zhao, Yike Yuan, Jiaqi Wang
, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-around Player? CoRR abs/2307.06281 (2023)
[i12]Haodong Duan, Mingze Xu, Bing Shuai, Davide Modolo, Zhuowen Tu, Joseph Tighe, Alessandro Bergamo:
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild. CoRR abs/2309.11445 (2023)
[i11]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang
, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023)
[i10]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. CoRR abs/2310.13650 (2023)- 2022
[c7]Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai:
Revisiting Skeleton-based Action Recognition. CVPR 2022: 2959-2968
[c6]Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin:
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition. CVPR 2022: 2990-3000
[c5]Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang:
OCSampler: Compressing Videos to One Clip with Single-step Sampling. CVPR 2022: 13884-13893
[c4]Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin:
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks. ECCV Workshops (4) 2022: 557-575
[c3]Haodong Duan, Jiaqi Wang
, Kai Chen, Dahua Lin:
PYSKL: Towards Good Practices for Skeleton Action Recognition. ACM Multimedia 2022: 7351-7354
[i9]Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang:
OCSampler: Compressing Videos to One Clip with Single-step Sampling. CoRR abs/2201.04388 (2022)
[i8]Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin:
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition. CoRR abs/2205.02028 (2022)
[i7]Haodong Duan, Jiaqi Wang
, Kai Chen, Dahua Lin:
PYSKL: Towards Good Practices for Skeleton Action Recognition. CoRR abs/2205.09443 (2022)
[i6]Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin:
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks. CoRR abs/2209.09393 (2022)
[i5]Haodong Duan, Jiaqi Wang, Kai Chen, Dahua Lin:
DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition. CoRR abs/2210.05895 (2022)- 2021
[i4]Haodong Duan
, Yue Zhao, Kai Chen, Dian Shao, Dahua Lin, Bo Dai:
Revisiting Skeleton-based Action Recognition. CoRR abs/2104.13586 (2021)- 2020
[c2]Haodong Duan, Yue Zhao, Yuanjun Xiong, Wentao Liu, Dahua Lin:
Omni-Sourced Webly-Supervised Learning for Video Recognition. ECCV (15) 2020: 670-688
[i3]Haodong Duan
, Yue Zhao, Yuanjun Xiong, Wentao Liu, Dahua Lin:
Omni-sourced Webly-supervised Learning for Video Recognition. CoRR abs/2003.13042 (2020)
2010 – 2019
- 2019
[c1]Haodong Duan, Kwan-Yee Lin, Sheng Jin
, Wentao Liu
, Chen Qian, Wanli Ouyang
:
TRB: A Novel Triplet Representation for Understanding 2D Human Body. ICCV 2019: 9478-9487
[i2]Haodong Duan
, Kwan-Yee Lin, Sheng Jin
, Wentao Liu, Chen Qian, Wanli Ouyang:
TRB: A Novel Triplet Representation for Understanding 2D Human Body. CoRR abs/1910.11535 (2019)- 2017
[i1]Bingzhe Wu, Haodong Duan, Zhichao Liu, Guangyu Sun:
SRPGAN: Perceptual Generative Adversarial Network for Single Image Super Resolution. CoRR abs/1712.05927 (2017)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-12-19 01:32 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







