


default search action
Songyang Zhang 0001
Person information
- affiliation: ShanghaiTech University, Shanghai, China
Other persons with the same name
- Songyang Zhang — disambiguation page
- Songyang Zhang 0002
— University of Louisiana at Lafayette, Lafayette, LA, USA
- Songyang Zhang 0003
— Northeastern University, Liaoning, China
- Songyang Zhang 0004
— University of Rochester, Rochester, NY, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c38]Baichuan Zhou, Haote Yang
, Dairong Chen, Junyan Ye, Tianyi Bai, Jinhua Yu, Songyang Zhang, Dahua Lin, Conghui He, Weijia Li:
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios. AAAI 2025: 10707-10715 - [c37]Haote Yang, Xingjian Wei, Jiang Wu, Noémi Ligeti-Nagy, Jiaxing Sun, Yinfan Wang, Zijian Gyozo Yang, Junyuan Gao, Jingchao Wang, Bowen Jiang, Shasha Wang, Nanjun Yu, Zihao Zhang, Shixin Hong, Hongwei Liu, Wei Li, Songyang Zhang, Dahua Lin, Lijun Wu, Gábor Prószéky, Conghui He:
OpenHuEval: Evaluating Large Language Model on Hungarian Specifics. ACL (Findings) 2025: 7464-7520 - [c36]Maosongcao Maosongcao, Taolin Zhang, Mo Li, Chuyu Zhang, Yunxin Liu, Conghui He, Haodong Duan, Songyang Zhang, Kai Chen:
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement. ACL (1) 2025: 22392-22412 - [c35]Qiming Ge, Shuhao Xing, Songyang Gao, Yunhua Zhou, Yicheng Zou, Songyang Zhang, Zhi Chen, Hang Yan, Qi Zhang, Qipeng Guo, Kai Chen:
Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law. ACL (1) 2025: 23746-23761 - [c34]Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Jidong Ge, Vincent Ng:
InternLM-Law: An Open-Sourced Chinese Legal Large Language Model. COLING 2025: 9376-9392 - [i65]Maosong Cao, Taolin Zhang, Mo Li
, Chuyu Zhang, Yunxin Liu, Haodong Duan, Songyang Zhang, Kai Chen:
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement. CoRR abs/2501.12273 (2025) - [i64]Jiahao Wang, Ning Kang, Lewei Yao, Mengzhao Chen, Chengyue Wu, Songyang Zhang, Shuchen Xue, Yong Liu, Taiqiang Wu, Xihui Liu, Kaipeng Zhang, Shifeng Zhang, Wenqi Shao, Zhenguo Li, Ping Luo:
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation. CoRR abs/2501.12976 (2025) - [i63]Chengqi Lyu, Songyang Gao, Yuzhe Gu, Wenwei Zhang, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen:
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning. CoRR abs/2502.06781 (2025) - [i62]Junyuan Gao, Jiahe Song, Jiang Wu, Runchuan Zhu, Guanlin Shen, Shasha Wang, Xingjian Wei, Haote Yang, Songyang Zhang, Weijia Li, Bin Wang, Dahua Lin, Lijun Wu, Conghui He:
PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model. CoRR abs/2503.18484 (2025) - [i61]Haote Yang
, Xingjian Wei, Jiang Wu, Noémi Ligeti-Nagy, Jiaxing Sun, Yinfan Wang, Zijian Gyozo Yang, Junyuan Gao, Jingchao Wang, Bowen Jiang, Shasha Wang, Nanjun Yu, Zihao Zhang, Shixin Hong, Hongwei Liu, Wei Li, Songyang Zhang, Dahua Lin, Lijun Wu, Gábor Prószéky, Conghui He:
OpenHuEval: Evaluating Large Language Model on Hungarian Specifics. CoRR abs/2503.21500 (2025) - [i60]Bowen Jiang, Runchuan Zhu, Jiang Wu, Zinco Jiang, Yifan He, Junyuan Gao, Jia Yu, Rui Min, Yinfan Wang, Haote Yang, Songyang Zhang, Dahua Lin, Lijun Wu, Conghui He:
Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering. CoRR abs/2505.16591 (2025) - [i59]Junnan Liu, Hongwei Liu, Linchen Xiao, Shudong Liu, Taolin Zhang, Zihan Ma, Songyang Zhang, Kai Chen:
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective. CoRR abs/2505.19815 (2025) - [i58]Qiming Ge, Shuhao Xing, Songyang Gao, Yunhua Zhou, Yicheng Zou, Songyang Zhang, Zhi Chen, Hang Yan, Qi Zhang, Qipeng Guo, Kai Chen:
Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law. CoRR abs/2506.13216 (2025) - [i57]Taolin Zhang, Zihan Ma, Maosong Cao, Junnan Liu, Songyang Zhang, Kai Chen:
Coding Triangle: How Does Large Language Model Understand Code? CoRR abs/2507.06138 (2025) - [i56]Zihan Ma, Taolin Zhang, Maosong Cao, Junnan Liu, Wenwei Zhang, Minnan Luo, Songyang Zhang, Kai Chen:
Rethinking Verification for LLM Code Generation: From Generation to Testing. CoRR abs/2507.06920 (2025) - [i55]Taolin Zhang, Maosong Cao, Alexander Lam, Songyang Zhang, Kai Chen:
CompassJudger-2: Towards Generalist Judge Model via Verifiable Rewards. CoRR abs/2507.09104 (2025) - [i54]Mingqi Wu, Zhihao Zhang, Qiaole Dong, Zhiheng Xi, Jun Zhao, Senjie Jin, Xiaoran Fan, Yuhao Zhou, Yanwei Fu, Qin Liu, Songyang Zhang, Qi Zhang:
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination. CoRR abs/2507.10532 (2025) - [i53]Shudong Liu, Hongwei Liu, Junnan Liu, Linchen Xiao, Songyang Gao, Chengqi Lyu, Yuzhe Gu, Wenwei Zhang, Derek F. Wong, Songyang Zhang, Kai Chen:
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward. CoRR abs/2508.03686 (2025) - [i52]Yufeng Zhao, Junnan Liu, Hongwei Liu, Dongsheng Zhu, Yuan Shen, Songyang Zhang, Kai Chen:
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis. CoRR abs/2508.15754 (2025) - [i51]Weiyun Wang, Zhangwei Gao, Lixin Gu, Hengjun Pu, Long Cui, Xingguang Wei, Zhaoyang Liu, Linglin Jing, Shenglong Ye, Jie Shao, Zhaokai Wang, Zhe Chen, Hongjie Zhang, Ganlin Yang, Haomin Wang, Qi Wei, Jinhui Yin, Wenhao Li, Erfei Cui, Guanzhou Chen, Zichen Ding, Changyao Tian, Zhenyu Wu, JingJing Xie, Zehao Li, Bowen Yang, Yuchen Duan, Xuehui Wang, Zhi Hou, Haoran Hao, Tianyi Zhang, Songze Li, Xiangyu Zhao, Haodong Duan, Nianchen Deng, Bin Fu, Yinan He, Yi Wang, Conghui He, Botian Shi, Junjun He, Yingtong Xiong, Han Lv, Lijun Wu, Wenqi Shao, Kaipeng Zhang, Huipeng Deng, Biqing Qi, Jiaye Ge, Qipeng Guo, Wenwei Zhang, Songyang Zhang, Maosong Cao, Junyao Lin, Kexian Tang, Jianfei Gao, Haian Huang, Yuzhe Gu, Chengqi Lyu, Huanze Tang, Rui Wang, Haijun Lv, Wanli Ouyang, Limin Wang, Min Dou, Xizhou Zhu, Tong Lu, Dahua Lin, Jifeng Dai, Weijie Su, Bowen Zhou, Kai Chen, Yu Qiao, Wenhai Wang, Gen Luo:
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency. CoRR abs/2508.18265 (2025) - 2024
- [j2]Rongjie Li
, Songyang Zhang
, Xuming He
:
SGTR+: End-to-End Scene Graph Generation With Transformer. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2191-2205 (2024) - [j1]Yuan Liu, Songyang Zhang, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. Trans. Mach. Learn. Res. 2024 (2024) - [c33]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. ACL (Findings) 2024: 6884-6915 - [c32]Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura:
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models. ACL (Findings) 2024: 6976-6987 - [c31]Zehui Chen, Weihua Du
, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step. ACL (1) 2024: 9510-9529 - [c30]Jiaxing Sun, Weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, Songyang Zhang, Hang Yan, Conghui He:
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations. ACL (1) 2024: 11205-11228 - [c29]Rongjie Li, Songyang Zhang
, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CVPR 2024: 28076-28086 - [c28]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-Around Player? ECCV (6) 2024: 216-233 - [c27]Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen:
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs. EMNLP (Findings) 2024: 1950-1976 - [c26]Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng:
LawBench: Benchmarking Legal Knowledge of Large Language Models. EMNLP 2024: 7933-7962 - [c25]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. NAACL-HLT (Findings) 2024: 3184-3200 - [c24]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. NAACL-HLT 2024: 3712-3724 - [c23]Yixu Wang, Yan Teng, Kexin Huang, Chengqi Lyu, Songyang Zhang, Wenwei Zhang, Xingjun Ma, Yu-Gang Jiang, Yu Qiao, Yingchun Wang:
Fake Alignment: Are LLMs Really Aligned Well? NAACL-HLT 2024: 4696-4712 - [c22]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. NeurIPS 2024 - [c21]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. NeurIPS 2024 - [c20]Jize Wang, Zerun Ma, Yining Li, Songyang Zhang, Cailian Chen, Kai Chen, Xinyi Le:
GTA: A Benchmark for General Tool Agents. NeurIPS 2024 - [i50]Huanjun Kong, Songyang Zhang
, Kai Chen:
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance. CoRR abs/2401.08772 (2024) - [i49]Rongjie Li, Songyang Zhang
, Xuming He:
SGTR+: End-to-end Scene Graph Generation with Transformer. CoRR abs/2401.12835 (2024) - [i48]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang
, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i47]Huaiyuan Ying, Shuo Zhang
, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang
, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin:
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning. CoRR abs/2402.06332 (2024) - [i46]Jiaxing Sun, Weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, Songyang Zhang
, Hang Yan, Conghui He:
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations. CoRR abs/2403.14112 (2024) - [i45]Rongjie Li, Songyang Zhang
, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CoRR abs/2404.00906 (2024) - [i44]Chonghua Wang, Haodong Duan, Songyang Zhang
, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. CoRR abs/2404.06480 (2024) - [i43]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang
, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i42]Jiahao Wang, Wenqi Shao, Mengzhao Chen, Chengyue Wu, Yong Liu, Kaipeng Zhang, Songyang Zhang
, Kai Chen, Ping Luo:
Adapting LLaMA Decoder to Vision Transformer. CoRR abs/2404.06773 (2024) - [i41]Wei Li, Ren Ma, Jiang Wu, Chenya Gu, Jiahui Peng, Jinyang Len, Songyang Zhang
, Hang Yan, Dahua Lin, Conghui He:
FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models. CoRR abs/2404.18359 (2024) - [i40]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang
, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. CoRR abs/2405.12209 (2024) - [i39]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang
, Jiaqi Wang, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. CoRR abs/2406.14544 (2024) - [i38]Zhiwei Fei, Songyang Zhang
, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge:
InternLM-Law: An Open Source Chinese Legal Large Language Model. CoRR abs/2406.14887 (2024) - [i37]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang
, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i36]Jize Wang, Zerun Ma, Yining Li, Songyang Zhang
, Cailian Chen, Kai Chen, Xinyi Le:
GTA: A Benchmark for General Tool Agents. CoRR abs/2407.08713 (2024) - [i35]Songyang Zhang
, Chuyu Zhang, Yingfan Hu, Haowen Shen, Kuikun Liu, Zerun Ma, Fengzhe Zhou, Wenwei Zhang, Xuming He, Dahua Lin, Kai Chen:
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin. CoRR abs/2407.10499 (2024) - [i34]Mo Li, Songyang Zhang
, Yunxin Liu, Kai Chen:
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? CoRR abs/2407.11963 (2024) - [i33]Xi Chen, Songyang Zhang
, Qibing Bai, Kai Chen, Satoshi Nakamura:
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models. CoRR abs/2407.15415 (2024) - [i32]Baichuan Zhou, Haote Yang, Dairong Chen, Junyan Ye, Tianyi Bai, Jinhua Yu, Songyang Zhang
, Dahua Lin, Conghui He, Weijia Li:
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios. CoRR abs/2408.17267 (2024) - [i31]Haoran Que, Feiyu Duan, Liqun He, Yutao Mou, Wangchunshu Zhou, Jiaheng Liu, Wenge Rong, Zekun Moore Wang, Jian Yang, Ge Zhang, Junran Peng, Zhaoxiang Zhang, Songyang Zhang
, Kai Chen:
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models. CoRR abs/2409.16191 (2024) - [i30]Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen:
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs. CoRR abs/2410.12405 (2024) - [i29]Maosong Cao, Alexander Lam, Haodong Duan, Hongwei Liu, Songyang Zhang, Kai Chen:
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution. CoRR abs/2410.16256 (2024) - [i28]Junnan Liu, Hongwei Liu, Linchen Xiao, Ziyi Wang, Kuikun Liu, Songyang Gao, Wenwei Zhang, Songyang Zhang, Kai Chen:
Are Your LLMs Capable of Stable Reasoning? CoRR abs/2412.13147 (2024) - 2023
- [c19]Jiahao Wang, Songyang Zhang
, Yong Liu, Taiqiang Wu, Yujiu Yang
, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer. CVPR 2023: 14443-14452 - [c18]Yuan Liu, Songyang Zhang
, Jiacheng Chen, Zhaohui Yu, Kai Chen, Dahua Lin:
Improving Pixel-based MIM by Reducing Wasted Modeling Capability. ICCV 2023: 5338-5349 - [c17]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. IJCAI 2023: 1044-1052 - [i27]Lin Song, Songyang Zhang
, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng:
Dynamic Grained Encoder for Vision Transformers. CoRR abs/2301.03831 (2023) - [i26]Zhichao Liu, Leshan Wang, Desen Zhou, Jian Wang, Songyang Zhang
, Yang Bai, Errui Ding, Rui Fan:
Temporal Segment Transformer for Action Segmentation. CoRR abs/2302.13074 (2023) - [i25]Yuan Liu, Songyang Zhang
, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. CoRR abs/2303.02416 (2023) - [i24]Jiahao Wang, Songyang Zhang
, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer. CoRR abs/2304.05659 (2023) - [i23]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. CoRR abs/2305.10049 (2023) - [i22]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang
, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-around Player? CoRR abs/2307.06281 (2023) - [i21]Yuan Liu, Songyang Zhang
, Jiacheng Chen, Zhaohui Yu, Kai Chen, Dahua Lin:
Improving Pixel-based MIM by Reducing Wasted Modeling Capability. CoRR abs/2308.00261 (2023) - [i20]Wangbo Zhao, Kepan Nan, Songyang Zhang
, Kai Chen, Dahua Lin, Yang You:
Learning Referring Video Object Segmentation from Weak Annotation. CoRR abs/2308.02162 (2023) - [i19]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang
, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023) - [i18]Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang
, Kai Chen, Zongwen Shen, Jidong Ge:
LawBench: Benchmarking Legal Knowledge of Large Language Models. CoRR abs/2309.16289 (2023) - [i17]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. CoRR abs/2310.13650 (2023) - [i16]Yixu Wang, Yan Teng
, Kexin Huang, Chengqi Lyu, Songyang Zhang, Wenwei Zhang, Xingjun Ma, Yu-Gang Jiang, Yu Qiao, Yingchun Wang:
Fake Alignment: Are LLMs Really Aligned Well? CoRR abs/2311.05915 (2023) - [i15]Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability Step by Step. CoRR abs/2312.14033 (2023) - 2022
- [c16]Rongjie Li, Songyang Zhang
, Xuming He:
SGTR: End-to-end Scene Graph Generation with Transformer. CVPR 2022: 19464-19474 - [c15]Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang
, Xuming He, Abhinav Shrivastava:
Learning Semantic Correspondence with Sparse Annotations. ECCV (14) 2022: 267-284 - [c14]Yang Bai
, Desen Zhou, Songyang Zhang
, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang:
Action Quality Assessment with Temporal Parsing Transformer. ECCV (4) 2022: 422-438 - [c13]Qiuyue Wang, Songyang Zhang
, Xuming He:
Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation. ICIP 2022: 251-255 - [i14]Shipeng Yan, Songyang Zhang, Xuming He:
Budget-aware Few-shot Learning via Graph Convolutional Network. CoRR abs/2201.02304 (2022) - [i13]Yang Bai, Desen Zhou, Songyang Zhang
, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang:
Action Quality Assessment with Temporal Parsing Transformer. CoRR abs/2207.09270 (2022) - [i12]Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang
, Xuming He, Abhinav Shrivastava:
Learning Semantic Correspondence with Sparse Annotations. CoRR abs/2208.06974 (2022) - 2021
- [c12]Songyang Zhang
, Zeming Li, Shipeng Yan, Xuming He, Jian Sun:
Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition. CVPR 2021: 2361-2370 - [c11]Rongjie Li, Songyang Zhang
, Bo Wan, Xuming He:
Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation. CVPR 2021: 11109-11119 - [c10]Songyang Zhang, Jiale Zhou, Xuming He:
Learning Implicit Temporal Alignment for Few-shot Video Classification. IJCAI 2021: 1309-1315 - [c9]Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang
, Xuming He:
An EM Framework for Online Incremental Learning of Semantic Segmentation. ACM Multimedia 2021: 3052-3060 - [c8]Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng:
Dynamic Grained Encoder for Vision Transformers. NeurIPS 2021: 5770-5783 - [i11]Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun:
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition. CoRR abs/2103.16370 (2021) - [i10]Rongjie Li, Songyang Zhang, Bo Wan, Xuming He:
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation. CoRR abs/2104.00308 (2021) - [i9]Songyang Zhang, Jiale Zhou, Xuming He:
Learning Implicit Temporal Alignment for Few-shot Video Classification. CoRR abs/2105.04823 (2021) - [i8]Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang, Xuming He:
An EM Framework for Online Incremental Learning of Semantic Segmentation. CoRR abs/2108.03613 (2021) - [i7]Songyang Zhang, Lin Song, Songtao Liu, Zheng Ge, Zeming Li, Xuming He, Jian Sun:
Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge. CoRR abs/2108.04230 (2021) - [i6]Rongjie Li, Songyang Zhang, Xuming He:
SGTR: End-to-end Scene Graph Generation with Transformer. CoRR abs/2112.12970 (2021) - 2020
- [c7]Yongfei Liu, Xiangyi Zhang, Songyang Zhang
, Xuming He:
Part-Aware Prototype Network for Few-Shot Semantic Segmentation. ECCV (9) 2020: 142-158 - [c6]Xi Chen, Songyang Zhang
, Dandan Song, Peng Ouyang, Shouyi Yin:
Transformer with Bidirectional Decoder for Speech Recognition. INTERSPEECH 2020: 1773-1777 - [i5]Yongfei Liu, Xiangyi Zhang, Songyang Zhang, Xuming He:
Part-aware Prototype Network for Few-shot Semantic Segmentation. CoRR abs/2007.06309 (2020) - [i4]Xi Chen, Songyang Zhang, Dandan Song, Peng Ouyang, Shouyi Yin:
Transformer with Bidirectional Decoder for Speech Recognition. CoRR abs/2008.04481 (2020)
2010 – 2019
- 2019
- [c5]Shipeng Yan, Songyang Zhang, Xuming He:
A Dual Attention Network with Semantic Embedding for Few-Shot Learning. AAAI 2019: 9079-9086 - [c4]Shipeng Yan, Songyang Zhang, Xuming He:
A Dual Attention Network with Semantic Embedding for Few-shot Learning. CVPR Workshops 2019 - [c3]Shuaiyi Huang, Qiuyue Wang, Songyang Zhang
, Shipeng Yan, Xuming He:
Dynamic Context Correspondence Network for Semantic Alignment. ICCV 2019: 2010-2019 - [c2]Songyang Zhang, Xuming He, Shipeng Yan:
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition. ICML 2019: 7374-7383 - [i3]Songyang Zhang, Shipeng Yan, Xuming He:
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition. CoRR abs/1905.11634 (2019) - [i2]Shuaiyi Huang, Qiuyue Wang, Songyang Zhang, Shipeng Yan, Xuming He:
Dynamic Context Correspondence Network for Semantic Alignment. CoRR abs/1909.03444 (2019) - 2017
- [c1]Yufan Liu
, Songyang Zhang
, Mai Xu, Xuming He:
Predicting Salient Face in Multiple-Face Videos. CVPR 2017: 3224-3232 - [i1]Yuhang Song, Mai Xu, Songyang Zhang, Liangyu Huo:
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning. CoRR abs/1710.10036 (2017)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-09-27 20:08 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint