default search action

combined dblp search
author search
venue search
publication search

ask others

Songyang Gao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DouLZGLXZJYZGZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DouLZGLXZJYZGZ025
Shihan Dou, Yan Liu, Enyu Zhou, Songyang Gao, Tianlong Li, Limao Xiong, Xin Zhao, Haoxiang Jia, Junjie Ye, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning. AAAI 2025: 23805-23813
[c18]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LiuLXWLGZZC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiuLXWLGZZC25
Junnan Liu, Hongwei Liu, Linchen Xiao, Ziyi Wang, Kuikun Liu, Songyang Gao, Wenwei Zhang, Songyang Zhang, Kai Chen:
Are Your LLMs Capable of Stable Reasoning? ACL (Findings) 2025: 17594-17632
[c17]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/GeXGZZZCYZG025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/GeXGZZZCYZG025
Qiming Ge, Shuhao Xing, Songyang Gao, Yunhua Zhou, Yicheng Zou, Songyang Zhang, Zhi Chen, Hang Yan, Qi Zhang, Qipeng Guo, Kai Chen:
Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law. ACL (1) 2025: 23746-23761
[c16]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/XiDCHGWGYLHGCZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiDCHGWGYLHGCZZ25
Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Xin Guo, Dingwen Yang, Chenyang Liao, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments. ACL (1) 2025: 27914-27961
[c15]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/YeLGHWLFDJ0G025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/YeLGHWLFDJ0G025
Junjie Ye, Guanyu Li, Songyang Gao, Caishuang Huang, Yilong Wu, Sixian Li, Xiaoran Fan, Shihan Dou, Tao Ji, Qi Zhang, Tao Gui, Xuanjing Huang:
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios. COLING 2025: 156-187
2024
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/DouZLGSXZWXFPZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DouZLGSXZWXFPZZ24
Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin. ACL (1) 2024: 1932-1945
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YeLLHGWZG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YeLLHGWZG024
Junjie Ye, Sixian Li, Guanyu Li, Caishuang Huang, Songyang Gao, Yilong Wu, Qi Zhang, Tao Gui, Xuanjing Huang:
ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages. ACL (1) 2024: 2181-2211
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ShiWGGYGZ0ZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ShiWGGYGZ0ZL24
Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin:
Navigating the OverKill in Large Language Models. ACL (1) 2024: 4602-4614
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/YeWGHLLFZG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YeWGHLLFZG024
Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang:
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning. EMNLP 2024: 313-333
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/XiaGGX0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XiaGGX0024
Han Xia, Songyang Gao, Qiming Ge, Zhiheng Xi, Qi Zhang, Xuanjing Huang:
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data. EMNLP (Findings) 2024: 8178-8188
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GaoGSDY0ZZC00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GaoGSDY0ZZC00L24
Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin:
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback. ICML 2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/nlpcc/DouGGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nlpcc/DouGGZ24
Shihan Dou, Songyang Gao, Tao Gui, Qi Zhang:
CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing. NLPCC (1) 2024: 284-297
2023
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangZZZGWZGCG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangZZZGWZGCG23
Xiao Wang, Weikang Zhou, Qi Zhang, Jie Zhou, Songyang Gao, Junzhe Wang, Menghan Zhang, Xiang Gao, Yunwen Chen, Tao Gui:
Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model. ACL (Findings) 2023: 555-568
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/GaoDLWZWMS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/GaoDLWZWMS23
Songyang Gao, Shihan Dou, Yan Liu, Xiao Wang, Qi Zhang, Zhongyu Wei, Jin Ma, Ying Shan:
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization. ACL (1) 2023: 12177-12189
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/GaoDZHMS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/GaoDZHMS23
Songyang Gao, Shihan Dou, Qi Zhang, Xuanjing Huang, Jin Ma, Ying Shan:
On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection. ACL (Findings) 2023: 13573-13581
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhouZXGFFYGZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhouZXGFFYGZH23
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang:
RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms. EMNLP (Findings) 2023: 10262-10274
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/XiJZZGLGZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XiJZZGLGZH23
Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Jia Liu, Tao Gui, Qi Zhang, Xuanjing Huang:
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement. EMNLP (Findings) 2023: 11383-11406
2022
[c2]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/DouZWGSZWH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/DouZWGSZWH22
Shihan Dou, Rui Zheng, Ting Wu, Songyang Gao, Junjie Shan, Qi Zhang, Yueming Wu, Xuanjing Huang:
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective. COLING 2022: 2278-2287
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/GaoDZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/GaoDZH22
Songyang Gao, Shihan Dou, Qi Zhang, Xuanjing Huang:
Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence Embedding. EMNLP 2022: 4112-4122

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-06781
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-06781
Chengqi Lyu, Songyang Gao, Yuzhe Gu, Wenwei Zhang, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen:
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning. CoRR abs/2502.06781 (2025)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-22655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-22655
Xiaomin Yu, Pengxiang Ding, Wenjie Zhang, Siteng Huang, Songyang Gao, Chengwei Qin, Kejian Wu, Zhaoxin Fan, Ziyue Qiao, Donglin Wang:
Unicorn: Text-Only Data Synthesis for Vision Language Model Training. CoRR abs/2503.22655 (2025)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-12328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-12328
Jialun Zhong, Wei Shen, Yanzeng Li, Songyang Gao, Hua Lu, Yicheng Chen, Yang Zhang, Wei Zhou, Jinjie Gu, Lei Zou:
A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future. CoRR abs/2504.12328 (2025)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13216
Qiming Ge, Shuhao Xing, Songyang Gao, Yunhua Zhou, Yicheng Zou, Songyang Zhang, Zhi Chen, Hang Yan, Qi Zhang, Qipeng Guo, Kai Chen:
Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law. CoRR abs/2506.13216 (2025)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-05197
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-05197
Shihan Dou, Shichun Liu, Yuming Yang, Yicheng Zou, Yunhua Zhou, Shuhao Xing, Chenhao Huang, Qiming Ge, Demin Song, Haijun Lv, Songyang Gao, Chengqi Lv, Enyu Zhou, Honglin Guo, Zhiheng Xi, Wenwei Zhang, Qipeng Guo, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Tao Gui, Kai Chen:
Pre-Trained Policy Discriminators are General Reward Models. CoRR abs/2507.05197 (2025)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-13332
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-13332
Zhouqi Hua, Wenwei Zhang, Chengqi Lyu, Yuzhe Gu, Songyang Gao, Kuikun Liu, Kai Chen:
The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner. CoRR abs/2507.13332 (2025)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-16814
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-16814
Junhao Shen, Haiteng Zhao, Yuzhe Gu, Songyang Gao, Kuikun Liu, Haian Huang, Jianfei Gao, Dahua Lin, Wenwei Zhang, Kai Chen:
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning. CoRR abs/2507.16814 (2025)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-03686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-03686
Shudong Liu, Hongwei Liu, Junnan Liu, Linchen Xiao, Songyang Gao, Chengqi Lyu, Yuzhe Gu, Wenwei Zhang, Derek F. Wong, Songyang Zhang, Kai Chen:
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward. CoRR abs/2508.03686 (2025)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-15763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-15763
Lei Bai, Zhongrui Cai, Yuhang Cao, Maosong Cao, Weihan Cao, Chiyu Chen, Haojiong Chen, Kai Chen, Pengcheng Chen, Ying Chen, Yongkang Chen, Yu Cheng, Pei Chu, Tao Chu, Erfei Cui, Ganqu Cui, Long Cui, Ziyun Cui, Nianchen Deng, Ning Ding, Nanqing Dong, Peijie Dong, Shihan Dou, Sinan Du, Haodong Duan, Caihua Fan, Ben Gao, Changjiang Gao, Jianfei Gao, Songyang Gao, Yang Gao, Zhangwei Gao, Jiaye Ge, Qiming Ge, Lixin Gu, Yuzhe Gu, Aijia Guo, Qipeng Guo, Xu Guo, Conghui He, Junjun He, Yili Hong, Siyuan Hou, Caiyu Hu, Hanglei Hu, Jucheng Hu, Ming Hu, Zhouqi Hua, Haian Huang, Junhao Huang, Xu Huang, Zixian Huang, Zhe Jiang, Lingkai Kong, Linyang Li, Peiji Li, Pengze Li, Shuaibin Li, Tianbin Li, Wei Li, Yuqiang Li, Dahua Lin, Junyao Lin, Tianyi Lin, Zhishan Lin, Hongwei Liu, Jiangning Liu, Jiyao Liu, Junnan Liu, Kai Liu, Kaiwen Liu, Kuikun Liu, Shichun Liu, Shudong Liu, Wei Liu, Xinyao Liu, Yuhong Liu, Zhan Liu, Yinquan Lu, Haijun Lv, Hongxia Lv, Huijie Lv, Qitan Lv, Ying Lv, Chengqi Lyu, Chenglong Ma, Jianpeng Ma, Ren Ma, Runmin Ma, Runyuan Ma, Xinzhu Ma, Yichuan Ma, Zihan Ma, Sixuan Mi, Junzhi Ning, Wenchang Ning, Xinle Pang, Jiahui Peng, Runyu Peng, Yu Qiao:
Intern-S1: A Scientific Multimodal Foundation Model. CoRR abs/2508.15763 (2025)
2024
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-00741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-00741
Junjie Ye, Guanyu Li, Songyang Gao, Caishuang Huang, Yilong Wu, Sixian Li, Xiaoran Fan, Shihan Dou, Qi Zhang, Tao Gui, Xuanjing Huang:
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios. CoRR abs/2401.00741 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06080
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06080
Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
Secrets of RLHF in Large Language Models Part II: Reward Modeling. CoRR abs/2401.06080 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08326
Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang:
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning. CoRR abs/2401.08326 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-11458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-11458
Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin:
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback. CoRR abs/2401.11458 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-17633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-17633
Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin:
Navigating the OverKill in Large Language Models. CoRR abs/2401.17633 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10753
Junjie Ye, Sixian Li, Guanyu Li, Caishuang Huang, Songyang Gao, Yilong Wu, Qi Zhang, Tao Gui, Xuanjing Huang:
ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages. CoRR abs/2402.10753 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12171
Weikang Zhou, Xiao Wang, Limao Xiong, Han Xia, Yingshuang Gu, Mingxu Chai, Fukang Zhu, Caishuang Huang, Shihan Dou, Zhiheng Xi, Rui Zheng, Songyang Gao, Yicheng Zou, Hang Yan, Yifan Le, Ruohui Wang, Lijun Li, Jing Shao, Tao Gui, Qi Zhang, Xuanjing Huang:
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models. CoRR abs/2403.12171 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01204
Chen Yang, Junzhuo Li, Xinyao Niu, Xinrun Du, Songyang Gao, Haoran Zhang, Zhaoliang Chen, Xingwei Qu, Ruibin Yuan, Yizhi Li, Jiaheng Liu, Stephen W. Huang, Shawn Yue, Wenhu Chen, Jie Fu, Ge Zhang:
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis. CoRR abs/2404.01204 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-04167
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-04167
Xinrun Du, Zhouliang Yu, Songyang Gao, Ding Pan, Yuyang Cheng, Ziyang Ma, Ruibin Yuan, Xingwei Qu, Jiaheng Liu, Tianyu Zheng, Xinchen Luo, Guorui Zhou, Binhang Yuan, Wenhu Chen, Jie Fu, Ge Zhang:
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model. CoRR abs/2404.04167 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04151
Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments. CoRR abs/2406.04151 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-14874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-14874
Han Xia, Songyang Gao, Qiming Ge, Zhiheng Xi, Qi Zhang, Xuanjing Huang:
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data. CoRR abs/2408.14874 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-13147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-13147
Junnan Liu, Hongwei Liu, Linchen Xiao, Ziyi Wang, Kuikun Liu, Songyang Gao, Wenwei Zhang, Songyang Zhang, Kai Chen:
Are Your LLMs Capable of Stable Reasoning? CoRR abs/2412.13147 (2024)
2023
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-02865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-02865
Songyang Gao, Shihan Dou, Junjie Shan, Qi Zhang, Xuanjing Huang:
CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing. CoRR abs/2305.02865 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12816
Xiao Wang, Weikang Zhou, Qi Zhang, Jie Zhou, Songyang Gao, Junzhe Wang, Menghan Zhang, Xiang Gao, Yunwen Chen, Tao Gui:
Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model. CoRR abs/2305.12816 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14497
Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang:
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement. CoRR abs/2305.14497 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15164
Songyang Gao, Shihan Dou, Yan Liu, Xiao Wang, Qi Zhang, Zhongyu Wei, Jin Ma, Ying Shan:
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization. CoRR abs/2306.15164 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15705
Songyang Gao, Shihan Dou, Qi Zhang, Xuanjing Huang, Jin Ma, Ying Shan:
On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection. CoRR abs/2306.15705 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04964
Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang:
Secrets of RLHF in Large Language Models Part I: PPO. CoRR abs/2307.04964 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07765
Sizhou Chen, Songyang Gao, Sen Fang:
Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks. CoRR abs/2309.07765 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06762
Xiao Wang, Yuansen Zhang, Tianze Chen, Songyang Gao, Senjie Jin, Xianjun Yang, Zhiheng Xi, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang:
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models. CoRR abs/2310.06762 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11227
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang:
RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms. CoRR abs/2310.11227 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09979
Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment. CoRR abs/2312.09979 (2023)
2022
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-08048
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-08048
Shihan Dou, Rui Zheng, Ting Wu, Songyang Gao, Qi Zhang, Yueming Wu, Xuanjing Huang:
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective. CoRR abs/2202.08048 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07547
Songyang Gao, Shihan Dou, Qi Zhang, Xuanjing Huang:
Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence Embedding. CoRR abs/2210.07547 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.