Xiao Wang 0001

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Zhiheng Xi

> Home > Persons > Xiao Wang 0001

Publications

2025
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/XiCGHDHZWJZZFWXZWJZLYDW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/XiCGHDHZWJZZFWXZWJZLYDW25
Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Qi Zhang, Tao Gui:
The rise and potential of large language model based agents: a survey. Sci. China Inf. Sci. 68(2) (2025)
2024
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/DouZLGSXZWXFPZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DouZLGSXZWXFPZZ24
Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin. ACL (1) 2024: 1932-1945
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Dou0JZXSHWFXZJZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Dou0JZXSHWFXZJZ24
Shihan Dou, Yan Liu, Haoxiang Jia, Enyu Zhou, Limao Xiong, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang:
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback. ACL (1) 2024: 4571-4585
[c26]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/ZhangWXXGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhangWXXGZ024
Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang, Xuanjing Huang:
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions. LREC/COLING 2024: 14186-14203
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/XiCHJZHDLGWGSFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XiCHJZHDLGWGSFZ24
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang:
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning. ICML 2024
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06080
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06080
Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
Secrets of RLHF in Large Language Models Part II: Reward Modeling. CoRR abs/2401.06080 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-01391
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-01391
Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui:
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback. CoRR abs/2402.01391 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05808
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05808
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang:
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning. CoRR abs/2402.05808 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16431
Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang, Xuanjing Huang:
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions. CoRR abs/2402.16431 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12171
Weikang Zhou, Xiao Wang, Limao Xiong, Han Xia, Yingshuang Gu, Mingxu Chai, Fukang Zhu, Caishuang Huang, Shihan Dou, Zhiheng Xi, Rui Zheng, Songyang Gao, Yicheng Zou, Hang Yan, Yifan Le, Ruohui Wang, Lijun Li, Jing Shao, Tao Gui, Qi Zhang, Xuanjing Huang:
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models. CoRR abs/2403.12171 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-16579
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-16579
Zhiheng Xi, Dingwen Yang, Jixuan Huang, Jiafu Tang, Guanyu Li, Yiwen Ding, Wei He, Boyang Hong, Shihan Dou, Wenyu Zhan, Xiao Wang, Rui Zheng, Tao Ji, Xiaowei Shi, Yitao Zhai, Rongxiang Weng, Jingang Wang, Xunliang Cai, Tao Gui, Zuxuan Wu, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Yu-Gang Jiang:
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision. CoRR abs/2411.16579 (2024)
2023
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07864
Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui:
The Rise and Potential of Large Language Model Based Agents: A Survey. CoRR abs/2309.07864 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06762
Xiao Wang, Yuansen Zhang, Tianze Chen, Songyang Gao, Senjie Jin, Xianjun Yang, Zhiheng Xi, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang:
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models. CoRR abs/2310.06762 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09979
Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment. CoRR abs/2312.09979 (2023)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.