default search action

combined dblp search
author search
venue search
publication search

ask others

Juntao Dai

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/csur/JiQCZZHLWDHVZZDPXONTFMW26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csur/JiQCZZHLWDHVZZDPXONTFMW26
Jiaming Ji, Tianyi Qiu, Boyuan Chen, Jiayi Zhou, Borong Zhang, Donghai Hong, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Lukas Vierling, Zhaowei Zhang, Fanzhi Zeng, Juntao Dai, Xuehai Pan, Hua Xu, Aidan O'Gara, Kwan Ng, Brian Tse, Jie Fu, Stephen Mcaleer, Yanfeng Wang, Mingchuan Yang, Yunhuai Liu, Yizhou Wang, Song-Chun Zhu, Yike Guo, Yaodong Yang, Wen Gao:
AI Alignment: A Contemporary Survey. ACM Comput. Surv. 58(5): 132:1-132:38 (2026)
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhouWCYZJDCH26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhouWCYZJDCH26
Yujin Zhou, Pengcheng Wen, Jiale Chen, Boqin Yin, Han Zhu, Jiaming Ji, Juntao Dai, Chi-Min Chan, Sirui Han:
What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning. AAAI 2026: 29071-29079
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-08346
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-08346
Yujin Zhou, Pengcheng Wen, Jiale Chen, Boqin Yin, Han Zhu, Jiaming Ji, Juntao Dai, Chi-Min Chan, Sirui Han:
What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning. CoRR abs/2602.08346 (2026)
2025
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/asc/SunPDL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/asc/SunPDL25
Bei Sun, Zhixuan Peng, Juntao Dai, Yonggang Li:
A control-oriented operation mode recognizing method using fuzzy evaluation and attention LSTM networks. Appl. Soft Comput. 180: 113326 (2025)
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/CaoLDYZZSLHG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/CaoLDYZZSLHG25
Chuxue Cao, Mengze Li, Juntao Dai, Jinluan Yang, Zijian Zhao, Shengyu Zhang, Weijie Shi, Chengzhong Liu, Sirui Han, Yike Guo:
Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving. EMNLP 2025: 12429-12449
[c7]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/KouYLPLLDCHG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KouYLPLLDCHG25
Zhizhuo Kou, Holam Yu, Junyu Luo, Jingshu Peng, Xujia Li, Chengzhong Liu, Juntao Dai, Lei Chen, Sirui Han, Yike Guo:
Automate Strategy Finding with LLM in Quant Investment. EMNLP (Findings) 2025: 18517-18533
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DaiC0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DaiC0Z025
Juntao Dai, Taiye Chen, Yaodong Yang, Qian Zheng, Gang Pan:
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization. ICLR 2025
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-12918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-12918
Pengcheng Wen, Jiaming Ji, Chi-Min Chan, Juntao Dai, Donghai Hong, Yaodong Yang, Sirui Han, Yike Guo:
ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs. CoRR abs/2503.12918 (2025)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-17682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-17682
Jiaming Ji, Xinyu Chen, Rui Pan, Han Zhu, Conghui Zhang, Jiahao Li, Donghai Hong, Boyuan Chen, Jiayi Zhou, Kaile Wang, Juntao Dai, Chi-Min Chan, Sirui Han, Yike Guo, Yaodong Yang:
Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models. CoRR abs/2503.17682 (2025)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-18130
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-18130
Juntao Dai, Taiye Chen, Yaodong Yang, Qian Zheng, Gang Pan:
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization. CoRR abs/2503.18130 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-02177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-02177
Chuxue Cao, Zhenghao Zhu, Junqi Zhu, Guoying Lu, Siyu Peng, Juntao Dai, Weijie Shi, Sirui Han, Yike Guo:
Measuring Hong Kong Massive Multi-Task Language Understanding. CoRR abs/2505.02177 (2025)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-18807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-18807
Jiaming Ji, Wenqi Chen, Kaile Wang, Donghai Hong, Sitong Fang, Boyuan Chen, Jiayi Zhou, Juntao Dai, Sirui Han, Yike Guo, Yaodong Yang:
Mitigating Deceptive Alignment via Self-Monitoring. CoRR abs/2505.18807 (2025)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20214
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20214
Jiaming Ji, Sitong Fang, Wenjing Cao, Jiahao Li, Xuyao Wang, Juntao Dai, Chi-Min Chan, Sirui Han, Yike Guo, Yaodong Yang:
The Mirage of Multimodality: Where Truth is Tested and Honesty Unravels. CoRR abs/2505.20214 (2025)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-23950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-23950
Boyuan Chen, Donghai Hong, Jiaming Ji, Jiacheng Zheng, Bowen Dong, Jiayi Zhou, Kaile Wang, Juntao Dai, Xuyao Wang, Wenqi Chen, Qirui Zheng, Wenxin Li, Sirui Han, Yike Guo, Yaodong Yang:
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback. CoRR abs/2505.23950 (2025)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-06636
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-06636
Chuxue Cao, Han Zhu, Jiaming Ji, Qichao Sun, Zhenghao Zhu, Yinyu Wu, Juntao Dai, Yaodong Yang, Sirui Han, Yike Guo:
SafeLawBench: Towards Safe Alignment of Large Language Models. CoRR abs/2506.06636 (2025)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13245
Guoxi Zhang, Jiawei Chen, Tianzhuo Yang, Jiaming Ji, Yaodong Yang, Juntao Dai:
A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs. CoRR abs/2506.13245 (2025)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-17104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-17104
Chuxue Cao, Mengze Li, Juntao Dai, Jinluan Yang, Zijian Zhao, Shengyu Zhang, Weijie Shi, Chengzhong Liu, Sirui Han, Yike Guo:
Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving. CoRR abs/2506.17104 (2025)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-20702
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-20702
Yoshua Bengio, Tegan Maharaj, Luke Ong, Stuart Russell, Dawn Song, Max Tegmark, Lan Xue, Ya-Qin Zhang, Stephen Casper, Wan Sie Lee, Sören Mindermann, Vanessa Wilfred, Vidhisha Balachandran, Fazl Barez, Michael Belinsky, Imane Bello, Malo Bourgon, Mark Brakel, Siméon Campos, Duncan Cass-Beggs, Jiahao Chen, Rumman Chowdhury, Kuan Chua Seah, Jeff Clune, Juntao Dai, Agnès Delaborde, Nouha Dziri, Francisco Eiras, Joshua Engels, Jinyu Fan, Adam Gleave, Noah Goodman, Fynn Heide, Johannes Heidecke, Dan Hendrycks, Cyrus Hodes, Bryan Low Kian Hsiang, Minlie Huang, Sami Jawhar, Wang Jingyu, Adam Tauman Kalai, Meindert Kamphuis, Mohan S. Kankanhalli, Subhash Kantamneni, Mathias Bonde Kirk, Thomas Kwa, Jeffrey Ladish, Kwok-Yan Lam, Wan Lee Sie, Taewhi Lee, Xiaojian Li, Jiajun Liu, Chaochao Lu, Yifan Mai, Richard Mallah, Julian Michael, Nick Moës, Simon Möller, Kihyuk Nam, Kwan Yee Ng, Mark Nitzberg, Besmira Nushi, Seán Ó hÉigeartaigh, Alejandro Ortega, Pierre Peigné, James Petrie, Benjamin Prud'homme, Reihaneh Rabbany, Nayat Sánchez-Pi, Sarah Schwettmann, Buck Shlegeris, Saad Siddiqui, Aradhana Sinha, Martín Soto, Cheston Tan, Dong Ting, William-Chandra Tjhi, Robert Trager, Brian Tse, Anthony Tung K. H., John Willes, Denise Wong, Wei Xu, Rongwu Xu, Yi Zeng, HongJiang Zhang, Djordje Zikelic:
The Singapore Consensus on Global AI Safety Research Priorities. CoRR abs/2506.20702 (2025)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-12133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-12133
Han Zhu, Juntao Dai, Jiaming Ji, Haoran Li, Chengkun Cai, Pengcheng Wen, Chi-Min Chan, Boyuan Chen, Yaodong Yang, Sirui Han, Yike Guo:
SafeMT: Multi-turn Safety for Multimodal Language Models. CoRR abs/2510.12133 (2025)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-24816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-24816
Yakun Cui, Fushuo Huo, Weijie Shi, Juntao Dai, Hang Du, Zhenghao Zhu, Sirui Han, Yike Guo:
Perception, Understanding and Reasoning, A Multimodal Benchmark for Video Fake News Detection. CoRR abs/2510.24816 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-24820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-24820
Ruiyang Zhang, Jiahao Luo, Xiaoru Feng, Qiufan Pang, Yaodong Yang, Juntao Dai:
SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing. CoRR abs/2510.24820 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-22619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-22619
Boyuan Chen, Sitong Fang, Jiaming Ji, Yanxu Zhu, Pengcheng Wen, Jinzhou Wu, Yingshui Tan, Boren Zheng, Mengying Yuan, Wenqi Chen, Donghai Hong, Alex Qiu, Xin Chen, Jiayi Zhou, Kaile Wang, Juntao Dai, Borong Zhang, Tianzhuo Yang, Saad Siddiqui, Isabella Duan, Yawen Duan, Brian Tse, Jen-Tse Huang, Kun Wang, Baihui Zheng, Jiaheng Liu, Jian Yang, Yiming Li, Wenting Chen, Dongrui Liu, Lukas Vierling, Zhiheng Xi, Haobo Fu, Wenxuan Wang, Jitao Sang, Zhengyan Shi, Chi-Min Chan, Eugenie Shi, Simin Li, Juncheng Li, Jian Yang, Wei Ji, Dong Li, Jinglin Yang, Jun Song, Yinpeng Dong, Jie Fu, Bo Zheng, Min Yang, Yike Guo, Philip Torr, Robert Trager, Yi Zeng, Zhongyuan Wang, Yaodong Yang, Tiejun Huang, Ya-Qin Zhang, Hongjiang Zhang, Andrew Yao:
AI Deception: Risks, Dynamics, and Controls. CoRR abs/2511.22619 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-04864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-04864
Dadi Guo, Qingyu Liu, Dongrui Liu, Qihan Ren, Shuai Shao, Tianyi Qiu, Haoran Li, Yi R. Fung, Zhongjie Ba, Juntao Dai, Jiaming Ji, Zhikai Chen, Jialing Tao, Yaodong Yang, Jing Shao, Xia Hu:
Are Your Agents Upward Deceivers? CoRR abs/2512.04864 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-22539
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-22539
Borong Zhang, Jiahao Li, Jiachen Shen, Yishuai Cai, Yuhao Zhang, Yuanpei Chen, Juntao Dai, Jiaming Ji, Yaodong Yang:
VLA-Arena: An Open-Source Framework for Benchmarking Vision-Language-Action Models. CoRR abs/2512.22539 (2025)
2024
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/JiZZDPS0GL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/JiZZDPS0GL024
Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, Yiran Geng, Mickel Liu, Yaodong Yang:
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research. J. Mach. Learn. Res. 25: 285:1-285:6 (2024)
[c5]
- view
- export record
  dblp key:
  - conf/icml/Dai0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Dai0Z024
Juntao Dai, Yaodong Yang, Qian Zheng, Gang Pan:
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation. ICML 2024: 9872-9903
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DaiCWYCJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DaiCWYCJ024
Juntao Dai, Tianle Chen, Xuyao Wang, Ziran Yang, Taiye Chen, Jiaming Ji, Yaodong Yang:
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset. NeurIPS 2024
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Ji0LHZPQD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Ji0LHZPQD024
Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Tianyi Qiu, Juntao Dai, Yaodong Yang:
Aligner: Efficient Alignment by Learning to Correct. NeurIPS 2024
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02416
Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Juntao Dai, Yaodong Yang:
Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction. CoRR abs/2402.02416 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00162
Jiayi Zhou, Jiaming Ji, Juntao Dai, Yaodong Yang:
Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback. CoRR abs/2409.00162 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-11138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-11138
Juntao Dai, Yaodong Yang, Qian Zheng, Gang Pan:
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation. CoRR abs/2412.11138 (2024)
2023
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DaiJYZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DaiJYZ023
Juntao Dai, Jiaming Ji, Long Yang, Qian Zheng, Gang Pan:
Augmented Proximal Policy Optimization for Safe Reinforcement Learning. AAAI 2023: 7288-7295
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09304
Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, Yiran Geng, Mickel Liu, Yaodong Yang:
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research. CoRR abs/2305.09304 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04657
Jiaming Ji, Mickel Liu, Juntao Dai, Xuehai Pan, Chi Zhang, Ce Bian, Boyuan Zhang, Ruiyang Sun, Yizhou Wang, Yaodong Yang:
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset. CoRR abs/2307.04657 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10305
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10305
Aiyuan Yang, Bin Xiao, Bingning Wang, Borong Zhang, Ce Bian, Chao Yin, Chenxu Lv, Da Pan, Dian Wang, Dong Yan, Fan Yang, Fei Deng, Feng Wang, Feng Liu, Guangwei Ai, Guosheng Dong, Haizhou Zhao, Hang Xu, Haoze Sun, Hongda Zhang, Hui Liu, Jiaming Ji, Jian Xie, Juntao Dai, Kun Fang, Lei Su, Liang Song, Lifeng Liu, Liyun Ru, Luyao Ma, Mang Wang, Mickel Liu, MingAn Lin, Nuolan Nie, Peidong Guo, Ruiyang Sun, Tao Zhang, Tianpeng Li, Tianyu Li, Wei Cheng, Weipeng Chen, Xiangrong Zeng, Xiaochuan Wang, Xiaoxi Chen, Xin Men, Xin Yu, Xuehai Pan, Yanjun Shen, Yiding Wang, Yiyu Li, Youxin Jiang, Yuchen Gao, Yupeng Zhang, Zenan Zhou, Zhiying Wu:
Baichuan 2: Open Large-scale Language Models. CoRR abs/2309.10305 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12567
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang, Ruiyang Sun, Yiran Geng, Yifan Zhong, Juntao Dai, Yaodong Yang:
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark. CoRR abs/2310.12567 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19852
Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen McAleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao:
AI Alignment: A Comprehensive Survey. CoRR abs/2310.19852 (2023)
2022
[c1]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangJDZZL0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangJDZZL0022
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. NeurIPS 2022
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07565
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07565
Long Yang, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, Gang Pan:
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning. CoRR abs/2202.07565 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07089
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. CoRR abs/2209.07089 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.