


default search action
Jiacai Liu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j1]Jungan Zhan, Rong Fan, Minghao Liu, Jiacai Liu
, Wenjian Zhao:
Equal Division Contribution Values of Trapezoidal Fuzzy Numbers and Their Application to Profit Allocation in Cold Chain Logistics for Agricultural Products. Symmetry 17(2): 210 (2025)
[c2]Wenye Li, Jiacai Liu, Ke Wei:
ϕ-Update: A Class of Policy Update Methods with Policy Convergence Guarantee. ICLR 2025
[i12]Jujie He, Jiacai Liu, Chris Yuhao Liu, Rui Yan, Chaojie Wang, Peng Cheng, Xiaoyu Zhang, Fuxiang Zhang, Jiacheng Xu, Wei Shen, Siyuan Li, Liang Zeng, Tianwen Wei, Cheng Cheng, Bo An, Yang Liu, Yahui Zhou:
Skywork Open Reasoner 1 Technical Report. CoRR abs/2505.22312 (2025)
[i11]Chris Yuhao Liu, Liang Zeng, Yuzhen Xiao, Jujie He, Jiacai Liu, Chaojie Wang, Rui Yan, Wei Shen, Fuxiang Zhang, Jiacheng Xu, Yang Liu, Yahui Zhou:
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy. CoRR abs/2507.01352 (2025)
[i10]Jiawei Wang, Jiacai Liu, Yuqian Fu, Yingru Li, Xintao Wang, Yuan Lin, Yu Yue, Lin Zhang, Yang Wang, Ke Wang:
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents. CoRR abs/2509.09265 (2025)
[i9]Jiacai Liu, Wenye Li, Ke Wei:
On the Convergence of Policy Mirror Descent with Temporal Difference Evaluation. CoRR abs/2509.18822 (2025)
[i8]Yingru Li, Jiacai Liu, Jiawei Xu, Yuxuan Tong, Ziniu Li, Baoxiang Wang:
Trust Region Masking for Long-Horizon LLM Reinforcement Learning. CoRR abs/2512.23075 (2025)
[i7]Yingru Li, Jiawei Xu, Jiacai Liu, Yuxuan Tong, Ziniu Li, Tianle Cai, Ge Zhang, Qian Liu, Baoxiang Wang:
Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning. CoRR abs/2512.23087 (2025)
[i6]Yingru Li, Ziniu Li, Jiacai Liu:
A Note on Hybrid Online Reinforcement and Imitation Learning for LLMs: Formulations and Algorithms. CoRR abs/2512.23097 (2025)
[i5]Wenye Li, Hongxu Chen, Jiacai Liu, Ke Wei:
Policy Mirror Descent with Temporal Difference Learning: Sample Complexity under Online Markov Data. CoRR abs/2512.24056 (2025)- 2024
[i4]Jiacai Liu, Wenye Li, Ke Wei:
Elementary Analysis of Policy Gradient Methods. CoRR abs/2404.03372 (2024)
[i3]Chris Yuhao Liu, Liang Zeng, Jiacai Liu, Rui Yan, Jujie He, Chaojie Wang, Shuicheng Yan, Yang Liu, Yahui Zhou:
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs. CoRR abs/2410.18451 (2024)
[i2]Jiacai Liu, Chaojie Wang, Chris Yuhao Liu, Liang Zeng, Rui Yan, Yiwen Sun, Yang Liu, Yahui Zhou:
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization. CoRR abs/2412.18279 (2024)- 2023
[i1]Jiacai Liu, Jinchi Chen, Ke Wei:
On the Linear Convergence of Policy Gradient under Hadamard Parameterization. CoRR abs/2305.19575 (2023)
2010 – 2019
- 2013
[c1]Zhaoqiang Huang, Jiacai Liu, Xuefeng Sun:
Detection of hydrothermally alteration rocks in the east Gandise, Tibet (China) using aster imagery. IGARSS 2013: 2860-2863
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-28 03:51 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







