default search action

combined dblp search
author search
venue search
publication search

ask others

Yufei Zhang 0001

> Home > Persons

Person information

affiliation: Imperial College London, Department of Mathematics, Westminster, UK
affiliation: University of Oxford, Mathematical Institute, UK
affiliation (2021-2023): London School of Economics and Political Science, Department of Statistics, UK

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/siamco/SzpruchTZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamco/SzpruchTZ24
Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang:
Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning. SIAM J. Control. Optim. 62(1): 135-166 (2024)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/siamco/GiegrichRZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamco/GiegrichRZ24
Michael Giegrich, Christoph Reisinger, Yufei Zhang:
Convergence of Policy Gradient Methods for Finite-Horizon Exploratory Linear-Quadratic Control Problems. SIAM J. Control. Optim. 62(2): 1060-1092 (2024)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/siamsc/ReisingerSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamsc/ReisingerSZ24
Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang:
A Fast Iterative PDE-Based Algorithm for Feedback Controls of Nonsmooth Mean-Field Control Problems. SIAM J. Sci. Comput. 46(4): 2737- (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-01198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-01198
Bekzhan Kerimkulov, David Siska, Lukasz Szpruch, Yufei Zhang:
Mirror Descent for Stochastic Control Problems with Measure-valued Controls. CoRR abs/2401.01198 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-03624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-03624
Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang:
ε-Policy Gradient for Online Pricing. CoRR abs/2405.03624 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20250
Deven Sethi, David Siska, Yufei Zhang:
Entropy annealing for policy mirror descent in continuous time and space. CoRR abs/2405.20250 (2024)
2023
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/siamco/GuoHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamco/GuoHZ23
Xin Guo, Anran Hu, Yufei Zhang:
Reinforcement Learning for Linear-Convex Models with Jumps via Stability Analysis of Feedback Controls. SIAM J. Control. Optim. 61(2): 755-787 (2023)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/siamco/ReisingerSZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamco/ReisingerSZ23
Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang:
Linear Convergence of a Policy Gradient Method for Some Finite Horizon Continuous Time Control Problems. SIAM J. Control. Optim. 61(6): 3526-3558 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14258
Melker Hoglund, Emilio Ferrucci, Camilo Hernández, Aitor Muguruza Gonzalez, Cristopher Salvi, Leandro Sánchez-Betancourt, Yufei Zhang:
A Neural RDE approach for continuous-time non-Markovian stochastic control problems. CoRR abs/2306.14258 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06935
Tanut Treetanthiploet, Yufei Zhang, Lukasz Szpruch, Isaac Bowers-Barnard, Henrietta Ridley, James Hickey, Chris Pearce:
Insurance pricing on price comparison websites via reinforcement learning. CoRR abs/2308.06935 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02259
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02259
Xin Guo, Yufei Zhang:
Towards An Analytical Framework for Potential Games. CoRR abs/2310.02259 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02951
Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang:
A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces. CoRR abs/2310.02951 (2023)
2022
[j6]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/BaseiGHZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/BaseiGHZ22
Matteo Basei, Xin Guo, Anran Hu, Yufei Zhang:
Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning over a Finite-Time Horizon. J. Mach. Learn. Res. 23: 178:1-178:34 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11758
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11758
Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang:
Linear convergence of a policy gradient method for finite horizon continuous time stochastic control problems. CoRR abs/2203.11758 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-04466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-04466
Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang:
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning. CoRR abs/2208.04466 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00617
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00617
Michael Giegrich, Christoph Reisinger, Yufei Zhang:
Convergence of policy gradient methods for finite-horizon stochastic linear-quadratic control problems. CoRR abs/2211.00617 (2022)
2021
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/cma/ReisingerZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cma/ReisingerZ21
Christoph Reisinger, Yufei Zhang:
A penalty scheme and policy iteration for nonlocal HJB variational inequalities with monotone nonlinearities. Comput. Math. Appl. 93: 199-213 (2021)
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/focm/ItoRZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/focm/ItoRZ21
Kazufumi Ito, Christoph Reisinger, Yufei Zhang:
A Neural Network-Based Policy Iteration Algorithm with Global H²-Superlinear Convergence for Stochastic Games on Domains. Found. Comput. Math. 21(2): 331-374 (2021)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/siamco/ReisingerZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamco/ReisingerZ21
Christoph Reisinger, Yufei Zhang:
Regularity and Stability of Feedback Relaxed Controls. SIAM J. Control. Optim. 59(5): 3118-3151 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-09311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-09311
Xin Guo, Anran Hu, Yufei Zhang:
Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls. CoRR abs/2104.09311 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10264
Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang:
Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models. CoRR abs/2112.10264 (2021)
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/siamco/ReisingerZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamco/ReisingerZ20
Christoph Reisinger, Yufei Zhang:
Error Estimates of Penalty Schemes for Quasi-Variational Inequalities Arising from Impulse Control Problems. SIAM J. Control. Optim. 58(1): 243-276 (2020)
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenZRS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenZRS20
Xinshi Chen, Yufei Zhang, Christoph Reisinger, Le Song:
Understanding Deep Architecture with Reasoning Layer. NeurIPS 2020
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-03148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-03148
Christoph Reisinger, Yufei Zhang:
Regularity and stability of feedback relaxed controls. CoRR abs/2001.03148 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13401
Xinshi Chen, Yufei Zhang, Christoph Reisinger, Le Song:
Understanding Deep Architectures with Reasoning Layer. CoRR abs/2006.13401 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-15316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-15316
Matteo Basei, Xin Guo, Anran Hu, Yufei Zhang:
Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon. CoRR abs/2006.15316 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-07731
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-07731
Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang:
A posteriori error estimates for fully coupled McKean-Vlasov forward-backward SDEs. CoRR abs/2007.07731 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-08175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-08175
Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang:
Regularity and time discretization of extended mean field control problems: a McKean-Vlasov FBSDE approach. CoRR abs/2009.08175 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/siamnum/ReisingerZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamnum/ReisingerZ19
Christoph Reisinger, Yufei Zhang:
A Penalty Scheme for Monotone Systems with Interconnected Obstacles: Convergence and Error Estimates. SIAM J. Numer. Anal. 57(4): 1625-1648 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-06652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-06652
Christoph Reisinger, Yufei Zhang:
Rectified deep neural networks overcome the curse of dimensionality for nonsmooth value functions in zero-sum games of nonlinear stiff systems. CoRR abs/1903.06652 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-02304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-02304
Kazufumi Ito, Christoph Reisinger, Yufei Zhang:
A neural network based policy iteration algorithm with global H²-superlinear convergence for stochastic games on domains. CoRR abs/1906.02304 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.