"Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning."

Andrea Zanette, Martin J. Wainwright, Emma Brunskill (2021)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics