"√n-Regret for Learning in Markov Decision Processes with Function ..."

Kefan Dong et al. (2019)
a service of Schloss Dagstuhl - Leibniz Center for Informatics