"Bayesian Policy Gradient Algorithms."

Mohammad Ghavamzadeh, Yaakov Engel (2006)
a service of Schloss Dagstuhl - Leibniz Center for Informatics