"Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error ..."

Yuan Xie et al. (2018)
a service of Schloss Dagstuhl - Leibniz Center for Informatics