"Information theoretic reward shaping for curiosity driven learning in POMDPs."

Nassim Mafi, Farnaz Abtahi, Ian R. Fasel (2011)
a service of Schloss Dagstuhl - Leibniz Center for Informatics