"Reinforcement learning from simultaneous human and MDP reward."

W. Bradley Knox, Peter Stone (2012)

Details and statistics

DOI:

access: closed

type: Conference or Workshop Paper

metadata version: 2015-03-19

a service of  Schloss Dagstuhl - Leibniz Center for Informatics