"Reducing policy degradation in neuro-dynamic programming."

Thomas Gabel, Martin A. Riedmiller (2006)

Details and statistics

DOI:

access: open

type: Conference or Workshop Paper

metadata version: 2022-08-02

a service of  Schloss Dagstuhl - Leibniz Center for Informatics