default search action

combined dblp search
author search
venue search
publication search

ask others

Hado van Hasselt

Hado Philip van Hasselt

> Home > Persons

Person information

affiliation: Google DeepMind, London, UK
affiliation (PhD 2011): Utrecht University, The Netherlands

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/OhFKCHZSHS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/OhFKCHZSHS25
Junhyuk Oh, Gregory Farquhar, Iurii Kemaev, Dan A. Calian, Matteo Hessel, Luisa M. Zintgraf, Satinder Singh, Hado van Hasselt, David Silver:
Discovering state-of-the-art reinforcement learning algorithms. Nat. 648(8092): 312-319 (2025)
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SchmittSH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SchmittSH25
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt:
General Uncertainty Estimation with Delta Variances. AAAI 2025: 20318-20328
[c52]
- view
- export record
  dblp key:
  - conf/icml/KemaevCZFH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KemaevCZFH25
Iurii Kemaev, Dan A. Calian, Luisa M. Zintgraf, Gregory Farquhar, Hado van Hasselt:
Scalable Meta-Learning via Mixed-Mode Differentiation. ICML 2025
[c51]
- view
- export record
  dblp key:
  - conf/icml/PfauDBATH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PfauDBATH25
David Pfau, Ian Davies, Diana L. Borsa, João Guilherme Madeira Araújo, Brendan D. Tracey, Hado van Hasselt:
Wasserstein Policy Optimization. ICML 2025
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-14698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-14698
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt:
General Uncertainty Estimation with Delta Variances. CoRR abs/2502.14698 (2025)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-00663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-00663
David Pfau, Ian Davies, Diana Borsa, João G. M. Araújo, Brendan D. Tracey, Hado van Hasselt:
Wasserstein Policy Optimization. CoRR abs/2505.00663 (2025)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-00793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-00793
Iurii Kemaev, Dan A. Calian, Luisa M. Zintgraf, Gregory Farquhar, Hado van Hasselt:
Scalable Meta-Learning via Mixed-Mode Differentiation. CoRR abs/2505.00793 (2025)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17895
Dan A. Calian, Gregory Farquhar, Iurii Kemaev, Luisa M. Zintgraf, Matteo Hessel, Jeremy Shar, Junhyuk Oh, András György, Tom Schaul, Jeffrey Dean, Hado van Hasselt, David Silver:
DataRater: Meta-Learned Dataset Curation. CoRR abs/2505.17895 (2025)
2024
[j3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/PignatelliFGMHT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/PignatelliFGMHT24
Eduardo Pignatelli, Johan Ferret, Matthieu Geist, Thomas Mesnard, Hado van Hasselt, Laura Toni:
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning. Trans. Mach. Learn. Res. 2024 (2024)
[c50]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/collas/LyleZKHPMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/LyleZKHPMD24
Clare Lyle, Zeyu Zheng, Khimya Khetarpal, Hado van Hasselt, Razvan Pascanu, James Martens, Will Dabney:
Disentangling the Causes of Plasticity Loss in Neural Networks. CoLLAs 2024: 750-783
[c49]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LyleZKMHPD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LyleZKMHPD24
Clare Lyle, Zeyu Zheng, Khimya Khetarpal, James Martens, Hado Philip van Hasselt, Razvan Pascanu, Will Dabney:
Normalization and effective learning rates in reinforcement learning. NeurIPS 2024
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-18762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-18762
Clare Lyle, Zeyu Zheng, Khimya Khetarpal, Hado van Hasselt, Razvan Pascanu, James Martens, Will Dabney:
Disentangling the Causes of Plasticity Loss in Neural Networks. CoRR abs/2402.18762 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-01800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-01800
Clare Lyle, Zeyu Zheng, Khimya Khetarpal, James Martens, Hado van Hasselt, Razvan Pascanu, Will Dabney:
Normalization and effective learning rates in reinforcement learning. CoRR abs/2407.01800 (2024)
2023
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SchmittSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SchmittSH23
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt:
Exploration via Epistemic Value Estimation. AAAI 2023: 9742-9751
[c47]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Kapturowski0JRH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Kapturowski0JRH23
Steven Kapturowski, Victor Campos, Ray Jiang, Nemanja Rakicevic, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia:
Human-level Atari 200x faster. ICLR 2023
[c46]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Abel0RPHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Abel0RPHS23
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado Philip van Hasselt, Satinder Singh:
A Definition of Continual Reinforcement Learning. NeurIPS 2023
[c45]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/FlennerhagZOH0S23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FlennerhagZOH0S23
Sebastian Flennerhag, Tom Zahavy, Brendan O'Donoghue, Hado Philip van Hasselt, András György, Satinder Singh:
Optimistic Meta-Gradients. NeurIPS 2023
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-03236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-03236
Sebastian Flennerhag, Tom Zahavy, Brendan O'Donoghue, Hado van Hasselt, András György, Satinder Singh:
Optimistic Meta-Gradients. CoRR abs/2301.03236 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-04250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-04250
Chentian Jiang, Nan Rosemary Ke, Hado van Hasselt:
Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration. CoRR abs/2302.04250 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04012
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt:
Exploration via Epistemic Value Estimation. CoRR abs/2303.04012 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11044
David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh:
On the Convergence of Bounded Agents. CoRR abs/2307.11044 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11046
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11046
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh:
A Definition of Continual Reinforcement Learning. CoRR abs/2307.11046 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01072
Eduardo Pignatelli, Johan Ferret, Matthieu Geist, Thomas Mesnard, Hado van Hasselt, Laura Toni:
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning. CoRR abs/2312.01072 (2023)
2022
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JiangZC0H22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JiangZC0H22
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt:
Learning Expected Emphatic Traces for Deep RL. AAAI 2022: 7015-7023
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KirschFHFOC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KirschFHFOC22
Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram L. Friesen, Junhyuk Oh, Yutian Chen:
Introducing Symmetries to Black Box Meta Reinforcement Learning. AAAI 2022: 7202-7210
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SchmittSH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SchmittSH22
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt:
Chaining Value Functions for Off-Policy Learning. AAAI 2022: 8187-8195
[c41]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FlennerhagSZHS022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FlennerhagSZHS022
Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh:
Bootstrapped Meta-Learning. ICLR 2022
[c40]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SilverGDHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SilverGDHH22
David Silver, Anirudh Goyal, Ivo Danihelka, Matteo Hessel, Hado van Hasselt:
Learning by Directional Gradient Descent. ICLR 2022
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-06468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-06468
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt:
Chaining Value Functions for Off-Policy Learning. CoRR abs/2201.06468 (2022)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09699
Veronica Chelu, Diana Borsa, Doina Precup, Hado van Hasselt:
Selective Credit Assignment. CoRR abs/2202.09699 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07550
Steven Kapturowski, Víctor Campos, Ray Jiang, Nemanja Rakicevic, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia:
Human-level Atari 200x faster. CoRR abs/2209.07550 (2022)
2021
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HasseltMHSBB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HasseltMHSBB21
Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa:
Expected Eligibility Traces. AAAI 2021: 9997-10005
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/GarneloCLTOGHB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/GarneloCLTOGHB21
Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi:
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity. AAMAS 2021: 1501-1503
[c37]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HesselDVGSSWSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HesselDVGSSWSH21
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt:
Muesli: Combining Improvements in Policy Optimization. ICML 2021: 4214-4226
[c36]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JiangZXWHBH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JiangZXWHBH21
Ray Jiang, Tom Zahavy, Zhongwen Xu, Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt:
Emphatic Algorithms for Deep Reinforcement Learning. ICML 2021: 5023-5033
[c35]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/FarquharBMFHHS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FarquharBMFHHS21
Gregory Farquhar, Kate Baumli, Zita Marinho, Angelos Filos, Matteo Hessel, Hado Philip van Hasselt, David Silver:
Self-Consistent Models and Values. NeurIPS 2021: 1111-1125
[c34]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/VeeriahZHXOKHSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/VeeriahZHXOKHSS21
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh:
Discovery of Options via Meta-Learned Subgoals. NeurIPS 2021: 29861-29873
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06741
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh:
Discovery of Options via Meta-Learned Subgoals. CoRR abs/2102.06741 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-12425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-12425
David Raposo, Samuel Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt M. Botvinick, Hado van Hasselt, H. Francis Song:
Synthetic Returns for Long-Term Credit Assignment. CoRR abs/2102.12425 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06159
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt:
Muesli: Combining Improvements in Policy Optimization. CoRR abs/2104.06159 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06272
Matteo Hessel, Manuel Kroiss, Aidan Clark, Iurii Kemaev, John Quan, Thomas Keck, Fabio Viola, Hado van Hasselt:
Podracer architectures for scalable Reinforcement Learning. CoRR abs/2104.06272 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11779
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11779
Ray Jiang, Tom Zahavy, Zhongwen Xu, Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt:
Emphatic Algorithms for Deep Reinforcement Learning. CoRR abs/2106.11779 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05405
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt:
Learning Expected Emphatic Traces for Deep RL. CoRR abs/2107.05405 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-04504
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-04504
Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh:
Bootstrapped Meta-Learning. CoRR abs/2109.04504 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-10781
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-10781
Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram L. Friesen, Junhyuk Oh, Yutian Chen:
Introducing Symmetries to Black Box Meta Reinforcement Learning. CoRR abs/2109.10781 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04041
Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi:
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity. CoRR abs/2110.04041 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-12840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-12840
Gregory Farquhar, Kate Baumli, Zita Marinho, Angelos Filos, Matteo Hessel, Hado van Hasselt, David Silver:
Self-Consistent Models and Values. CoRR abs/2110.12840 (2021)
2020
[c33]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/RowlandHHBSMD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandHHBSMD20
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney:
Conditional Importance Sampling for Off-Policy Learning. AISTATS 2020: 45-55
[c32]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/OsbandDHASSMLSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/OsbandDHASSMLSS20
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepesvári, Satinder Singh, Benjamin Van Roy, Richard S. Sutton, David Silver, Hado van Hasselt:
Behaviour Suite for Reinforcement Learning. ICLR 2020
[c31]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhengOHXKHSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhengOHXKHSS20
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh:
What Can Learned Intrinsic Rewards Capture? ICML 2020: 11436-11446
[c30]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CheluPH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CheluPH20
Veronica Chelu, Doina Precup, Hado van Hasselt:
Forethought and Hindsight in Credit Assignment. NeurIPS 2020
[c29]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/OhHCXHSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OhHCXHSS20
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver:
Discovering Reinforcement Learning Algorithms. NeurIPS 2020
[c28]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuHHOSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuHHOSS20
Zhongwen Xu, Hado Philip van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver:
Meta-Gradient Reinforcement Learning with an Objective Discovered Online. NeurIPS 2020
[c27]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZahavyXVHOHSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZahavyXVHOHSS20
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh:
A Self-Tuning Actor-Critic Algorithm. NeurIPS 2020
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-12928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-12928
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh:
Self-Tuning Deep Reinforcement Learning. CoRR abs/2002.12928 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-01839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-01839
Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa:
Expected Eligibility Traces. CoRR abs/2007.01839 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-08433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-08433
Zhongwen Xu, Hado van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver:
Meta-Gradient Reinforcement Learning with an Objective Discovered Online. CoRR abs/2007.08433 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-08794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-08794
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver:
Discovering Reinforcement Learning Algorithms. CoRR abs/2007.08794 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13685
Veronica Chelu, Doina Precup, Hado van Hasselt:
Forethought and Hindsight in Credit Assignment. CoRR abs/2010.13685 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HesselSE0SH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HesselSE0SH19
Matteo Hessel, Hubert Soyer, Lasse Espeholt, Wojciech Czarnecki, Simon Schmitt, Hado van Hasselt:
Multi-Task Deep Reinforcement Learning with PopArt. AAAI 2019: 3796-3803
[c25]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BorsaBQMHMSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BorsaBQMHMSS19
Diana Borsa, André Barreto, John Quan, Daniel J. Mankowitz, Hado van Hasselt, Rémi Munos, David Silver, Tom Schaul:
Universal Successor Features Approximators. ICLR (Poster) 2019
[c24]
- view
- export record
  dblp key:
  - conf/nips/VeeriahHXRLOHSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/VeeriahHXRLOHSS19
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Janarthanan Rajendran, Richard L. Lewis, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh:
Discovery of Useful Questions as Auxiliary Tasks. NeurIPS 2019: 9306-9317
[c23]
- view
- export record
  dblp key:
  - conf/nips/HarutyunyanDMAP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HarutyunyanDMAP19
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Gheshlaghi Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Gregory Wayne, Satinder Singh, Doina Precup, Rémi Munos:
Hindsight Credit Assignment. NeurIPS 2019: 12467-12476
[c22]
- view
- export record
  dblp key:
  - conf/nips/HasseltHA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HasseltHA19
Hado van Hasselt, Matteo Hessel, John Aslanides:
When to use parametric models in reinforcement learning? NeurIPS 2019: 14322-14333
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03030
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alexander Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin J. Miller, Mohammad Gheshlaghi Azar, Ian Osband, Neil C. Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew M. Botvinick, Shane Legg:
Meta-learning of Sequential Strategies. CoRR abs/1905.03030 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-05243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-05243
Hado van Hasselt, Matteo Hessel, John Aslanides:
When to use parametric models in reinforcement learning? CoRR abs/1906.05243 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-02908
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-02908
Matteo Hessel, Hado van Hasselt, Joseph Modayil, David Silver:
On Inductive Biases in Deep Reinforcement Learning. CoRR abs/1907.02908 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-03687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-03687
Hado van Hasselt, John Quan, Matteo Hessel, Zhongwen Xu, Diana Borsa, André Barreto:
General non-linear Bellman equations. CoRR abs/1907.03687 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-03568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-03568
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepesvári, Satinder Singh, Benjamin Van Roy, Richard S. Sutton, David Silver, Hado van Hasselt:
Behaviour Suite for Reinforcement Learning. CoRR abs/1908.03568 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-04607
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-04607
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard L. Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh:
Discovery of Useful Questions as Auxiliary Tasks. CoRR abs/1909.04607 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07479
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney:
Conditional Importance Sampling for Off-Policy Learning. CoRR abs/1910.07479 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02503
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Gheshlaghi Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Rémi Munos:
Hindsight Credit Assignment. CoRR abs/1912.02503 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-05500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-05500
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh:
What Can Learned Intrinsic Rewards Capture? CoRR abs/1912.05500 (2019)
2018
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HesselMHSODHPAS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HesselMHSODHPAS18
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Gheshlaghi Azar, David Silver:
Rainbow: Combining Improvements in Deep Reinforcement Learning. AAAI 2018: 3215-3222
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HorganQBBHHS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HorganQBBHHS18
Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado van Hasselt, David Silver:
Distributed Prioritized Experience Replay. ICLR (Poster) 2018
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BargiacchiVRNH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BargiacchiVRNH18
Eugenio Bargiacchi, Timothy Verstraeten, Diederik M. Roijers, Ann Nowé, Hado van Hasselt:
Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems. ICML 2018: 491-499
[c18]
- view
- export record
  dblp key:
  - conf/nips/XuHS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuHS18
Zhongwen Xu, Hado van Hasselt, David Silver:
Meta-Gradient Reinforcement Learning. NeurIPS 2018: 2402-2413
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-08294
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-08294
Daniel J. Mankowitz, Augustin Zídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul:
Unicorn: Continual Learning with a Universal, Off-policy Agent. CoRR abs/1802.08294 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-00933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-00933
Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado van Hasselt, David Silver:
Distributed Prioritized Experience Replay. CoRR abs/1803.00933 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-09801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-09801
Zhongwen Xu, Hado van Hasselt, David Silver:
Meta-Gradient Reinforcement Learning. CoRR abs/1805.09801 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-11593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-11593
Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Vecerík, Matteo Hessel, Rémi Munos, Olivier Pietquin:
Observe and Look Further: Achieving Consistent Performance on Atari. CoRR abs/1805.11593 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-04474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-04474
Matteo Hessel, Hubert Soyer, Lasse Espeholt, Wojciech Czarnecki, Simon Schmitt, Hado van Hasselt:
Multi-task Deep Reinforcement Learning with PopArt. CoRR abs/1809.04474 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07004
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare, Doina Precup:
The Barbados 2018 List of Open Issues in Continual Learning. CoRR abs/1811.07004 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-02648
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-02648
Hado van Hasselt, Yotam Doron, Florian Strub, Matteo Hessel, Nicolas Sonnerat, Joseph Modayil:
Deep Reinforcement Learning and the Deadly Triad. CoRR abs/1812.02648 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-07626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-07626
Diana Borsa, André Barreto, John Quan, Daniel J. Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul:
Universal Successor Features Approximators. CoRR abs/1812.07626 (2018)
2017
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SilverHHSGHDRRB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SilverHHSGHDRRB17
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David P. Reichert, Neil C. Rabinowitz, André Barreto, Thomas Degris:
The Predictron: End-To-End Learning and Planning. ICML 2017: 3191-3199
[c16]
- view
- export record
  dblp key:
  - conf/nips/XuMHBSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuMHBSS17
Zhongwen Xu, Joseph Modayil, Hado van Hasselt, André Barreto, David Silver, Tom Schaul:
Natural Value Approximators: Learning when to Trust Past Estimates. NIPS 2017: 2120-2128
[c15]
- view
- export record
  dblp key:
  - conf/nips/BarretoDMHSSH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BarretoDMHSSH17
André Barreto, Will Dabney, Rémi Munos, Jonathan J. Hunt, Tom Schaul, David Silver, Hado van Hasselt:
Successor Features for Transfer in Reinforcement Learning. NIPS 2017: 4055-4065
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-04782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-04782
Oriol Vinyals, Timo Ewalds, Sergey Bartunov, Petko Georgiev, Alexander Sasha Vezhnevets, Michelle Yeo, Alireza Makhzani, Heinrich Küttler, John P. Agapiou, Julian Schrittwieser, John Quan, Stephen Gaffney, Stig Petersen, Karen Simonyan, Tom Schaul, Hado van Hasselt, David Silver, Timothy P. Lillicrap, Kevin Calderone, Paul Keet, Anthony Brunasso, David Lawrence, Anders Ekermo, Jacob Repp, Rodney Tsing:
StarCraft II: A New Challenge for Reinforcement Learning. CoRR abs/1708.04782 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-02298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-02298
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Daniel Horgan, Bilal Piot, Mohammad Gheshlaghi Azar, David Silver:
Rainbow: Combining Improvements in Deep Reinforcement Learning. CoRR abs/1710.02298 (2017)
2016
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HasseltGS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HasseltGS16
Hado van Hasselt, Arthur Guez, David Silver:
Deep Reinforcement Learning with Double Q-Learning. AAAI 2016: 2094-2100
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangSHHLF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangSHHLF16
Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas:
Dueling Network Architectures for Deep Reinforcement Learning. ICML 2016: 1995-2003
[c12]
- view
- export record
  dblp key:
  - conf/nips/HasseltGHMS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HasseltGHMS16
Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver:
Learning values across many orders of magnitude. NIPS 2016: 4287-4295
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HasseltGHS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HasseltGHS16
Hado van Hasselt, Arthur Guez, Matteo Hessel, David Silver:
Learning functions across many orders of magnitudes. CoRR abs/1602.07714 (2016)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SilverHHSGHDRRB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SilverHHSGHDRRB16
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David P. Reichert, Neil C. Rabinowitz, André Barreto, Thomas Degris:
The Predictron: End-To-End Learning and Planning. CoRR abs/1612.08810 (2016)
2015
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HasseltS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HasseltS15
Hado van Hasselt, Richard S. Sutton:
Learning to Predict Independent of Span. CoRR abs/1508.04582 (2015)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HasseltGS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HasseltGS15
Hado van Hasselt, Arthur Guez, David Silver:
Deep Reinforcement Learning with Double Q-learning. CoRR abs/1509.06461 (2015)
2014
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SuttonMPH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SuttonMPH14
Richard S. Sutton, Ashique Rupam Mahmood, Doina Precup, Hado van Hasselt:
A new Q(lambda) with interim forward view and Monte Carlo equivalence. ICML 2014: 568-576
[c10]
- view
- export record
  dblp key:
  - conf/nips/MahmoodHS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MahmoodHS14
Ashique Rupam Mahmood, Hado van Hasselt, Richard S. Sutton:
Weighted importance sampling for off-policy learning with linear function approximation. NIPS 2014: 3014-3022
[c9]
- view
- export record
  dblp key:
  - conf/uai/HasseltMS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/HasseltMS14
Hado van Hasselt, Ashique Rupam Mahmood, Richard S. Sutton:
Off-policy TD( l) with a true online equivalence. UAI 2014: 330-339
2013
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cipls/HasseltP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cipls/HasseltP13
Hado van Hasselt, Han La Poutré:
Stacking under uncertainty: We know how to predict, but how should we act? CIPLS 2013: 25-32
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1302-7175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1302-7175
Hado van Hasselt:
Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average. CoRR abs/1302.7175 (2013)
2012
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/12/Hasselt12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/12/Hasselt12
Hado van Hasselt:
Reinforcement Learning in Continuous State and Action Spaces. Reinforcement Learning 2012: 207-251
2011
[b1]
- view
  - electronic edition @ uu.nl
  - details & citations
  authority control:
- export record
  dblp key:
  - phd/basesearch/vanHasselt11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/vanHasselt11
Hado Philip van Hasselt:
Insights in reinforcement rearning : formal analysis and empirical evaluation of temporal-difference learning algorithms. Utrecht University, Netherlands, 2011
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jmlr/SeijenWHW11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/SeijenWHW11
Harm van Seijen, Shimon Whiteson, Hado van Hasselt, Marco A. Wiering:
Exploiting Best-Match Equations for Efficient Reinforcement Learning. J. Mach. Learn. Res. 12: 2045-2094 (2011)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/adprl/WieringHPS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adprl/WieringHPS11
Marco A. Wiering, Hado van Hasselt, Auke-Dirk Pietersma, Lambert Schomaker:
Reinforcement learning algorithms for solving classification problems. ADPRL 2011: 91-96
2010
[c6]
- view
- export record
  dblp key:
  - conf/nips/Hasselt10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Hasselt10
Hado van Hasselt:
Double Q-learning. NIPS 2010: 2613-2621

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/adprl/WieringH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adprl/WieringH09
Marco A. Wiering, Hado van Hasselt:
The QV family compared to other reinforcement learning algorithms. ADPRL 2009: 101-108
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/adprl/SeijenHWW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adprl/SeijenHWW09
Harm van Seijen, Hado van Hasselt, Shimon Whiteson, Marco A. Wiering:
A theoretical and empirical analysis of Expected Sarsa. ADPRL 2009: 177-184
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ags/WestraHDD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ags/WestraHDD09
Joost Westra, Hado van Hasselt, Frank Dignum, Virginia Dignum:
Adaptive Serious Games Using Agent Organizations. AGS 2009: 206-220
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/HasseltW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/HasseltW09
Hado van Hasselt, Marco A. Wiering:
Using continuous action spaces to solve discrete problems. IJCNN 2009: 1149-1156
2008
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tsmc/WieringH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsmc/WieringH08
Marco A. Wiering, Hado van Hasselt:
Ensemble Algorithms in Reinforcement Learning. IEEE Trans. Syst. Man Cybern. Part B 38(4): 930-936 (2008)
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/WestraHDD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/WestraHDD08
Joost Westra, Hado van Hasselt, Virginia Dignum, Frank Dignum:
On-line adapting games using agent organizations. CIG 2008: 243-250

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.