Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Thore Graepel

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Joel Z. Leibo

> Home > Persons > Thore Graepel

Publications

2022
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-01816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-01816
Kavya Kopparapu, Edgar A. Duéñez-Guzmán, Jayd Matyas, Alexander Sasha Vezhnevets, John P. Agapiou, Kevin R. McKee, Richard Everett, Janusz Marecki, Joel Z. Leibo, Thore Graepel:
Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria. CoRR abs/2201.01816 (2022)
2021
[c86]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LeiboDVASKMBMG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LeiboDVASKMBMG21
Joel Z. Leibo, Edgar A. Duéñez-Guzmán, Alexander Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charlie Beattie, Igor Mordatch, Thore Graepel:
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot. ICML 2021: 6187-6199
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04982
Kevin R. McKee, Edward Hughes, Tina O. Zhu, Martin J. Chadwick, Raphael Koster, Antonio García Castañeda, Charlie Beattie, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Deep reinforcement learning models the emergent dynamics of human cooperation. CoRR abs/2103.04982 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06857
Joel Z. Leibo, Edgar A. Duéñez-Guzmán, Alexander Sasha Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charles Beattie, Igor Mordatch, Thore Graepel:
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot. CoRR abs/2107.06857 (2021)
2020
[j16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/aamas/TuylsPLHELSG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/TuylsPLHELSG20
Karl Tuyls, Julien Pérolat, Marc Lanctot, Edward Hughes, Richard Everett, Joel Z. Leibo, Csaba Szepesvári, Thore Graepel:
Bounds and dynamics for empirical game theoretic analysis. Auton. Agents Multi Agent Syst. 34(1): 7 (2020)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/BachrachEHLLLJC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/BachrachEHLLLJC20
Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel:
Negotiating team formation using deep reinforcement learning. Artif. Intell. 288: 103356 (2020)
[c82]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BalduzziCAGHLPG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BalduzziCAGHLPG20
David Balduzzi, Wojciech M. Czarnecki, Tom Anthony, Ian Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel:
Smooth markets: A basic mechanism for organizing gradient-based learners. ICLR 2020
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-04678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-04678
David Balduzzi, Wojciech M. Czarnecki, Thomas W. Anthony, Ian M. Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel:
Smooth markets: A basic mechanism for organizing gradient-based learners. CoRR abs/2001.04678 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-09054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-09054
Raphael Köster, Kevin R. McKee, Richard Everett, Laura Weidinger, William S. Isaac, Edward Hughes, Edgar A. Duéñez-Guzmán, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences. CoRR abs/2010.09054 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-10380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-10380
Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel:
Negotiating Team Formation Using Deep Reinforcement Learning. CoRR abs/2010.10380 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-08630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-08630
Allan Dafoe, Edward Hughes, Yoram Bachrach, Tantum Collins, Kevin R. McKee, Joel Z. Leibo, Kate Larson, Thore Graepel:
Open Problems in Cooperative AI. CoRR abs/2012.08630 (2020)
2019
[c78]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/LeiboPHWMDSDG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LeiboPHWMDSDG19
Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. AAMAS 2019: 1099-1107
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/isalalife/SunehagLLMHL0EG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isalalife/SunehagLLMHL0EG19
Peter Sunehag, Guy Lever, Siqi Liu, Josh Merel, Nicolas Heess, Joel Z. Leibo, Edward Hughes, Tom Eccles, Thore Graepel:
Reinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems. ALIFE 2019: 103-110
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00742
Joel Z. Leibo, Edward Hughes, Marc Lanctot, Thore Graepel:
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research. CoRR abs/1903.00742 (2019)
2018
[c71]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/TuylsPLLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/TuylsPLLG18
Karl Tuyls, Julien Pérolat, Marc Lanctot, Joel Z. Leibo, Thore Graepel:
A Generalised Method for Empirical Game Theoretic Analysis. AAMAS 2018: 77-85
[c70]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/SunehagLGCZJLSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/SunehagLGCZJLSL18
Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinícius Flores Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel:
Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward. AAMAS 2018: 2085-2087
[c67]
- view
- export record
  dblp key:
  - conf/nips/HughesLPTDCDZMK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HughesLPTDCDZMK18
Edward Hughes, Joel Z. Leibo, Matthew Phillips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion improves cooperation in intertemporal social dilemmas. NeurIPS 2018: 3330-3340
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-06376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-06376
Karl Tuyls, Julien Pérolat, Marc Lanctot, Joel Z. Leibo, Thore Graepel:
A Generalised Method for Empirical Game Theoretic Analysis. CoRR abs/1803.06376 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-08884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-08884
Edward Hughes, Joel Z. Leibo, Matthew G. Philips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion resolves intertemporal social dilemmas. CoRR abs/1803.08884 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-01281
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-01281
Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio García Castañeda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel:
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning. CoRR abs/1807.01281 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-07019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-07019
Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. CoRR abs/1812.07019 (2018)
2017
[c66]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/LeiboZLMG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LeiboZLMG17
Joel Z. Leibo, Vinícius Flores Zambaldi, Marc Lanctot, Janusz Marecki, Thore Graepel:
Multi-agent Reinforcement Learning in Sequential Social Dilemmas. AAMAS 2017: 464-473
[c65]
- view
- export record
  dblp key:
  - conf/nips/PerolatLZBTG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PerolatLZBTG17
Julien Pérolat, Joel Z. Leibo, Vinícius Flores Zambaldi, Charles Beattie, Karl Tuyls, Thore Graepel:
A multi-agent reinforcement learning model of common-pool resource appropriation. NIPS 2017: 3643-3652
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LeiboZLMG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeiboZLMG17
Joel Z. Leibo, Vinícius Flores Zambaldi, Marc Lanctot, Janusz Marecki, Thore Graepel:
Multi-agent Reinforcement Learning in Sequential Social Dilemmas. CoRR abs/1702.03037 (2017)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SunehagLGCZJLSL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SunehagLGCZJLSL17
Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinícius Flores Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel:
Value-Decomposition Networks For Cooperative Multi-Agent Learning. CoRR abs/1706.05296 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/PerolatLZBTG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PerolatLZBTG17
Julien Pérolat, Joel Z. Leibo, Vinícius Flores Zambaldi, Charles Beattie, Karl Tuyls, Thore Graepel:
A multi-agent reinforcement learning model of common-pool resource appropriation. CoRR abs/1707.06600 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-05074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-05074
Karl Tuyls, Julien Pérolat, Marc Lanctot, Georg Ostrovski, Rahul Savani, Joel Z. Leibo, Toby Ord, Thore Graepel, Shane Legg:
Symmetric Decomposition of Asymmetric Games. CoRR abs/1711.05074 (2017)

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.