Остановите войну!
for scientists:
default search action
Thore Graepel
- > Home > Persons > Thore Graepel
Publications
- 2022
- [i45]Kavya Kopparapu, Edgar A. Duéñez-Guzmán, Jayd Matyas, Alexander Sasha Vezhnevets, John P. Agapiou, Kevin R. McKee, Richard Everett, Janusz Marecki, Joel Z. Leibo, Thore Graepel:
Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria. CoRR abs/2201.01816 (2022) - 2021
- [c86]Joel Z. Leibo, Edgar A. Duéñez-Guzmán, Alexander Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charlie Beattie, Igor Mordatch, Thore Graepel:
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot. ICML 2021: 6187-6199 - [i41]Kevin R. McKee, Edward Hughes, Tina O. Zhu, Martin J. Chadwick, Raphael Koster, Antonio García Castañeda, Charlie Beattie, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Deep reinforcement learning models the emergent dynamics of human cooperation. CoRR abs/2103.04982 (2021) - [i38]Joel Z. Leibo, Edgar A. Duéñez-Guzmán, Alexander Sasha Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charles Beattie, Igor Mordatch, Thore Graepel:
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot. CoRR abs/2107.06857 (2021) - 2020
- [j16]Karl Tuyls, Julien Pérolat, Marc Lanctot, Edward Hughes, Richard Everett, Joel Z. Leibo, Csaba Szepesvári, Thore Graepel:
Bounds and dynamics for empirical game theoretic analysis. Auton. Agents Multi Agent Syst. 34(1): 7 (2020) - [j15]Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel:
Negotiating team formation using deep reinforcement learning. Artif. Intell. 288: 103356 (2020) - [c82]David Balduzzi, Wojciech M. Czarnecki, Tom Anthony, Ian Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel:
Smooth markets: A basic mechanism for organizing gradient-based learners. ICLR 2020 - [i36]David Balduzzi, Wojciech M. Czarnecki, Thomas W. Anthony, Ian M. Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel:
Smooth markets: A basic mechanism for organizing gradient-based learners. CoRR abs/2001.04678 (2020) - [i33]Raphael Köster, Kevin R. McKee, Richard Everett, Laura Weidinger, William S. Isaac, Edward Hughes, Edgar A. Duéñez-Guzmán, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences. CoRR abs/2010.09054 (2020) - [i32]Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel:
Negotiating Team Formation Using Deep Reinforcement Learning. CoRR abs/2010.10380 (2020) - [i30]Allan Dafoe, Edward Hughes, Yoram Bachrach, Tantum Collins, Kevin R. McKee, Joel Z. Leibo, Kate Larson, Thore Graepel:
Open Problems in Cooperative AI. CoRR abs/2012.08630 (2020) - 2019
- [c78]Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. AAMAS 2019: 1099-1107 - [c73]Peter Sunehag, Guy Lever, Siqi Liu, Josh Merel, Nicolas Heess, Joel Z. Leibo, Edward Hughes, Tom Eccles, Thore Graepel:
Reinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems. ALIFE 2019: 103-110 - [i27]Joel Z. Leibo, Edward Hughes, Marc Lanctot, Thore Graepel:
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research. CoRR abs/1903.00742 (2019) - 2018
- [c71]Karl Tuyls, Julien Pérolat, Marc Lanctot, Joel Z. Leibo, Thore Graepel:
A Generalised Method for Empirical Game Theoretic Analysis. AAMAS 2018: 77-85 - [c70]Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinícius Flores Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel:
Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward. AAMAS 2018: 2085-2087 - [c67]Edward Hughes, Joel Z. Leibo, Matthew Phillips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion improves cooperation in intertemporal social dilemmas. NeurIPS 2018: 3330-3340 - [i20]Karl Tuyls, Julien Pérolat, Marc Lanctot, Joel Z. Leibo, Thore Graepel:
A Generalised Method for Empirical Game Theoretic Analysis. CoRR abs/1803.06376 (2018) - [i19]Edward Hughes, Joel Z. Leibo, Matthew G. Philips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion resolves intertemporal social dilemmas. CoRR abs/1803.08884 (2018) - [i16]Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio García Castañeda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel:
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning. CoRR abs/1807.01281 (2018) - [i14]Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. CoRR abs/1812.07019 (2018) - 2017
- [c66]Joel Z. Leibo, Vinícius Flores Zambaldi, Marc Lanctot, Janusz Marecki, Thore Graepel:
Multi-agent Reinforcement Learning in Sequential Social Dilemmas. AAMAS 2017: 464-473 - [c65]Julien Pérolat, Joel Z. Leibo, Vinícius Flores Zambaldi, Charles Beattie, Karl Tuyls, Thore Graepel:
A multi-agent reinforcement learning model of common-pool resource appropriation. NIPS 2017: 3643-3652 - [i13]Joel Z. Leibo, Vinícius Flores Zambaldi, Marc Lanctot, Janusz Marecki, Thore Graepel:
Multi-agent Reinforcement Learning in Sequential Social Dilemmas. CoRR abs/1702.03037 (2017) - [i12]Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinícius Flores Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel:
Value-Decomposition Networks For Cooperative Multi-Agent Learning. CoRR abs/1706.05296 (2017) - [i11]Julien Pérolat, Joel Z. Leibo, Vinícius Flores Zambaldi, Charles Beattie, Karl Tuyls, Thore Graepel:
A multi-agent reinforcement learning model of common-pool resource appropriation. CoRR abs/1707.06600 (2017) - [i9]Karl Tuyls, Julien Pérolat, Marc Lanctot, Georg Ostrovski, Rahul Savani, Joel Z. Leibo, Toby Ord, Thore Graepel, Shane Legg:
Symmetric Decomposition of Asymmetric Games. CoRR abs/1711.05074 (2017)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2023-09-27 22:40 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint