default search action

combined dblp search
author search
venue search
publication search

ask others

Jakob N. Foerster

Jakob Nicolaus Foerster

> Home > Persons

Person information

affiliation: University of Oxford, UK

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c94]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GalliciFEPMFM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GalliciFEPMFM25
Matteo Gallici, Mattie Fellows, Benjamin Ellis, Bartomeu Pou, Ivan Masmitja, Jakob Nicolaus Foerster, Mario Martin:
Simplifying Deep Temporal Difference Learning. ICLR 2025
[c93]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GesslerDCELF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GesslerDCELF25
Tobias Gessler, Tin Dizdarevic, Ani Calinescu, Benjamin Ellis, Andrei Lupu, Jakob Nicolaus Foerster:
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination. ICLR 2025
[c92]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MatthewsB0F25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MatthewsB0F25
Michael T. Matthews, Michael Beukman, Chris Lu, Jakob Nicolaus Foerster:
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks. ICLR 2025
[c91]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MuglichFPF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MuglichFPF25
Darius Muglich, Johannes Forkel, Elise van der Pol, Jakob Nicolaus Foerster:
Expected Return Symmetries. ICLR 2025
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-00757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-00757
J. Rosser, Jakob Nicolaus Foerster:
AgentBreeder: Mitigating the AI Safety Impact of Multi-Agent Scaffolds. CoRR abs/2502.00757 (2025)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-01711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-01711
Darius Muglich, Johannes Forkel, Elise van der Pol, Jakob N. Foerster:
Expected Return Symmetries. CoRR abs/2502.01711 (2025)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-09172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-09172
Peer Nagy, Sascha Frey, Kang Li, Bidipta Sarkar, Svitlana Vyetrenko, Stefan Zohren, Ani Calinescu, Jakob N. Foerster:
LOB-Bench: Benchmarking Generative AI for Finance - an Application to Limit Order Book Data. CoRR abs/2502.09172 (2025)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-14143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-14143
Lewis Hammond, Alan Chan, Jesse Clifton, Jason Hoelscher-Obermaier, Akbir Khan, Euan McLean, Chandler Smith, Wolfram Barfuss, Jakob N. Foerster, Tomas Gavenciak, The Anh Han, Edward Hughes, Vojtech Kovarík, Jan Kulveit, Joel Z. Leibo, Caspar Oesterheld, Christian Schröder de Witt, Nisarg Shah, Michael P. Wellman, Paolo Bova, Theodor Cimpeanu, Carson Ezell, Quentin Feuillade-Montixi, Matija Franklin, Esben Kran, Igor Krawczuk, Max Lamparth, Niklas Lauffer, Alexander Meinke, Sumeet Motwani, Anka Reuel, Vincent Conitzer, Michael Dennis, Iason Gabriel, Adam Gleave, Gillian K. Hadfield, Nika Haghtalab, Atoosa Kasirzadeh, Sébastien Krier, Kate Larson, Joel Lehman, David C. Parkes, Georgios Piliouras, Iyad Rahwan:
Multi-Agent Risks from Advanced AI. CoRR abs/2502.14143 (2025)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-14499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-14499
Deepak Nathani, Lovish Madaan, Nicholas Roberts, Nikolay Bashlykov, Ajay Menon, Vincent Moens, Amar Budhiraja, Despoina Magka, Vladislav Vorotilov, Gaurav Chaurasia, Dieuwke Hupkes, Ricardo Silveira Cabral, Tatiana Shavrina, Jakob N. Foerster, Yoram Bachrach, William Yang Wang, Roberta Raileanu:
MLGym: A New Framework and Benchmark for Advancing AI Research Agents. CoRR abs/2502.14499 (2025)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-17821
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-17821
Tobias Gessler, Tin Dizdarevic, Ani Calinescu, Benjamin Ellis, Andrei Lupu, Jakob Nicolaus Foerster:
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination. CoRR abs/2503.17821 (2025)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-08066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-08066
Yutaro Yamada, Robert Tjarko Lange, Cong Lu, Shengran Hu, Chris Lu, Jakob N. Foerster, Jeff Clune, David Ha:
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search. CoRR abs/2504.08066 (2025)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-11453
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-11453
Matthew Thomas Jackson, Uljad Berdica, Jarek Liesen, Shimon Whiteson, Jakob Nicolaus Foerster:
A Clean Slate for Offline Reinforcement Learning. CoRR abs/2504.11453 (2025)
2024
[j8]
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/WilliOFDC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/WilliOFDC24
Timon Willi, Johan Samir Obando-Ceron, Jakob Nicolaus Foerster, Gintare Karolina Dziugaite, Pablo Samuel Castro:
Mixture of Experts in a Mixture of RL settings. RLJ 3: 1072-1105 (2024)
[j7]
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/JacksonMLEWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/JacksonMLEWF24
Matthew Thomas Jackson, Michael T. Matthews, Cong Lu, Benjamin Ellis, Shimon Whiteson, Jakob Nicolaus Foerster:
Policy-Guided Diffusion. RLJ 4: 1855-1872 (2024)
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ploscb/JamesTFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ploscb/JamesTFS24
Jessica James, Sebastian Towers, Jakob N. Foerster, Harrison Steel:
Optimisation strategies for directed evolution without sequencing. PLoS Comput. Biol. 20(12): 1012695 (2024)
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/FranzmeyerSA0HF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/FranzmeyerSA0HF24
Tim Franzmeyer, Aleksandar Shtedritski, Samuel Albanie, Philip Torr, João F. Henriques, Jakob N. Foerster:
HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits. ACL (Findings) 2024: 12702-12716
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Fung00WWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Fung00WWF24
Kitty Fung, Qizhen Zhang, Chris Lu, Jia Wan, Timon Willi, Jakob N. Foerster:
Analysing the Sample Complexity of Opponent Shaping. AAMAS 2024: 623-631
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/KhanWKT0GRF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/KhanWKT0GRF24
Akbir Khan, Timon Willi, Newton Kwan, Andrea Tacchetti, Chris Lu, Edward Grefenstette, Tim Rocktäschel, Jakob N. Foerster:
Scaling Opponent Shaping to High Dimensional Games. AAMAS 2024: 1001-1010
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/NasvytisSFFW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/NasvytisSFFW24
Linas Nasvytis, Kai Sandbrink, Jakob N. Foerster, Tim Franzmeyer, Christian Schröder de Witt:
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection. AAMAS 2024: 1445-1453
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/RutherfordEGCLI24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/RutherfordEGCLI24
Alexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook, Andrei Lupu, Garðar Ingvarsson, Timon Willi, Akbir Khan, Christian Schröder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert T. Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktäschel, Chris Lu, Jakob N. Foerster:
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX. AAMAS 2024: 2444-2446
[c85]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FranzmeyerE0FH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FranzmeyerE0FH24
Tim Franzmeyer, Edith Elkind, Philip Torr, Jakob Nicolaus Foerster, João F. Henriques:
Select to Perfect: Imitating desired behavior from large multi-agent data. ICLR 2024
[c84]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FranzmeyerMHF0B24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FranzmeyerMHF0B24
Tim Franzmeyer, Stephen Marcus McAleer, João F. Henriques, Jakob Nicolaus Foerster, Philip Torr, Adel Bibi, Christian Schröder de Witt:
Illusory Attacks: Information-theoretic detectability matters in adversarial attacks. ICLR 2024
[c83]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Jackson0KLWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Jackson0KLWF24
Matthew Thomas Jackson, Chris Lu, Louis Kirsch, Robert Tjarko Lange, Shimon Whiteson, Jakob Nicolaus Foerster:
Discovering Temporally-Aware Reinforcement Learning Algorithms. ICLR 2024
[c82]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LoSFN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LoSFN24
Yat Long Lo, Biswa Sengupta, Jakob Nicolaus Foerster, Michael Noukhovitch:
Learning Multi-Agent Communication with Contrastive Learning. ICLR 2024
[c81]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Lupu0LLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Lupu0LLF24
Andrei Lupu, Chris Lu, Jarek Liesen, Robert Tjarko Lange, Jakob Nicolaus Foerster:
Behaviour Distillation. ICLR 2024
[c80]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BeukmanCMFJDF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BeukmanCMFJDF24
Michael Beukman, Samuel Coward, Michael T. Matthews, Mattie Fellows, Minqi Jiang, Michael D. Dennis, Jakob Nicolaus Foerster:
Refining Minimax Regret for Unsupervised Environment Design. ICML 2024
[c79]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/EirasPVWPEMBCSB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EirasPVWPEMBCSB24
Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schröder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Thomas Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob N. Foerster:
Position: Near to Mid-term Risks and Opportunities of Open-Source Generative AI. ICML 2024
[c78]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Jesson0GBFFG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Jesson0GBFFG24
Andrew Jesson, Chris Lu, Gunshi Gupta, Nicolas Beltran-Velez, Angelos Filos, Jakob Nicolaus Foerster, Yarin Gal:
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages. ICML 2024
[c77]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MatthewsBESJCF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MatthewsBESJCF24
Michael T. Matthews, Michael Beukman, Benjamin Ellis, Mikayel Samvelyan, Matthew Thomas Jackson, Samuel Coward, Jakob Nicolaus Foerster:
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning. ICML 2024
[c76]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Obando-CeronSWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Obando-CeronSWL24
Johan Samir Obando-Ceron, Ghada Sokar, Timon Willi, Clare Lyle, Jesse Farebrother, Jakob Nicolaus Foerster, Gintare Karolina Dziugaite, Doina Precup, Pablo Samuel Castro:
Mixtures of Experts Unlock Parameter Scaling for Deep RL. ICML 2024
[c75]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SaporaS0TF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SaporaS0TF24
Silvia Sapora, Gokul Swamy, Chris Lu, Yee Whye Teh, Jakob Nicolaus Foerster:
EvIL: Evolution Strategies for Generalisable Imitation Learning. ICML 2024
[c74]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangZF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangZF24
Ziyang Zhang, Qizhen Zhang, Jakob Nicolaus Foerster:
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition. ICML 2024
[c73]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001HFCFSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001HFCFSL24
Chris Lu, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob N. Foerster, Mihaela van der Schaar, Robert T. Lange:
Discovering Preference Optimization Algorithms with and for Large Language Models. NeurIPS 2024
[c72]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/000400LF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/000400LF24
Jonathan Cook, Chris Lu, Edward Hughes, Joel Z. Leibo, Jakob N. Foerster:
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning. NeurIPS 2024
[c71]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/EllisJLGFWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/EllisJLGFWF24
Benjamin Ellis, Matthew Thomas Jackson, Andrei Lupu, Alexander David Goldie, Mattie Fellows, Shimon Whiteson, Jakob N. Foerster:
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps. NeurIPS 2024
[c70]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Goldie0JWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Goldie0JWF24
Alexander David Goldie, Chris Lu, Matthew Thomas Jackson, Shimon Whiteson, Jakob N. Foerster:
Can Learned Optimization Make Reinforcement Learning Less Difficult? NeurIPS 2024
[c69]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Morad0KLFP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Morad0KLFP24
Steven D. Morad, Chris Lu, Ryan Kortvelesy, Stephan Liwicki, Jakob N. Foerster, Amanda Prorok:
Recurrent Reinforcement Learning with Memoroids. NeurIPS 2024
[c68]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/RutherfordBWLHF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RutherfordBWLHF24
Alexander Rutherford, Michael Beukman, Timon Willi, Bruno Lacerda, Nick Hawes, Jakob N. Foerster:
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery. NeurIPS 2024
[c67]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/RutherfordEG0LI24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RutherfordEG0LI24
Alexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook, Andrei Lupu, Garðar Ingvarsson, Timon Willi, Ravi Hammond, Akbir Khan, Christian Schröder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert T. Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktäschel, Chris Lu, Jakob N. Foerster:
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX. NeurIPS 2024
[c66]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TrivediKCHDCAMV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TrivediKCHDCAMV24
Rakshit Trivedi, Akbir Khan, Jesse Clifton, Lewis Hammond, Edgar A. Duéñez-Guzmán, Dipam Chakraborty, John P. Agapiou, Jayd Matyas, Alexander Sasha Vezhnevets, Barna Pásztor, Yunke Ao, Omar G. Younis, Jiawei Huang, Benjamin Swain, Haoyuan Qin, Mian Deng, Ziwei Deng, Utku Erdoganaras, Yue Zhao, Marko Tesic, Natasha Jaques, Jakob N. Foerster, Vincent Conitzer, José Hernández-Orallo, Dylan Hadfield-Menell, Joel Z. Leibo:
Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence. NeurIPS 2024
[c65]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangGG0CVFBRUL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangGG0CVFBRUL24
Qizhen (Irene) Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob N. Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli:
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts. NeurIPS 2024
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/robosoft/BerdicaJVFM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robosoft/BerdicaJVFM24
Uljad Berdica, Matthew Thomas Jackson, Niccolò Enrico Veronese, Jakob N. Foerster, Perla Maiolino:
Reinforcement Learning Controllers for Soft Robots Using Learned Environments. RoboSoft 2024: 933-939
[c63]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/SokotaSWCFK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/SokotaSWCFK24
Samuel Sokota, Dylan Sam, Christian Schröder de Witt, Spencer Compton, Jakob N. Foerster, J. Zico Kolter:
Computing Low-Entropy Couplings for Large-Support Distributions. UAI 2024: 3279-3298
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-01088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-01088
Jake Levi, Chris Lu, Timon Willi, Christian Schröder de Witt, Jakob N. Foerster:
The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games. CoRR abs/2402.01088 (2024)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05782
Kitty Fung, Qizhen Zhang, Chris Lu, Jia Wan, Timon Willi, Jakob N. Foerster:
Analysing the Sample Complexity of Opponent Shaping. CoRR abs/2402.05782 (2024)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05828
Matthew Thomas Jackson, Chris Lu, Louis Kirsch, Robert T. Lange, Shimon Whiteson, Jakob Nicolaus Foerster:
Discovering Temporally-Aware Reinforcement Learning Algorithms. CoRR abs/2402.05828 (2024)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-08609
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-08609
Johan S. Obando-Ceron, Ghada Sokar, Timon Willi, Clare Lyle, Jesse Farebrother, Jakob N. Foerster, Gintare Karolina Dziugaite, Doina Precup, Pablo Samuel Castro:
Mixtures of Experts Unlock Parameter Scaling for Deep RL. CoRR abs/2402.08609 (2024)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-09900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-09900
Steven D. Morad, Chris Lu, Ryan Kortvelesy, Stephan Liwicki, Jakob N. Foerster, Amanda Prorok:
Revisiting Recurrent Reinforcement Learning with Memory Monoids. CoRR abs/2402.09900 (2024)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12284
Michael Beukman, Samuel Coward, Michael T. Matthews, Mattie Fellows, Minqi Jiang, Michael Dennis, Jakob N. Foerster:
Refining Minimax Regret for Unsupervised Environment Design. CoRR abs/2402.12284 (2024)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16801
Michael T. Matthews, Michael Beukman, Benjamin Ellis, Mikayel Samvelyan, Matthew Thomas Jackson, Samuel Coward, Jakob N. Foerster:
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning. CoRR abs/2402.16801 (2024)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16822
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16822
Mikayel Samvelyan, Sharath Chandra Raparthy, Andrei Lupu, Eric Hambro, Aram H. Markosyan, Manish Bhatt, Yuning Mao, Minqi Jiang, Jack Parker-Holder, Jakob N. Foerster, Tim Rocktäschel, Roberta Raileanu:
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts. CoRR abs/2402.16822 (2024)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13091
Samuel Coward, Michael Beukman, Jakob N. Foerster:
JaxUED: A simple and useable UED library in Jax. CoRR abs/2403.13091 (2024)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06356
Matthew Thomas Jackson, Michael T. Matthews, Cong Lu, Benjamin Ellis, Shimon Whiteson, Jakob N. Foerster:
Policy-Guided Diffusion. CoRR abs/2404.06356 (2024)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-07099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-07099
Linas Nasvytis, Kai Sandbrink, Jakob N. Foerster, Tim Franzmeyer, Christian Schröder de Witt:
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection. CoRR abs/2404.07099 (2024)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09932
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, José Hernández-Orallo, Lewis Hammond, Eric J. Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob N. Foerster, Florian Tramèr, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger:
Foundational Challenges in Assuring Alignment and Safety of Large Language Models. CoRR abs/2404.09932 (2024)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-17047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-17047
Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schröder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Thomas Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob N. Foerster:
Near to Mid-term Risks and Opportunities of Open Source Generative AI. CoRR abs/2404.17047 (2024)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-03735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-03735
Tim Franzmeyer, Edith Elkind, Philip Torr, Jakob N. Foerster, João F. Henriques:
Select to Perfect: Imitating desired behavior from large multi-agent data. CoRR abs/2405.03735 (2024)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-07932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-07932
Ziyang Zhang, Qizhen Zhang, Jakob N. Foerster:
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition. CoRR abs/2405.07932 (2024)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-08597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-08597
Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schröder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Botos Csaba, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Thomas Jackson, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob N. Foerster:
Risks and Opportunities of Open-Source Generative AI. CoRR abs/2405.08597 (2024)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19540
Samuel Sokota, Dylan Sam, Christian Schröder de Witt, Spencer Compton, Jakob N. Foerster, J. Zico Kolter:
Computing Low-Entropy Couplings for Large-Support Distributions. CoRR abs/2405.19540 (2024)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00392
Jonathan Cook, Chris Lu, Edward Hughes, Joel Z. Leibo, Jakob N. Foerster:
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning. CoRR abs/2406.00392 (2024)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03428
Tim Franzmeyer, Aleksandar Shtedritski, Samuel Albanie, Philip Torr, João F. Henriques, Jakob N. Foerster:
HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits. CoRR abs/2406.03428 (2024)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08414
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08414
Chris Lu, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob N. Foerster, Mihaela van der Schaar, Robert Tjarko Lange:
Discovering Preference Optimization Algorithms with and for Large Language Models. CoRR abs/2406.08414 (2024)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11905
Silvia Sapora, Gokul Swamy, Chris Lu, Yee Whye Teh, Jakob Nicolaus Foerster:
EvIL: Evolution Strategies for Generalisable Imitation Learning. CoRR abs/2406.11905 (2024)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12589
Jarek Liesen, Chris Lu, Andrei Lupu, Jakob N. Foerster, Henning Sprekeler, Robert T. Lange:
Discovering Minimal Reinforcement Learning Environments. CoRR abs/2406.12589 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15042
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15042
Andrei Lupu, Chris Lu, Jarek Liesen, Robert Tjarko Lange, Jakob N. Foerster:
Behaviour Distillation. CoRR abs/2406.15042 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18420
Timon Willi, Johan S. Obando-Ceron, Jakob N. Foerster, Karolina Dziugaite, Pablo Samuel Castro:
Mixture of Experts in a Mixture of RL settings. CoRR abs/2406.18420 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04811
Matteo Gallici, Mattie Fellows, Benjamin Ellis, Bartomeu Pou, Ivan Masmitja, Jakob Nicolaus Foerster, Mario Martin:
Simplifying Deep Temporal Difference Learning. CoRR abs/2407.04811 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-07082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-07082
Alexander David Goldie, Chris Lu, Matthew Thomas Jackson, Shimon Whiteson, Jakob Nicolaus Foerster:
Can Learned Optimization Make Reinforcement Learning Less Difficult? CoRR abs/2407.07082 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-06292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-06292
Chris Lu, Cong Lu, Robert Tjarko Lange, Jakob N. Foerster, Jeff Clune, David Ha:
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. CoRR abs/2408.06292 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-08274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-08274
Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob N. Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli:
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts. CoRR abs/2408.08274 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15099
Alexander Rutherford, Michael Beukman, Timon Willi, Bruno Lacerda, Nick Hawes, Jakob N. Foerster:
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery. CoRR abs/2408.15099 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00853
Chris Lu, Michael Beukman, Michael T. Matthews, Jakob N. Foerster:
JaxLife: An Open-Ended Agentic Simulator. CoRR abs/2409.00853 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-10588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-10588
Sebastian Towers, Aleksandra Kalisz, Philippe A. Robert, Alicia Higueruelo, Francesca V. Vianello, Ming-Han Chloe Tsai, Harrison Steel, Jakob N. Foerster:
Opponent Shaping for Antibody Development. CoRR abs/2409.10588 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03608
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03608
Jonathan Cook, Tim Rocktäschel, Jakob N. Foerster, Dennis Aumiller, Alex Wang:
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation. CoRR abs/2410.03608 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-18519
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-18519
Uljad Berdica, Matthew Thomas Jackson, Niccolò Enrico Veronese, Jakob N. Foerster, Perla Maiolino:
Reinforcement Learning Controllers for Soft Robots using Learned Environments. CoRR abs/2410.18519 (2024)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-21159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-21159
Lize Alberts, Benjamin Ellis, Andrei Lupu, Jakob N. Foerster:
CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants. CoRR abs/2410.21159 (2024)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-23208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-23208
Michael T. Matthews, Michael Beukman, Chris Lu, Jakob N. Foerster:
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks. CoRR abs/2410.23208 (2024)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-00666
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-00666
Charlie B. Tan, Edan Toledo, Benjamin Ellis, Jakob N. Foerster, Ferenc Huszár:
Beyond the Boundaries of Proximal Policy Optimization. CoRR abs/2411.00666 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-04976
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-04976
Usman Anwar, Ashish Pandian, Jia Wan, David Krueger, Jakob N. Foerster:
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games. CoRR abs/2411.04976 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-06568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-06568
Carlo Alfano, Silvia Sapora, Jakob Nicolaus Foerster, Patrick Rebeschini, Yee Whye Teh:
Learning Loss Landscapes in Preference Optimization. CoRR abs/2411.06568 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13543
Davide Paglieri, Bartlomiej Cupial, Samuel Coward, Ulyana Piterbarg, Maciej Wolczyk, Akbir Khan, Eduardo Pignatelli, Lukasz Kucinski, Lerrel Pinto, Rob Fergus, Jakob Nicolaus Foerster, Jack Parker-Holder, Tim Rocktäschel:
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games. CoRR abs/2411.13543 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-09810
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-09810
Branton DeMoss, Silvia Sapora, Jakob N. Foerster, Nick Hawes, Ingmar Posner:
The Complexity Dynamics of Grokking. CoRR abs/2412.09810 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-17113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-17113
Benjamin Ellis, Matthew Thomas Jackson, Andrei Lupu, Alexander David Goldie, Mattie Fellows, Shimon Whiteson, Jakob N. Foerster:
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps. CoRR abs/2412.17113 (2024)
2023
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/LiaoDFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/LiaoDFS23
Isaac Liao, Rumen Dangovski, Jakob Nicolaus Foerster, Marin Soljacic:
Learning to Optimize Quasi-Newton Methods. Trans. Mach. Learn. Res. 2023 (2023)
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icaif/NagyFSLCZF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaif/NagyFSLCZF23
Peer Nagy, Sascha Frey, Silvia Sapora, Kang Li, Anisoara Calinescu, Stefan Zohren, Jakob N. Foerster:
Generative AI for End-to-End Limit Order Book Modelling: A Token-Level Autoregressive Generative Model of Message Flow Using a Deep State Space Network. ICAIF 2023: 91-99
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icaif/FreyLNS0ZFC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaif/FreyLNS0ZFC23
Sascha Yves Frey, Kang Li, Peer Nagy, Silvia Sapora, Christopher Lu, Stefan Zohren, Jakob N. Foerster, Anisoara Calinescu:
JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading. ICAIF 2023: 583-591
[c60]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/CuiLSH0F23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CuiLSH0F23
Brandon Cui, Andrei Lupu, Samuel Sokota, Hengyuan Hu, David J. Wu, Jakob Nicolaus Foerster:
Adversarial Diversity in Hanabi. ICLR 2023
[c59]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LoWSFW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LoWSFW23
Yat Long Lo, Christian Schröder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson:
Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning. ICLR 2023
[c58]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SamvelyanK0JPFR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SamvelyanK0JPFR23
Mikayel Samvelyan, Akbir Khan, Michael Dennis, Minqi Jiang, Jack Parker-Holder, Jakob Nicolaus Foerster, Roberta Raileanu, Tim Rocktäschel:
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning. ICLR 2023
[c57]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WittSKFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WittSKFS23
Christian Schröder de Witt, Samuel Sokota, J. Zico Kolter, Jakob Nicolaus Foerster, Martin Strohmeier:
Perfectly Secure Steganography Using Minimum Entropy Coupling. ICLR 2023
[c56]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0001WLF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001WLF23
Chris Lu, Timon Willi, Alistair Letcher, Jakob Nicolaus Foerster:
Adversarial Cheap Talk. ICML 2023: 22917-22941
[c55]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MaLSKF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MaLSKF23
Mingwei Ma, Jizhou Liu, Samuel Sokota, Max Kleiman-Weiner, Jakob Nicolaus Foerster:
Learning Intuitive Policies Using Action Features. ICML 2023: 23358-23372
[c54]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001SGPF0B23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001SGPF0B23
Chris Lu, Yannick Schroecker, Albert Gu, Emilio Parisotto, Jakob N. Foerster, Satinder Singh, Feryal M. P. Behbahani:
Structured State Space Models for In-Context Reinforcement Learning. NeurIPS 2023
[c53]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/EllisCMSSMFW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/EllisCMSSMFW23
Benjamin Ellis, Jonathan Cook, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob N. Foerster, Shimon Whiteson:
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning. NeurIPS 2023
[c52]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JacksonJPV0FWF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JacksonJPV0FWF23
Matthew Thomas Jackson, Minqi Jiang, Jack Parker-Holder, Risto Vuorio, Chris Lu, Gregory Farquhar, Shimon Whiteson, Jakob N. Foerster:
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design. NeurIPS 2023
[c51]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/OesterheldTGCF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OesterheldTGCF23
Caspar Oesterheld, Johannes Treutlein, Roger B. Grosse, Vincent Conitzer, Jakob N. Foerster:
Similarity-based cooperative equilibrium. NeurIPS 2023
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03376
Mikayel Samvelyan, Akbir Khan, Michael Dennis, Minqi Jiang, Jack Parker-Holder, Jakob N. Foerster, Roberta Raileanu, Tim Rocktäschel:
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning. CoRR abs/2303.03376 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03982
Chris Lu, Yannick Schroecker, Albert Gu, Emilio Parisotto, Jakob N. Foerster, Satinder Singh, Feryal M. P. Behbahani:
Structured State Space Models for In-Context Reinforcement Learning. CoRR abs/2303.03982 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09478
Chris Lu, Sebastian Towers, Jakob N. Foerster:
Arbitrary Order Meta-Learning with Simple Population-Based Evolution. CoRR abs/2303.09478 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-10733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-10733
Yat Long Lo, Christian Schröder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson:
Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning. CoRR abs/2303.10733 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01460
Andrew Jesson, Chris Lu, Gunshi Gupta, Angelos Filos, Jakob Nicolaus Foerster, Yarin Gal:
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages. CoRR abs/2306.01460 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-01403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-01403
Yat Long Lo, Biswa Sengupta, Jakob N. Foerster, Michael Noukhovitch:
Learning to Communicate using Contrastive Learning. CoRR abs/2307.01403 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-08051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-08051
Elena Gal, Shaun Singh, Aldo Pacchiano, Ben Walker, Terry J. Lyons, Jakob N. Foerster:
Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem. CoRR abs/2308.08051 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-13289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-13289
Sascha Frey, Kang Li, Peer Nagy, Silvia Sapora, Chris Lu, Stefan Zohren, Jakob N. Foerster, Anisoara Calinescu:
JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading. CoRR abs/2308.13289 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00638
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00638
Peer Nagy, Sascha Frey, Silvia Sapora, Kang Li, Anisoara Calinescu, Stefan Zohren, Jakob N. Foerster:
Generative AI for End-to-End Limit Order Book Modelling: A Token-Level Autoregressive Generative Model of Message Flow Using a Deep State Space Network. CoRR abs/2309.00638 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02782
Matthew Thomas Jackson, Minqi Jiang, Jack Parker-Holder, Risto Vuorio, Chris Lu, Gregory Farquhar, Shimon Whiteson, Jakob Nicolaus Foerster:
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design. CoRR abs/2310.02782 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-10090
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-10090
Alexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook, Andrei Lupu, Garðar Ingvarsson, Timon Willi, Akbir Khan, Christian Schröder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert Tjarko Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktäschel, Chris Lu, Jakob Nicolaus Foerster:
JaxMARL: Multi-Agent RL Environments in JAX. CoRR abs/2311.10090 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12568
Akbir Khan, Timon Willi, Newton Kwan, Andrea Tacchetti, Chris Lu, Edward Grefenstette, Tim Rocktäschel, Jakob N. Foerster:
Scaling Opponent Shaping to High Dimensional Games. CoRR abs/2312.12568 (2023)
2022
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/LorraineVPKMF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LorraineVPKMF22
Jonathan Lorraine, Paul Vicol, Jack Parker-Holder, Tal Kachman, Luke Metz, Jakob N. Foerster:
Lyapunov Exponents for Diversity in Differentiable Games. AAMAS 2022: 842-852
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0002LGF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0002LGF22
Qizhen Zhang, Christopher Lu, Animesh Garg, Jakob N. Foerster:
Centralized Model and Exploration Policy for Multi-Agent RL. AAMAS 2022: 1500-1508
[c48]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SokotaHWKFB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SokotaHWKFB22
Samuel Sokota, Hengyuan Hu, David J. Wu, J. Zico Kolter, Jakob Nicolaus Foerster, Noam Brown:
A Fine-Tuning Approach to Belief State Modeling. ICLR 2022
[c47]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KubaWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KubaWF22
Jakub Grudzien Kuba, Christian A. Schröder de Witt, Jakob N. Foerster:
Mirror Learning: A Unifying Framework of Policy Optimisation. ICML 2022: 7825-7844
[c46]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LuWWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LuWWF22
Christopher Lu, Timon Willi, Christian A. Schröder de Witt, Jakob N. Foerster:
Model-Free Opponent Shaping. ICML 2022: 14398-14411
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MuglichZWWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MuglichZWWF22
Darius Muglich, Luisa M. Zintgraf, Christian A. Schröder de Witt, Shimon Whiteson, Jakob N. Foerster:
Generalized Beliefs for Cooperative AI. ICML 2022: 16062-16082
[c44]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Parker-HolderJ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Parker-HolderJ022
Jack Parker-Holder, Minqi Jiang, Michael Dennis, Mikayel Samvelyan, Jakob N. Foerster, Edward Grefenstette, Tim Rocktäschel:
Evolving Curricula with Regret-Based Environment Design. ICML 2022: 17473-17498
[c43]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SokotaWIZTSKWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SokotaWIZTSKWF22
Samuel Sokota, Christian A. Schröder de Witt, Maximilian Igl, Luisa M. Zintgraf, Philip H. S. Torr, Martin Strohmeier, J. Zico Kolter, Shimon Whiteson, Jakob N. Foerster:
Communicating via Markov Decision Processes. ICML 2022: 20314-20328
[c42]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WilliLTF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WilliLTF22
Timon Willi, Alistair Letcher, Johannes Treutlein, Jakob N. Foerster:
COLA: Consistent Learning with Opponent-Learning Awareness. ICML 2022: 23804-23831
[c41]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001KLMWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001KLMWF22
Chris Lu, Jakub Grudzien Kuba, Alistair Letcher, Luke Metz, Christian Schröder de Witt, Jakob N. Foerster:
Discovered Policy Optimisation. NeurIPS 2022
[c40]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiHLSF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiHLSF22
Brandon Cui, Hengyuan Hu, Andrei Lupu, Samuel Sokota, Jakob N. Foerster:
Off-Team Learning. NeurIPS 2022
[c39]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HuS0BLCF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuS0BLCF22
Hengyuan Hu, Samuel Sokota, David J. Wu, Anton Bakhtin, Andrei Lupu, Brandon Cui, Jakob N. Foerster:
Self-Explaining Deviations for Coordination. NeurIPS 2022
[c38]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Jiang0PLKGRF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Jiang0PLKGRF22
Minqi Jiang, Michael Dennis, Jack Parker-Holder, Andrei Lupu, Heinrich Küttler, Edward Grefenstette, Tim Rocktäschel, Jakob N. Foerster:
Grounding Aleatoric Uncertainty for Unsupervised Environment Design. NeurIPS 2022
[c37]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KimRLFESTH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KimRLFESTH22
Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P. How:
Influencing Long-Term Behavior in Multiagent Reinforcement Learning. NeurIPS 2022
[c36]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MuglichWPWF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MuglichWPWF22
Darius Muglich, Christian Schröder de Witt, Elise van der Pol, Shimon Whiteson, Jakob N. Foerster:
Equivariant Networks for Zero-Shot Coordination. NeurIPS 2022
[c35]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Zhao0GF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao0GF22
Stephen Zhao, Chris Lu, Roger B. Grosse, Jakob N. Foerster:
Proximal Learning With Opponent-Learning Awareness. NeurIPS 2022
[i65]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-02373
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-02373
Jakub Grudzien Kuba, Christian Schröder de Witt, Jakob N. Foerster:
Mirror Learning: A Unifying Framework of Policy Optimisation. CoRR abs/2201.02373 (2022)
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12658
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12658
Mingwei Ma, Jizhou Liu, Samuel Sokota, Max Kleiman-Weiner, Jakob N. Foerster:
Learning to Coordinate with Humans using Action Features. CoRR abs/2201.12658 (2022)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-01302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-01302
Jack Parker-Holder, Minqi Jiang, Michael Dennis, Mikayel Samvelyan, Jakob N. Foerster, Edward Grefenstette, Tim Rocktäschel:
Evolving Curricula with Regret-Based Environment Design. CoRR abs/2203.01302 (2022)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-03535
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-03535
Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P. How:
Influencing Long-Term Behavior in Multiagent Reinforcement Learning. CoRR abs/2203.03535 (2022)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04098
Timon Willi, Johannes Treutlein, Alistair Letcher, Jakob N. Foerster:
COLA: Consistent Learning with Opponent-Learning Awareness. CoRR abs/2203.04098 (2022)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01447
Christopher Lu, Timon Willi, Christian Schröder de Witt, Jakob N. Foerster:
Model-Free Opponent Shaping. CoRR abs/2205.01447 (2022)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-12765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-12765
Darius Muglich, Luisa M. Zintgraf, Christian Schröder de Witt, Shimon Whiteson, Jakob N. Foerster:
Generalized Beliefs for Cooperative AI. CoRR abs/2206.12765 (2022)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-05219
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-05219
Minqi Jiang, Michael Dennis, Jack Parker-Holder, Andrei Lupu, Heinrich Küttler, Edward Grefenstette, Tim Rocktäschel, Jakob N. Foerster:
Grounding Aleatoric Uncertainty in Unsupervised Environment Design. CoRR abs/2207.05219 (2022)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07166
Brandon Cui, Hengyuan Hu, Luis Pineda, Jakob N. Foerster:
K-level Reasoning for Zero-Shot Coordination in Hanabi. CoRR abs/2207.07166 (2022)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-10170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-10170
Tim Franzmeyer, João F. Henriques, Jakob N. Foerster, Philip H. S. Torr, Adel Bibi, Christian Schröder de Witt:
Illusionary Attacks on Sequential Decision Makers and Countermeasures. CoRR abs/2207.10170 (2022)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-12322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-12322
Hengyuan Hu, Samuel Sokota, David J. Wu, Anton Bakhtin, Andrei Lupu, Brandon Cui, Jakob N. Foerster:
Self-Explaining Deviations for Coordination. CoRR abs/2207.12322 (2022)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11303
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11303
Risto Vuorio, Jacob Beck, Shimon Whiteson, Jakob N. Foerster, Gregory Farquhar:
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients. CoRR abs/2209.11303 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05125
Hengyuan Hu, David J. Wu, Adam Lerer, Jakob N. Foerster, Noam Brown:
Human-AI Coordination via Human-Regularized Search and Learning. CoRR abs/2210.05125 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05639
Chris Lu, Jakub Grudzien Kuba, Alistair Letcher, Luke Metz, Christian Schröder de Witt, Jakob N. Foerster:
Discovered Policy Optimisation. CoRR abs/2210.05639 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06171
Isaac Liao, Rumen R. Dangovski, Jakob N. Foerster, Marin Soljacic:
Learning to Optimize Quasi-Newton Methods. CoRR abs/2210.06171 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10125
Stephen Zhao, Chris Lu, Roger Baker Grosse, Jakob Nicolaus Foerster:
Proximal Learning With Opponent-Learning Awareness. CoRR abs/2210.10125 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12124
Darius Muglich, Christian Schröder de Witt, Elise van der Pol, Shimon Whiteson, Jakob N. Foerster:
Equivariant Networks for Zero-Shot Coordination. CoRR abs/2210.12124 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14889
Christian Schröder de Witt, Samuel Sokota, J. Zico Kolter, Jakob N. Foerster, Martin Strohmeier:
Perfectly Secure Steganography Using Minimum Entropy Coupling. CoRR abs/2210.14889 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16175
Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Gerald Tesauro, Jonathan P. How:
Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria. CoRR abs/2210.16175 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11030
Chris Lu, Timon Willi, Alistair Letcher, Jakob N. Foerster:
Adversarial Cheap Talk. CoRR abs/2211.11030 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-14468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-14468
Caspar Oesterheld, Johannes Treutlein, Roger B. Grosse, Vincent Conitzer, Jakob N. Foerster:
Similarity-based Cooperation. CoRR abs/2211.14468 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-07489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-07489
Benjamin Ellis, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob N. Foerster, Shimon Whiteson:
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2212.07489 (2022)
2021
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/mlst/BeloborodovUFWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mlst/BeloborodovUFWL21
Dmitrii Beloborodov, Alexander E. Ulanov, Jakob N. Foerster, Shimon Whiteson, A. I. Lvovsky:
Reinforcement learning enhanced quantum-inspired algorithm for combinatorial optimization. Mach. Learn. Sci. Technol. 2(2): 25009 (2021)
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/LupuHF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LupuHF21
Andrei Lupu, Hengyuan Hu, Jakob N. Foerster:
Trajectory Diversity for Zero-Shot Coordination. AAMAS 2021: 1593-1595
[c33]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuLCPBF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuLCPBF21
Hengyuan Hu, Adam Lerer, Brandon Cui, Luis Pineda, Noam Brown, Jakob N. Foerster:
Off-Belief Learning. ICML 2021: 4369-4379
[c32]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LupuCHF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LupuCHF21
Andrei Lupu, Brandon Cui, Hengyuan Hu, Jakob N. Foerster:
Trajectory Diversity for Zero-Shot Coordination. ICML 2021: 7204-7213
[c31]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Treutlein0OF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Treutlein0OF21
Johannes Treutlein, Michael Dennis, Caspar Oesterheld, Jakob N. Foerster:
A New Formalism, Method and Open Issues for Zero-Shot Coordination. ICML 2021: 10413-10423
[c30]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JiangDPFGR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiangDPFGR21
Minqi Jiang, Michael Dennis, Jack Parker-Holder, Jakob N. Foerster, Edward Grefenstette, Tim Rocktäschel:
Replay-Guided Adversarial Environment Design. NeurIPS 2021: 1884-1897
[c29]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PacchianoSCBF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PacchianoSCBF21
Aldo Pacchiano, Shaun Singh, Edward Chou, Alexander C. Berg, Jakob N. Foerster:
Neural Pseudo-Label Optimism for the Bank Loan Problem. NeurIPS 2021: 6580-6593
[c28]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiHPF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiHPF21
Brandon Cui, Hengyuan Hu, Luis Pineda, Jakob N. Foerster:
K-level Reasoning for Zero-Shot Coordination in Hanabi. NeurIPS 2021: 8215-8228
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04000
Hengyuan Hu, Adam Lerer, Brandon Cui, Luis Pineda, David J. Wu, Noam Brown, Jakob N. Foerster:
Off-Belief Learning. CoRR abs/2103.04000 (2021)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08067
Kalesha Bullard, Douwe Kiela, Joelle Pineau, Jakob N. Foerster:
Quasi-Equivalence Discovery for Zero-Shot Emergent Communication. CoRR abs/2103.08067 (2021)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06613
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06613
Johannes Treutlein, Michael Dennis, Caspar Oesterheld, Jakob N. Foerster:
A New Formalism, Method and Open Issues for Zero-Shot Coordination. CoRR abs/2106.06613 (2021)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09086
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09086
Hengyuan Hu, Adam Lerer, Noam Brown, Jakob N. Foerster:
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings. CoRR abs/2106.09086 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06434
Qizhen Zhang, Christopher Lu, Animesh Garg, Jakob N. Foerster:
Centralized Model and Exploration Policy for Multi-Agent RL. CoRR abs/2107.06434 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-08295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-08295
Samuel Sokota, Christian Schröder de Witt, Maximilian Igl, Luisa M. Zintgraf, Philip H. S. Torr, Shimon Whiteson, Jakob N. Foerster:
Implicit Communication as Minimum Entropy Coupling. CoRR abs/2107.08295 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-12460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-12460
Danielle Rothermel, Margaret Li, Tim Rocktäschel, Jakob N. Foerster:
Don't Sweep your Learning Rate under the Rug: A Closer Look at Cross-modal Transfer of Pretrained Transformers. CoRR abs/2107.12460 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02439
Minqi Jiang, Michael Dennis, Jack Parker-Holder, Jakob N. Foerster, Edward Grefenstette, Tim Rocktäschel:
Replay-Guided Adversarial Environment Design. CoRR abs/2110.02439 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02185
Aldo Pacchiano, Shaun Singh, Edward Chou, Alexander C. Berg, Jakob N. Foerster:
Neural Pseudo-Label Optimism for the Bank Loan Problem. CoRR abs/2112.02185 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-14570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-14570
Jonathan Lorraine, Paul Vicol, Jack Parker-Holder, Tal Kachman, Luke Metz, Jakob N. Foerster:
Lyapunov Exponents for Diversity in Differentiable Games. CoRR abs/2112.14570 (2021)
2020
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ai/BardFCBLSPDMHDM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/BardFCBLSPDMHDM20
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling:
The Hanabi challenge: A new frontier for AI research. Artif. Intell. 280: 103216 (2020)
[j2]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/RashidSWFFW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/RashidSWFFW20
Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. J. Mach. Learn. Res. 21: 178:1-178:51 (2020)
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BarrettCFL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BarrettCFL20
Thomas D. Barrett, William R. Clements, Jakob N. Foerster, A. I. Lvovsky:
Exploratory Combinatorial Optimization with Reinforcement Learning. AAAI 2020: 3243-3250
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LererHFB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LererHFB20
Adam Lerer, Hengyuan Hu, Jakob N. Foerster, Noam Brown:
Improving Policies via Search in Cooperative Partially Observable Games. AAAI 2020: 7187-7194
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Resnick0FDC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Resnick0FDC20
Cinjon Resnick, Abhinav Gupta, Jakob N. Foerster, Andrew M. Dai, Kyunghyun Cho:
Capacity, Bandwidth, and Compositionality in Emergent Language Learning. AAMAS 2020: 1125-1133
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HuF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HuF20
Hengyuan Hu, Jakob N. Foerster:
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning. ICLR 2020
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Lowe0FKP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Lowe0FKP20
Ryan Lowe, Abhinav Gupta, Jakob N. Foerster, Douwe Kiela, Joelle Pineau:
On the interaction between supervision and self-play in emergent communication. ICLR 2020
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuLPF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuLPF20
Hengyuan Hu, Adam Lerer, Alex Peysakhovich, Jakob N. Foerster:
"Other-Play" for Zero-Shot Coordination. ICML 2020: 4399-4410
[c21]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Parker-HolderMR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Parker-HolderMR20
Jack Parker-Holder, Luke Metz, Cinjon Resnick, Hengyuan Hu, Adam Lerer, Alistair Letcher, Alexander Peysakhovich, Aldo Pacchiano, Jakob N. Foerster:
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian. NeurIPS 2020
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/rep4nlp/GuptaRFDC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rep4nlp/GuptaRFDC20
Abhinav Gupta, Cinjon Resnick, Jakob N. Foerster, Andrew M. Dai, Kyunghyun Cho:
Compositionality and Capacity in Emergent Languages. RepL4NLP@ACL 2020: 34-38
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-01093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-01093
Ryan Lowe, Abhinav Gupta, Jakob N. Foerster, Douwe Kiela, Joelle Pineau:
On the interaction between supervision and self-play in emergent communication. CoRR abs/2002.01093 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-04676
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-04676
Dmitrii Beloborodov, Alexander E. Ulanov, Jakob N. Foerster, Shimon Whiteson, A. I. Lvovsky:
Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization. CoRR abs/2002.04676 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-02979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-02979
Hengyuan Hu, Adam Lerer, Alex Peysakhovich, Jakob N. Foerster:
"Other-Play" for Zero-Shot Coordination. CoRR abs/2003.02979 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-08839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-08839
Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. CoRR abs/2003.08839 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-11023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-11023
Oana-Maria Camburu, Eleonora Giunchiglia, Jakob N. Foerster, Thomas Lukasiewicz, Phil Blunsom:
The Struggles of Feature-Based Explanations: Shapley Values vs. Minimal Sufficient Subsets. CoRR abs/2009.11023 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15896
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15896
Kalesha Bullard, Franziska Meier, Douwe Kiela, Joelle Pineau, Jakob N. Foerster:
Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations. CoRR abs/2010.15896 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-06505
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-06505
Jack Parker-Holder, Luke Metz, Cinjon Resnick, Hengyuan Hu, Adam Lerer, Alistair Letcher, Alex Peysakhovich, Aldo Pacchiano, Jakob N. Foerster:
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian. CoRR abs/2011.06505 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/LetcherBRMFTG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/LetcherBRMFTG19
Alistair Letcher, David Balduzzi, Sébastien Racanière, James Martens, Jakob N. Foerster, Karl Tuyls, Thore Graepel:
Differentiable Game Mechanics. J. Mach. Learn. Res. 20: 84:1-84:40 (2019)
[c19]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/LoweFBPD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LoweFBPD19
Ryan Lowe, Jakob N. Foerster, Y-Lan Boureau, Joelle Pineau, Yann N. Dauphin:
On the Pitfalls of Measuring Emergent Communication. AAMAS 2019: 693-701
[c18]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/SamvelyanRWFNRH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/SamvelyanRWFNRH19
Mikayel Samvelyan, Tabish Rashid, Christian Schröder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob N. Foerster, Shimon Whiteson:
The StarCraft Multi-Agent Challenge. AAMAS 2019: 2186-2188
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/GuptaLFKP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/GuptaLFKP19
Abhinav Gupta, Ryan Lowe, Jakob N. Foerster, Douwe Kiela, Joelle Pineau:
Seeded self-play for language learning. LANTERN@EMNLP-IJCNLP 2019: 62-66
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LetcherFBRW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LetcherFBRW19
Alistair Letcher, Jakob N. Foerster, David Balduzzi, Tim Rocktäschel, Shimon Whiteson:
Stable Opponent Shaping in Differentiable Games. ICLR (Poster) 2019
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FoersterSHBDWBB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FoersterSHBDWBB19
Jakob N. Foerster, H. Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew M. Botvinick, Michael Bowling:
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning. ICML 2019: 1942-1951
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MaoFRAFW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MaoFRAFW19
Jingkai Mao, Jakob N. Foerster, Tim Rocktäschel, Maruan Al-Shedivat, Gregory Farquhar, Shimon Whiteson:
A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs. ICML 2019: 4343-4351
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/LuketinaNFFAGWR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LuketinaNFFAGWR19
Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob N. Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel:
A Survey of Reinforcement Learning Informed by Natural Language. IJCAI 2019: 6309-6317
[c12]
- view
- export record
  dblp key:
  - conf/nips/FarquharWF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FarquharWF19
Gregory Farquhar, Shimon Whiteson, Jakob N. Foerster:
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning. NeurIPS 2019: 8149-8160
[c11]
- view
- export record
  dblp key:
  - conf/nips/WittFFTBW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WittFFTBW19
Christian Schröder de Witt, Jakob N. Foerster, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson:
Multi-Agent Common Knowledge Reinforcement Learning. NeurIPS 2019: 9924-9935
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-00506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-00506
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling:
The Hanabi Challenge: A New Frontier for AI Research. CoRR abs/1902.00506 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-04043
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-04043
Mikayel Samvelyan, Tabish Rashid, Christian Schröder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob N. Foerster, Shimon Whiteson:
The StarCraft Multi-Agent Challenge. CoRR abs/1902.04043 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-05168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-05168
Ryan Lowe, Jakob N. Foerster, Y-Lan Boureau, Joelle Pineau, Yann N. Dauphin:
On the Pitfalls of Measuring Emergent Communication. CoRR abs/1903.05168 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-04926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-04926
Alistair Letcher, David Balduzzi, Sébastien Racanière, James Martens, Jakob N. Foerster, Karl Tuyls, Thore Graepel:
Differentiable Game Mechanics. CoRR abs/1905.04926 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03926
Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob N. Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel:
A Survey of Reinforcement Learning Informed by Natural Language. CoRR abs/1906.03926 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-04063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-04063
Thomas D. Barrett, William R. Clements, Jakob N. Foerster, A. I. Lvovsky:
Exploratory Combinatorial Optimization with Reinforcement Learning. CoRR abs/1909.04063 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-10549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-10549
Gregory Farquhar, Shimon Whiteson, Jakob N. Foerster:
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning. CoRR abs/1909.10549 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-02065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-02065
Oana-Maria Camburu, Eleonora Giunchiglia, Jakob N. Foerster, Thomas Lukasiewicz, Phil Blunsom:
Can I Trust the Explainer? Verifying Post-hoc Explanatory Methods. CoRR abs/1910.02065 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10537
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10537
Reda Bahi Slaoui, William R. Clements, Jakob N. Foerster, Sébastien Toth:
Robust Domain Randomization for Reinforcement Learning. CoRR abs/1910.10537 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-11424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-11424
Cinjon Resnick, Abhinav Gupta, Jakob N. Foerster, Andrew M. Dai, Kyunghyun Cho:
Capacity, Bandwidth, and Compositionality in Emergent Language Learning. CoRR abs/1910.11424 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02288
Hengyuan Hu, Jakob N. Foerster:
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning. CoRR abs/1912.02288 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02318
Adam Lerer, Hengyuan Hu, Jakob N. Foerster, Noam Brown:
Improving Policies via Search in Cooperative Partially Observable Games. CoRR abs/1912.02318 (2019)
2018
[b1]
- view
- export record
  dblp key:
  - phd/ethos/Foerster18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ethos/Foerster18
Jakob N. Foerster:
Deep multi-agent reinforcement learning. University of Oxford, UK, 2018
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/FoersterFANW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/FoersterFANW18
Jakob N. Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson:
Counterfactual Multi-Agent Policy Gradients. AAAI 2018: 2974-2982
[c9]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aiide/ResnickEHBFTCB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aiide/ResnickEHBFTCB18
Cinjon Resnick, Wes Eldridge, David Ha, Denny Britz, Jakob N. Foerster, Julian Togelius, Kyunghyun Cho, Joan Bruna:
Pommerman: A Multi-Agent Playground. AIIDE Workshops 2018
[c8]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/FoersterCAWAM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/FoersterCAWAM18
Jakob N. Foerster, Richard Y. Chen, Maruan Al-Shedivat, Shimon Whiteson, Pieter Abbeel, Igor Mordatch:
Learning with Opponent-Learning Awareness. AAMAS 2018: 122-130
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FoersterFARXW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FoersterFARXW18
Jakob N. Foerster, Gregory Farquhar, Maruan Al-Shedivat, Tim Rocktäschel, Eric P. Xing, Shimon Whiteson:
DiCE: The Infinitely Differentiable Monte-Carlo Estimator. ICLR (Workshop) 2018
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BalduzziRMFTG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BalduzziRMFTG18
David Balduzzi, Sébastien Racanière, James Martens, Jakob N. Foerster, Karl Tuyls, Thore Graepel:
The Mechanics of n-Player Differentiable Games. ICML 2018: 363-372
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FoersterFARXW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FoersterFARXW18
Jakob N. Foerster, Gregory Farquhar, Maruan Al-Shedivat, Tim Rocktäschel, Eric P. Xing, Shimon Whiteson:
DiCE: The Infinitely Differentiable Monte Carlo Estimator. ICML 2018: 1524-1533
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RashidSWFFW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RashidSWFFW18
Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. ICML 2018: 4292-4301
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05098
Jakob N. Foerster, Gregory Farquhar, Maruan Al-Shedivat, Tim Rocktäschel, Eric P. Xing, Shimon Whiteson:
DiCE: The Infinitely Differentiable Monte-Carlo Estimator. CoRR abs/1802.05098 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05642
David Balduzzi, Sébastien Racanière, James Martens, Jakob N. Foerster, Karl Tuyls, Thore Graepel:
The Mechanics of n-Player Differentiable Games. CoRR abs/1802.05642 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-11485
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-11485
Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. CoRR abs/1803.11485 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-07124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-07124
Cinjon Resnick, Wes Eldridge, David Ha, Denny Britz, Jakob N. Foerster, Julian Togelius, Kyunghyun Cho, Joan Bruna:
Pommerman: A Multi-Agent Playground. CoRR abs/1809.07124 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-11702
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-11702
Jakob N. Foerster, Christian A. Schröder de Witt, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson:
Multi-Agent Common Knowledge Reinforcement Learning. CoRR abs/1810.11702 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01458
Jakob N. Foerster, H. Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew M. Botvinick, Michael Bowling:
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning. CoRR abs/1811.01458 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-08469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-08469
Alistair Letcher, Jakob N. Foerster, David Balduzzi, Tim Rocktäschel, Shimon Whiteson:
Stable Opponent Shaping in Differentiable Games. CoRR abs/1811.08469 (2018)
2017
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FoersterGSCS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FoersterGSCS17
Jakob N. Foerster, Justin Gilmer, Jascha Sohl-Dickstein, Jan Chorowski, David Sussillo:
Input Switched Affine Networks: An RNN Architecture Designed for Interpretability. ICML 2017: 1136-1145
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FoersterNFATKW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FoersterNFATKW17
Jakob N. Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson:
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning. ICML 2017: 1146-1155
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FoersterNFTKW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FoersterNFTKW17
Jakob N. Foerster, Nantas Nardelli, Gregory Farquhar, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson:
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning. CoRR abs/1702.08887 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FoersterFANW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FoersterFANW17
Jakob N. Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson:
Counterfactual Multi-Agent Policy Gradients. CoRR abs/1705.08926 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-06233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-06233
Christoph Aymanns, Jakob N. Foerster, Co-Pierre Georg:
Fake News in Social Networks. CoRR abs/1708.06233 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-04326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-04326
Jakob N. Foerster, Richard Y. Chen, Maruan Al-Shedivat, Shimon Whiteson, Pieter Abbeel, Igor Mordatch:
Learning with Opponent-Learning Awareness. CoRR abs/1709.04326 (2017)
2016
[c1]
- view
- export record
  dblp key:
  - conf/nips/FoersterAFW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FoersterAFW16
Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson:
Learning to Communicate with Deep Multi-Agent Reinforcement Learning. NIPS 2016: 2137-2145
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FoersterAFW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FoersterAFW16
Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson:
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks. CoRR abs/1602.02672 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FoersterAFW16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FoersterAFW16a
Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson:
Learning to Communicate with Deep Multi-Agent Reinforcement Learning. CoRR abs/1605.06676 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FoersterGCSS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FoersterGCSS16
Jakob N. Foerster, Justin Gilmer, Jan Chorowski, Jascha Sohl-Dickstein, David Sussillo:
Intelligible Language Modeling with Input Switched Affine Networks. CoRR abs/1611.09434 (2016)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.