


default search action
Tom Bewley
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c10]Junqi Jiang, Tom Bewley, Saumitra Mishra, Freddy Lécué, Manuela Veloso:
Interpreting Language Reward Models via Contrastive Explanations. ICLR 2025
[c9]Anna Hedström, Salim I. Amoukou, Tom Bewley, Saumitra Mishra, Manuela Veloso:
To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models. ICML 2025
[i16]Scott R. Jeen, Tom Bewley, Jonathan M. Cullen:
Zero-Shot Reinforcement Learning Under Partial Observability. CoRR abs/2506.15446 (2025)
[i15]Junqi Jiang, Tom Bewley, Salim I. Amoukou, Francesco Leofante, Antonio Rago, Saumitra Mishra, Francesca Toni:
Representation Consistency for Accurate and Coherent LLM Answer Aggregation. CoRR abs/2506.21590 (2025)
[i14]Alexander H. Liu, Andy Ehrenberg, Andy Lo, Clément Denoix, Corentin Barreau, Guillaume Lample, Jean-Malo Delignon, Khyathi Raghavi Chandu, Patrick von Platen, Pavankumar Reddy Muddireddy, Sanchit Gandhi, Soham Ghosh, Srijan Mishra, Thomas Foubert, Abhinav Rastogi, Adam Yang, Albert Q. Jiang, Alexandre Sablayrolles, Amélie Héliou, Amélie Martin, Anmol Agarwal, Antoine Roux, Arthur Darcet, Arthur Mensch, Baptiste Bout, Baptiste Rozière, Baudouin De Monicault, Chris Bamford, Christian Wallenwein, Christophe Renaudin, Clémence Lanfranchi, Darius Dabert, Devendra Singh Chaplot, Devon Mizelle, Diego de Las Casas, Elliot Chane-Sane, Emilien Fugier, Emma Bou Hanna, Gabrielle Berrada, Gauthier Delerce, Gauthier Guinet, Georgii Novikov, Guillaume Martin, Himanshu Jaju, Jan Ludziejewski, Jason Rute, Jean-Hadrien Chabran, Jessica Chudnovsky, Joachim Studnia, Joep Barmentlo, Jonas Amar, Josselin Somerville Roberts, Julien Denize, Karan Saxena, Karmesh Yadav, Kartik Khandelwal, Kush Jain, Lélio Renard Lavaud, Léonard Blier, Lingxiao Zhao, Louis Martin, Lucile Saulnier, Luyu Gao, Marie Pellat, Mathilde Guillaumin, Mathis Felardos, Matthieu Dinot, Maxime Darrin, Maximilian Augustin, Mickaël Seznec, Neha Gupta, Nikhil Raghuraman, Olivier Duchenne, Patricia Wang, Patryk Saffer, Paul Jacob, Paul Wambergue, Paula Kurylowicz, Philomène Chagniot, Pierre Stock, Pravesh Agrawal, Rémi Delacourt, Romain Sauvestre, Roman Soletskyi, Sagar Vaze, Sandeep Subramanian, Saurabh Garg, Shashwat Dalal, Siddharth Gandhi, Sumukh Aithal, Szymon Antoniak, Teven Le Scao, Thibault Schueller, Thibaut Lavril, Thomas Robert, Thomas Wang, Timothée Lacroix, Tom Bewley, Valeriia Nemychnikova, Victor Paltz:
Voxtral. CoRR abs/2507.13264 (2025)
[i13]Anna Hedström, Salim I. Amoukou, Tom Bewley, Saumitra Mishra, Manuela Veloso:
To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models. CoRR abs/2510.13290 (2025)- 2024
[c8]Tom Bewley, Salim I. Amoukou, Saumitra Mishra, Daniele Magazzeni, Manuela Veloso:
Counterfactual Metarules for Local and Global Recourse. ICML 2024
[c7]Salim I. Amoukou, Tom Bewley, Saumitra Mishra, Freddy Lécué, Daniele Magazzeni, Manuela Veloso:
Sequential Harmful Shift Detection Without Labels. NeurIPS 2024
[c6]Scott R. Jeen, Tom Bewley, Jonathan M. Cullen:
Zero-Shot Reinforcement Learning from Low Quality Data. NeurIPS 2024
[i12]Tom Bewley, Salim I. Amoukou, Saumitra Mishra, Daniele Magazzeni, Manuela Veloso:
Counterfactual Metarules for Local and Global Recourse. CoRR abs/2405.18875 (2024)
[i11]Junqi Jiang, Tom Bewley, Saumitra Mishra, Freddy Lécué, Manuela Veloso:
Interpreting Language Reward Models via Contrastive Explanations. CoRR abs/2411.16502 (2024)
[i10]Salim I. Amoukou, Tom Bewley, Saumitra Mishra, Freddy Lécué, Daniele Magazzeni, Manuela Veloso:
Sequential Harmful Shift Detection Without Labels. CoRR abs/2412.12910 (2024)- 2023
[i9]Tom Bewley, Jonathan Lawry, Arthur Richards:
Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback. CoRR abs/2305.16924 (2023)
[i8]Scott R. Jeen, Tom Bewley
, Jonathan M. Cullen:
Conservative World Models. CoRR abs/2309.15178 (2023)- 2022
[c5]Tom Bewley, Freddy Lécué:
Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions. AAMAS 2022: 118-126
[c4]Joseph Early, Tom Bewley, Christine Evers, Sarvapali D. Ramchurn:
Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning. NeurIPS 2022
[i7]Tom Bewley
, Jonathan Lawry, Arthur Richards
:
Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction. CoRR abs/2201.07749 (2022)
[i6]Joseph Early, Tom Bewley, Christine Evers
, Sarvapali D. Ramchurn
:
Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning. CoRR abs/2205.15367 (2022)
[i5]Tom Bewley, Jonathan Lawry, Arthur Richards, Rachel Craddock, Ian Henderson:
Reward Learning with Trees: Methods and Evaluation. CoRR abs/2210.01007 (2022)- 2021
[c3]Tom Bewley
, Jonathan Lawry:
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments. AAAI 2021: 11415-11422
[i4]Tom Bewley
, Freddy Lécué:
Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions. CoRR abs/2112.11230 (2021)- 2020
[c2]Tom Bewley
, Jonathan Lawry
, Arthur Richards
:
Modelling Agent Policies with Interpretable Imitation Learning. TAILOR 2020: 180-186
[i3]Tom Bewley, Jonathan Lawry, Arthur Richards:
Modelling Agent Policies with Interpretable Imitation Learning. CoRR abs/2006.11309 (2020)
[i2]Tom Bewley:
Am I Building a White Box Agent or Interpreting a Black Box Agent? CoRR abs/2007.01187 (2020)
[i1]Tom Bewley
, Jonathan Lawry:
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments. CoRR abs/2009.04743 (2020)
2010 – 2019
- 2019
[c1]Tom Bewley
, Minas V. Liarokapis
:
On The Combination of Gamification and Crowd Computation in Industrial Automation and Robotics Applications. ICRA 2019: 1955-1961
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-12-09 00:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







