default search action

combined dblp search
author search
venue search
publication search

ask others

Colin Raffel

Colin A. Raffel

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-22146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-22146
Ajay Patel, Colin Raffel, Chris Callison-Burch:
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale. CoRR abs/2601.22146 (2026)
2025
[j15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/YadavCRB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/YadavCRB25
Prateek Yadav, Leshem Choshen, Colin Raffel, Mohit Bansal:
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization. Trans. Mach. Learn. Res. 2025 (2025)
[j14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/YadavRMCL0BCS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/YadavRMCL0BCS25
Prateek Yadav, Colin Raffel, Mohammed Muqeeth, Lucas Caccia, Haokun Liu, Tianlong Chen, Mohit Bansal, Leshem Choshen, Alessandro Sordoni:
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning. Trans. Mach. Learn. Res. 2025 (2025)
[c69]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuKR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuKR25
Fengyuan Liu, Nikhil Kandpal, Colin Raffel:
AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution. ICLR 2025
[c68]
- view
- export record
  dblp key:
  - conf/icml/AltintasKRR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AltintasKRR25
Gül Sena Altintas, Devin Kwok, Colin Raffel, David Rolnick:
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions. ICML 2025
[c67]
- view
- export record
  dblp key:
  - conf/icml/KandpalR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KandpalR25
Nikhil Kandpal, Colin Raffel:
Position: The Most Expensive Part of an LLM *should* be its Training Data. ICML (Position Papers) 2025
[c66]
- view
- export record
  dblp key:
  - conf/icml/LiDTR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiDTR25
Yu Xin Li, Felix Dangel, Derek Tam, Colin Raffel:
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator. ICML 2025
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-02737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-02737
Loubna Ben Allal, Anton Lozhkov, Elie Bakouch, Gabriel Martín Blázquez, Guilherme Penedo, Lewis Tunstall, Andrés Marafioti, Hynek Kydlícek, Agustín Piqueres Lajarín, Vaibhav Srivastav, Joshua Lochner, Caleb Fahlgren, Xuan-Son Nguyen, Clémentine Fourrier, Ben Burtenshaw, Hugo Larcher, Haojun Zhao, Cyril Zakka, Mathieu Morlon, Colin Raffel, Leandro von Werra, Thomas Wolf:
SmolLM2: When Smol Goes Big - Data-Centric Training of a Small Language Model. CoRR abs/2502.02737 (2025)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-12427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-12427
Nikhil Kandpal, Colin Raffel:
Position: The Most Expensive Part of an LLM should be its Training Data. CoRR abs/2504.12427 (2025)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-18513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-18513
Weiwei Sun, Haokun Liu, Nikhil Kandpal, Colin Raffel, Yiming Yang:
Enhancing Training Data Attribution with Representational Optimization. CoRR abs/2505.18513 (2025)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-05209
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-05209
Nikhil Kandpal, Brian Lester, Colin Raffel, Sebastian Majstorovic, Stella Biderman, Baber Abbasi, Luca Soldaini, Enrico Shippole, A. Feder Cooper, Aviya Skowron, John Kirchenbauer, Shayne Longpre, Lintang Sutawika, Alon Albalak, Zhenlin Xu, Guilherme Penedo, Loubna Ben Allal, Elie Bakouch, John David Pressman, Honglu Fan, Dashiell Stander, Guangyu Song, Aaron Gokaslan, Tom Goldstein, Brian R. Bartoldson, Bhavya Kailkhura, Tyler Murray:
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text. CoRR abs/2506.05209 (2025)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13234
Devin Kwok, Gül Sena Altintas, Colin Raffel, David Rolnick:
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions. CoRR abs/2506.13234 (2025)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-20920
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-20920
Guilherme Penedo, Hynek Kydlícek, Vinko Sabolcec, Bettina Messmer, Negar Foroutan, Amir Hossein Kargaran, Colin Raffel, Martin Jaggi, Leandro von Werra, Thomas Wolf:
FineWeb2: One Pipeline to Scale Them All - Adapting Pre-Training Data Processing to Every Language. CoRR abs/2506.20920 (2025)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-18807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-18807
YuXin Li, Felix Dangel, Derek Tam, Colin Raffel:
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator. CoRR abs/2507.18807 (2025)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-20757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-20757
Gül Sena Altintas, Malikeh Ehghaghi, Brian Lester, Fengyuan Liu, Wanru Zhao, Marco Ciccone, Colin Raffel:
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior. CoRR abs/2512.20757 (2025)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-24991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-24991
Gyung Hyun Je, Colin Raffel:
Efficiently Estimating Data Efficiency for Language Model Fine-tuning. CoRR abs/2512.24991 (2025)
2024
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/cacm/MaasAIJMR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cacm/MaasAIJMR24
Martin Maas, David G. Andersen, Michael Isard, Mohammad Mahdi Javanmard, Kathryn S. McKinley, Colin Raffel:
Combining Machine Learning and Lifetime-Based Resource Management for Memory Allocation and Beyond. Commun. ACM 67(4): 87-96 (2024)
[j12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/AlbalakEXLL0MHP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/AlbalakEXLL0MHP24
Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang:
A Survey on Data Selection for Language Models. Trans. Mach. Learn. Res. 2024 (2024)
[j11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/MuqeethLR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/MuqeethLR24
Mohammed Muqeeth, Haokun Liu, Colin Raffel:
Soft Merging of Experts with Adaptive Routing. Trans. Mach. Learn. Res. 2024 (2024)
[j10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/TamBR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/TamBR24
Derek Tam, Mohit Bansal, Colin Raffel:
Merging by Matching Models in Task Parameter Subspaces. Trans. Mach. Learn. Res. 2024 (2024)
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/PatelRC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/PatelRC24
Ajay Patel, Colin Raffel, Chris Callison-Burch:
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows. ACL (1) 2024: 3781-3799
[c64]
- view
- export record
  dblp key:
  - conf/icml/MuqeethLLR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MuqeethLLR24
Mohammed Muqeeth, Haokun Liu, Yufan Liu, Colin Raffel:
Learning to Route Among Specialized Experts for Zero-Shot Generalization. ICML 2024: 36829-36846
[c63]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PenedoKALMRW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PenedoKALMRW024
Guilherme Penedo, Hynek Kydlícek, Loubna Ben Allal, Anton Lozhkov, Margaret Mitchell, Colin A. Raffel, Leandro von Werra, Thomas Wolf:
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale. NeurIPS 2024
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05859
Mohammed Muqeeth, Haokun Liu, Yufan Liu, Colin Raffel:
Learning to Route Among Specialized Experts for Zero-Shot Generalization. CoRR abs/2402.05859 (2024)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10379
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10379
Ajay Patel, Colin Raffel, Chris Callison-Burch:
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows. CoRR abs/2402.10379 (2024)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16827
Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang:
A Survey on Data Selection for Language Models. CoRR abs/2402.16827 (2024)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-05567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-05567
Bowen Pan, Yikang Shen, Haokun Liu, Mayank Mishra, Gaoyuan Zhang, Aude Oliva, Colin Raffel, Rameswar Panda:
Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models. CoRR abs/2404.05567 (2024)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17557
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17557
Guilherme Penedo, Hynek Kydlícek, Loubna Ben Allal, Anton Lozhkov, Margaret Mitchell, Colin Raffel, Leandro von Werra, Thomas Wolf:
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale. CoRR abs/2406.17557 (2024)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-07057
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-07057
Prateek Yadav, Colin Raffel, Mohammed Muqeeth, Lucas Caccia, Haokun Liu, Tianlong Chen, Mohit Bansal, Leshem Choshen, Alessandro Sordoni:
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning. CoRR abs/2408.07057 (2024)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18314
Derek Tam, Yash Kant, Brian Lester, Igor Gilitschenski, Colin Raffel:
Realistic Evaluation of Model Merging for Compositional Generalization. CoRR abs/2409.18314 (2024)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-15102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-15102
Fengyuan Liu, Nikhil Kandpal, Colin Raffel:
AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution. CoRR abs/2411.15102 (2024)
2023
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/cacm/Raffel23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cacm/Raffel23
Colin Raffel:
Building Machine Learning Models Like Open Source Software. Commun. ACM 66(2): 38-40 (2023)
[j8]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/RobertsCMLBANLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/RobertsCMLBANLG23
Adam Roberts, Hyung Won Chung, Gaurav Mishra, Anselm Levskaya, James Bradbury, Daniel Andor, Sharan Narang, Brian Lester, Colin Gaffney, Afroz Mohiuddin, Curtis Hawthorne, Aitor Lewkowycz, Alex Salcianu, Marc van Zee, Jacob Austin, Sebastian Goodman, Livio Baldini Soares, Haitang Hu, Sasha Tsvyashchenko, Aakanksha Chowdhery, Jasmijn Bastings, Jannis Bulian, Xavier Garcia, Jianmo Ni, Andrew Chen, Kathleen Kenealy, Kehang Han, Michelle Casbon, Jonathan H. Clark, Stephan Lee, Dan Garrette, James Lee-Thorp, Colin Raffel, Noam Shazeer, Marvin Ritter, Maarten Bosma, Alexandre Passos, Jeremy Maitin-Shepard, Noah Fiedel, Mark Omernick, Brennan Saeta, Ryan Sepassi, Alexander Spiridonov, Joshua Newlan, Andrea Gesmundo:
Scaling Up Models and Data with t5x and seqio. J. Mach. Learn. Res. 24: 377:1-377:8 (2023)
[j7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/ChenTRBY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/ChenTRBY23
Jiaao Chen, Derek Tam, Colin Raffel, Mohit Bansal, Diyi Yang:
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP. Trans. Assoc. Comput. Linguistics 11: 191-211 (2023)
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/TrevisoLJACCHHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/TrevisoLJACCHHH23
Marcos V. Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro Henrique Martins, André F. T. Martins, Jessica Zosa Forde, Peter A. Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz:
Efficient Methods for Natural Language Processing: A Survey. Trans. Assoc. Comput. Linguistics 11: 826-860 (2023)
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/SrivastavaRRSAF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/SrivastavaRRSAF23
Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory W. Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023)
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/BorzunovBDRBCSR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/BorzunovBDRBCSR23
Alexander Borzunov, Dmitry Baranchuk, Tim Dettmers, Maksim Riabinin, Younes Belkada, Artem Chumachenko, Pavel Samygin, Colin Raffel:
Petals: Collaborative Inference and Fine-tuning of Large Models. ACL (demo) 2023: 558-568
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Don-YehiyaVRSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Don-YehiyaVRSC23
Shachar Don-Yehiya, Elad Venezian, Colin Raffel, Noam Slonim, Leshem Choshen:
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning. ACL (1) 2023: 788-806
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/TamMZKBR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/TamMZKBR23
Derek Tam, Anisha Mascarenhas, Shiyue Zhang, Sarah Kwan, Mohit Bansal, Colin Raffel:
Evaluating the Factual Consistency of Large Language Models Through News Summarization. ACL (Findings) 2023: 5220-5255
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/MuennighoffWSRB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/MuennighoffWSRB23
Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M. Saiful Bari, Sheng Shen, Zheng Xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel:
Crosslingual Generalization through Multitask Finetuning. ACL (1) 2023: 15991-16111
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/GuetaVRSKC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/GuetaVRSKC23
Almog Gueta, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen:
Knowledge is a Region in Weight Space for Fine-tuned Language Models. EMNLP (Findings) 2023: 1350-1370
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/DengR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/DengR23
Haikang Deng, Colin Raffel:
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model. EMNLP 2023: 11781-11791
[c56]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PatelLRCRC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PatelLRCRC23
Ajay Patel, Bryan Li, Mohammad Sadegh Rasooli, Noah Constant, Colin Raffel, Chris Callison-Burch:
Bidirectional Language Models Are Also Few-shot Learners. ICLR 2023
[c55]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KandpalDRWR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KandpalDRWR23
Nikhil Kandpal, Haikang Deng, Adam Roberts, Eric Wallace, Colin Raffel:
Large Language Models Struggle to Learn Long-Tail Knowledge. ICML 2023: 15696-15707
[c54]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KandpalLMMEBHLR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KandpalLMMEBHLR23
Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel:
Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models. ICML 2023: 15708-15719
[c53]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AlbalakRW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AlbalakRW23
Alon Albalak, Colin A. Raffel, William Yang Wang:
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data. NeurIPS 2023
[c52]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BorzunovRCBDBSR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BorzunovRCBDBSR23
Alexander Borzunov, Max Ryabinin, Artem Chumachenko, Dmitry Baranchuk, Tim Dettmers, Younes Belkada, Pavel Samygin, Colin A. Raffel:
Distributed Inference and Fine-tuning of Large Language Models Over The Internet. NeurIPS 2023
[c51]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MuennighoffRBST23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MuennighoffRBST23
Niklas Muennighoff, Alexander M. Rush, Boaz Barak, Teven Le Scao, Nouamane Tazi, Aleksandra Piktus, Sampo Pyysalo, Thomas Wolf, Colin A. Raffel:
Scaling Data-Constrained Language Models. NeurIPS 2023
[c50]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YadavTCRB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YadavTCRB23
Prateek Yadav, Derek Tam, Leshem Choshen, Colin A. Raffel, Mohit Bansal:
TIES-Merging: Resolving Interference When Merging Models. NeurIPS 2023
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-00674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-00674
Alon Albalak, Colin Raffel, William Yang Wang:
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data. CoRR abs/2302.00674 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-04863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-04863
Almog Gueta, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen:
Knowledge is a Region in Weight Space for Fine-tuned Language Models. CoRR abs/2302.04863 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16264
Niklas Muennighoff, Alexander M. Rush, Boaz Barak, Teven Le Scao, Aleksandra Piktus, Nouamane Tazi, Sampo Pyysalo, Thomas Wolf, Colin Raffel:
Scaling Data-Constrained Language Models. CoRR abs/2305.16264 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01708
Prateek Yadav, Derek Tam, Leshem Choshen, Colin Raffel, Mohit Bansal:
Resolving Interference When Merging Models. CoRR abs/2306.01708 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03745
Mohammed Muqeeth, Haokun Liu, Colin Raffel:
Soft Merging of Experts with Adaptive Routing. CoRR abs/2306.03745 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04529
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04529
Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel:
Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models. CoRR abs/2306.04529 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04649
Michael Matena, Colin Raffel:
NPEFF: Non-Negative Per-Example Fisher Factorization. CoRR abs/2310.04649 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-09520
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-09520
Haikang Deng, Colin Raffel:
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model. CoRR abs/2310.09520 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-13171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-13171
Prateek Yadav, Leshem Choshen, Colin Raffel, Mohit Bansal:
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization. CoRR abs/2311.13171 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02406
Alon Albalak, Liangming Pan, Colin Raffel, William Yang Wang:
Efficient Online Data Mixing For Language Model Pre-Training. CoRR abs/2312.02406 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04339
Derek Tam, Mohit Bansal, Colin Raffel:
Merging by Matching Models in Task Subspaces. CoRR abs/2312.04339 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08361
Alexander Borzunov, Max Ryabinin, Artem Chumachenko, Dmitry Baranchuk, Tim Dettmers, Younes Belkada, Pavel Samygin, Colin Raffel:
Distributed Inference and Fine-tuning of Large Language Models Over The Internet. CoRR abs/2312.08361 (2023)
2022
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/XueBCANKRR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/XueBCANKRR22
Linting Xue, Aditya Barua, Noah Constant, Rami Al-Rfou, Sharan Narang, Mihir Kale, Adam Roberts, Colin Raffel:
ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models. Trans. Assoc. Comput. Linguistics 10: 291-306 (2022)
[j3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/WeiTBRZBYBZMCHVLDF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/WeiTBRZBYBZMCHVLDF22
Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus:
Emergent Abilities of Large Language Models. Trans. Mach. Learn. Res. 2022 (2022)
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YangPR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YangPR22
Diyi Yang, Ankur P. Parikh, Colin Raffel:
Learning with Limited Text Data. ACL (tutorial) 2022: 28-31
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/BachSYWRNSKBFAD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/BachSYWRNSKBFAD22
Stephen H. Bach, Victor Sanh, Zheng Xin Yong, Albert Webson, Colin Raffel, Nihal V. Nayak, Abheesht Sharma, Taewoon Kim, M. Saiful Bari, Thibault Févry, Zaid Alyafeai, Manan Dey, Andrea Santilli, Zhiqing Sun, Srulik Ben-David, Canwen Xu, Gunjan Chhablani, Han Wang, Jason Alan Fries, Maged Saeed AlShaibani, Shanya Sharma, Urmish Thakker, Khalid Almubarak, Xiangru Tang, Dragomir R. Radev, Mike Tian-Jian Jiang, Alexander M. Rush:
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts. ACL (demo) 2022: 93-104
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ScaoWHBBBEMPPRS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ScaoWHBBBEMPPRS22
Teven Le Scao, Thomas Wang, Daniel Hesslow, Lucile Saulnier, Stas Bekman, M. Saiful Bari, Stella Biderman, Hady Elsahar, Niklas Muennighoff, Jason Phang, Ofir Press, Colin Raffel, Victor Sanh, Sheng Shen, Lintang Sutawika, Jaesung Tae, Zheng Xin Yong, Julien Launay, Iz Beltagy:
What Language Model to Train if You Have One Million GPU Hours? EMNLP (Findings) 2022: 765-782
[c46]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SanhWRBSACSRDBX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SanhWRBSACSRDBX22
Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Teven Le Scao, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush:
Multitask Prompted Training Enables Zero-Shot Task Generalization. ICLR 2022
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KandpalWR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KandpalWR22
Nikhil Kandpal, Eric Wallace, Colin Raffel:
Deduplicating Training Data Mitigates Privacy Risks in Language Models. ICML 2022: 10697-10707
[c44]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangRHSCBLR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangRHSCBLR22
Thomas Wang, Adam Roberts, Daniel Hesslow, Teven Le Scao, Hyung Won Chung, Iz Beltagy, Julien Launay, Colin Raffel:
What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization? ICML 2022: 22964-22984
[c43]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuTMMHBR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuTMMHBR22
Haokun Liu, Derek Tam, Mohammed Muqeeth, Jay Mohta, Tenghao Huang, Mohit Bansal, Colin Raffel:
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning. NeurIPS 2022
[c42]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MatenaR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MatenaR22
Michael Matena, Colin Raffel:
Merging Models with Fisher-Weighted Averaging. NeurIPS 2022
[c41]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MatenaR22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MatenaR22a
Michael Matena, Colin Raffel:
A Combinatorial Perspective on the Optimization of Shallow ReLU Networks. NeurIPS 2022
[c40]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuNR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuNR22
Zhenlin Xu, Marc Niethammer, Colin Raffel:
Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language. NeurIPS 2022
[e1]
- view
- export record
  dblp key:
  - conf/tl4nlp/2022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tl4nlp/2022
Alon Albalak, Chunting Zhou, Colin Raffel, Deepak Ramachandran, Sebastian Ruder, Xuezhe Ma:
Transfer Learning for Natural Language Processing Workshop, 03 December 2022, New Orleans, Louisiana, USA. Proceedings of Machine Learning Research 203, PMLR 2022 [contents]
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01279
Stephen H. Bach, Victor Sanh, Zheng Xin Yong, Albert Webson, Colin Raffel, Nihal V. Nayak, Abheesht Sharma, Taewoon Kim, M. Saiful Bari, Thibault Févry, Zaid Alyafeai, Manan Dey, Andrea Santilli, Zhiqing Sun, Srulik Ben-David, Canwen Xu, Gunjan Chhablani, Han Wang, Jason Alan Fries, Maged Saeed AlShaibani, Shanya Sharma, Urmish Thakker, Khalid Almubarak, Xiangru Tang, Mike Tian-Jian Jiang, Alexander M. Rush:
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts. CoRR abs/2202.01279 (2022)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-06539
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-06539
Nikhil Kandpal, Eric Wallace, Colin Raffel:
Deduplicating Training Data Mitigates Privacy Risks in Language Models. CoRR abs/2202.06539 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17189
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17189
Adam Roberts, Hyung Won Chung, Anselm Levskaya, Gaurav Mishra, James Bradbury, Daniel Andor, Sharan Narang, Brian Lester, Colin Gaffney, Afroz Mohiuddin, Curtis Hawthorne, Aitor Lewkowycz, Alex Salcianu, Marc van Zee, Jacob Austin, Sebastian Goodman, Livio Baldini Soares, Haitang Hu, Sasha Tsvyashchenko, Aakanksha Chowdhery, Jasmijn Bastings, Jannis Bulian, Xavier Garcia, Jianmo Ni, Andrew Chen, Kathleen Kenealy, Jonathan H. Clark, Stephan Lee, Dan Garrette, James Lee-Thorp, Colin Raffel, Noam Shazeer, Marvin Ritter, Maarten Bosma, Alexandre Passos, Jeremy Maitin-Shepard, Noah Fiedel, Mark Omernick, Brennan Saeta, Ryan Sepassi, Alexander Spiridonov, Joshua Newlan, Andrea Gesmundo:
Scaling Up Models and Data with t5x and seqio. CoRR abs/2203.17189 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-05832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-05832
Thomas Wang, Adam Roberts, Daniel Hesslow, Teven Le Scao, Hyung Won Chung, Iz Beltagy, Julien Launay, Colin Raffel:
What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? CoRR abs/2204.05832 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05638
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05638
Haokun Liu, Derek Tam, Mohammed Muqeeth, Jay Mohta, Tenghao Huang, Mohit Bansal, Colin Raffel:
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning. CoRR abs/2205.05638 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04615
Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory W. Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. CoRR abs/2206.04615 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07682
Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus:
Emergent Abilities of Large Language Models. CoRR abs/2206.07682 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-00099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-00099
Marcos V. Treviso, Tianchu Ji, Ji-Ung Lee, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Pedro Henrique Martins, André F. T. Martins, Peter A. Milder, Colin Raffel, Edwin Simpson, Noam Slonim, Niranjan Balasubramanian, Leon Derczynski, Roy Schwartz:
Efficient Methods for Natural Language Processing: A Survey. CoRR abs/2209.00099 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-01188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-01188
Alexander Borzunov, Dmitry Baranchuk, Tim Dettmers, Max Ryabinin, Younes Belkada, Artem Chumachenko, Pavel Samygin, Colin Raffel:
Petals: Collaborative Inference and Fine-tuning of Large Models. CoRR abs/2209.01188 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-14500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-14500
Ajay Patel, Bryan Li, Mohammad Sadegh Rasooli, Noah Constant, Colin Raffel, Chris Callison-Burch:
Bidirectional Language Models Are Also Few-shot Learners. CoRR abs/2209.14500 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00176
Michael Matena, Colin Raffel:
A Combinatorial Perspective on the Optimization of Shallow ReLU Networks. CoRR abs/2210.00176 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00482
Zhenlin Xu, Marc Niethammer, Colin Raffel:
Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language. CoRR abs/2210.00482 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15424
Teven Le Scao, Thomas Wang, Daniel Hesslow, Lucile Saulnier, Stas Bekman, M. Saiful Bari, Stella Biderman, Hady Elsahar, Niklas Muennighoff, Jason Phang, Ofir Press, Colin Raffel, Victor Sanh, Sheng Shen, Lintang Sutawika, Jaesung Tae, Zheng Xin Yong, Julien Launay, Iz Beltagy:
What Language Model to Train if You Have One Million GPU Hours? CoRR abs/2210.15424 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01786
Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M. Saiful Bari, Sheng Shen, Zheng Xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel:
Crosslingual Generalization through Multitask Finetuning. CoRR abs/2211.01786 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08411
Nikhil Kandpal, Haikang Deng, Adam Roberts, Eric Wallace, Colin Raffel:
Large Language Models Struggle to Learn Long-Tail Knowledge. CoRR abs/2211.08411 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08412
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08412
Derek Tam, Anisha Mascarenhas, Shiyue Zhang, Sarah Kwan, Mohit Bansal, Colin Raffel:
Evaluating the Factual Consistency of Large Language Models Through Summarization. CoRR abs/2211.08412 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-01378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-01378
Shachar Don-Yehiya, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen:
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning. CoRR abs/2212.01378 (2022)
2021
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/TamMBSR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/TamMBSR21
Derek Tam, Rakesh R. Menon, Mohit Bansal, Shashank Srivastava, Colin Raffel:
Improving and Simplifying Pattern Exploiting Training. EMNLP (1) 2021: 4980-4991
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/NarangCTFFMMFSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/NarangCTFFMMFSL21
Sharan Narang, Hyung Won Chung, Yi Tay, Liam Fedus, Thibault Févry, Michael Matena, Karishma Malkan, Noah Fiedel, Noam Shazeer, Zhenzhong Lan, Yanqi Zhou, Wei Li, Nan Ding, Jake Marcus, Adam Roberts, Colin Raffel:
Do Transformer Modifications Transfer Across Implementations and Applications? EMNLP (1) 2021: 5758-5773
[c37]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuLYRN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuLYRN21
Zhenlin Xu, Deyi Liu, Junlin Yang, Colin Raffel, Marc Niethammer:
Robust and Generalizable Visual Representation Learning via Random Convolutions. ICLR 2021
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/BaiLRK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/BaiLRK21
Ching-Yuan Bai, Hsuan-Tien Lin, Colin Raffel, Wendy Chi-wen Kan:
On Training Sample Memorization: Lessons from Benchmarking Generative Modeling with a Large-scale Competition. KDD 2021: 2534-2542
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/XueCRKASBR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/XueCRKASBR21
Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel:
mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer. NAACL-HLT 2021: 483-498
[c34]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SungNR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SungNR21
Yi-Lin Sung, Varun Nair, Colin Raffel:
Training Neural Networks with Fixed Sparse Masks. NeurIPS 2021: 24193-24205
[c33]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/uss/CarliniTWJHLRBS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uss/CarliniTWJHLRBS21
Nicholas Carlini, Florian Tramèr, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom B. Brown, Dawn Song, Úlfar Erlingsson, Alina Oprea, Colin Raffel:
Extracting Training Data from Large Language Models. USENIX Security Symposium 2021: 2633-2650
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-00133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-00133
Sewon Min, Jordan L. Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih:
NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned. CoRR abs/2101.00133 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11972
Sharan Narang, Hyung Won Chung, Yi Tay, William Fedus, Thibault Févry, Michael Matena, Karishma Malkan, Noah Fiedel, Noam Shazeer, Zhenzhong Lan, Yanqi Zhou, Wei Li, Nan Ding, Jake Marcus, Adam Roberts, Colin Raffel:
Do Transformer Modifications Transfer Across Implementations and Applications? CoRR abs/2102.11972 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-11955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-11955
Derek Tam, Rakesh R. Menon, Mohit Bansal, Shashank Srivastava, Colin Raffel:
Improving and Simplifying Pattern Exploiting Training. CoRR abs/2103.11955 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-13626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-13626
Linting Xue, Aditya Barua, Noah Constant, Rami Al-Rfou, Sharan Narang, Mihir Kale, Adam Roberts, Colin Raffel:
ByT5: Towards a token-free future with pre-trained byte-to-byte models. CoRR abs/2105.13626 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03062
Ching-Yuan Bai, Hsuan-Tien Lin, Colin Raffel, Wendy Chih-wen Kan:
On Training Sample Memorization: Lessons from Benchmarking Generative Modeling with a Large-scale Competition. CoRR abs/2106.03062 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07499
Jiaao Chen, Derek Tam, Colin Raffel, Mohit Bansal, Diyi Yang:
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP. CoRR abs/2106.07499 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08207
Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M. Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Stella Biderman, Leo Gao, Tali Bers, Thomas Wolf, Alexander M. Rush:
Multitask Prompted Training Enables Zero-Shot Task Generalization. CoRR abs/2110.08207 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-09832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-09832
Michael Matena, Colin Raffel:
Merging Models with Fisher-Weighted Averaging. CoRR abs/2111.09832 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-09839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-09839
Yi-Lin Sung, Varun Nair, Colin Raffel:
Training Neural Networks with Fixed Sparse Masks. CoRR abs/2111.09839 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10508
Sabrina J. Mielke, Zaid Alyafeai, Elizabeth Salesky, Colin Raffel, Manan Dey, Matthias Gallé, Arun Raja, Chenglei Si, Wilson Y. Lee, Benoît Sagot, Samson Tan:
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP. CoRR abs/2112.10508 (2021)
2020
[j2]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/RaffelSRLNMZLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/RaffelSRLNMZLL20
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu:
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. J. Mach. Learn. Res. 21: 140:1-140:67 (2020)
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/MaasAIJMR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/MaasAIJMR20
Martin Maas, David G. Andersen, Michael Isard, Mohammad Mahdi Javanmard, Kathryn S. McKinley, Colin Raffel:
Learning-based Memory Allocation for C++ Server Workloads. ASPLOS 2020: 541-556
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/RobertsRS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/RobertsRS20
Adam Roberts, Colin Raffel, Noam Shazeer:
How Much Knowledge Can You Pack Into the Parameters of a Language Model? EMNLP (1) 2020: 5418-5426
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BerthelotCCKSZR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BerthelotCCKSZR20
David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel:
ReMixMatch: Semi-Supervised Learning with Distribution Matching and Augmentation Anchoring. ICLR 2020
[c29]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/QinFSRCH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/QinFSRCH20
Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison W. Cottrell, Geoffrey E. Hinton:
Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions. ICLR 2020
[c28]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MinBACC0GHLPRRK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MinBACC0GHLPRRK20
Sewon Min, Jordan L. Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih:
NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned. NeurIPS (Competition and Demos) 2020: 86-111
[c27]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SinhaZGRO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SinhaZGRO20
Samarth Sinha, Zhengli Zhao, Anirudh Goyal, Colin Raffel, Augustus Odena:
Top-k Training of GANs: Improving GAN Performance by Throwing Away Bad Samples. NeurIPS 2020
[c26]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SohnBCZZRCKL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SohnBCZZRCKL20
Kihyuk Sohn, David Berthelot, Nicholas Carlini, Zizhao Zhang, Han Zhang, Colin Raffel, Ekin Dogus Cubuk, Alexey Kurakin, Chun-Liang Li:
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. NeurIPS 2020
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-03653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-03653
Ishaan Gulrajani, Colin Raffel, Luke Metz:
Towards GAN Benchmarks Which Require Generalization. CoRR abs/2001.03653 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-07685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-07685
Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel:
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. CoRR abs/2001.07685 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06224
Samarth Sinha, Anirudh Goyal, Colin Raffel, Augustus Odena:
Top-K Training of GANs: Improving Generators by Making Critics Less Critical. CoRR abs/2002.06224 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-07405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-07405
Yao Qin, Nicholas Frosst, Colin Raffel, Garrison W. Cottrell, Geoffrey E. Hinton:
Deflecting Adversarial Attacks. CoRR abs/2002.07405 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08910
Adam Roberts, Colin Raffel, Noam Shazeer:
How Much Knowledge Can You Pack Into the Parameters of a Language Model? CoRR abs/2002.08910 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-14546
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-14546
Sharan Narang, Colin Raffel, Katherine Lee, Adam Roberts, Noah Fiedel, Karishma Malkan:
WT5?! Training Text-to-Text Models to Explain their Predictions. CoRR abs/2004.14546 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11934
Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel:
mT5: A massively multilingual pre-trained text-to-text transformer. CoRR abs/2010.11934 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07805
Nicholas Carlini, Florian Tramèr, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom B. Brown, Dawn Song, Úlfar Erlingsson, Alina Oprea, Colin Raffel:
Extracting Training Data from Large Language Models. CoRR abs/2012.07805 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ArivazhaganCMCY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ArivazhaganCMCY19
Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel:
Monotonic Infinite Lookback Attention for Simultaneous Machine Translation. ACL (1) 2019: 1313-1323
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BerthelotRRG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BerthelotRRG19
David Berthelot, Colin Raffel, Aurko Roy, Ian J. Goodfellow:
Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer. ICLR (Poster) 2019
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GulrajaniRM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GulrajaniRM19
Ishaan Gulrajani, Colin Raffel, Luke Metz:
Towards GAN Benchmarks Which Require Generalization. ICLR (Poster) 2019
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QinCCGR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QinCCGR19
Yao Qin, Nicholas Carlini, Garrison W. Cottrell, Ian J. Goodfellow, Colin Raffel:
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition. ICML 2019: 5231-5240
[c21]
- view
- export record
  dblp key:
  - conf/nips/BerthelotCGPOR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BerthelotCGPOR19
David Berthelot, Nicholas Carlini, Ian J. Goodfellow, Nicolas Papernot, Avital Oliver, Colin Raffel:
MixMatch: A Holistic Approach to Semi-Supervised Learning. NeurIPS 2019: 5050-5060
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08295
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-10346
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-10346
Yao Qin, Nicholas Carlini, Ian J. Goodfellow, Garrison W. Cottrell, Colin Raffel:
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition. CoRR abs/1903.10346 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-02249
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-02249
David Berthelot, Nicholas Carlini, Ian J. Goodfellow, Nicolas Papernot, Avital Oliver, Colin Raffel:
MixMatch: A Holistic Approach to Semi-Supervised Learning. CoRR abs/1905.02249 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-05218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-05218
Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel:
Monotonic Infinite Lookback Attention for Simultaneous Machine Translation. CoRR abs/1906.05218 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-02957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-02957
Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison W. Cottrell, Geoffrey E. Hinton:
Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions. CoRR abs/1907.02957 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10683
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10683
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu:
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. CoRR abs/1910.10683 (2019)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-09785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-09785
David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel:
ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring. CoRR abs/1911.09785 (2019)
2018
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jossw/PriceCEMORY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossw/PriceCEMORY18
Danny C. Price, Sebastien Celles, Pieter T. Eendebak, Michael M. McKerns, Eben M. Olson, Colin Raffel, Bairen Yi:
Hickle: A HDF5-based python pickle replacement. J. Open Source Softw. 3(32): 1115 (2018)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LawsonCTRSJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LawsonCTRSJ18
Dieterich Lawson, Chung-Cheng Chiu, George Tucker, Colin Raffel, Kevin Swersky, Navdeep Jaitly:
Learning Hard Alignments with Variational Inference. ICASSP 2018: 5799-5803
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BuckmanRRG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BuckmanRRG18
Jacob Buckman, Aurko Roy, Colin Raffel, Ian J. Goodfellow:
Thermometer Encoding: One Hot Way To Resist Adversarial Examples. ICLR (Poster) 2018
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChiuR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChiuR18
Chung-Cheng Chiu, Colin Raffel:
Monotonic Chunkwise Attention. ICLR (Poster) 2018
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/OliverORCG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/OliverORCG18
Avital Oliver, Augustus Odena, Colin Raffel, Ekin D. Cubuk, Ian J. Goodfellow:
Realistic Evaluation of Semi-Supervised Learning Algorithms. ICLR (Workshop) 2018
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/OdenaBOBORG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/OdenaBOBORG18
Augustus Odena, Jacob Buckman, Catherine Olsson, Tom B. Brown, Christopher Olah, Colin Raffel, Ian J. Goodfellow:
Is Generator Conditioning Causally Related to GAN Performance? ICML 2018: 3846-3855
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RobertsERHE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RobertsERHE18
Adam Roberts, Jesse H. Engel, Colin Raffel, Curtis Hawthorne, Douglas Eck:
A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music. ICML 2018: 4361-4370
[c14]
- view
  - electronic edition @ ircam.fr (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/HawthorneESRSRE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/HawthorneESRSRE18
Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse H. Engel, Sageev Oore, Douglas Eck:
Onsets and Frames: Dual-Objective Piano Transcription. ISMIR 2018: 50-57
[c13]
- view
- export record
  dblp key:
  - conf/nips/OliverORCG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OliverORCG18
Avital Oliver, Augustus Odena, Colin Raffel, Ekin Dogus Cubuk, Ian J. Goodfellow:
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms. NeurIPS 2018: 3239-3250
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-08768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-08768
Augustus Odena, Jacob Buckman, Catherine Olsson, Tom B. Brown, Christopher Olah, Colin Raffel, Ian J. Goodfellow:
Is Generator Conditioning Causally Related to GAN Performance? CoRR abs/1802.08768 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05428
Adam Roberts, Jesse H. Engel, Colin Raffel, Curtis Hawthorne, Douglas Eck:
A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music. CoRR abs/1803.05428 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-09170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-09170
Avital Oliver, Augustus Odena, Colin Raffel, Ekin D. Cubuk, Ian J. Goodfellow:
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms. CoRR abs/1804.09170 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-00195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-00195
Ian Simon, Adam Roberts, Colin Raffel, Jesse H. Engel, Curtis Hawthorne, Douglas Eck:
Learning a Latent Space of Multitrack Measures. CoRR abs/1806.00195 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-07543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-07543
David Berthelot, Colin Raffel, Aurko Roy, Ian J. Goodfellow:
Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer. CoRR abs/1807.07543 (2018)
2017
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GilmerRSRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GilmerRSRS17
Justin Gilmer, Colin Raffel, Samuel S. Schoenholz, Maithra Raghu, Jascha Sohl-Dickstein:
Explaining the Learning Dynamics of Direct Feedback Alignment. ICLR (Workshop) 2017
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/RaffelL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RaffelL17
Colin Raffel, Dieterich Lawson:
Training a Subsampling Mechanism in Expectation. ICLR (Workshop) 2017
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RaffelLLWE17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RaffelLLWE17
Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck:
Online and Linear-Time Attention by Enforcing Monotonic Alignments. ICML 2017: 2837-2846
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/RaffelL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/RaffelL17
Colin Raffel, Dieterich Lawson:
Training a Subsampling Mechanism in Expectation. CoRR abs/1702.06914 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/RaffelLLWE17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/RaffelLLWE17
Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck:
Online and Linear-Time Attention by Enforcing Monotonic Alignments. CoRR abs/1704.00784 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LawsonTCRSJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LawsonTCRSJ17
Dieterich Lawson, George Tucker, Chung-Cheng Chiu, Colin Raffel, Kevin Swersky, Navdeep Jaitly:
Learning Hard Alignments with Variational Inference. CoRR abs/1705.05524 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-11153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-11153
Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse H. Engel, Sageev Oore, Douglas Eck:
Onsets and Frames: Dual-Objective Piano Transcription. CoRR abs/1710.11153 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-05382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-05382
Chung-Cheng Chiu, Colin Raffel:
Monotonic Chunkwise Attention. CoRR abs/1712.05382 (2017)
2016
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/us/Raffel16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Raffel16
Colin Raffel:
Learning-Based Methods for Comparing Sequences, with Applications to Audio-to-MIDI Alignment and Matching. Columbia University, USA, 2016
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YakovenkoCRF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YakovenkoCRF16
Nikolai Yakovenko, Liangliang Cao, Colin Raffel, James Fan:
Poker-CNN: A Pattern Learning Strategy for Making Draws and Bets in Poker Games Using Convolutional Networks. AAAI 2016: 360-368
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RaffelE16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RaffelE16
Colin Raffel, Daniel P. W. Ellis:
Optimizing DTW-based audio-to-MIDI alignment and matching. ICASSP 2016: 81-85
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RaffelE16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RaffelE16a
Colin Raffel, Daniel P. W. Ellis:
Pruning subsequence search with attention-based embedding. ICASSP 2016: 554-558
[c6]
- view
  - electronic edition @ nyu.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/RaffelE16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/RaffelE16
Colin Raffel, Daniel P. W. Ellis:
Extracting Ground-Truth Information from MIDI Files: A MIDIfesto. ISMIR 2016: 796-802
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/Al-RfouAAa16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Al-RfouAAa16
Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermüller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul F. Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron C. Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Melanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian J. Goodfellow, Matthew Graham, Çaglar Gülçehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrançois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Joseph Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph P. Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang:
Theano: A Python framework for fast computation of mathematical expressions. CoRR abs/1605.02688 (2016)
2015
[c5]
- view
  - electronic edition @ uma.es (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/RaffelE15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/RaffelE15
Colin Raffel, Daniel P. W. Ellis:
Large-Scale Content-Based Matching of MIDI and Audio Files. ISMIR 2015: 234-240
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/scipy/McFeeRLEMBN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/scipy/McFeeRLEMBN15
Brian McFee, Colin Raffel, Dawen Liang, Daniel P. W. Ellis, Matt McVicar, Eric Battenberg, Oriol Nieto:
librosa: Audio and Music Signal Analysis in Python. SciPy 2015: 18-24
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YakovenkoCRF15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YakovenkoCRF15
Nikolai Yakovenko, Liangliang Cao, Colin Raffel, James Fan:
Poker-CNN: A Pattern Learning Strategy for Making Draws and Bets in Poker Games. CoRR abs/1509.06731 (2015)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/RaffelE15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/RaffelE15
Colin Raffel, Daniel P. W. Ellis:
Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems. CoRR abs/1512.08756 (2015)
2014
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RaffelE14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RaffelE14
Colin Raffel, Daniel P. W. Ellis:
Estimating timing and channel distortion across related signals. ICASSP 2014: 654-658
[c2]
- view
- export record
  dblp key:
  - conf/ismir/RaffelMHSNLE14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/RaffelMHSNLE14
Colin Raffel, Brian McFee, Eric J. Humphrey, Justin Salamon, Oriol Nieto, Dawen Liang, Daniel P. W. Ellis:
MIR_EVAL: A Transparent Implementation of Common MIR Metrics. ISMIR 2014: 367-372
2010
[c1]
- view
  - electronic edition via handle.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icmc/RaffelKDBJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmc/RaffelKDBJ10
Colin Raffel, Nick Kruge, Diane Douglas, Edgar Berdahl, Wendy Ju:
The Lattice Harp: A New Hybrid Instrument And Controller. ICMC 2010

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.