default search action

combined dblp search
author search
venue search
publication search

ask others

Behnam Neyshabur

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/SinghCAAPGLH0XP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/SinghCAAPGLH0XP24
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron T. Parisi, Abhishek Kumar, Alexander A. Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Fathy Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models. Trans. Mach. Learn. Res. 2024 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-00118
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-00118
Morgane Rivière, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman, Shantanu Thakoor, Jean-Bastien Grill, Behnam Neyshabur, Olivier Bachem, Alanna Walton, Aliaksei Severyn, Alicia Parrish, Aliya Ahmad, Allen Hutchison, Alvin Abdagic, Amanda Carl, Amy Shen, Andy Brock, Andy Coenen, Anthony Laforge, Antonia Paterson, Ben Bastian, Bilal Piot, Bo Wu, Brandon Royal, Charlie Chen, Chintu Kumar, Chris Perry, Chris Welty, Christopher A. Choquette-Choo, Danila Sinopalnikov, David Weinberger, Dimple Vijaykumar, Dominika Rogozinska, Dustin Herbison, Elisa Bandy, Emma Wang, Eric Noland, Erica Moreira, Evan Senter, Evgenii Eltyshev, Francesco Visin, Gabriel Rasskin, Gary Wei, Glenn Cameron, Gus Martins, Hadi Hashemi, Hanna Klimczak-Plucinska, Harleen Batra, Harsh Dhand, Ivan Nardini, Jacinda Mein, Jack Zhou, James Svensson, Jeff Stanway, Jetha Chan, Jin Peng Zhou, Joana Carrasqueira, Joana Iljazi, Jocelyn Becker, Joe Fernandez, Joost van Amersfoort, Josh Gordon, Josh Lipschultz, Josh Newlan, Ju-yeong Ji, Kareem Mohamed, Kartikeya Badola, Kat Black, Katie Millican, Keelin McDonell, Kelvin Nguyen, Kiranbir Sodhia, Kish Greene, Lars Lowe Sjösund, Lauren Usui, Laurent Sifre, Lena Heuermann, Leticia Lago, Lilly McNealus:
Gemma 2: Improving Open Language Models at a Practical Size. CoRR abs/2408.00118 (2024)
2023
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/SrivastavaRRSAF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/SrivastavaRRSAF23
Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory W. Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023)
[c41]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JordanSSEN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JordanSSEN23
Keller Jordan, Hanie Sedghi, Olga Saukh, Rahim Entezari, Behnam Neyshabur:
REPAIR: REnormalizing Permuted Activations for Interpolation Repair. ICLR 2023
[c40]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Mehta0CN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Mehta0CN23
Harsh Mehta, Ankit Gupta, Ashok Cutkosky, Behnam Neyshabur:
Long Range Language Modeling via Gated State Spaces. ICLR 2023
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-06585
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-06585
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin F. Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models. CoRR abs/2312.06585 (2023)
2022
[j3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/AndreassenBNR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/AndreassenBNR22
Anders Johan Andreassen, Yasaman Bahri, Behnam Neyshabur, Rebecca Roelofs:
The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning. Trans. Mach. Learn. Res. 2022 (2022)
[c39]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Abnar0NS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Abnar0NS22
Samira Abnar, Mostafa Dehghani, Behnam Neyshabur, Hanie Sedghi:
Exploring the Limits of Large Scale Pre-training. ICLR 2022
[c38]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/EntezariSSN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/EntezariSSN22
Rahim Entezari, Hanie Sedghi, Olga Saukh, Behnam Neyshabur:
The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks. ICLR 2022
[c37]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GargBLNS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GargBLNS22
Saurabh Garg, Sivaraman Balakrishnan, Zachary Chase Lipton, Behnam Neyshabur, Hanie Sedghi:
Leveraging unlabeled data to predict out-of-distribution performance. ICLR 2022
[c36]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GilmerGGKNCDNF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GilmerGGKNCDNF22
Justin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George Edward Dahl, Zachary Nado, Orhan Firat:
A Loss Curvature Perspective on Training Instabilities of Deep Learning Models. ICLR 2022
[c35]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BansalGGZCNF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BansalGGZCNF22
Yamini Bansal, Behrooz Ghorbani, Ankush Garg, Biao Zhang, Colin Cherry, Behnam Neyshabur, Orhan Firat:
Data Scaling Laws in NMT: The Effect of Noise and Architecture. ICML 2022: 1466-1482
[c34]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AlabdulmohsinNZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AlabdulmohsinNZ22
Ibrahim M. Alabdulmohsin, Behnam Neyshabur, Xiaohua Zhai:
Revisiting Neural Scaling Laws in Language and Vision. NeurIPS 2022
[c33]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AnilWALMRSGDN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AnilWALMRSGDN22
Cem Anil, Yuhuai Wu, Anders Andreassen, Aitor Lewkowycz, Vedant Misra, Vinay V. Ramasesh, Ambrose Slone, Guy Gur-Ari, Ethan Dyer, Behnam Neyshabur:
Exploring Length Generalization in Large Language Models. NeurIPS 2022
[c32]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HutchinsSWDN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HutchinsSWDN22
DeLesley Hutchins, Imanol Schlag, Yuhuai Wu, Ethan Dyer, Behnam Neyshabur:
Block-Recurrent Transformers. NeurIPS 2022
[c31]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LewkowyczADDMRS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LewkowyczADDMRS22
Aitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay V. Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-Solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-Ari, Vedant Misra:
Solving Quantitative Reasoning Problems with Language Models. NeurIPS 2022
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-04234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-04234
Saurabh Garg, Sivaraman Balakrishnan, Zachary C. Lipton, Behnam Neyshabur, Hanie Sedghi:
Leveraging Unlabeled Data to Predict Out-of-Distribution Performance. CoRR abs/2201.04234 (2022)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01994
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01994
Yamini Bansal, Behrooz Ghorbani, Ankush Garg, Biao Zhang, Maxim Krikun, Colin Cherry, Behnam Neyshabur, Orhan Firat:
Data Scaling Laws in NMT: The Effect of Noise and Architecture. CoRR abs/2202.01994 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-07852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-07852
DeLesley Hutchins, Imanol Schlag, Yuhuai Wu, Ethan Dyer, Behnam Neyshabur:
Block-Recurrent Transformers. CoRR abs/2203.07852 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04615
Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory W. Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. CoRR abs/2206.04615 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-10915
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-10915
Lukas Timpl, Rahim Entezari, Hanie Sedghi, Behnam Neyshabur, Olga Saukh:
Understanding the effect of sparsity on neural networks robustness. CoRR abs/2206.10915 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13947
Harsh Mehta, Ankit Gupta, Ashok Cutkosky, Behnam Neyshabur:
Long Range Language Modeling via Gated State Spaces. CoRR abs/2206.13947 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-14858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-14858
Aitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay V. Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-Solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-Ari, Vedant Misra:
Solving Quantitative Reasoning Problems with Language Models. CoRR abs/2206.14858 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04901
Cem Anil, Yuhuai Wu, Anders Andreassen, Aitor Lewkowycz, Vedant Misra, Vinay V. Ramasesh, Ambrose Slone, Guy Gur-Ari, Ethan Dyer, Behnam Neyshabur:
Exploring Length Generalization in Large Language Models. CoRR abs/2207.04901 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06640
Ibrahim Alabdulmohsin, Behnam Neyshabur, Xiaohua Zhai:
Revisiting Neural Scaling Laws in Language and Vision. CoRR abs/2209.06640 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08403
Keller Jordan, Hanie Sedghi, Olga Saukh, Rahim Entezari, Behnam Neyshabur:
REPAIR: REnormalizing Permuted Activations for Interpolation Repair. CoRR abs/2211.08403 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-09066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-09066
Hattie Zhou, Azade Nova, Hugo Larochelle, Aaron C. Courville, Behnam Neyshabur, Hanie Sedghi:
Teaching Algorithmic Reasoning via In-context Learning. CoRR abs/2211.09066 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10193
Amr Khalifa, Michael C. Mozer, Hanie Sedghi, Behnam Neyshabur, Ibrahim Alabdulmohsin:
Layer-Stack Temperature Scaling. CoRR abs/2211.10193 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11052
Tolga Ergen, Behnam Neyshabur, Harsh Mehta:
Convexifying Transformers: Improving optimization and understanding of transformer networks. CoRR abs/2211.11052 (2022)
2021
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ForetKMN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ForetKMN21
Pierre Foret, Ariel Kleiner, Hossein Mobahi, Behnam Neyshabur:
Sharpness-aware Minimization for Efficiently Improving Generalization. ICLR 2021
[c29]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GolubevaGN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GolubevaGN21
Anna Golubeva, Guy Gur-Ari, Behnam Neyshabur:
Are wider nets better given the same number of parameters? ICLR 2021
[c28]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MehtaCN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MehtaCN21
Harsh Mehta, Ashok Cutkosky, Behnam Neyshabur:
Extreme Memorization via Scale of Initialization. ICLR 2021
[c27]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NagarajanAN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NagarajanAN21
Vaishnavh Nagarajan, Anders Andreassen, Behnam Neyshabur:
Understanding the failure modes of out-of-distribution generalization. ICLR 2021
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NakkiranNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NakkiranNS21
Preetum Nakkiran, Behnam Neyshabur, Hanie Sedghi:
The Deep Bootstrap Framework: Good Online Learners are Good Offline Generalizers. ICLR 2021
[c25]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WuDN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WuDN21
Xiaoxia Wu, Ethan Dyer, Behnam Neyshabur:
When Do Curricula Work? ICLR 2021
[c24]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BaldockMN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaldockMN21
Robert J. N. Baldock, Hartmut Maennel, Behnam Neyshabur:
Deep Learning Through the Lens of Example Difficulty. NeurIPS 2021: 10876-10889
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09647
Robert J. N. Baldock, Hartmut Maennel, Behnam Neyshabur:
Deep Learning Through the Lens of Example Difficulty. CoRR abs/2106.09647 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-15831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-15831
Anders Andreassen, Yasaman Bahri, Behnam Neyshabur, Rebecca Roelofs:
The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning. CoRR abs/2106.15831 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02095
Samira Abnar, Mostafa Dehghani, Behnam Neyshabur, Hanie Sedghi:
Exploring the Limits of Large Scale Pre-training. CoRR abs/2110.02095 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04369
Justin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George E. Dahl, Zachary Nado, Orhan Firat:
A Loss Curvature Perspective on Training Instability in Deep Learning. CoRR abs/2110.04369 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06296
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06296
Rahim Entezari, Hanie Sedghi, Olga Saukh, Behnam Neyshabur:
The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks. CoRR abs/2110.06296 (2021)
2020
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChatterjiNS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChatterjiNS20
Niladri S. Chatterji, Behnam Neyshabur, Hanie Sedghi:
The intriguing role of module criticality in the generalization of deep networks. ICLR 2020
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JiangNMKB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JiangNMKB20
Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio:
Fantastic Generalization Measures and Where to Find Them. ICLR 2020
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SongJTDN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SongJTDN20
Xingyou Song, Yiding Jiang, Stephen Tu, Yilun Du, Behnam Neyshabur:
Observational Overfitting in Reinforcement Learning. ICLR 2020
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JiangNSAKSL0DGG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiangNSAKSL0DGG20
Yiding Jiang, Parth Natekar, Manik Sharma, Sumukh K. Aithal, Dhruva Kashyap, Natarajan Subramanyam, Carlos Lassance, Daniel M. Roy, Gintare Karolina Dziugaite, Suriya Gunasekar, Isabelle Guyon, Pierre Foret, Scott Yak, Hossein Mobahi, Behnam Neyshabur, Samy Bengio:
Methods and Analysis of The First Competition in Predicting Generalization of Deep Learning. NeurIPS (Competition and Demos) 2020: 170-190
[c19]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Neyshabur20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Neyshabur20
Behnam Neyshabur:
Towards Learning Convolutions from Scratch. NeurIPS 2020
[c18]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/NeyshaburSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NeyshaburSZ20
Behnam Neyshabur, Hanie Sedghi, Chiyuan Zhang:
What is being transferred in transfer learning? NeurIPS 2020
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13657
Behnam Neyshabur:
Towards Learning Convolutions from Scratch. CoRR abs/2007.13657 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-11687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-11687
Behnam Neyshabur, Hanie Sedghi, Chiyuan Zhang:
What is being transferred in transfer learning? CoRR abs/2008.11687 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-13363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-13363
Harsh Mehta, Ashok Cutkosky, Behnam Neyshabur:
Extreme Memorization via Scale of Initialization. CoRR abs/2008.13363 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01412
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01412
Pierre Foret, Ariel Kleiner, Hossein Mobahi, Behnam Neyshabur:
Sharpness-Aware Minimization for Efficiently Improving Generalization. CoRR abs/2010.01412 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08127
Preetum Nakkiran, Behnam Neyshabur, Hanie Sedghi:
The Deep Bootstrap: Good Online Learners are Good Offline Generalizers. CoRR abs/2010.08127 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14495
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14495
Anna Golubeva, Behnam Neyshabur, Guy Gur-Ari:
Are wider nets better given the same number of parameters? CoRR abs/2010.14495 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15775
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15775
Vaishnavh Nagarajan, Anders Andreassen, Behnam Neyshabur:
Understanding the Failure Modes of Out-of-Distribution Generalization. CoRR abs/2010.15775 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-03107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-03107
Xiaoxia Wu, Ethan Dyer, Behnam Neyshabur:
When Do Curricula Work? CoRR abs/2012.03107 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07976
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07976
Yiding Jiang, Pierre Foret, Scott Yak, Daniel M. Roy, Hossein Mobahi, Gintare Karolina Dziugaite, Samy Bengio, Suriya Gunasekar, Isabelle Guyon, Behnam Neyshabur:
NeurIPS 2020 Competition: Predicting Generalization in Deep Learning. CoRR abs/2012.07976 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NeyshaburLBLS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NeyshaburLBLS19
Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann LeCun, Nathan Srebro:
The role of over-parametrization in generalization of neural networks. ICLR (Poster) 2019
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-00528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-00528
Niladri S. Chatterji, Behnam Neyshabur, Hanie Sedghi:
The intriguing role of module criticality in the generalization of deep networks. CoRR abs/1912.00528 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02178
Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio:
Fantastic Generalization Measures and Where to Find Them. CoRR abs/1912.02178 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02975
Xingyou Song, Yiding Jiang, Stephen Tu, Yilun Du, Behnam Neyshabur:
Observational Overfitting in Reinforcement Learning. CoRR abs/1912.02975 (2019)
2018
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/bioinformatics/HashemifarNKX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bioinformatics/HashemifarNKX18
Somaye Hashemifar, Behnam Neyshabur, Aly A. Khan, Jinbo Xu:
Predicting protein-protein interactions through sequence-based deep learning. Bioinform. 34(17): i802-i810 (2018)
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NeyshaburBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NeyshaburBS18
Behnam Neyshabur, Srinadh Bhojanapalli, Nathan Srebro:
A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks. ICLR (Poster) 2018
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Arora0NZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Arora0NZ18
Sanjeev Arora, Rong Ge, Behnam Neyshabur, Yi Zhang:
Stronger Generalization Bounds for Deep Nets via a Compression Approach. ICML 2018: 254-263
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/ita/GunasekarWBNS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ita/GunasekarWBNS18
Suriya Gunasekar, Blake E. Woodworth, Srinadh Bhojanapalli, Behnam Neyshabur, Nathan Srebro:
Implicit Regularization in Matrix Factorization. ITA 2018: 1-10
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05296
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05296
Sanjeev Arora, Rong Ge, Behnam Neyshabur, Yi Zhang:
Stronger generalization bounds for deep nets via a compression approach. CoRR abs/1802.05296 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-12076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-12076
Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann LeCun, Nathan Srebro:
Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks. CoRR abs/1805.12076 (2018)
2017
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/AgarwalLNS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/AgarwalLNS17
Alekh Agarwal, Haipeng Luo, Behnam Neyshabur, Robert E. Schapire:
Corralling a Band of Bandit Algorithms. COLT 2017: 12-38
[c12]
- view
- export record
  dblp key:
  - conf/nips/NeyshaburBMS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NeyshaburBMS17
Behnam Neyshabur, Srinadh Bhojanapalli, David McAllester, Nati Srebro:
Exploring Generalization in Deep Learning. NIPS 2017: 5947-5956
[c11]
- view
- export record
  dblp key:
  - conf/nips/GunasekarWBNS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GunasekarWBNS17
Suriya Gunasekar, Blake E. Woodworth, Srinadh Bhojanapalli, Behnam Neyshabur, Nati Srebro:
Implicit Regularization in Matrix Factorization. NIPS 2017: 6151-6159
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburTSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburTSS17
Behnam Neyshabur, Ryota Tomioka, Ruslan Salakhutdinov, Nathan Srebro:
Geometry of Optimization and Implicit Regularization in Deep Learning. CoRR abs/1705.03071 (2017)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburBC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburBC17
Behnam Neyshabur, Srinadh Bhojanapalli, Ayan Chakrabarti:
Stabilizing GAN Training with Multiple Random Projections. CoRR abs/1705.07831 (2017)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/GunasekarWBNS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GunasekarWBNS17
Suriya Gunasekar, Blake E. Woodworth, Srinadh Bhojanapalli, Behnam Neyshabur, Nathan Srebro:
Implicit Regularization in Matrix Factorization. CoRR abs/1705.09280 (2017)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburBMS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburBMS17
Behnam Neyshabur, Srinadh Bhojanapalli, David McAllester, Nathan Srebro:
Exploring Generalization in Deep Learning. CoRR abs/1706.08947 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburBMS17aa
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburBMS17aa
Behnam Neyshabur, Srinadh Bhojanapalli, David McAllester, Nathan Srebro:
A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks. CoRR abs/1707.09564 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-01953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-01953
Behnam Neyshabur:
Implicit Regularization in Deep Learning. CoRR abs/1709.01953 (2017)
2016
[c10]
- view
- export record
  dblp key:
  - conf/nips/NeyshaburWSS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NeyshaburWSS16
Behnam Neyshabur, Yuhuai Wu, Ruslan Salakhutdinov, Nati Srebro:
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations. NIPS 2016: 3477-3485
[c9]
- view
- export record
  dblp key:
  - conf/nips/BhojanapalliNS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BhojanapalliNS16
Srinadh Bhojanapalli, Behnam Neyshabur, Nati Srebro:
Global Optimality of Local Search for Low Rank Matrix Recovery. NIPS 2016: 3873-3881
[c8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburTSS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburTSS15
Behnam Neyshabur, Ryota Tomioka, Ruslan Salakhutdinov, Nathan Srebro:
Data-Dependent Path Normalization in Neural Networks. ICLR (Poster) 2016
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburWSS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburWSS16
Behnam Neyshabur, Yuhuai Wu, Ruslan Salakhutdinov, Nathan Srebro:
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations. CoRR abs/1605.07154 (2016)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BhojanapalliNS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BhojanapalliNS16
Srinadh Bhojanapalli, Behnam Neyshabur, Nathan Srebro:
Global Optimality of Local Search for Low Rank Matrix Recovery. CoRR abs/1605.07221 (2016)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AgarwalLNS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AgarwalLNS16
Alekh Agarwal, Haipeng Luo, Behnam Neyshabur, Robert E. Schapire:
Corralling a Band of Bandit Algorithms. CoRR abs/1612.06246 (2016)
2015
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/bibm/HashemifarNX15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bibm/HashemifarNX15
Somaye Hashemifar, Behnam Neyshabur, Jinbo Xu:
Joint inference of tissue-specific networks with a scale free topology. BIBM 2015: 290-294
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/NeyshaburTS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/NeyshaburTS15
Behnam Neyshabur, Ryota Tomioka, Nathan Srebro:
Norm-Based Capacity Control in Neural Networks. COLT 2015: 1376-1401
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/NeyshaburS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/NeyshaburS15
Behnam Neyshabur, Nathan Srebro:
On Symmetric and Asymmetric LSHs for Inner Product Search. ICML 2015: 1926-1934
[c4]
- view
- export record
  dblp key:
  - conf/nips/NeyshaburSS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NeyshaburSS15
Behnam Neyshabur, Ruslan Salakhutdinov, Nathan Srebro:
Path-SGD: Path-Normalized Optimization in Deep Neural Networks. NIPS 2015: 2422-2430
[c3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburTS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburTS14
Behnam Neyshabur, Ryota Tomioka, Nathan Srebro:
In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning. ICLR (Workshop) 2015
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburTS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburTS15
Behnam Neyshabur, Ryota Tomioka, Nathan Srebro:
Norm-Based Capacity Control in Neural Networks. CoRR abs/1503.00036 (2015)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburSS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburSS15
Behnam Neyshabur, Ruslan Salakhutdinov, Nathan Srebro:
Path-SGD: Path-Normalized Optimization in Deep Neural Networks. CoRR abs/1506.02617 (2015)
2014
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/alt/NeyshaburMS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/NeyshaburMS14
Behnam Neyshabur, Yury Makarychev, Nathan Srebro:
Clustering, Hamming Embedding, Generalized LSH and the Max Norm. ALT 2014: 306-320
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburMS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburMS14
Behnam Neyshabur, Yury Makarychev, Nathan Srebro:
Clustering, Hamming Embedding, Generalized LSH and the Max Norm. CoRR abs/1405.3167 (2014)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburS14
Behnam Neyshabur, Nathan Srebro:
On Symmetric and Asymmetric LSHs for Inner Product Search. CoRR abs/1410.5518 (2014)
2013
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/bioinformatics/NeyshaburKHA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bioinformatics/NeyshaburKHA13
Behnam Neyshabur, Ahmadreza Khadem, Somaye Hashemifar, Seyed Shahriar Arab:
NETAL: a new graph-based method for global alignment of protein-protein interaction networks. Bioinform. 29(13): 1654-1662 (2013)
[c1]
- view
- export record
  dblp key:
  - conf/nips/NeyshaburSSMY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NeyshaburSSMY13
Behnam Neyshabur, Nati Srebro, Ruslan Salakhutdinov, Yury Makarychev, Payman Yadollahpour:
The Power of Asymmetry in Binary Hashing. NIPS 2013: 2823-2831
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburP13
Behnam Neyshabur, Rina Panigrahy:
Sparse Matrix Factorization. CoRR abs/1311.3315 (2013)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NeyshaburYMSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NeyshaburYMSS13
Behnam Neyshabur, Payman Yadollahpour, Yury Makarychev, Ruslan Salakhutdinov, Nathan Srebro:
The Power of Asymmetry in Binary Hashing. CoRR abs/1311.7662 (2013)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.