default search action
Denny Zhou
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j3]Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-Ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Y. Zhao, Yanping Huang, Andrew M. Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei:
Scaling Instruction-Finetuned Language Models. J. Mach. Learn. Res. 25: 70:1-70:53 (2024) - [c47]Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry W. Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc V. Le, Thang Luong:
FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation. ACL (Findings) 2024: 13697-13720 - [c46]Zhiyuan Liu, Hong Liu, Denny Zhou, Tengyu Ma:
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems. ICLR 2024 - [c45]Jie Huang, Xinyun Chen, Swaroop Mishra, Huaixiu Steven Zheng, Adams Wei Yu, Xinying Song, Denny Zhou:
Large Language Models Cannot Self-Correct Reasoning Yet. ICLR 2024 - [c44]Tianle Cai, Xuezhi Wang, Tengyu Ma, Xinyun Chen, Denny Zhou:
Large Language Models as Tool Makers. ICLR 2024 - [c43]Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou:
Teaching Large Language Models to Self-Debug. ICLR 2024 - [c42]Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Y. Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou:
Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models. ICLR 2024 - [c41]Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V. Le, Denny Zhou, Xinyun Chen:
Large Language Models as Optimizers. ICLR 2024 - [c40]Michihiro Yasunaga, Xinyun Chen, Yujia Li, Panupong Pasupat, Jure Leskovec, Percy Liang, Ed H. Chi, Denny Zhou:
Large Language Models as Analogical Reasoners. ICLR 2024 - [c39]Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen, Heng-Tze Cheng, Ed H. Chi, Quoc V. Le, Denny Zhou:
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models. ICLR 2024 - [c38]Xinyun Chen, Ryan A. Chi, Xuezhi Wang, Denny Zhou:
Premise Order Matters in Reasoning with Large Language Models. ICML 2024 - [c37]Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito:
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity. NAACL-HLT 2024: 3245-3276 - [i61]Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng:
Self-Discover: Large Language Models Self-Compose Reasoning Structures. CoRR abs/2402.03620 (2024) - [i60]Xinyun Chen, Ryan A. Chi, Xuezhi Wang, Denny Zhou:
Premise Order Matters in Reasoning with Large Language Models. CoRR abs/2402.08939 (2024) - [i59]Yongchao Zhou, Uri Alon, Xinyun Chen, Xuezhi Wang, Rishabh Agarwal, Denny Zhou:
Transformers Can Achieve Length Generalization But Not Robustly. CoRR abs/2402.09371 (2024) - [i58]Xuezhi Wang, Denny Zhou:
Chain-of-Thought Reasoning Without Prompting. CoRR abs/2402.10200 (2024) - [i57]Zhiyuan Li, Hong Liu, Denny Zhou, Tengyu Ma:
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems. CoRR abs/2402.12875 (2024) - [i56]Ruibo Liu, Jerry Wei, Fangyu Liu, Chenglei Si, Yanzhe Zhang, Jinmeng Rao, Steven Zheng, Daiyi Peng, Diyi Yang, Denny Zhou, Andrew M. Dai:
Best Practices and Lessons Learned on Synthetic Data for Language Models. CoRR abs/2404.07503 (2024) - [i55]Kaixuan Huang, Yuanhao Qu, Henry Cousins, William A. Johnson, Di Yin, Mihir Shah, Denny Zhou, Russ B. Altman, Mengdi Wang, Le Cong:
CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments. CoRR abs/2404.18021 (2024) - [i54]Huaixiu Steven Zheng, Swaroop Mishra, Hugh Zhang, Xinyun Chen, Minmin Chen, Azade Nova, Le Hou, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou:
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning. CoRR abs/2406.04520 (2024) - [i53]Yuan Xue, Denny Zhou, Nan Du, Andrew M. Dai, Zhen Xu, Kun Zhang, Claire Cui:
Deep State-Space Generative Model For Correlated Time-to-Event Predictions. CoRR abs/2407.19371 (2024) - 2023
- [j2]Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel:
PaLM: Scaling Language Modeling with Pathways. J. Mach. Learn. Res. 24: 240:1-240:113 (2023) - [c36]Mirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed H. Chi, Denny Zhou, Jason Wei:
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them. ACL (Findings) 2023: 13003-13051 - [c35]Jerry W. Wei, Le Hou, Andrew K. Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc V. Le:
Symbol tuning improves in-context learning in language models. EMNLP 2023: 968-979 - [c34]Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani:
Transcending Scaling Laws with 0.1% Extra Compute. EMNLP 2023: 1471-1486 - [c33]Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc V. Le, Ed H. Chi, Sharan Narang, Aakanksha Chowdhery, Denny Zhou:
Self-Consistency Improves Chain of Thought Reasoning in Language Models. ICLR 2023 - [c32]Ekin Akyürek, Dale Schuurmans, Jacob Andreas, Tengyu Ma, Denny Zhou:
What learning algorithm is in-context learning? Investigations with linear models. ICLR 2023 - [c31]Andrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou:
Compositional Semantic Parsing with Large Language Models. ICLR 2023 - [c30]Ruibo Liu, Jason Wei, Shixiang Shane Gu, Te-Yen Wu, Soroush Vosoughi, Claire Cui, Denny Zhou, Andrew M. Dai:
Mind's Eye: Grounded Language Model Reasoning through Simulation. ICLR 2023 - [c29]Freda Shi, Mirac Suzgun, Markus Freitag, Xuezhi Wang, Suraj Srivats, Soroush Vosoughi, Hyung Won Chung, Yi Tay, Sebastian Ruder, Denny Zhou, Dipanjan Das, Jason Wei:
Language models are multilingual chain-of-thought reasoners. ICLR 2023 - [c28]Zhiqing Sun, Xuezhi Wang, Yi Tay, Yiming Yang, Denny Zhou:
Recitation-Augmented Language Models. ICLR 2023 - [c27]Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler:
UL2: Unifying Language Learning Paradigms. ICLR 2023 - [c26]Tianjun Zhang, Xuezhi Wang, Denny Zhou, Dale Schuurmans, Joseph E. Gonzalez:
TEMPERA: Test-Time Prompt Editing via Reinforcement Learning. ICLR 2023 - [c25]Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc V. Le, Ed H. Chi:
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models. ICLR 2023 - [c24]Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts:
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning. ICML 2023: 22631-22648 - [c23]Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang:
Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization. ICML 2023: 28389-28421 - [c22]Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed H. Chi, Nathanael Schärli, Denny Zhou:
Large Language Models Can Be Easily Distracted by Irrelevant Context. ICML 2023: 31210-31227 - [i52]Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts:
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning. CoRR abs/2301.13688 (2023) - [i51]Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed H. Chi, Nathanael Schärli, Denny Zhou:
Large Language Models Can Be Easily Distracted by Irrelevant Context. CoRR abs/2302.00093 (2023) - [i50]Jerry W. Wei, Jason Wei, Yi Tay, Dustin Tran, Albert Webson, Yifeng Lu, Xinyun Chen, Hanxiao Liu, Da Huang, Denny Zhou, Tengyu Ma:
Larger language models do in-context learning differently. CoRR abs/2303.03846 (2023) - [i49]Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou:
Teaching Large Language Models to Self-Debug. CoRR abs/2304.05128 (2023) - [i48]Jerry W. Wei, Le Hou, Andrew K. Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc V. Le:
Symbol tuning improves in-context learning in language models. CoRR abs/2305.08298 (2023) - [i47]Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang:
Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization. CoRR abs/2305.11965 (2023) - [i46]Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito:
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity. CoRR abs/2305.13169 (2023) - [i45]Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Y. Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou:
Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts. CoRR abs/2305.14705 (2023) - [i44]Ruibo Liu, Ruixin Yang, Chenyan Jia, Ge Zhang, Denny Zhou, Andrew M. Dai, Diyi Yang, Soroush Vosoughi:
Training Socially Aligned Language Models in Simulated Human Society. CoRR abs/2305.16960 (2023) - [i43]Tianle Cai, Xuezhi Wang, Tengyu Ma, Xinyun Chen, Denny Zhou:
Large Language Models as Tool Makers. CoRR abs/2305.17126 (2023) - [i42]Jerry W. Wei, Da Huang, Yifeng Lu, Denny Zhou, Quoc V. Le:
Simple synthetic data reduces sycophancy in large language models. CoRR abs/2308.03958 (2023) - [i41]Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V. Le, Denny Zhou, Xinyun Chen:
Large Language Models as Optimizers. CoRR abs/2309.03409 (2023) - [i40]Michihiro Yasunaga, Xinyun Chen, Yujia Li, Panupong Pasupat, Jure Leskovec, Percy Liang, Ed H. Chi, Denny Zhou:
Large Language Models as Analogical Reasoners. CoRR abs/2310.01714 (2023) - [i39]Jie Huang, Xinyun Chen, Swaroop Mishra, Huaixiu Steven Zheng, Adams Wei Yu, Xinying Song, Denny Zhou:
Large Language Models Cannot Self-Correct Reasoning Yet. CoRR abs/2310.01798 (2023) - [i38]Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry W. Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc V. Le, Thang Luong:
FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation. CoRR abs/2310.03214 (2023) - [i37]Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen, Heng-Tze Cheng, Ed H. Chi, Quoc V. Le, Denny Zhou:
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models. CoRR abs/2310.06117 (2023) - [i36]Zhaocheng Zhu, Yuan Xue, Xinyun Chen, Denny Zhou, Jian Tang, Dale Schuurmans, Hanjun Dai:
Large Language Models can Learn Rules. CoRR abs/2310.07064 (2023) - [i35]Jeffrey Zhou, Tianjian Lu, Swaroop Mishra, Siddhartha Brahma, Sujoy Basu, Yi Luan, Denny Zhou, Le Hou:
Instruction-Following Evaluation for Large Language Models. CoRR abs/2311.07911 (2023) - [i34]Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou:
Universal Self-Consistency for Large Language Model Generation. CoRR abs/2311.17311 (2023) - 2022
- [j1]Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus:
Emergent Abilities of Large Language Models. Trans. Mach. Learn. Res. 2022 (2022) - [c21]Le Hou, Richard Yuanzhe Pang, Tianyi Zhou, Yuexin Wu, Xinying Song, Xiaodan Song, Denny Zhou:
Token Dropping for Efficient BERT Pretraining. ACL (1) 2022: 3774-3784 - [c20]Yingwei Li, Adams Wei Yu, Tianjian Meng, Benjamin Caine, Jiquan Ngiam, Daiyi Peng, Junyang Shen, Yifeng Lu, Denny Zhou, Quoc V. Le, Alan L. Yuille, Mingxing Tan:
DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection. CVPR 2022: 17161-17170 - [c19]Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou:
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation. ECCV (10) 2022: 711-727 - [c18]Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wang, Denny Zhou:
Auto-scaling Vision Transformers without Training. ICLR 2022 - [c17]Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang:
Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance. ICML 2022: 25760-25782 - [c16]Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans:
SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs. KDD 2022: 1472-1482 - [c15]Ziyu Jiang, Xuxi Chen, Xueqin Huang, Xianzhi Du, Denny Zhou, Zhangyang Wang:
Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropagation. NeurIPS 2022 - [c14]Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed H. Chi, Quoc V. Le, Denny Zhou:
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. NeurIPS 2022 - [i33]Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Ed H. Chi, Quoc Le, Denny Zhou:
Chain of Thought Prompting Elicits Reasoning in Large Language Models. CoRR abs/2201.11903 (2022) - [i32]Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wang, Denny Zhou:
Auto-scaling Vision Transformers without Training. CoRR abs/2202.11921 (2022) - [i31]Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang:
Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance. CoRR abs/2202.12387 (2022) - [i30]Yingwei Li, Adams Wei Yu, Tianjian Meng, Benjamin Caine, Jiquan Ngiam, Daiyi Peng, Junyang Shen, Bo Wu, Yifeng Lu, Denny Zhou, Quoc V. Le, Alan L. Yuille, Mingxing Tan:
DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection. CoRR abs/2203.08195 (2022) - [i29]Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc V. Le, Ed H. Chi, Denny Zhou:
Self-Consistency Improves Chain of Thought Reasoning in Language Models. CoRR abs/2203.11171 (2022) - [i28]Le Hou, Richard Yuanzhe Pang, Tianyi Zhou, Yuexin Wu, Xinying Song, Xiaodan Song, Denny Zhou:
Token Dropping for Efficient BERT Pretraining. CoRR abs/2203.13240 (2022) - [i27]Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel:
PaLM: Scaling Language Modeling with Pathways. CoRR abs/2204.02311 (2022) - [i26]Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Olivier Bousquet, Quoc Le, Ed H. Chi:
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models. CoRR abs/2205.10625 (2022) - [i25]Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus:
Emergent Abilities of Large Language Models. CoRR abs/2206.07682 (2022) - [i24]Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc V. Le, Ed H. Chi, Denny Zhou:
Rationale-Augmented Ensembles in Language Models. CoRR abs/2207.00747 (2022) - [i23]Andrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou:
Compositional Semantic Parsing with Large Language Models. CoRR abs/2209.15003 (2022) - [i22]Zhiqing Sun, Xuezhi Wang, Yi Tay, Yiming Yang, Denny Zhou:
Recitation-Augmented Language Models. CoRR abs/2210.01296 (2022) - [i21]Freda Shi, Mirac Suzgun, Markus Freitag, Xuezhi Wang, Suraj Srivats, Soroush Vosoughi, Hyung Won Chung, Yi Tay, Sebastian Ruder, Denny Zhou, Dipanjan Das, Jason Wei:
Language Models are Multilingual Chain-of-Thought Reasoners. CoRR abs/2210.03057 (2022) - [i20]Ruibo Liu, Jason Wei, Shixiang Shane Gu, Te-Yen Wu, Soroush Vosoughi, Claire Cui, Denny Zhou, Andrew M. Dai:
Mind's Eye: Grounded Language Model Reasoning through Simulation. CoRR abs/2210.05359 (2022) - [i19]Mirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed H. Chi, Denny Zhou, Jason Wei:
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them. CoRR abs/2210.09261 (2022) - [i18]Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani:
Transcending Scaling Laws with 0.1% Extra Compute. CoRR abs/2210.11399 (2022) - [i17]Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Eric Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Y. Zhao, Yanping Huang, Andrew M. Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei:
Scaling Instruction-Finetuned Language Models. CoRR abs/2210.11416 (2022) - [i16]Tianjun Zhang, Xuezhi Wang, Denny Zhou, Dale Schuurmans, Joseph E. Gonzalez:
TEMPERA: Test-Time Prompting via Reinforcement Learning. CoRR abs/2211.11890 (2022) - [i15]Ekin Akyürek, Dale Schuurmans, Jacob Andreas, Tengyu Ma, Denny Zhou:
What learning algorithm is in-context learning? Investigations with linear models. CoRR abs/2211.15661 (2022) - 2021
- [c13]Sanqiang Zhao, Raghav Gupta, Yang Song, Denny Zhou:
Extremely Small BERT Models from Mixed-Vocabulary Training. EACL 2021: 2753-2759 - [c12]Xinying Song, Alex Salcianu, Yang Song, Dave Dopson, Denny Zhou:
Fast WordPiece Tokenization. EMNLP (1) 2021: 2089-2103 - [c11]Xinyun Chen, Petros Maniatis, Rishabh Singh, Charles Sutton, Hanjun Dai, Max Lin, Denny Zhou:
SpreadsheetCoder: Formula Prediction from Semi-structured Context. ICML 2021: 1661-1672 - [c10]Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Michihiro Yasunaga, Haitian Sun, Dale Schuurmans, Jure Leskovec, Denny Zhou:
LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs. ICML 2021: 8959-8970 - [i14]Xinyun Chen, Petros Maniatis, Rishabh Singh, Charles Sutton, Hanjun Dai, Max Lin, Denny Zhou:
SpreadsheetCoder: Formula Prediction from Semi-structured Context. CoRR abs/2106.15339 (2021) - [i13]Shuo Yang, Le Hou, Xiaodan Song, Qiang Liu, Denny Zhou:
Speeding up Deep Model Training by Sharing Weights and Then Unsharing. CoRR abs/2110.03848 (2021) - [i12]Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans:
SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs. CoRR abs/2110.14890 (2021) - [i11]Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou:
A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation. CoRR abs/2112.09747 (2021) - 2020
- [c9]Zhiqing Sun, Hongkun Yu, Xiaodan Song, Renjie Liu, Yiming Yang, Denny Zhou:
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices. ACL 2020: 2158-2170 - [c8]Xinyun Chen, Chen Liang, Adams Wei Yu, Denny Zhou, Dawn Song, Quoc V. Le:
Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension. ICLR 2020 - [c7]Ali Mousavi, Lihong Li, Qiang Liu, Denny Zhou:
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning. ICLR 2020 - [c6]Mao Ye, Chengyue Gong, Lizhen Nie, Denny Zhou, Adam R. Klivans, Qiang Liu:
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection. ICML 2020: 10820-10830 - [c5]Denny Zhou, Mao Ye, Chen Chen, Tianjian Meng, Mingxing Tan, Xiaodan Song, Quoc V. Le, Qiang Liu, Dale Schuurmans:
Go Wide, Then Narrow: Efficient Training of Deep Thin Networks. ICML 2020: 11546-11555 - [c4]Yuan Xue, Denny Zhou, Nan Du, Andrew M. Dai, Zhen Xu, Kun Zhang, Claire Cui:
Deep State-Space Generative Model For Correlated Time-to-Event Predictions. KDD 2020: 1552-1562 - [c3]Xinyun Chen, Chen Liang, Adams Wei Yu, Dawn Song, Denny Zhou:
Compositional Generalization via Neural-Symbolic Stack Machines. NeurIPS 2020 - [i10]Mao Ye, Chengyue Gong, Lizhen Nie, Denny Zhou, Adam R. Klivans, Qiang Liu:
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection. CoRR abs/2003.01794 (2020) - [i9]Ali Mousavi, Lihong Li, Qiang Liu, Denny Zhou:
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning. CoRR abs/2003.11126 (2020) - [i8]Zhiqing Sun, Hongkun Yu, Xiaodan Song, Renjie Liu, Yiming Yang, Denny Zhou:
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices. CoRR abs/2004.02984 (2020) - [i7]Denny Zhou, Mao Ye, Chen Chen, Tianjian Meng, Mingxing Tan, Xiaodan Song, Quoc V. Le, Qiang Liu, Dale Schuurmans:
Go Wide, Then Narrow: Efficient Training of Deep Thin Networks. CoRR abs/2007.00811 (2020) - [i6]Xinyun Chen, Chen Liang, Adams Wei Yu, Dawn Song, Denny Zhou:
Compositional Generalization via Neural-Symbolic Stack Machines. CoRR abs/2008.06662 (2020) - [i5]Xinying Song, Alex Salcianu, Yang Song, Dave Dopson, Denny Zhou:
Linear-Time WordPiece Tokenization. CoRR abs/2012.15524 (2020)
2010 – 2019
- 2019
- [c2]Honghua Dong, Jiayuan Mao, Tian Lin, Chong Wang, Lihong Li, Denny Zhou:
Neural Logic Machines. ICLR (Poster) 2019 - [i4]