Остановите войну!
for scientists:
default search action
Lukasz Kaiser
- > Home > Persons > Lukasz Kaiser
Publications
- 2024
- [i33]Lukasz Kucinski, Witold Drzewakowski, Mateusz Olko, Piotr Kozakowski, Lukasz Maziarka, Marta Emilia Nowakowska, Lukasz Kaiser, Piotr Milos:
tsGT: Stochastic Time Series Modeling With Transformer. CoRR abs/2403.05713 (2024) - 2022
- [c44]Piotr Kozakowski, Lukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kanska:
Q-Value Weighted Regression: Reinforcement Learning with Limited Data. IJCNN 2022: 1-8 - [c43]Piotr Nawrot, Szymon Tworkowski, Michal Tyrolski, Lukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski:
Hierarchical Transformers Are More Efficient Language Models. NAACL-HLT (Findings) 2022: 1559-1571 - 2021
- [c42]Krzysztof Marcin Choromanski, Valerii Likhosherstov, David Dohan, Xingyou Song, Andreea Gane, Tamás Sarlós, Peter Hawkins, Jared Quincy Davis, Afroz Mohiuddin, Lukasz Kaiser, David Benjamin Belanger, Lucy J. Colwell, Adrian Weller:
Rethinking Attention with Performers. ICLR 2021 - [c41]Sebastian Jaszczur, Aakanksha Chowdhery, Afroz Mohiuddin, Lukasz Kaiser, Wojciech Gajewski, Henryk Michalewski, Jonni Kanerva:
Sparse is Enough in Scaling Transformers. NeurIPS 2021: 9895-9907 - [i32]Piotr Kozakowski, Lukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kanska:
Q-Value Weighted Regression: Reinforcement Learning with Limited Data. CoRR abs/2102.06782 (2021) - [i31]Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Pondé de Oliveira Pinto, Jared Kaplan, Harrison Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-Voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Joshua Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob McGrew, Dario Amodei, Sam McCandlish, Ilya Sutskever, Wojciech Zaremba:
Evaluating Large Language Models Trained on Code. CoRR abs/2107.03374 (2021) - [i30]Piotr Nawrot, Szymon Tworkowski, Michal Tyrolski, Lukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski:
Hierarchical Transformers Are More Efficient Language Models. CoRR abs/2110.13711 (2021) - [i29]Karl Cobbe, Vineet Kosaraju, Mohammad Bavarian, Mark Chen, Heewoo Jun, Lukasz Kaiser, Matthias Plappert, Jerry Tworek, Jacob Hilton, Reiichiro Nakano, Christopher Hesse, John Schulman:
Training Verifiers to Solve Math Word Problems. CoRR abs/2110.14168 (2021) - [i28]Sebastian Jaszczur, Aakanksha Chowdhery, Afroz Mohiuddin, Lukasz Kaiser, Wojciech Gajewski, Henryk Michalewski, Jonni Kanerva:
Sparse is Enough in Scaling Transformers. CoRR abs/2111.12763 (2021) - 2020
- [c40]Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H. Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski:
Model Based Reinforcement Learning for Atari. ICLR 2020 - [c39]Nikita Kitaev, Lukasz Kaiser, Anselm Levskaya:
Reformer: The Efficient Transformer. ICLR 2020 - [i27]Nikita Kitaev, Lukasz Kaiser, Anselm Levskaya:
Reformer: The Efficient Transformer. CoRR abs/2001.04451 (2020) - [i26]Krzysztof Choromanski, Valerii Likhosherstov, David Dohan, Xingyou Song, Andreea Gane, Tamás Sarlós, Peter Hawkins, Jared Davis, Afroz Mohiuddin, Lukasz Kaiser, David Belanger, Lucy J. Colwell, Adrian Weller:
Rethinking Attention with Performers. CoRR abs/2009.14794 (2020) - 2019
- [c38]Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, Lukasz Kaiser:
Universal Transformers. ICLR (Poster) 2019 - [c37]Yang Li, Lukasz Kaiser, Samy Bengio, Si Si:
Area Attention. ICML 2019: 3846-3855 - [i25]Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H. Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Ryan Sepassi, George Tucker, Henryk Michalewski:
Model-Based Reinforcement Learning for Atari. CoRR abs/1903.00374 (2019) - [i24]Urvashi Khandelwal, Kevin Clark, Dan Jurafsky, Lukasz Kaiser:
Sample Efficient Text Summarization Using a Single Pre-Trained Transformer. CoRR abs/1905.08836 (2019) - [i23]Daniel Duckworth, Arvind Neelakantan, Ben Goodrich, Lukasz Kaiser, Samy Bengio:
Parallel Scheduled Sampling. CoRR abs/1906.04331 (2019) - 2018
- [c36]Mia Xu Chen, Orhan Firat, Ankur Bapna, Melvin Johnson, Wolfgang Macherey, George F. Foster, Llion Jones, Mike Schuster, Noam Shazeer, Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Lukasz Kaiser, Zhifeng Chen, Yonghui Wu, Macduff Hughes:
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation. ACL (1) 2018: 76-86 - [c35]Ashish Vaswani, Samy Bengio, Eugene Brevdo, François Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Lukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit:
Tensor2Tensor for Neural Machine Translation. AMTA (1) 2018: 193-199 - [c34]Aidan N. Gomez, Sicong Huang, Ivan Zhang, Bryan M. Li, Muhammad Osama, Lukasz Kaiser:
Unsupervised Cipher Cracking Using Discrete GANs. ICLR (Poster) 2018 - [c33]Lukasz Kaiser, Aidan N. Gomez, François Chollet:
Depthwise Separable Convolutions for Neural Machine Translation. ICLR (Poster) 2018 - [c32]Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer:
Generating Wikipedia by Summarizing Long Sequences. ICLR (Poster) 2018 - [c31]Lukasz Kaiser, Samy Bengio, Aurko Roy, Ashish Vaswani, Niki Parmar, Jakob Uszkoreit, Noam Shazeer:
Fast Decoding in Sequence Models Using Discrete Latent Variables. ICML 2018: 2395-2404 - [c30]Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Lukasz Kaiser, Noam Shazeer, Alexander Ku, Dustin Tran:
Image Transformer. ICML 2018: 4052-4061 - [i22]Aidan N. Gomez, Sicong Huang, Ivan Zhang, Bryan M. Li, Muhammad Osama, Lukasz Kaiser:
Unsupervised Cipher Cracking Using Discrete GANs. CoRR abs/1801.04883 (2018) - [i21]Lukasz Kaiser, Samy Bengio:
Discrete Autoencoders for Sequence Models. CoRR abs/1801.09797 (2018) - [i20]Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer:
Generating Wikipedia by Summarizing Long Sequences. CoRR abs/1801.10198 (2018) - [i19]Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Lukasz Kaiser, Noam Shazeer, Alexander Ku:
Image Transformer. CoRR abs/1802.05751 (2018) - [i18]Lukasz Kaiser, Aurko Roy, Ashish Vaswani, Niki Parmar, Samy Bengio, Jakob Uszkoreit, Noam Shazeer:
Fast Decoding in Sequence Models using Discrete Latent Variables. CoRR abs/1803.03382 (2018) - [i17]Ashish Vaswani, Samy Bengio, Eugene Brevdo, François Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Lukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit:
Tensor2Tensor for Neural Machine Translation. CoRR abs/1803.07416 (2018) - [i16]Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, Lukasz Kaiser:
Universal Transformers. CoRR abs/1807.03819 (2018) - [i15]Yang Li, Lukasz Kaiser, Samy Bengio, Si Si:
Area Attention. CoRR abs/1810.10126 (2018) - 2017
- [c29]Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio:
Learning to Remember Rare Events. ICLR (Poster) 2017 - [c28]Gabriel Pereyra, George Tucker, Jan Chorowski, Lukasz Kaiser, Geoffrey E. Hinton:
Regularizing Neural Networks by Penalizing Confident Output Distributions. ICLR (Workshop) 2017 - [c27]Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin:
Attention is All you Need. NIPS 2017: 5998-6008 - [i14]Gabriel Pereyra, George Tucker, Jan Chorowski, Lukasz Kaiser, Geoffrey E. Hinton:
Regularizing Neural Networks by Penalizing Confident Output Distributions. CoRR abs/1701.06548 (2017) - [i13]Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio:
Learning to Remember Rare Events. CoRR abs/1703.03129 (2017) - [i12]Lukasz Kaiser, Aidan N. Gomez, François Chollet:
Depthwise Separable Convolutions for Neural Machine Translation. CoRR abs/1706.03059 (2017) - [i11]Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin:
Attention Is All You Need. CoRR abs/1706.03762 (2017) - [i10]Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, Jakob Uszkoreit:
One Model To Learn Them All. CoRR abs/1706.05137 (2017) - 2016
- [c26]Lukasz Kaiser, Samy Bengio:
Can Active Memory Replace Attention? NIPS 2016: 3774-3782 - [c25]Lukasz Kaiser, Ilya Sutskever:
Neural GPUs Learn Algorithms. ICLR (Poster) 2016 - [c24]Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser:
Multi-task Sequence to Sequence Learning. ICLR (Poster) 2016 - [i9]Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Gregory S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian J. Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Józefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Gordon Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul A. Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda B. Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, Xiaoqiang Zheng:
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. CoRR abs/1603.04467 (2016) - [i7]Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Lukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, Jeffrey Dean:
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation. CoRR abs/1609.08144 (2016) - [i6]Lukasz Kaiser, Samy Bengio:
Can Active Memory Replace Attention? CoRR abs/1610.08613 (2016) - 2015
- [c22]Katja Filippova, Enrique Alfonseca, Carlos A. Colmenares, Lukasz Kaiser, Oriol Vinyals:
Sentence Compression by Deletion with LSTMs. EMNLP 2015: 360-368 - [c20]Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, Geoffrey E. Hinton:
Grammar as a Foreign Language. NIPS 2015: 2773-2781 - [i5]Arvind Neelakantan, Luke Vilnis, Quoc V. Le, Ilya Sutskever, Lukasz Kaiser, Karol Kurach, James Martens:
Adding Gradient Noise Improves Learning for Very Deep Networks. CoRR abs/1511.06807 (2015) - 2014
- [i3]Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, Geoffrey E. Hinton:
Grammar as a Foreign Language. CoRR abs/1412.7449 (2014)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-07 22:29 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint