default search action
Kyle Lo
Person information
- affiliation: Allen Institute for Artificial Intelligence, Seattle, Washington, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j11]Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu B. Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Madeleine van Zuylen, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Marti A. Hearst, Daniel S. Weld:
The Semantic Reader Project. Commun. ACM 67(10): 50-61 (2024) - [c43]Jan Trienes, Sebastian Joseph, Jörg Schlötterer, Christin Seifert, Kyle Lo, Wei Xu, Byron C. Wallace, Junyi Jessy Li:
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification. ACL (1) 2024: 4263-4294 - [c42]Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. ACL (Findings) 2024: 12969-12990 - [c41]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Evan Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. ACL (1) 2024: 15725-15788 - [c40]Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. ACL (1) 2024: 15789-15809 - [c39]Tal August, Kyle Lo, Noah A. Smith, Katharina Reinecke:
Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience. CHI 2024: 14:1-14:26 - [c38]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. EACL (Findings) 2024: 1987-2003 - [c37]Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo:
MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. EMNLP (Findings) 2024: 5644-5673 - [c36]Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S. Weld, Joseph Chee Chang, Kyle Lo:
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models. EMNLP 2024: 9612-9631 - [c35]Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer:
One Thousand and One Pairs: A "novel" challenge for long-context language models. EMNLP 2024: 17048-17085 - [c34]Yapei Chang, Kyle Lo, Tanya Goyal, Mohit Iyyer:
BooookScore: A systematic exploration of book-length summarization in the era of LLMs. ICLR 2024 - [i64]Jan Trienes, Sebastian Joseph, Jörg Schlötterer, Christin Seifert, Kyle Lo, Wei Xu, Byron C. Wallace, Junyi Jessy Li:
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification. CoRR abs/2401.16475 (2024) - [i63]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. CoRR abs/2402.00159 (2024) - [i62]Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. CoRR abs/2402.00838 (2024) - [i61]Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. CoRR abs/2403.03866 (2024) - [i60]Tal August, Kyle Lo, Noah A. Smith, Katharina Reinecke:
Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience. CoRR abs/2403.04979 (2024) - [i59]Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn J. Lawrie, Luca Soldaini:
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions. CoRR abs/2403.15246 (2024) - [i58]Yekyung Kim, Yapei Chang, Marzena Karpinska, Aparna Garimella, Varun Manjunatha, Kyle Lo, Tanya Goyal, Mohit Iyyer:
FABLES: Evaluating faithfulness and content selection in book-length summarization. CoRR abs/2404.01261 (2024) - [i57]David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan:
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature. CoRR abs/2406.07835 (2024) - [i56]Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Kumar Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G. Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar:
DataComp-LM: In search of the next generation of training sets for language models. CoRR abs/2406.11794 (2024) - [i55]Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer:
One Thousand and One Pairs: A "novel" challenge for long-context language models. CoRR abs/2406.16264 (2024) - [i54]Shayne Longpre, Stella Biderman, Alon Albalak, Hailey Schoelkopf, Daniel McDuff, Sayash Kapoor, Kevin Klyman, Kyle Lo, Gabriel Ilharco, Nay San, Maribeth Rauh, Aviya Skowron, Bertie Vidgen, Laura Weidinger, Arvind Narayanan, Victor Sanh, David Ifeoluwa Adelani, Percy Liang, Rishi Bommasani, Peter Henderson, Sasha Luccioni, Yacine Jernite, Luca Soldaini:
The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources. CoRR abs/2406.16746 (2024) - [i53]Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo:
Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. CoRR abs/2408.04226 (2024) - [i52]Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Morrison, Sewon Min, Weijia Shi, Pete Walsh, Oyvind Tafjord, Nathan Lambert, Yuling Gu, Shane Arora, Akshita Bhagia, Dustin Schwenk, David Wadden, Alexander Wettig, Binyuan Hui, Tim Dettmers, Douwe Kiela, Ali Farhadi, Noah A. Smith, Pang Wei Koh, Amanpreet Singh, Hannaneh Hajishirzi:
OLMoE: Open Mixture-of-Experts Language Models. CoRR abs/2409.02060 (2024) - [i51]Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo:
RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models. CoRR abs/2409.02685 (2024) - [i50]Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, Yen-Sung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross B. Girshick, Ali Farhadi, Aniruddha Kembhavi:
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models. CoRR abs/2409.17146 (2024) - [i49]Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S. Weld, Joseph Chee Chang, Kyle Lo:
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models. CoRR abs/2410.22360 (2024) - 2023
- [j10]Benjamin Charles Germain Lee, Doug Downey, Kyle Lo, Daniel S. Weld:
LIMEADE: From AI Explanations to Advice Taking. ACM Trans. Interact. Intell. Syst. 13(4): 24:1-24:29 (2023) - [j9]Tal August, Lucy Lu Wang, Jonathan Bragg, Marti A. Hearst, Andrew Head, Kyle Lo:
Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing. ACM Trans. Comput. Hum. Interact. 30(5): 74:1-74:38 (2023) - [c33]Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, Kyle Lo:
Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents. ACL (Findings) 2023: 13345-13360 - [c32]Joseph Chee Chang, Amy X. Zhang, Jonathan Bragg, Andrew Head, Kyle Lo, Doug Downey, Daniel S. Weld:
CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context. CHI 2023: 737:1-737:15 - [c31]Kalpesh Krishna, Erin Bransom, Bailey Kuehl, Mohit Iyyer, Pradeep Dasigi, Arman Cohan, Kyle Lo:
LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization. EACL 2023: 1642-1661 - [c30]Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chee Chang, Russell Authur, Erin Bransom, Stefan Candra, Yoganand Chandrasekhar, Regan Huff, Bailey Kuehl, Amanpreet Singh, Chris Wilhelm, Angele Zamarron, Marti A. Hearst, Daniel S. Weld, Doug Downey, Luca Soldaini:
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents. EMNLP (Demos) 2023: 495-507 - [c29]Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents. EMNLP 2023: 3194-3212 - [c28]Kevin Lin, Kyle Lo, Joseph Gonzalez, Dan Klein:
Decomposing Complex Queries for Tip-of-the-tongue Retrieval. EMNLP (Findings) 2023: 5521-5533 - [c27]John M. Giorgi, Luca Soldaini, Bo Wang, Gary D. Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan:
Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval. EMNLP (Findings) 2023: 8177-8199 - [c26]Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Marti A. Hearst, Andrew Head, Daniel S. Weld:
Scim: Intelligent Skimming Support for Scientific Papers. IUI 2023: 476-490 - [i48]Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin, Haokun Liu, Kyle Lo, Jaron Lochner, Kelsey MacMillan, Tyler Murray, Chris Newell, Smita Rao, Shaurya Rohatgi, Paul Sayre, Zejiang Shen, Amanpreet Singh, Luca Soldaini, Shivashankar Subramanian, Amber Tanaka, Alex D. Wade, Linda Wagner, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Madeleine van Zuylen, Daniel S. Weld:
The Semantic Scholar Open Data Platform. CoRR abs/2301.10140 (2023) - [i47]Kalpesh Krishna, Erin Bransom, Bailey Kuehl, Mohit Iyyer, Pradeep Dasigi, Arman Cohan, Kyle Lo:
LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization. CoRR abs/2301.13298 (2023) - [i46]Joseph Chee Chang, Amy X. Zhang, Jonathan Bragg, Andrew Head, Kyle Lo, Doug Downey, Daniel S. Weld:
CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context. CoRR abs/2302.07302 (2023) - [i45]Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Sasko, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa, Paulo Villegas, Tristan Thrush, Shayne Longpre, Sebastian Nagel, Leon Weber, Manuel Muñoz, Jian Zhu, Daniel van Strien, Zaid Alyafeai, Khalid Almubarak, Minh Chien Vu, Itziar Gonzalez-Dios, Aitor Soroa, Kyle Lo, Manan Dey, Pedro Ortiz Suarez, Aaron Gokaslan, Shamik Bose, David Ifeoluwa Adelani, Long Phan, Hieu Tran, Ian Yu, Suhas Pai, Jenny Chim, Violette Lepercq, Suzana Ilic, Margaret Mitchell, Sasha Luccioni, Yacine Jernite:
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset. CoRR abs/2303.03915 (2023) - [i44]Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu B. Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Madeleine van Zuylen, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Marti A. Hearst, Daniel S. Weld:
The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces. CoRR abs/2303.14334 (2023) - [i43]Zejiang Shen, Tal August, Pao Siangliulue, Kyle Lo, Jonathan Bragg, Jeff Hammerbacher, Doug Downey, Joseph Chee Chang, David A. Sontag:
Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks. CoRR abs/2304.02623 (2023) - [i42]Anna Martin-Boyle, Andrew Head, Kyle Lo, Risham Sidhu, Marti A. Hearst, Dongyeop Kang:
Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition Extraction. CoRR abs/2305.14660 (2023) - [i41]Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Controllable QA-based Framework for Decontextualization. CoRR abs/2305.14772 (2023) - [i40]Kevin Lin, Kyle Lo, Joseph E. Gonzalez, Dan Klein:
Decomposing Complex Queries for Tip-of-the-tongue Retrieval. CoRR abs/2305.15053 (2023) - [i39]Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, Kyle Lo:
Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents. CoRR abs/2306.01058 (2023) - [i38]Hao Peng, Qingqing Cao, Jesse Dodge, Matthew E. Peters, Jared Fernandez, Tom Sherborne, Kyle Lo, Sam Skjonsberg, Emma Strubell, Darrell Plessas, Iz Beltagy, Evan Pete Walsh, Noah A. Smith, Hannaneh Hajishirzi:
Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation. CoRR abs/2307.09701 (2023) - [i37]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. CoRR abs/2309.08541 (2023) - [i36]Yapei Chang, Kyle Lo, Tanya Goyal, Mohit Iyyer:
BooookScore: A systematic exploration of book-length summarization in the era of LLMs. CoRR abs/2310.00785 (2023) - [i35]Hancheng Cao, Jesse Dodge, Kyle Lo, Daniel A. McFarland, Lucy Lu Wang:
The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices. CoRR abs/2310.03193 (2023) - [i34]Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo:
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders. CoRR abs/2311.09765 (2023) - [i33]Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge:
Paloma: A Benchmark for Evaluating Language Model Fit. CoRR abs/2312.10523 (2023) - 2022
- [j8]Michael J. Cafarella, Michael R. Anderson, Iz Beltagy, Arie Cattan, Sarah E. Chasins, Ido Dagan, Doug Downey, Oren Etzioni, Sergey Feldman, Tian Gao, Tom Hope, Kexin Huang, Sophie Johnson, Daniel King, Kyle Lo, Yuze Lou, Matthew D. Shapiro, Dinghao Shen, Shivashankar Subramanian, Lucy Lu Wang, Yuning Wang, Yitong Wang, Daniel S. Weld, Jenny M. Vo-Phamhi, Anna Zeng, Jiayun Zou:
Infrastructure for Rapid Open Knowledge Network Development. AI Mag. 43(1): 59-68 (2022) - [j7]Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, Doug Downey:
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups. Trans. Assoc. Comput. Linguistics 10: 376-392 (2022) - [c25]Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, Lucy Lu Wang:
Generating Scientific Claims for Zero-Shot Scientific Fact Checking. ACL (1) 2022: 2448-2460 - [c24]Marissa Radensky, Doug Downey, Kyle Lo, Zoran Popovic, Daniel S. Weld:
Exploring the Role of Local and Global Explanations in Recommender Systems. CHI Extended Abstracts 2022: 290:1-290:7 - [c23]Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Drahomira Herrmannova, Petr Knoth, Kyle Lo, Philipp Mayr, Michal Shmueli-Scheuer, Anita de Waard, Lucy Lu Wang:
Overview of the Third Workshop on Scholarly Document Processing. SDP@COLING 2022: 1-6 - [c22]Sonia K. Murthy, Kyle Lo, Daniel King, Chandra Bhagavatula, Bailey Kuehl, Sophie Johnson, Jonathan Borchardt, Daniel S. Weld, Tom Hope, Doug Downey:
ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts. EMNLP (Demos) 2022: 200-213 - [c21]David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Iz Beltagy, Lucy Lu Wang, Hannaneh Hajishirzi:
SciFact-Open: Towards open-domain scientific claim verification. EMNLP (Findings) 2022: 4719-4734 - [c20]Yacine Jernite, Huu Nguyen, Stella Biderman, Anna Rogers, Maraim Masoud, Valentin Danchev, Samson Tan, Alexandra Sasha Luccioni, Nishant Subramani, Isaac Johnson, Gérard Dupont, Jesse Dodge, Kyle Lo, Zeerak Talat, Dragomir R. Radev, Aaron Gokaslan, Somaieh Nikpoor, Peter Henderson, Rishi Bommasani, Margaret Mitchell:
Data Governance in the Age of Large-Scale Data-Driven Language Technology. FAccT 2022: 2206-2222 - [c19]David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi:
MultiVerS: Improving scientific claim verification with weak supervision and full-document context. NAACL-HLT (Findings) 2022: 61-76 - [c18]Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, Arman Cohan, David Jurgens, Kyle Lo:
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting. NAACL-HLT 2022: 1875-1889 - [c17]Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Sasko, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa, Paulo Villegas, Tristan Thrush, Shayne Longpre, Sebastian Nagel, Leon Weber, Manuel Muñoz, Jian Zhu, Daniel van Strien, Zaid Alyafeai, Khalid Almubarak, Minh Chien Vu, Itziar Gonzalez-Dios, Aitor Soroa, Kyle Lo, Manan Dey, Pedro Ortiz Suarez, Aaron Gokaslan, Shamik Bose, David Ifeoluwa Adelani, Long Phan, Hieu Tran, Ian Yu, Suhas Pai, Jenny Chim, Violette Lepercq, Suzana Ilic, Margaret Mitchell, Alexandra Sasha Luccioni, Yacine Jernite:
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset. NeurIPS 2022 - [c16]Zejiang Shen, Kyle Lo, Lauren Yu, Nathan Dahlberg, Margo Schlanger, Doug Downey:
Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities. NeurIPS 2022 - [e1]Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Drahomira Herrmannova, Petr Knoth, Kyle Lo, Philipp Mayr, Michal Shmueli-Scheuer, Anita de Waard, Lucy Lu Wang:
Proceedings of the Third Workshop on Scholarly Document Processing, SDP@COLING 2022, Gyeongju, Republic of Korea, October 12 - 17, 2022. Association for Computational Linguistics 2022 [contents] - [i32]Tal August, Lucy Lu Wang, Jonathan Bragg, Marti A. Hearst, Andrew Head, Kyle Lo:
Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing. CoRR abs/2203.00130 (2022) - [i31]Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, Lucy Lu Wang:
Generating Scientific Claims for Zero-Shot Scientific Fact Checking. CoRR abs/2203.12990 (2022) - [i30]Raymond Fok, Andrew Head, Jonathan Bragg, Kyle Lo, Marti A. Hearst, Daniel S. Weld:
Scim: Intelligent Faceted Highlights for Interactive, Multi-Pass Skimming of Scientific Papers. CoRR abs/2205.04561 (2022) - [i29]Sonia K. Murthy, Kyle Lo, Daniel King, Chandra Bhagavatula, Bailey Kuehl, Sophie Johnson, Jonathan Borchardt, Daniel S. Weld, Tom Hope, Doug Downey:
ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts. CoRR abs/2205.06982 (2022) - [i28]Yacine Jernite, Huu Nguyen, Stella Biderman, Anna Rogers, Maraim Masoud, Valentin Danchev, Samson Tan, Alexandra Sasha Luccioni, Nishant Subramani, Gérard Dupont, Jesse Dodge, Kyle Lo, Zeerak Talat, Isaac Johnson, Dragomir R. Radev, Somaieh Nikpoor, Jörg Frohberg, Aaron Gokaslan, Peter Henderson, Rishi Bommasani, Margaret Mitchell:
Data Governance in the Age of Large-Scale Data-Driven Language Technology. CoRR abs/2206.03216 (2022) - [i27]Zejiang Shen, Kyle Lo, Lauren Yu, Nathan Dahlberg, Margo Schlanger, Doug Downey:
Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities. CoRR abs/2206.10883 (2022) - [i26]David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Iz Beltagy, Lucy Lu Wang, Hannaneh Hajishirzi:
SciFact-Open: Towards open-domain scientific claim verification. CoRR abs/2210.13777 (2022) - [i25]John M. Giorgi, Luca Soldaini, Bo Wang, Gary D. Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan:
Exploring the Challenges of Open Domain Multi-Document Summarization. CoRR abs/2212.10526 (2022) - 2021
- [j6]Lucy Lu Wang, Kyle Lo:
Text mining approaches for dealing with the rapidly expanding literature on COVID-19. Briefings Bioinform. 22(2): 781-799 (2021) - [j5]Farshad Firouzi, Bahareh J. Farahani, Mahmoud Daneshmand, Kathy Grise, Jaeseung Song, Roberto Saracco, Lucy Lu Wang, Kyle Lo, Plamen Angelov, Eduardo A. Soares, Po-Shen Loh, Zeynab Talebpour, Reza Moradi, Mohsen Goodarzi, Haleh Ashraf, Mohammad Talebpour, Alireza Talebpour, Luca Romeo, Rupam Das, Hadi Heidari, Dana K. Pasquale, James Moody, Chris Woods, Erich S. Huang, Payam M. Barnaghi, Majid Sarrafzadeh, Ron C. Li, Kristen L. Beck, Olexandr Isayev, NakMyoung Sung, Alan Luo:
Harnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World. IEEE Internet Things J. 8(16): 12826-12846 (2021) - [j4]Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen M. Voorhees, Lucy Lu Wang, William R. Hersh:
Searching for scientific evidence in a pandemic: An overview of TREC-COVID. J. Biomed. Informatics 121: 103865 (2021) - [c15]Kelvin Luu, Xinyi Wu, Rik Koncel-Kedziorski, Kyle Lo, Isabel Cachola, Noah A. Smith:
Explaining Relationships Between Scientific Documents. ACL/IJCNLP (1) 2021: 2130-2144 - [c14]Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, Marti A. Hearst:
Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols. CHI 2021: 413:1-413:18 - [c13]Saadia Gabriel, Antoine Bosselut, Jeff Da, Ari Holtzman, Jan Buys, Kyle Lo, Asli Celikyilmaz, Yejin Choi:
Discourse Understanding and Factual Consistency in Abstractive Summarization. EACL 2021: 435-447 - [c12]Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith, Matt Gardner:
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers. NAACL-HLT 2021: 4599-4610 - [c11]Jonathan Bragg, Arman Cohan, Kyle Lo, Iz Beltagy:
FLEX: Unifying Evaluation for Few-Shot NLP. NeurIPS 2021: 15787-15800 - [i24]Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen M. Voorhees, Lucy Lu Wang, William R. Hersh:
Searching for Scientific Evidence in a Pandemic: An Overview of TREC-COVID. CoRR abs/2104.09632 (2021) - [i23]Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith, Matt Gardner:
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers. CoRR abs/2105.03011 (2021) - [i22]Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, Doug Downey:
Incorporating Visual Layout Structures for Scientific Text Classification. CoRR abs/2106.00676 (2021) - [i21]Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, David Jurgens, Arman Cohan, Kyle Lo:
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting. CoRR abs/2107.00414 (2021) - [i20]Jonathan Bragg, Arman Cohan, Kyle Lo, Iz Beltagy:
FLEX: Unifying Evaluation for Few-Shot NLP. CoRR abs/2107.07170 (2021) - [i19]David Wadden, Kyle Lo:
Overview and Insights from the SciVer Shared Task on Scientific Claim Verification. CoRR abs/2107.08188 (2021) - [i18]Marissa Radensky, Doug Downey, Kyle Lo, Zoran Popovic, Daniel S. Weld:
Exploring The Role of Local and Global Explanations in Recommender Systems. CoRR abs/2109.13301 (2021) - [i17]David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi:
LongChecker: Improving scientific claim verification by modeling full-abstract context. CoRR abs/2112.01640 (2021) - 2020
- [j3]Anshul Kanakia, Kuansan Wang, Yuxiao Dong, Boya Xie, Kyle Lo, Zhihong Shen, Lucy Lu Wang, Chiyuan Huang, Darrin Eide, Sebastian Kohlmeier, Chieh-Han Wu:
Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature. Frontiers Res. Metrics Anal. 5: 596624 (2020) - [j2]Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen M. Voorhees, Lucy Lu Wang, William R. Hersh:
TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19. J. Am. Medical Informatics Assoc. 27(9): 1431-1436 (2020) - [j1]Ellen M. Voorhees, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, William R. Hersh, Kyle Lo, Kirk Roberts, Ian Soboroff, Lucy Lu Wang:
TREC-COVID: constructing a pandemic information retrieval test collection. SIGIR Forum 54(1): 1:1-1:12 (2020) - [c10]