default search action
Hinrich Schütze
Person information
- affiliation: Ludwig Maximilian University of Munich, Center for Information and Language Processing, Germany
- affiliation (former): University of Stuttgart, Institute for Natural Language Processing, Germany
- affiliation (former): Stanford University, CA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j35]Valentin Hofmann, Goran Glavas, Nikola Ljubesic, Janet B. Pierrehumbert, Hinrich Schütze:
Geographic Adaptation of Pretrained Language Models. Trans. Assoc. Comput. Linguistics 12: 411-431 (2024) - [c288]Amir Hossein Kargaran, François Yvon, Hinrich Schütze:
MaskLID: Code-Switching Language Identification through Iterative Masking. ACL (Short Papers) 2024: 459-469 - [c287]Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schütze:
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models. ACL (1) 2024: 2476-2499 - [c286]Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schütze:
GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network. ACL (Findings) 2024: 3987-4001 - [c285]Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Kirk, Hinrich Schütze, Dirk Hovy:
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models. ACL (1) 2024: 15295-15311 - [c284]Shijia Zhou, Leonie Weissweiler, Taiqi He, Hinrich Schütze, David R. Mortensen, Lori S. Levin:
Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons. LREC/COLING 2024: 3804-3811 - [c283]Amir Hossein Kargaran, François Yvon, Hinrich Schütze:
GlotScript: A Resource and Tool for Low Resource Writing System Identification. LREC/COLING 2024: 7774-7784 - [c282]Verena Blaschke, Barbara Kovacic, Siyao Peng, Hinrich Schütze, Barbara Plank:
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank. LREC/COLING 2024: 10921-10938 - [c281]Abdullatif Köksal, Silvia Severini, Hinrich Schütze:
SilverAlign: MT-Based Silver Data Algorithm for Evaluating Word Alignment. LREC/COLING 2024: 14812-14825 - [c280]Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Samuel Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori S. Levin, Amir Zeldes, Joakim Nivre, William Croft, Nathan Schneider:
UCxn: Typologically-Informed Annotation of Constructions Atop Universal Dependencies. LREC/COLING 2024: 16919-16932 - [c279]David R. Mortensen, Valentina Izrailevitch, Yunze Xiao, Hinrich Schütze, Leonie Weissweiler:
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs. LREC/COLING 2024: 17359-17364 - [c278]Peiqin Lin, Chengzhi Hu, Zheyu Zhang, André F. T. Martins, Hinrich Schütze:
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models. EACL (Findings) 2024: 276-310 - [c277]Lütfi Kerem Senel, Benedikt Ebing, Konul Baghirova, Hinrich Schütze, Goran Glavas:
Kardeş-NLU: Transfer to Low-Resource Languages with Big Brother's Help - A Benchmark and Evaluation for Turkic Languages. EACL (1) 2024: 1672-1688 - [c276]Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze:
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks. EACL (1) 2024: 2685-2702 - [c275]Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze:
ChatZero: Zero-Shot Cross-Lingual Dialogue Generation via Pseudo-Target Language. ECAI 2024: 3867-3874 - [c274]Raoyuan Zhao, Abdullatif Köksal, Yihong Liu, Leonie Weissweiler, Anna Korhonen, Hinrich Schütze:
SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation. EMNLP (Findings) 2024: 7017-7034 - [c273]Arda Yüksel, Abdullatif Köksal, Lütfi Kerem Senel, Anna Korhonen, Hinrich Schütze:
TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish. EMNLP (Findings) 2024: 7035-7055 - [c272]Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze:
LongForm: Effective Instruction Tuning with Reverse Instructions. EMNLP (Findings) 2024: 7056-7078 - [c271]Mingyang Wang, Lukas Lange, Heike Adel, Jannik Strötgen, Hinrich Schütze:
Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization. EMNLP (Findings) 2024: 7990-8000 - [c270]Orgest Xhelili, Yihong Liu, Hinrich Schütze:
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment. EMNLP (Findings) 2024: 11283-11296 - [c269]Ali Modarressi, Abdullatif Köksal, Hinrich Schütze:
Consistent Document-level Relation Extraction via Counterfactuals. EMNLP (Findings) 2024: 11501-11507 - [c268]Yongkang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze:
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy. EMNLP 2024: 18266-18287 - [c267]Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze:
Rehearsal-Free Modular and Compositional Continual Learning for Language Models. NAACL (Short Papers) 2024: 469-480 - [c266]Yihong Liu, Peiqin Lin, Mingyang Wang, Hinrich Schütze:
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining. NAACL-HLT (Findings) 2024: 1067-1097 - [c265]Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling Wang, Yifei Zhang, Hinrich Schütze:
A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation. ECML/PKDD (2) 2024: 162-177 - [e3]Cristina Piazza, Patricia Capsi-Morales, Luis Figueredo, Manuel Keppler, Hinrich Schütze:
Human-Friendly Robotics 2023 - HFR: 16th International Workshop on Human-Friendly Robotics, Munich, Germany, 20-21 September 2023. Springer Proceedings in Advanced Robotics 29, Springer 2024, ISBN 978-3-031-54999-1 [contents] - [i230]Haotian Ye, Yihong Liu, Chunlan Ma, Hinrich Schütze:
MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer. CoRR abs/2401.04821 (2024) - [i229]Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schütze:
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models. CoRR abs/2401.06620 (2024) - [i228]Peiqin Lin, Shaoxiong Ji, Jörg Tiedemann, André F. T. Martins, Hinrich Schütze:
MaLA-500: Massive Language Adaptation of Large Language Models. CoRR abs/2401.13303 (2024) - [i227]Yongkang Liu, Yiqun Zhang, Qian Li, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze:
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy. CoRR abs/2401.15207 (2024) - [i226]Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze:
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks. CoRR abs/2401.16589 (2024) - [i225]Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schütze:
GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network. CoRR abs/2402.11709 (2024) - [i224]Verena Blaschke, Christoph Purschke, Hinrich Schütze, Barbara Plank:
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects. CoRR abs/2402.11968 (2024) - [i223]Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Schütze, Dirk Hovy:
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models. CoRR abs/2402.16786 (2024) - [i222]Ercong Nie, Shuzhou Yuan, Bolei Ma, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze:
Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models. CoRR abs/2402.18397 (2024) - [i221]Leonie Weissweiler, Abdullatif Köksal, Hinrich Schütze:
Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena. CoRR abs/2403.06965 (2024) - [i220]Verena Blaschke, Barbara Kovacic, Siyao Peng, Hinrich Schütze, Barbara Plank:
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank. CoRR abs/2403.10293 (2024) - [i219]Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori S. Levin, Amir Zeldes, Joakim Nivre, William Croft, Nathan Schneider:
UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies. CoRR abs/2403.17748 (2024) - [i218]Shijia Zhou, Leonie Weissweiler, Taiqi He, Hinrich Schütze, David R. Mortensen, Lori S. Levin:
Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons. CoRR abs/2403.17760 (2024) - [i217]David R. Mortensen, Valentina Izrailevitch, Yunze Xiao, Hinrich Schütze, Leonie Weissweiler:
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs. CoRR abs/2403.17856 (2024) - [i216]Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze:
Rehearsal-Free Modular and Compositional Continual Learning for Language Models. CoRR abs/2404.00790 (2024) - [i215]Ryan Cotterell, Thomas Müller, Alexander Fraser, Hinrich Schütze:
Labeled Morphological Segmentation with Semi-Markov Models. CoRR abs/2404.08997 (2024) - [i214]Ali Modarressi, Abdullatif Köksal, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schütze:
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory. CoRR abs/2404.11672 (2024) - [i213]Peiqin Lin, André F. T. Martins, Hinrich Schütze:
XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples. CoRR abs/2405.05116 (2024) - [i212]Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schütze:
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data. CoRR abs/2405.09913 (2024) - [i211]Thomas Müller, Ryan Cotterell, Alexander Fraser, Hinrich Schütze:
Joint Lemmatization and Morphological Tagging with LEMMING. CoRR abs/2405.18308 (2024) - [i210]Amir Hossein Kargaran, François Yvon, Hinrich Schütze:
MaskLID: Code-Switching Language Identification through Iterative Masking. CoRR abs/2406.06263 (2024) - [i209]Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling Wang, Yifei Zhang, Hinrich Schütze:
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation. CoRR abs/2406.09881 (2024) - [i208]Lea Hirlimann, Shengqiang Zhang, Hinrich Schütze, Philipp Wicke:
Robustness Testing of Multi-Modal Models in Varied Home Environments for Assistive Robots. CoRR abs/2406.12443 (2024) - [i207]Ercong Nie, Bo Shao, Zifeng Ding, Mingyang Wang, Helmut Schmid, Hinrich Schütze:
BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning. CoRR abs/2406.17764 (2024) - [i206]Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze:
Learn it or Leave it: Module Composition and Pruning for Continual Learning. CoRR abs/2406.18708 (2024) - [i205]Orgest Xhelili, Yihong Liu, Hinrich Schütze:
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment. CoRR abs/2406.19759 (2024) - [i204]Peiqin Lin, André F. T. Martins, Hinrich Schütze:
A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models. CoRR abs/2407.00436 (2024) - [i203]Chunlan Ma, Yihong Liu, Haotian Ye, Hinrich Schütze:
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts. CoRR abs/2407.02320 (2024) - [i202]Ali Modarressi, Abdullatif Köksal, Hinrich Schütze:
Consistent Document-Level Relation Extraction via Counterfactuals. CoRR abs/2407.06699 (2024) - [i201]Arda Yüksel, Abdullatif Köksal, Lütfi Kerem Senel, Anna Korhonen, Hinrich Schütze:
TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish. CoRR abs/2407.12402 (2024) - [i200]Subhabrata Dutta, Timo Kaufmann, Goran Glavas, Ivan Habernal, Kristian Kersting, Frauke Kreuter, Mira Mezini, Iryna Gurevych, Eyke Hüllermeier, Hinrich Schütze:
Problem Solving Through Human-AI Preference-Based Cooperation. CoRR abs/2408.07461 (2024) - [i199]Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze:
ChatZero:Zero-shot Cross-Lingual Dialogue Generation via Pseudo-Target Language. CoRR abs/2408.08724 (2024) - [i198]Raoyuan Zhao, Abdullatif Köksal, Yihong Liu, Leonie Weissweiler, Anna Korhonen, Hinrich Schütze:
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists. CoRR abs/2408.17437 (2024) - [i197]Ingo Ziegler, Abdullatif Köksal, Desmond Elliott, Hinrich Schütze:
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation. CoRR abs/2409.02098 (2024) - [i196]Abdullatif Köksal, Marion Thaler, Ayyoob Imani, Ahmet Üstün, Anna Korhonen, Hinrich Schütze:
MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions. CoRR abs/2409.12958 (2024) - [i195]Yihong Liu, Mingyang Wang, Amir Hossein Kargaran, Ayyoob Imani, Orgest Xhelili, Haotian Ye, Chunlan Ma, François Yvon, Hinrich Schütze:
How Transliterations Improve Crosslingual Alignment. CoRR abs/2409.17326 (2024) - [i194]Yihong Liu, Haotian Ye, Chunlan Ma, Mingyang Wang, Hinrich Schütze:
LangSAMP: Language-Script Aware Multilingual Pretraining. CoRR abs/2409.18199 (2024) - [i193]Mingyang Wang, Lukas Lange, Heike Adel, Jannik Strötgen, Hinrich Schütze:
Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization. CoRR abs/2410.02433 (2024) - [i192]Amir Hossein Kargaran, Ali Modarressi, Nafiseh Nikeghbal, Jana Diesner, François Yvon, Hinrich Schütze:
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment. CoRR abs/2410.05873 (2024) - [i191]Amir Hossein Kargaran, François Yvon, Hinrich Schütze:
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages. CoRR abs/2410.23825 (2024) - 2023
- [j34]Leonie Weissweiler, Valentin Hofmann, Abdullatif Köksal, Hinrich Schütze:
Explaining pretrained language models' understanding of linguistic structures using construction grammar. Frontiers Artif. Intell. 6 (2023) - [j33]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023) - [c264]Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André F. T. Martins, François Yvon, Hinrich Schütze:
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages. ACL (1) 2023: 1082-1117 - [c263]Xinpeng Wang, Leonie Weissweiler, Hinrich Schütze, Barbara Plank:
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives. ACL (2) 2023: 1843-1852 - [c262]Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze:
PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism. ACL (1) 2023: 3295-3310 - [c261]Zhen Han, Ruotong Liao, Jindong Gu, Yao Zhang, Zifeng Ding, Yujia Gu, Heinz Koeppl, Hinrich Schütze, Volker Tresp:
ECOLA: Enhancing Temporal Knowledge Embeddings with Contextualized Language Representations. ACL (Findings) 2023: 5433-5447 - [c260]Ercong Nie, Sheng Liang, Helmut Schmid, Hinrich Schütze:
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages. ACL (Findings) 2023: 8320-8340 - [c259]Yihong Liu, Haotian Ye, Leonie Weissweiler, Philipp Wicke, Renhao Pei, Robert Zangenfeind, Hinrich Schütze:
A Crosslingual Investigation of Conceptualization in 1335 Languages. ACL (1) 2023: 12969-13000 - [c258]Abdullatif Köksal, Timo Schick, Hinrich Schütze:
MEAL: Stable and Active Learning for Few-Shot Prompting. EMNLP (Findings) 2023: 506-517 - [c257]Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze:
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training. EMNLP 2023: 4631-4646 - [c256]Amir Hossein Kargaran, Ayyoob Imani, François Yvon, Hinrich Schütze:
GlotLID: Language Identification for Low-Resource Languages. EMNLP (Findings) 2023: 6155-6218 - [c255]Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schütze, Kemal Oflazer, David R. Mortensen:
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model. EMNLP 2023: 6508-6524 - [c254]Yihong Liu, Haotian Ye, Leonie Weissweiler, Renhao Pei, Hinrich Schütze:
Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs. EMNLP (Findings) 2023: 8376-8401 - [c253]Abdullatif Köksal, Omer Faruk Yalcin, Ahmet Akbiyik, M. Tahir Kilavuz, Anna Korhonen, Hinrich Schütze:
Language-Agnostic Bias Detection in Language Models with Bias Probing. EMNLP (Findings) 2023: 12735-12747 - [c252]Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schütze, Peter Clark:
Language Models with Rationality. EMNLP 2023: 14190-14201 - [c251]Ercong Nie, Helmut Schmid, Hinrich Schütze:
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration. EMNLP (Findings) 2023: 15774-15782 - [c250]Yihong Liu, Alexandra Chronopoulou, Hinrich Schütze, Alexander Fraser:
On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss. IWSLT@ACL 2023: 491-502 - [c249]Bolei Ma, Ercong Nie, Helmut Schmid, Hinrich Schütze:
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding. KONVENS 2023: 1-16 - [c248]Nafiseh Nikeghbal, Amir Hossein Kargaran, Abbas Heydarnoori, Hinrich Schütze:
GIRT-Data: Sampling GitHub Issue Report Templates. MSR 2023: 104-108 - [c247]Verena Blaschke, Hinrich Schütze, Barbara Plank:
A Survey of Corpora for Germanic Low-Resource Languages and Dialects. NoDaLiDa 2023: 392-414 - [c246]Ercong Nie, Helmut Schmid, Hinrich Schütze:
Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach. ALP@RANLP 2023: 68-79 - [c245]Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze:
NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis. SemEval@ACL 2023: 488-497 - [c244]Yanchen Liu, Timo Schick, Hinrich Schütze:
Semantic-Oriented Unlabeled Priming for Large-Scale Language Models. SustaiNLP 2023: 32-38 - [c243]Verena Blaschke, Hinrich Schütze, Barbara Plank:
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages. VarDial@EACL 2023: 40-54 - [i190]Leonie Weissweiler, Taiqi He, Naoki Otani, David R. Mortensen, Lori S. Levin, Hinrich Schütze:
Construction Grammar Provides Unique Insight into Neural Language Models. CoRR abs/2302.02178 (2023) - [i189]Amir Hossein Kargaran, Nafiseh Nikeghbal, Abbas Heydarnoori, Hinrich Schütze:
MenuCraft: Interactive Menu System Design with Large Language Models. CoRR abs/2303.04496 (2023) - [i188]Nafiseh Nikeghbal, Amir Hossein Kargaran, Abbas Heydarnoori, Hinrich Schütze:
GIRT-Data: Sampling GitHub Issue Report Templates. CoRR abs/2303.09236 (2023) - [i187]Antonis Maronikolakis, Abdullatif Köksal, Hinrich Schütze:
Sociocultural knowledge is needed for selection of shots in hate speech detection tasks. CoRR abs/2304.01890 (2023) - [i186]Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze:
LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction. CoRR abs/2304.08460 (2023) - [i185]Verena Blaschke, Hinrich Schütze, Barbara Plank:
A Survey of Corpora for Germanic Low-Resource Languages and Dialects. CoRR abs/2304.09805 (2023) - [i184]Verena Blaschke, Hinrich Schütze, Barbara Plank:
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages. CoRR abs/2304.10158 (2023) - [i183]Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze:
NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis. CoRR abs/2305.00090 (2023) - [i182]Yihong Liu, Haotian Ye, Leonie Weissweiler, Philipp Wicke, Renhao Pei, Robert Zangenfeind, Hinrich Schütze:
A Crosslingual Investigation of Conceptualization in 1335 Languages. CoRR abs/2305.08475 (2023) - [i181]Chunlan Ma, Ayyoob Imani, Haotian Ye, Ehsaneddin Asgari, Hinrich Schütze:
Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages. CoRR abs/2305.08487 (2023) - [i180]Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André F. T. Martins, François Yvon, Hinrich Schütze:
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages. CoRR abs/2305.12182 (2023) - [i179]