


default search action
NAACL-HLT 2025: Albuquerque, New Mexico, USA - Volume 2: Short Papers
- Luis Chiruzzo, Alan Ritter, Lu Wang:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2025 - Volume 2: Short Papers, Albuquerque, New Mexico, April 29 - May 4, 2025. Association for Computational Linguistics 2025, ISBN 979-8-89176-190-2 - Yinqi Zhang, Xintian Han, Haolong Li, Kedi Chen, Shaohui Lin:
Complete Chess Games Enable LLM Become A Chess Master. 1-7 - Dipankar Srirag
, Aditya Joshi, Jacob Eisenstein:
Predicting the Target Word of Game-playing Conversations using a Low-Rank Dialect Adapter for Decoder Models. 8-17 - Shani Goren, Oren Kalinsky, Tomer Stav, Yuri Rapoport, Yaron Fairstein, Ram Yazdi, Nachshon Cohen, Alexander Libov, Guy Kushilevitz:
ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based Chatbots. 18-32 - Rao Ma, Mengjie Qian, Yassir Fathullah, Siyuan Tang, Mark J. F. Gales, Kate M. Knill:
Cross-Lingual Transfer Learning for Speech Translation. 33-43 - Nishant Balepur, Feng Gu, Abhilasha Ravichander, Shi Feng, Jordan Lee Boyd-Graber, Rachel Rudinger:
Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer? 44-64 - Feng Gu, Wichayaporn Wongkamjan, Jordan Lee Boyd-Graber, Jonathan K. Kummerfeld, Denis Peskoff, Jonathan May:
Personalized Help for Optimizing Low-Skilled Users' Strategy. 65-74 - Yash Jain, Vishal Chowdhary:
Local Prompt Optimization. 75-81 - Jiwoo Hong, Noah Lee, Rodrigo Martínez-Castaño, César Rodríguez, James Thorne:
Cross-lingual Transfer of Reward Models in Multilingual Alignment. 82-94 - Gleb Kuzmin, Neemesh Yadav, Ivan V. Smirnov, Timothy Baldwin, Artem Shelmanov:
Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models. 95-107 - Anna Arias-Duart, Pablo Agustin Martin-Torres, Daniel Hinjos, Pablo Bernabeu-Perez, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Sergio Álvarez-Napagao, Dario Garcia-Gasulla:
Automatic Evaluation of Healthcare LLMs Beyond Question-Answering. 108-130 - Yiming Lu, Yebowen Hu, Hassan Foroosh, Wei Jin, Fei Liu:
STRUX: An LLM for Decision-Making with Structured Explanations. 131-141 - Toan Ngoc Nguyen, Nam Le Hai, Nguyen Doan Hieu, Dai An Nguyen, Linh Ngo Van, Thien Huu Nguyen, Sang Dinh:
Improving Vietnamese-English Cross-Lingual Retrieval for Legal and General Domains. 142-153 - Hope McGovern, Hale Sirin, Tom Lippincott:
Computational Discovery of Chiasmus in Ancient Religious Text. 154-160 - Hope McGovern, Hale Sirin, Tom Lippincott:
Characterizing the Effects of Translation on Intertextuality using Multilingual Embedding Spaces. 161-167 - Cheng Yang, Chufan Shi, Siheng Li, Bo Shui, Yujiu Yang, Wai Lam:
LLM2: Let Large Language Models Harness System 2 Reasoning. 168-177 - Yanhong Li, David Yunis, David McAllester, Jiawei Zhou:
Context-Efficient Retrieval with Factual Decomposition. 178-194 - Laura Biester:
Sports and Women's Sports: Gender Bias in Text Generation with Olympic Data. 195-205 - Elizabeth Nielsen, Isaac Caswell, Jiaming Luo, Colin Cherry:
Alligators All Around: Mitigating Lexical Confusion in Low-resource Machine Translation. 206-221 - Jaeseong Lee, Seung-won Hwang, Hojin Lee, Yunju Bak, Changmin Lee:
PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning. 222-228 - Kaiqiao Han, Tianqing Fang, Zhaowei Wang, Yangqiu Song, Mark Steedman:
Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction. 229-243 - Ruiyi Zhang, David Sullivan, Kyle Jackson, Pengtao Xie, Mei Chen:
Defense against Prompt Injection Attacks via Mixture of Encodings. 244-252 - Akshit Achara, Anshuman Chhabra:
Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers. 253-264 - Aashiq Muhamed, Mona T. Diab, Virginia Smith:
CoRAG: Collaborative Retrieval-Augmented Generation. 265-276 - Ivory Yang, Weicheng Ma, Chunhui Zhang, Soroush Vosoughi:
Is It Navajo? Accurate Language Detection for Endangered Athabaskan Languages. 277-284 - Kyle Gorman, Yuval Pinter:
Don't Touch My Diacritics. 285-291 - Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi:
Pretrained Image-Text Models are Secretly Video Captioners. 292-305 - Sicheng Yu, Yuanchen Xu, Cunxiao Du, Yanying Zhou, Minghui Qiu, Qianru Sun, Hao Zhang, Jiawei Wu:
Reverse Modeling in Large Language Models. 306-320 - Oleg Vasilyev, Randy Sawaya, John Bohannon:
Preserving Multilingual Quality While Tuning Query Encoder on English Only. 321-341 - Zixin Tang, Chieh-Yang Huang, Tsung-Chi Li, Ho Yin Sam Ng, Hen-Hsen Huang, Ting-Hao Kenneth Huang:
Using Contextually Aligned Online Reviews to Measure LLMs' Performance Disparities Across Language Varieties. 342-355 - Yuji Byun, Jaeho Lee:
Towards Federated Low-Rank Adaptation of Language Models with Rank Heterogeneity. 356-362 - Zenghao Duan, Wenbin Duan, Zhiyi Yin, Yinghan Shen, Shaoling Jing, Jie Zhang, Huawei Shen, Xueqi Cheng:
Related Knowledge Perturbation Matters: Rethinking Multiple Pieces of Knowledge Editing in Same-Subject. 363-373 - Kazuki Yano, Takumi Ito, Jun Suzuki:
STEP: Staged Parameter-Efficient Pre-training for Large Language Models. 374-384 - Amit A. Levy, Mor Geva:
Language Models Encode Numbers Using Digit Representations in Base 10. 385-395 - You Wu, Haoyi Wu, Kewei Tu:
A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference. 396-403 - Abhishek Gupta, Amruta Parulekar, Sameep Chattopadhyay, Preethi Jyothi:
AMPS: ASR with Multimodal Paraphrase Supervision. 404-413 - Chunlan Ma, Ayyoob Imani, Haotian Ye, Renhao Pei
, Ehsaneddin Asgari, Hinrich Schütze:
Taxi1500: A Dataset for Multilingual Text Classification in 1500 Languages. 414-439 - Usman Naseem, Shuvam Shiwakoti, Siddhant Bikram Shah, Surendrabikram Thapa, Qi Zhang:
GameTox: A Comprehensive Dataset and Analysis for Enhanced Toxicity Detection in Online Gaming Communities. 440-447 - Forrest Sheng Bao, Miaoran Li, Renyi Qu, Ge Luo, Erana Wan, Yujia Tang, Weisi Fan, Manveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Mike Qi, Ruixuan Tu, Chenyu Xu, Matthew Gonzales, Ofer Mendelevitch, Amin Ahmad:
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMs. 448-461 - Xi Chen, Mao Mao, Shuo Li, Haotian Shangguan:
Debate-Feedback: A Multi-Agent Framework for Efficient Legal Judgment Prediction. 462-470 - Shangyi Geng, Wenting Zhao, Alexander M. Rush:
Great Memory, Shallow Reasoning: Limits of kNN-LMs. 471-482 - Tatsuya Hiraoka, Kentaro Inui:
Repetition Neurons: How Do Language Models Produce Repetitions? 483-495 - Yu-Ang Lee, Ching-Yun Ko, Tejaswini Pedapati, I-Hsin Chung, Mi-Yen Yeh, Pin-Yu Chen:
STAR: Spectral Truncation and Rescale for Model Merging. 496-505 - Hieu Trung Nguyen, Bao Nguyen, Binh Nguyen, Viet Anh Nguyen:
Task-driven Layerwise Additive Activation Intervention. 506-513 - Adithya Pratapa, Teruko Mitamura:
Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches. 514-528 - Sangmin Woo, Kang Zhou, Yun Zhou, Shuai Wang, Sheng Guan, Haibo Ding, Lin Lee Cheong:
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models. 529-538 - Yutian Zhao, Huimin Wang, Yefeng Zheng, Xian Wu:
A Layered Debating Multi-Agent System for Similar Disease Diagnosis. 539-549 - Ahmed Oumar El-Shangiti, Tatsuya Hiraoka, Hilal AlQuabeh, Benjamin Heinzerling, Kentaro Inui:
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces. 550-561 - Steve Bakos, David Guzmán, Riddhi More, Kelly Chutong Li, Félix Gaschi, En-Shiun Annie Lee:
AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages. 562-586 - Junhao Chen, Zhiyuan Ding, Yan Liu, Xiangzhu Zeng, Ling Wang:
FLIQA-AD: a Fusion Model with Large Language Model for Better Diagnose and MMSE Prediction of Alzheimer's Disease. 587-594 - Quan Guo, Xin Liang:
Transform Retrieval for Textual Entailment in RAG. 595-599 - Hyunji Lee, Danni Liu
, Supriti Sinhamahapatra, Jan Niehues:
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations. 600-610 - Shu Wang, Lei Ji, Renxi Wang, Wenxiao Zhao, Haokun Liu, Yifan Hou, Ying Nian Wu:
Explore the Reasoning Capability of LLMs in the Chess Testbed. 611-622 - Aman Tiwari, Shiva Krishna Reddy Malay, Vikas Yadav, Masoud Hashemi, Sathwik Tejaswi Madhusudhan:
Auto-Cypher: Improving LLMs on Cypher generation via LLM-supervised generation-verification framework. 623-640 - Seo Yeon Park:
Leveraging Moment Injection for Enhanced Semi-supervised Natural Language Inference with Large Language Models. 641-648 - Taisei Enomoto, Hwichan Kim, Zhousi Chen, Mamoru Komachi:
A Fair Comparison without Translationese: English vs. Target-language Instructions for Multilingual LLMs. 649-670 - Sanghee Park, Geewook Kim:
Evaluating Multimodal Generative AI with Korean Educational Standards. 671-688 - Rao Fu, Ziyang Luo, Hongzhan Lin, Zhen Ye, Jing Ma:
ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges. 689-699 - Hao Kang, Tevin Wang, Chenyan Xiong:
Interpret and Control Dense Retrieval with Sparse Latent Features. 700-709 - Hyeonchu Park, Byungjun Kim, Bugeun Kim:
DART: An AIGT Detector using AMR of Rephrased Text. 710-721 - Nicolas Floquet, Joseph Le Roux, Nadi Tomeh, Thierry Charnois:
Scaling Graph-Based Dependency Parsing with Arc Vectorization and Attention-Based Refinement. 722-734 - Ang Lv, Ruobing Xie, Xingwu Sun, Zhanhui Kang, Rui Yan:
Language Models "Grok" to Copy. 735-741 - Gaspard Michel, Elena V. Epure, Romain Hennequin, Christophe Cerisara:
Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3. 742-755 - Katharina Hämmerl, Tomasz Limisiewicz, Jindrich Libovický, Alexander Fraser:
Beyond Literal Token Overlap: Token Alignability for Multilinguality. 756-767 - Kawshik Manikantan, Makarand Tapaswi, Vineet Gandhi, Shubham Toshniwal:
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMs. 768-777 - Karl El Hajal, Ajinkya Kulkarni, Enno Hermann, Mathew Magimai-Doss:
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech. 778-786 - Youngwon Lee, Seung-won Hwang, Daniel F. Campos, Filip Gralinski, Zhewei Yao, Yuxiong He:
CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation. 787-796 - Margarita Bugueño, Hazem Abou Hamdan, Gerard de Melo:
GraphLSS: Integrating Lexical, Structural, and Semantic Features for Long Document Extractive Summarization. 797-804 - Juraj Vladika, Ivana Hacajová, Florian Matthes:
Step-by-Step Fact Verification System for Medical Claims with Explainable Reasoning. 805-816 - Shenran Wang, Changbing Yang, Mike Parkhill, Chad Quinn, Christopher Hammerly, Jian Zhu:
Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and Maliseet. 817-826 - Kun Qian, Maximillian Chen, Siyan Li, Arpit Sharma, Zhou Yu:
Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts. 827-844 - Huaman Sun, Jiaxin Pei, Minje Choi, David Jurgens:
Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs. 845-854 - Zhaoqing Wu, Dan Goldwasser, Maria Leonor Pacheco, Leora Morgenstern:
Identifying Power Relations in Conversations using Multi-Agent Social Reasoning. 855-865 - Aylin Gunal, Bowen Yi, John Piette, Rada Mihalcea, Verónica Pérez-Rosas:
Examining Spanish Counseling with MIDAS: a Motivational Interviewing Dataset in Spanish. 866-872 - Isabel O. Gallegos, Ryan Aponte, Ryan A. Rossi, Joe Barrow, Md. Mehrab Tanjim, Tong Yu, Hanieh Deilamsalehy, Ruiyi Zhang, Sungchul Kim, Franck Dernoncourt, Nedim Lipka, Deonna M. Owens, Jiuxiang Gu:
Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes. 873-888 - Jiali Cheng, Hadi Amiri:
EqualizeIR: Mitigating Linguistic Biases in Retrieval Models. 889-898 - Ramaneswaran Selvakumar, Sonal Kumar, Hemant Kumar Giri, Nishit Anand, Ashish Seth, Sreyan Ghosh, Dinesh Manocha:
Do Audio-Language Models Understand Linguistic Variations? 899-913 - Sourabh Dattatray Deoghare, Diptesh Kanojia, Pushpak Bhattacharyya:
Giving the Old a Fresh Spin: Quality Estimation-Assisted Constrained Decoding for Automatic Post-Editing. 914-925 - Ming Li, Han Chen, Chenguang Wang, Dang Nguyen, Dianqi Li, Tianyi Zhou:
RuleR: Improving LLM Controllability by Rule-based Data Recycling. 926-943 - Sandeep Kumar, Samarth Garg, Sagnik Sengupta, Tirthankar Ghosal, Asif Ekbal:
MixRevDetect: Towards Detecting AI-Generated Content in Hybrid Peer Reviews. 944-953 - Maitreya Prafulla Chitale, Uday Bindal, Rajakrishnan Rajkumar, Rahul Mishra:
DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph. 954-965 - Vasudha Varadarajan, Syeda Mahwish, Xiaoran Liu, Julia Buffolino, Christian C. Luhmann, Ryan L. Boyd, H. Andrew Schwartz:
Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm. 966-979

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.