default search action
NAACL-HLT 2024: Mexico City, Mexico
- Kevin Duh, Helena Gómez-Adorno, Steven Bethard:
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), NAACL 2024, Mexico City, Mexico, June 16-21, 2024. Association for Computational Linguistics 2014, ISBN 979-8-89176-114-8 - Frontmatter.
- Hongyi Liu, Qingyun Wang, Payam Karisani, Heng Ji:
Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences. 1-21 - Hongyi Yuan, Zheng Yuan, Chuanqi Tan, Fei Huang, Songfang Huang:
Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation. 22-39 - Nikhil Mehta, Dan Goldwasser:
An Interactive Framework for Profiling News Media Sources. 40-58 - Yinghao Li, Haorui Wang, Chao Zhang:
Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study. 59-81 - Taeyang Yun, Hyunkuk Lim, Jeonghwan Lee, Min Song:
TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation. 82-95 - Seanie Lee, Jianpeng Cheng, Joris Driesen, Alexandru Coca, Anders Johannsen:
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries. 96-111 - Maitrey Mehta, Valentina Pyatkin, Vivek Srikumar:
Promptly Predicting Structures: The Return of Inference. 112-130 - Yutong Shao, Ndapa Nakashole:
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL. 131-156 - Thang Le, Anh Tuan Luu:
Extractive Summarization with Text Generator. 157-174 - Michele Resta, Davide Bacciu:
Self-generated Replay Memories for Continual Neural Machine Translation. 175-191 - Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models. 192-210 - Shreya Havaldar, Salvatore Giorgi, Sunny Rai, Thomas Talhelm, Sharath Chandra Guntuku, Lyle H. Ungar:
Building Knowledge-Guided Lexica to Model Cultural Variation. 211-226 - Shangqian Gao, Ting Hua, Yen-Chang Hsu, Yilin Shen, Hongxia Jin:
Adaptive Rank Selections for Low-Rank Approximation of Language Models. 227-241 - Pengzhi Gao, Ruiqing Zhang, Zhongjun He, Hua Wu, Haifeng Wang:
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation. 242-256 - Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji:
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration. 257-279 - Ziyang Wang, Sanwoo Lee, Hsiu-Yuan Huang, Yunfang Wu:
FPT: Feature Prompt Tuning for Few-shot Readability Assessment. 280-295 - Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao:
Self-Prompting Large Language Models for Zero-Shot Open-Domain QA. 296-310 - Kai Sun, Yifan Ethan Xu, Hanwen Zha, Yue Liu, Xin Luna Dong:
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? 311-325 - Wenting Zhao, Ye Liu, Yao Wan, Yibo Wang, Qingyang Wu, Zhongfen Deng, Jiangshu Du, Shuaiqi Liu, Yunlong Xu, Philip S. Yu:
kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning. 326-337 - Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia:
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems. 338-354 - Fan Zhang, Xian-Sheng Hua, Chong Chen, Xiao Luo:
DEMO: A Statistical Perspective for Efficient Image-Text Matching. 355-369 - Bin Wang, Zhengyuan Liu, Xin Huang, Fangkai Jiao, Yang Ding, AiTi Aw, Nancy Chen:
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning. 370-390 - Seongyun Lee, Sue Hyun Park, Yongrae Jo, Minjoon Seo:
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision. 391-404 - Samuel Cahyawijaya, Holy Lovenia, Pascale Fung:
LLMs Are Few-Shot In-Context Low-Resource Language Learners. 405-433 - Yuekun Yao, Alexander Koller:
Simple and effective data augmentation for compositional generalization. 434-449 - Tianyang Liu, Fei Wang, Muhao Chen:
Rethinking Tabular Data Understanding with Large Language Models. 450-482 - Qin Liu, Fei Wang, Chaowei Xiao, Muhao Chen:
From Shortcuts to Triggers: Backdoor Defense with Denoised PoE. 483-496 - Rahul Kumar, Amar Raja Dibbu, Shrutendra Harsola, Vignesh Subrahmaniam, Ashutosh Modi:
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain. 497-516 - Shamik Roy, Sailik Sengupta, Daniele Bonadiman, Saab Mansour, Arshit Gupta:
FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs. 517-539 - Yuxi Feng, Laks V. S. Lakshmanan:
DuRE: Dual Contrastive Self Training for Semi-Supervised Relation Extraction. 540-555 - Zhen Yu, Zhenhua Chen, Kun He:
Query-Efficient Textual Adversarial Example Generation for Black-Box Attacks. 556-569 - Kung-Hsiang Huang, Philippe Laban, Alexander R. Fabbri, Prafulla Kumar Choubey, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu:
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles. 570-593 - Haoyi Qiu, Kung-Hsiang Huang, Jingnong Qu, Nanyun Peng:
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation. 594-608 - Lang Cao, Zifeng Wang, Cao Xiao, Jimeng Sun:
PILOT: Legal Case Outcome Prediction with Case Law. 609-621 - Zequan Liu, Jiawen Lyn, Wei Zhu, Xing Tian, Yvette Graham:
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models. 622-641 - Heng-Jui Chang, James R. Glass:
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces. 642-662 - Yifan Wang, Yafei Liu, Chufan Shi, Haoling Li, Chen Chen, Haonan Lu, Yujiu Yang:
InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions. 663-677 - Saiteja Utpala, Alex Gu, Pin-Yu Chen:
Language Agnostic Code Embeddings. 678-691 - Teli Ma, Rong Li, Junwei Liang:
An Examination of the Compositionality of Large Generative Vision-Language Models. 692-705 - Victoria Graf, Qin Liu, Muhao Chen:
Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors. 706-718 - Jonathan Rusert:
VertAttack: Taking Advantage of Text Classifiers' Horizontal Vision. 719-732 - Cong-Duy Nguyen, Thong Nguyen, Xiaobao Wu, Anh Tuan Luu:
KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning. 733-749 - Jian Zhu, Changbing Yang, Farhan Samir, Jahurul Islam:
The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language. 750-772 - Yunqi Zhang, Songda Li, Chunyuan Deng, Luyi Wang, Hui Zhao:
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks. 773-791 - Xianming Li, Jing Li:
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings. 792-804 - Weixuan Wang, Barry Haddow, Alexandra Birch, Wei Peng:
Assessing Factual Reliability of Large Language Model Knowledge. 805-819 - Zhenpeng Su, Xing Wu, Wei Zhou, Guangyuan Ma, Songlin Hu:
Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems. 820-830 - Cheng Qian, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu:
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model. 831-854 - Letian Wang, Xianggen Liu, Jiancheng Lv:
Create! Don't Repeat: A Paradigm Shift in Multi-Label Augmentation through Label Creative Generation. 855-869 - Ali Safaya, Deniz Yuret:
Neurocache: Efficient Vector Retrieval for Long-range Language Modeling. 870-883 - Haoran Yang, Yumeng Zhang, Jiaqi Xu, Hongyuan Lu, Pheng-Ann Heng, Wai Lam:
Unveiling the Generalization Power of Fine-Tuned Large Language Models. 884-899 - Ruixin Hong, Hongming Zhang, Xinyu Pang, Dong Yu, Changshui Zhang:
A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning. 900-925 - Fangkai Jiao, Zhiyang Teng, Bosheng Ding, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty:
Exploring Self-supervised Logic-enhanced Training for Large Language Models. 926-941 - Debrup Das, Debopriyo Banerjee, Somak Aditya, Ashish Kulkarni:
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning. 942-966 - Dawei Zhu, Wenhao Wu, Yifan Song, Fangwei Zhu, Ziqiang Cao, Sujian Li:
CoUDA: Coherence Evaluation via Unified Data Augmentation. 967-978 - Vipul Raheja, Dimitris Alikaniotis, Vivek Kulkarni, Bashar Alhafni, Dhruv Kumar:
mEdIT: Multilingual Text Editing via Instruction Tuning. 979-1001 - Yunchao Zhang, Zonglin Di, Kaiwen Zhou, Cihang Xie, Xin Wang:
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning. 1002-1016 - Gilad Deutch, Nadav Magar, Tomer Bar Natan, Guy Dar:
In-context Learning and Gradient Descent Revisited. 1017-1028 - Olufunke Oluyemi Sarumi, Béla Neuendorf, Joan Plepi, Lucie Flek, Jörg Schlötterer, Charles Welch:
Corpus Considerations for Annotator Modeling and Scaling. 1029-1040 - Che Jiang, Biqing Qi, Xiangyu Hong, Dayuan Fu, Yang Cheng, Fandong Meng, Mo Yu, Bowen Zhou, Jie Zhou:
On Large Language Models' Hallucination with Regard to Known Facts. 1041-1053 - Li Lucy, Su Lin Blodgett, Milad Shokouhi, Hanna M. Wallach, Alexandra Olteanu:
"One-Size-Fits-All"? Examining Expectations around What Constitute "Fair" or "Good" NLG System Behaviors. 1054-1089 - Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, Hao Peng:
Language Models Hallucinate, but May Excel at Fact Verification. 1090-1111 - Bowen Ding, Qingkai Min, Shengkun Ma, Yingjie Li, Linyi Yang, Yue Zhang:
A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution. 1112-1140 - Mengxin Zheng, Jiaqi Xue, Xun Chen, Yanshan Wang, Qian Lou, Lei Jiang:
TrojFSP: Trojan Insertion in Few-shot Prompt Tuning. 1141-1151 - Yi Luo, Zhenghao Lin, Yuhao Zhang, Jiashuo Sun, Chen Lin, Chengjin Xu, Xiangdong Su, Yelong Shen, Jian Guo, Yeyun Gong:
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models. 1152-1197 - Juan Diego Rodriguez, Katrin Erk, Greg Durrett:
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs. 1198-1222 - Rajiv Movva, Sidhika Balachandar, Kenny Peng, Gabriel Agostini, Nikhil Garg, Emma Pierson:
Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers. 1223-1243 - Zhehao Zhang, Yan Gao, Jian-Guang Lou:
E⁵: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and Extrapolate. 1244-1258 - Fangyu Lei, Qian Liu, Yiming Huang, Shizhu He, Jun Zhao, Kang Liu:
S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Model. 1259-1286 - Fuxiao Liu, Xiaoyang Wang, Wenlin Yao, Jianshu Chen, Kaiqiang Song, Sangwoo Cho, Yaser Yacoob, Dong Yu:
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning. 1287-1310 - Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas:
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes. 1311-1329 - Hendra Setiawan:
Accurate Knowledge Distillation via n-best Reranking. 1330-1345 - Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao:
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition. 1346-1362 - Tal Schuster, Ádám D. Lelkes, Haitian Sun, Jai Gupta, Jonathan Berant, William W. Cohen, Donald Metzler:
SEMQA: Semi-Extractive Multi-Source Question Answering. 1363-1381 - Hao Lang, Fei Huang, Yongbin Li:
Fine-Tuning Language Models with Reward Learning on Policy. 1382-1392 - Robert Pugh, Francis M. Tyers:
A Universal Dependencies Treebank for Highland Puebla Nahuatl. 1393-1403 - Haryo Akbarianto Wibowo, Erland Hilman Fuadi, Made Nindyatama Nityasya, Radityo Eko Prasojo, Alham Fikri Aji:
COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances. 1404-1422 - Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang:
IterAlign: Iterative Constitutional Alignment of Large Language Models. 1423-1433 - Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf:
OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking. 1434-1445 - Marco Valentino, Jordan Meadows, Lan Zhang, André Freitas:
Multi-Operational Mathematical Derivations in Latent Space. 1446-1458 - Chenglei Si, Navita Goyal, Tongshuang Wu, Chen Zhao, Shi Feng, Hal Daumé III, Jordan L. Boyd-Graber:
Large Language Models Help Humans Verify Truthfulness - Except When They Are Convincingly Wrong. 1459-1474 - Brendon Boldt, David R. Mortensen:
XferBench: a Data-Driven Benchmark for Emergent Language. 1475-1489 - Se-eun Yoon, Zhankui He, Jessica Maria Echterhoff, Julian J. McAuley:
Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation. 1490-1504 - Jordan Meadows, Marco Valentino, Damien Teney, André Freitas:
A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers. 1505-1523 - David Chanin, Anthony Hunter, Oana-Maria Camburu:
Identifying Linear Relational Concepts in Large Language Models. 1524-1535 - Venelin Kovatchev, Matthew Lease:
Benchmark Transparency: Measuring the Impact of Data on Evaluation. 1536-1551 - Jillian Fisher, Ximing Lu, Jaehun Jung, Liwei Jiang, Zaïd Harchaoui, Yejin Choi:
JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models. 1552-1581 - Zhenyu He, Zexuan Zhong, Tianle Cai, Jason D. Lee, Di He:
REST: Retrieval-Based Speculative Decoding. 1582-1595 - Sihao Chen, Hongming Zhang, Tong Chen, Ben Zhou, Wenhao Yu, Dian Yu, Baolin Peng, Hongwei Wang, Dan Roth, Dong Yu:
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations. 1596-1609 - Mobashir Sadat, Cornelia Caragea:
MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference. 1610-1629 - Bohan Zhang, Yixin Wang, Paramveer Dhillon:
Causal Inference for Human-Language Model Collaboration. 1630-1647 - Zezhong Wang, Fangkai Yang, Lu Wang, Pu Zhao, Hongru Wang, Liang Chen, Qingwei Lin, Kam-Fai Wong:
SELF-GUARD: Empower the LLM to Safeguard Itself. 1648-1668 - Jinpeng Li, Hang Yu, Xiangfeng Luo, Qian Liu:
COSIGN: Contextual Facts Guided Generation for Knowledge Graph Completion. 1669-1682 - Zhewei Sun, Qian Hu, Rahul Gupta, Richard S. Zemel, Yang Xu:
Toward Informal Language Processing: Knowledge of Slang in Large Language Models. 1683-1701 - Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein:
Ghostbuster: Detecting Text Ghostwritten by Large Language Models. 1702-1717 - Jiahao Zhang, Haiyang Zhang, Dongmei Zhang, Yong Liu, Shen Huang:
End-to-End Beam Retrieval for Multi-Hop Question Answering. 1718-1731 - Binghao Tang, Boda Lin, Haolong Yan, Si Li:
Leveraging Generative Large Language Models with Visual Instruction and Demonstration Retrieval for Multimodal Sarcasm Detection. 1732-1742 - Xiaojun Kuang, C. L. Philip Chen, Shuzhen Li, Tong Zhang:
Multi-Scale Prompt Memory-Augmented Model for Black-Box Scenarios. 1743-1757 - Chenming Tang, Fanyi Qu, Yunfang Wu:
Ungrammatical-syntax-based In-context Example Selection for Grammatical Error Correction. 1758-1770 - Akari Asai, Sneha Kudugunta, Xinyan Yu, Terra Blevins, Hila Gonen, Machel Reid, Yulia Tsvetkov, Sebastian Ruder, Hannaneh Hajishirzi:
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer. 1771-1800 - Yanhe Fu, Yanan Cao, Qingyue Wang, Yi Liu:
TISE: A Tripartite In-context Selection Method for Event Argument Extraction. 1801-1818 - Zhaofeng Wu, Linlu Qiu, Alexis Ross, Ekin Akyürek, Boyuan Chen, Bailin Wang, Najoung Kim, Jacob Andreas, Yoon Kim:
Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks. 1819-1862 - Yucheng Wang, Bowen Yu, Yilin Liu, Shudong Lu:
TRUE-UIE: Two Universal Relations Unify Information Extraction Tasks. 1863-1876 - Zifeng Ding, Heling Cai, Jingpei Wu, Yunpu Ma, Ruotong Liao, Bo Xiong, Volker Tresp:
zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models. 1877-1895 - Jielin Qiu, Mengdi Xu, William Han, Seungwhan Moon, Ding Zhao:
Embodied Executable Policy Learning with Language-based Scene Summarization. 1896-1913 - Yuqing Wang, Yun Zhao:
Metacognitive Prompting Improves Understanding in Large Language Models. 1914-1926 - Suyu Ge, Chunting Zhou, Rui Hou, Madian Khabsa, Yi-Chia Wang, Qifan Wang, Jiawei Han, Yuning Mao:
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming. 1927-1937 - Young-Jun Lee, Byungsoo Ko, Han-Gyu Kim, Jonghwan Hyeon, Ho-Jin Choi:
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset. 1938-1963 - Keming Lu, Hongyi Yuan, Runji Lin, Junyang Lin, Zheng Yuan, Chang Zhou, Jingren Zhou:
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models. 1964-1974 - Jiarui Liu, Wenkai Li, Zhijing Jin, Mona T. Diab:
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI. 1975-1997 - Chen Liu, Jonas Pfeiffer, Ivan Vulic, Iryna Gurevych:
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing. 1998-2015 - Chen Liu, Fajri Koto, Timothy Baldwin, Iryna Gurevych:
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings. 2016-2039 - Shir Lissak, Nitay Calderon, Geva Shenkman, Yaakov Ophir, Eyal Fruchter, Anat Brunstein Klomek, Roi Reichart:
The Colorful Future of LLMs: Evaluating and Improving LLMs as Emotional Supporters for Queer Youth. 2040-2079 - Jianli Zhao, Changhao Xu, Bin. Jiang:
IPED: An Implicit Perspective for Relational Triple Extraction based on Diffusion Model. 2080-2092 - Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan:
QualEval: Qualitative Evaluation for Model Improvement. 2093-2111 - Kehuan Yan, Peichao Lai, Yilei Wang:
Quantum-inspired Language Model with Lindblad Master Equation and Interference Measurement for Sentiment Analysis. 2112-2121 - Dongsheng Zhu, Daniel Tang, Weidong Han, Jinghui Lu, Yukun Zhao, Guoliang Xing, Junfeng Wang, Dawei Yin:
VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization. 2122-2135