default search action
ACL 2024: Bangkok, Thailand
- Lun-Wei Ku, Andre Martins, Vivek Srikumar:
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024. Association for Computational Linguistics 2024, ISBN 979-8-89176-094-3 - Frontmatter.
- Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Qing Li, Yong Jiang, Zhihao Jia:
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models. 1-17 - Hanlei Zhang, Hua Xu, Fei Long, Xin Wang, Kai Gao:
Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances. 18-35 - Yafu Li, Qintong Li, Leyang Cui, Wei Bi, Zhilin Wang, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang:
MAGE: Machine-generated Text Detection in the Wild. 36-53 - Haoran Li, Dadi Guo, Donghao Li, Wei Fan, Qi Hu, Xin Liu, Chunkit Chan, Duanyi Yao, Yuan Yao, Yangqiu Song:
PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models. 54-73 - Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, EngSiong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. 74-90 - Yanzhi Xu, Yueying Hua, Shichen Li, Zhongqing Wang:
Exploring Chain-of-Thought for Multi-modal Metaphor Detection. 91-101 - Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. 102-116 - Kai Chen, Ye Wang, Yitong Li, Aiping Li, Han Yu, Xin Song:
A Unified Temporal Knowledge Graph Reasoning Model Towards Interpolation and Extrapolation. 117-132 - Shicheng Xu, Liang Pang, Mo Yu, Fandong Meng, Huawei Shen, Xueqi Cheng, Jie Zhou:
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation. 133-145 - Yong Hu, Fandong Meng, Jie Zhou:
CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers. 146-159 - Charu James, Mayank Nagda, Nooshin Haji Ghassemi, Marius Kloft, Sophie Fellenz:
Evaluating Dynamic Topic Models. 160-176 - Guanting Dong, Hongyi Yuan, Keming Lu, Chengpeng Li, Mingfeng Xue, Dayiheng Liu, Wei Wang, Zheng Yuan, Chang Zhou, Jingren Zhou:
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition. 177-198 - Shanshan Xu, T. Y. S. S. Santosh, Oana Ichim, Barbara Plank, Matthias Grabmair:
Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification. 199-216 - Dhairya Dalal, Marco Valentino, André Freitas, Paul Buitelaar:
Inference to the Best Explanation in Large Language Models. 217-235 - Eduard Poesina, Cornelia Caragea, Radu Tudor Ionescu:
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus. 236-253 - Xiusi Chen, Jyun-Yu Jiang, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Wei Wang:
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering. 254-266 - Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh, Dong Yu, Fei Liu:
SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs. 267-278 - Qingyun Wang, Doug Downey, Heng Ji, Tom Hope:
SciMON: Scientific Inspiration Machines Optimized for Novelty. 279-299 - Yiren Jian, Tingkai Liu, Yunzhe Tao, Chunhui Zhang, Soroush Vosoughi, Hongxia Yang:
Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction. 300-314 - Abhishek Kumar, Robert Morabito, Sanzhar Umbet, Jad Kabbara, Ali Emami:
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models. 315-334 - Weixuan Wang, Barry Haddow, Alexandra Birch:
Retrieval-Augmented Multilingual Knowledge Editing. 335-354 - Brendan Park, Madeline Janecek, Naser Ezzati-Jivan, Yifeng Li, Ali Emami:
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge. 355-374 - Abhishek Kumar, Sarfaroz Yunusov, Ali Emami:
Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models. 375-392 - Alexandria Leto, Elliot Pickens, Coen D. Needell, David Rothschild, Maria Leonor Pacheco:
Framing in the Presence of Supporting Data: A Case Study in U.S. Economic News. 393-415 - Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Fuxiao Liu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang:
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences. 416-442 - Chufan Gao, Xuan Wang, Jimeng Sun:
TTM-RE: Memory-Augmented Document-Level Relation Extraction. 443-458 - Letian Peng, Yuwei Zhang, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang:
Answer is All You Need: Instruction-following Text Embedding via Answering the Question. 459-477 - Yuhang Zhou, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, Furong Huang:
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification. 478-492 - Qi Cheng, Michael Boratko, Pranay Kumar Yelugam, Tim O'Gorman, Nalini Singh, Andrew McCallum, Xiang Li:
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures. 493-506 - Yueqi Xie, Minghong Fang, Renjie Pi, Neil Gong:
GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis. 507-518 - Gyeongeun Lee, Christina Wong, Meghan Guo, Natalie Parde:
Pouring Your Heart Out: Investigating the Role of Figurative Language in Online Expressions of Empathy. 519-529 - Luran Wang, Mark J. F. Gales, Vatsal Raina:
An Information-Theoretic Approach to Analyze NLP Classification Tasks. 530-551 - Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour:
Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders. 552-567 - Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori S. Levin:
Wav2Gloss: Generating Interlinear Glossed Text from Speech. 568-582 - Yibo Hu, Erick Skorupa Parolin, Latifur Khan, Patrick T. Brandt, Javier Osorio, Vito D'Orazio:
Leveraging Codebook Knowledge with NLI and ChatGPT for Zero-Shot Political Relation Classification. 583-603 - Ziyao Xu, Houfeng Wang:
SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation. 604-621 - Haochen Shi, Zhiyuan Sun, Xingdi Yuan, Marc-Alexandre Côté, Bang Liu:
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following. 622-636 - Ying Shen, Zhiyang Xu, Qifan Wang, Yu Cheng, Wenpeng Yin, Lifu Huang:
Multimodal Instruction Tuning with Conditional Mixture of LoRA. 637-648 - Yiqing Xie, Sheng Zhang, Hao Cheng, Pengfei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn P. Rosé:
DocLens: Multi-aspect Fine-grained Medical Text Evaluation. 649-679 - Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong:
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability. 680-699 - Young Hyun Yoo, Jii Cha, Changhyeon Kim, Taeuk Kim:
Hyper-CL: Conditioning Sentence Representations with Hypernetworks. 700-711 - Seong Hoon Lim, Taejun Yun, Jinhyeon Kim, Jihun Choi, Taeuk Kim:
Analysis of Multi-Source Language Training in Cross-Lingual Transfer. 712-725 - Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramaneswaran S., S. Sakshi, Dinesh Manocha:
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions. 726-748 - Lucas Bandarkar, Davis Liang, Benjamin Muller, Mikel Artetxe, Satya Narayan Shukla, Donald Husa, Naman Goyal, Abhinandan Krishnan, Luke Zettlemoyer, Madian Khabsa:
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants. 749-775 - Chenyang An, Zhibo Chen, Qihao Ye, Emily First, Letian Peng, Jiayun Zhang, Zihan Wang, Sorin Lerner, Jingbo Shang:
Learn from Failure: Fine-tuning LLMs with Trial-and-Error Data for Intuitionistic Propositional Logic Proving. 776-790 - Saehyung Lee, Sangwon Yu, Junsung Park, Jihun Yi, Sungroh Yoon:
Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach. 791-809 - Inna W. Lin, Ashish Sharma, Christopher Michael Rytting, Adam S. Miner, Jina Suh, Tim Althoff:
IMBUE: Improving Interpersonal Effectiveness through Simulation and Just-in-time Feedback with Human-Language Model Interaction. 810-840 - Huawei Lin, Jikai Long, Zhaozhuo Xu, Weijie Zhao:
Token-wise Influential Training Data Retrieval for Large Language Models. 841-860 - Maxwell A. Weinzierl, Sanda M. Harabagiu:
Tree-of-Counterfactual Prompting for Zero-Shot Stance Detection. 861-880 - Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Russ Salakhutdinov, Daniel Fried:
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks. 881-905 - Hwanjun Song, Hang Su, Igor Shalyminov, Jason Cai, Saab Mansour:
FineSurE: Fine-grained Summarization Evaluation using LLMs. 906-922 - Daechul Ahn, Yura Choi, Youngjae Yu, Dongyeop Kang, Jonghyun Choi:
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback. 923-940 - Jingtao Zhan, Qingyao Ai, Yiqun Liu, Yingwei Pan, Ting Yao, Jiaxin Mao, Shaoping Ma, Tao Mei:
Prompt Refinement with Image Pivot for Text-to-Image Generation. 941-954 - Masato Mita, Soichiro Murakami, Akihiko Kato, Peinan Zhang:
Striking Gold in Advertising: Standardization and Exploration of Ad Text Generation. 955-972 - Zhaowei Wang, Wei Fan, Qing Zong, Hongming Zhang, Sehyun Choi, Tianqing Fang, Xin Liu, Yangqiu Song, Ginny Y. Wong, Simon See:
AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation. 973-994 - Runlong Zhou, Simon S. Du, Beibin Li:
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs. 995-1015 - Cheng Yang, Puli Chen, Qingbao Huang:
Can ChatGPT's Performance be Improved on Verb Metaphor Detection Tasks? Bootstrapping and Combining Tacit Knowledge. 1016-1027 - Zhaorui Yang, Tianyu Pang, Haozhe Feng, Han Wang, Wei Chen, Minfeng Zhu, Qian Liu:
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning. 1028-1043 - Kun Zhu, Xiaocheng Feng, Xiyuan Du, Yuxuan Gu, Weijiang Yu, Haotian Wang, Qianglong Chen, Zheng Chu, Jingchang Chen, Bing Qin:
An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation. 1044-1069 - Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu:
RORA: Robust Free-Text Rationale Evaluation. 1070-1087 - Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun:
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents. 1088-1113 - Zeyuan Wang, Qiang Zhang, Keyan Ding, Ming Qin, Xiang Zhuang, Xiaotong Li, Huajun Chen:
InstructProtein: Aligning Human and Protein Language via Knowledge Instruction. 1114-1136 - Aparna Elangovan, Ling Liu, Lei Xu, Sravan Babu Bodapati, Dan Roth:
ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models. 1137-1160 - Jingxuan Tu, Keer Xu, Liulu Yue, Bingyang Ye, Kyeongmin Rim, James Pustejovsky:
Linguistically Conditioned Semantic Textual Similarity. 1161-1172 - Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Tao He, Haotian Wang, Weihua Peng, Ming Liu, Bing Qin, Ting Liu:
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future. 1173-1203 - Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Haotian Wang, Ming Liu, Bing Qin:
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models. 1204-1228 - Zheng Chu, Jingchang Chen, Qianglong Chen, Haotian Wang, Kun Zhu, Xiyuan Du, Weijiang Yu, Ming Liu, Bing Qin:
BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering. 1229-1248 - Siyu Yuan, Jiangjie Chen, Changzhi Sun, Jiaqing Liang, Yanghua Xiao, Deqing Yang:
ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base. 1249-1265 - Yujie Feng, Xu Chu, Yongxin Xu, Guangyuan Shi, Bo Liu, Xiao-Ming Wu:
TaSL: Continual Dialog State Tracking via Task Skill Localization and Consolidation. 1266-1279 - Damai Dai, Chengqi Deng, Chenggang Zhao, R. X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y. K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang:
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models. 1280-1297 - Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou:
Grounding Language Model with Chunking-Free In-Context Retrieval. 1298-1311 - Jiaxin Bai, Yicheng Wang, Tianshi Zheng, Yue Guo, Xin Liu, Yangqiu Song:
Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation. 1312-1329 - Shizhe Diao, Pengcheng Wang, Yong Lin, Rui Pan, Xiang Liu, Tong Zhang:
Active Prompting with Chain-of-Thought for Large Language Models. 1330-1350 - Xiangyu Zhao, Bo Liu, Qijiong Liu, Guangyuan Shi, Xiao-Ming Wu:
EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs. 1351-1370 - Haochen Li, Xin Zhou, Zhiqi Shen:
Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search. 1371-1389 - Naomi Baes, Nick Haslam, Ekaterina Vylomova:
A Multidimensional Framework for Evaluating Lexical Semantic Change with Social Science Applications. 1390-1415 - Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su:
Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal. 1416-1428 - Baizhou Huang, Shuai Lu, Xiaojun Wan, Nan Duan:
Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency. 1429-1450 - Weitao Li, Junkai Li, Weizhi Ma, Yang Liu:
Citation-Enhanced Generation for LLM-based Chatbots. 1451-1466 - Haoyang Wen, Eduard H. Hovy, Alexander Hauptmann:
Transitive Consistency Constrained Learning for Entity-to-Entity Stance Detection. 1467-1480 - Jiahao Li, Quan Wang, Licheng Zhang, Guoqing Jin, Zhendong Mao:
Feature-Adaptive and Data-Scalable In-Context Learning. 1481-1494 - Yizhe Zhang, Jiarui Lu, Navdeep Jaitly:
Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games. 1495-1516 - Shangqing Tu, Yuliang Sun, Yushi Bai, Jifan Yu, Lei Hou, Juanzi Li:
WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models. 1517-1542 - Yida Zhao, Chao Lou, Kewei Tu:
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models. 1543-1556 - Zhengrui Ma, Qingkai Fang, Shaolei Zhang, Shoutao Guo, Yang Feng, Min Zhang:
A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation. 1557-1575 - Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Bing Liu, Haonan Lu, Wenliang Chen:
Probing Language Models for Pre-training Data Detection. 1576-1587 - Zhihan Zhang, Yixin Cao, Chenchen Ye, Yunshan Ma, Lizi Liao, Tat-Seng Chua:
Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding. 1588-1606 - Senyu Han, Lu Chen, Li-Min Lin, Zhengshan Xu, Kai Yu:
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation. 1607-1619 - Jiangxing Wang, Jiachen Li, Xiao Han, Deheng Ye, Zongqing Lu:
Language Model Adaption for Reinforcement Learning with Natural Language Action Space. 1620-1634 - Hiromasa Sakurai, Yusuke Miyao:
Evaluating Intention Detection Capability of Large Language Models in Persuasive Dialogues. 1635-1657 - Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu:
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression. 1658-1677 - Chuhao Jin, Kening Ren, Lingzhen Kong, Xiting Wang, Ruihua Song, Huan Chen:
Persuading across Diverse Domains: a Dataset and Persuasion Large Language Model. 1678-1706 - Mengxi Xiao, Qianqian Xie, Ziyan Kuang, Zhicheng Liu, Kailai Yang, Min Peng, Weiguang Han, Jimin Huang:
HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy. 1707-1725 - Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. 1726-1736 - Bi-Cheng Yan, Jiun-Ting Li, Yi-Cheng Wang, Hsin-Wei Wang, Tien-Hong Lo, Yung-Chang Hsu, Wei-Cheng Chao, Berlin Chen:
An Effective Pronunciation Assessment Approach Leveraging Hierarchical Transformers and Pre-training Strategies. 1737-1747 - Wei Li, Houfeng Wang:
Detection-Correction Structure via General Language Model for Grammatical Error Correction. 1748-1763 - Yongxin Zhu, Dan Su, Liqiang He, Linli Xu, Dong Yu:
Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer. 1764-1775 - Lichen Zhang, Shuai Lu, Nan Duan:
Selene: Pioneering Automated Proof in Software Verification. 1776-1789 - Junlong Li, Fan Zhou, Shichao Sun, Yikai Zhang, Hai Zhao, Pengfei Liu:
Dissecting Human and LLM Preferences. 1790-1811 - Tao Sun, Linzheng Chai, Jian Yang, Yuwei Yin, Hongcheng Guo, Jiaheng Liu, Bing Wang, Liqun Yang, Zhoujun Li:
UniCoder: Scaling Code Large Language Model via Universal Code. 1812-1824 - Xianming Li, Jing Li:
AoE: Angle-optimized Embeddings for Semantic Textual Similarity. 1825-1839 - Xintao Wang, Yunze Xiao, Jen-tse Huang, Siyu Yuan, Rui Xu, Haoran Guo, Quan Tu, Yaying Fei, Ziang Leng, Wei Wang, Jiangjie Chen, Cheng Li, Yanghua Xiao:
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews. 1840-1873 - Shengchao Liu, Xiaoming Liu, Yichen Wang, Zehua Cheng, Chengzhengxu Li, Zhaohan Zhang, Yu Lan, Chao Shen:
Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better. 1874-1889 - Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold:
AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators. 1890-1912 - Tobias Schimanski, Jingwei Ni, Mathias Kraus, Elliott Ash, Markus Leippold:
Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering. 1913-1931 - Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin. 1932-1945 - Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Lifeng Jin, Linfeng Song, Haitao Mi, Helen Meng:
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation. 1946-1965 - Zheng Wang, Shu Xian Teo, Jieer Ouyang, Yongjun Xu, Wei Shi:
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions. 1966-1978 - Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. 1979-1998 - Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post:
Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies. 1999-2014