


default search action
EMNLP 2023: Singapore
- Houda Bouamor, Juan Pino, Kalika Bali:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023. Association for Computational Linguistics 2023, ISBN 979-8-89176-060-8 - Frontmatter.
- Zhebin Zhang, Xinyu Zhang, Yuanhang Ren, Saijiang Shi, Meng Han, Yongkang Wu, Ruofei Lai, Zhao Cao:
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions. 1-14 - Yuji Yamamoto, Takuya Matsuzaki:
Absolute Position Embedding Learns Sinusoid-like Waves for Attention Based on Relative Position. 15-28 - Jipeng Qiang, Kang Liu, Ying Li, Yun Li, Yi Zhu, Yun-Hao Yuan, Xiaocheng Hu, Xiaoye Ouyang:
Chinese Lexical Substitution: Dataset and Method. 29-42 - Chenkai Sun, Jinning Li, Yi Ren Fung, Hou Pong Chan, Tarek F. Abdelzaher, ChengXiang Zhai, Heng Ji:
Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting. 43-57 - Yuxuan Yao, Han Wu, Qiling Xu
, Linqi Song
:
Fine-grained Conversational Decoding via Isotropic and Proximal Search. 58-70 - Nicolas Stefanovitch, Jakub Piskorski:
Holistic Inter-Annotator Agreement and Corpus Coherence Estimation in a Large-scale Multilingual Annotation Campaign. 71-86 - Nadav Borenstein, Phillip Rust
, Desmond Elliott
, Isabelle Augenstein
:
PHD: Pixel-Based Language Modeling of Historical Documents. 87-107 - Yiwei Wang
, Yujun Cai, Muhao Chen, Yuxuan Liang, Bryan Hooi:
Primacy Effect of ChatGPT. 108-115 - Akira Kawabata, Saku Sugawara:
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension. 116-143 - Benjamin Muller, John Wieting, Jonathan H. Clark, Tom Kwiatkowski, Sebastian Ruder, Livio Soares, Roee Aharoni, Jonathan Herzig, Xinyi Wang:
Evaluating and Modeling Attribution for Cross-Lingual Question Answering. 144-157 - Akintunde Oladipo, Mofetoluwa Adeyemi, Orevaoghene Ahia, Abraham Toluwase Owodunni, Odunayo Ogundepo, David Ifeoluwa Adelani, Jimmy Lin:
Better Quality Pre-training Data and T5 Models for African Languages. 158-168 - Shawn Tan, Yikang Shen, Zhenfang Chen, Aaron C. Courville, Chuang Gan:
Sparse Universal Transformer. 169-179 - Huao Li, Yu Quan Chong, Simon Stepputtis, Joseph Campbell, Dana Hughes, Charles Lewis, Katia P. Sycara:
Theory of Mind for Multi-Agent Collaboration via Large Language Models. 180-192 - Robert Litschko, Max Müller-Eberstein
, Rob van der Goot, Leon Weber-Genzel, Barbara Plank:
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation. 193-203 - Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang:
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought. 204-219 - Md Tawkat Islam Khondaker, Abdul Waheed, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed:
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP. 220-247 - Pan Li, Ping Li, Kai Zhang:
Dual-Channel Span for Aspect Sentiment Triplet Extraction. 248-261 - Zhi Li, Yin Zhang:
Cultural Concept Adaptation on Multimodal Reasoning. 262-276 - Farhan Samir, Miikka Silfverberg:
Understanding Compositional Data Augmentation in Typologically Diverse Morphological Inflection. 277-291 - Yifan Li, Yifan Du, Kun Zhou, Jinpeng Wang, Wayne Xin Zhao, Ji-Rong Wen:
Evaluating Object Hallucination in Large Vision-Language Models. 292-305 - Pengfei Cao, Yupu Hao, Yubo Chen
, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao:
Event Ontology Completion with Hierarchical Structure Evolution Networks. 306-320 - Feihu Jin, Jiajun Zhang, Chengqing Zong:
Parameter-efficient Tuning for Large Language Model without Calculating Its Gradients. 321-330 - Yuanyuan Lei, Ruihong Huang:
Discourse Structures Guided Fine-grained Propaganda Identification. 331-342 - Benjamin Minixhofer, Jonas Pfeiffer, Ivan Vulic:
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models. 343-359 - Ting Wang, Weidong Chen, Yuanhe Tian, Yan Song, Zhendong Mao:
Improving Image Captioning via Predicting Structured Concepts. 360-370 - Alexander Jones, Isaac Caswell, Orhan Firat, Ishank Saxena:
GATITOS: Using a New Multilingual Lexicon for Low-resource Machine Translation. 371-405 - Ge Gao, Hung-Ting Chen, Yoav Artzi, Eunsol Choi:
Continually Improving Extractive QA via Human Feedback. 406-423 - Zhuo Chen, Chengyue Jiang, Kewei Tu:
Using Interpretation Methods for Model Enhancement. 424-438 - Wenqi Zhang, Yongliang Shen, Qingpeng Nong, Zeqi Tan, Yanna Ma, Weiming Lu:
An Expression Tree Decoding Strategy for Mathematical Equation Generation. 439-456 - Yahan Yang, Elior Sulem
, Insup Lee, Dan Roth:
Bootstrapping Small & High Performance Language Models with Unmasking-Removal Training Policy. 457-464 - Hokeun Yoon, JinYeong Bak:
Diversity Enhanced Narrative Question Generation for Storybooks. 465-482 - Chengyu Dong, Zihan Wang, Jingbo Shang:
Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification. 483-493 - Hang Chen
, Xinyu Yang, Jing Luo, Wenjing Zhu:
How to Enhance Causal Discrimination of Utterances: A Case on Affective Reasoning. 494-512 - Qingyi Si, Yuanxin Liu, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang:
Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering. 513-529 - Jeremy R. Cole, Michael J. Q. Zhang, Daniel Gillick, Julian Eisenschlos, Bhuwan Dhingra, Jacob Eisenstein:
Selectively Answering Ambiguous Questions. 530-543 - Dong-Ho Lee, Kian Ahrabian, Woojeong Jin, Fred Morstatter, Jay Pujara:
Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning. 544-557 - Eunjeong Hwang, Veronika Thost, Vered Shwartz, Tengfei Ma:
Knowledge Graph Compression Enhances Diverse Commonsense Generation. 558-572 - Yiyuan Li, Rakesh R. Menon, Sayan Ghosh, Shashank Srivastava:
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models. 573-591 - Shih-Yang Liu, Zechun Liu, Xijie Huang, Pingcheng Dong, Kwang-Ting Cheng:
LLM-FP4: 4-Bit Floating-Point Quantized Transformers. 592-605 - Chen Tang, Shun Wang, Tomas Goldsack, Chenghua Lin:
Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers. 606-618 - Xi Ye, Greg Durrett:
Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting. 619-637 - David Dale, Elena Voita, Janice Lam, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Loïc Barrault, Marta R. Costa-jussà:
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation. 638-653 - Dan He, Minh-Quang Pham, Thanh-Le Ha, Marco Turchi:
Gradient-based Gradual Pruning for Language-Specific Multilingual Neural Machine Translation. 654-670 - Chenxi Whitehouse, Monojit Choudhury, Alham Fikri Aji:
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance. 671-686 - Chenxu Wang, Ping Jian, Mu Huang:
Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition. 687-699 - Jiwan Chung, Youngjae Yu:
VLIS: Unimodal Language Models Guide Multimodal Language Generation. 700-721 - Siddharth Suresh, Kushin Mukherjee, Xizheng Yu, Wei-Chun Huang, Lisa Padua, Timothy T. Rogers:
Conceptual structure coheres in human cognition but not in large language models. 722-738 - Yujie Feng, Zexin Lu
, Bo Liu, Liming Zhan, Xiao-Ming Wu:
Towards LLM-driven Dialogue State Tracking. 739-755 - Haoyu Zhang
, Yu Wang, Guanghao Yin, Kejun Liu, Yuanyuan Liu, Tianshu Yu:
Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis. 756-767 - Georgios Pantazopoulos, Malvina Nikandrou, Amit Parekh, Bhathiya Hemanthage, Arash Eshghi, Ioannis Konstas, Verena Rieser, Oliver Lemon
, Alessandro Suglia
:
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion. 768-789 - Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, Yejin Choi:
We're Afraid Language Models Aren't Modeling Ambiguity. 790-807 - Tianyu Liu, Afra Amini, Mrinmaya Sachan, Ryan Cotterell:
Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective. 808-830 - Guangsheng Bao, Zebin Ou, Yue Zhang:
GEMINI: Controlling The Sentence-Level Summary Style in Abstractive Text Summarization. 831-842 - Wei-Lin Chen, Cheng-Kuang Wu, Hsin-Hsi Chen, Chung-Chi Chen:
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation. 843-851 - Jihyung Moon, Dong-Ho Lee, Hyundong Cho, Woojeong Jin, Chan Young Park, Minwoo Kim, Jonathan May
, Jay Pujara, Sungjoon Park:
Analyzing Norm Violations in Live-Stream Chat. 852-868 - Harman Singh, Pengchuan Zhang, Qifan Wang, Mengjiao Wang, Wenhan Xiong, Jingfei Du, Yu Chen:
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality. 869-893 - Seungju Han, Junhyeok Kim, Jack Hessel, Liwei Jiang, Jiwan Chung, Yejin Son, Yejin Choi, Youngjae Yu:
Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms. 894-914 - Tianhang Zhang, Lin Qiu, Qipeng Guo, Cheng Deng, Yue Zhang, Zheng Zhang, Chenghu Zhou, Xinbing Wang, Luoyi Fu:
Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus. 915-932 - Shangbin Feng, Vidhisha Balachandran, Yuyang Bai, Yulia Tsvetkov:
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge. 933-952 - Xuanli He, Qiongkai Xu
, Jun Wang, Benjamin I. P. Rubinstein, Trevor Cohn:
Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation. 953-967 - Jerry W. Wei, Le Hou, Andrew K. Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc V. Le:
Symbol tuning improves in-context learning in language models. 968-979 - Jon Gauthier, Roger Levy:
The neural dynamics of word recognition and integration. 980-995 - Gangwoo Kim, Sungdong Kim, Byeongguk Jeon, Joonsuk Park, Jaewoo Kang:
Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models. 996-1009 - Olivia Huang, Eve Fleisig, Dan Klein:
Incorporating Worker Perspectives into MTurk Annotation Practices for NLP. 1010-1028 - Yue Guo, Chenxi Hu, Yi Yang:
Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications. 1029-1038 - Nan Xu, Chunting Zhou, Asli Celikyilmaz
, Xuezhe Ma:
Look-back Decoding for Open-Ended Text Generation. 1039-1050 - Jiaxin Huang, Shixiang Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han:
Large Language Models Can Self-Improve. 1051-1068 - Yue Wang, Hung Le, Akhilesh Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi:
CodeT5+: Open Code Large Language Models for Code Understanding and Generation. 1069-1088 - Alban Petit, Caio F. Corro, François Yvon:
Structural generalization in COGS: Supertagging is (almost) all you need. 1089-1101 - Qizhi Pei, Wei Zhang, Jinhua Zhu, Kehan Wu, Kaiyuan Gao, Lijun Wu, Yingce Xia, Rui Yan:
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations. 1102-1123 - Andrea W. Wen-Yi, David Mimno:
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings. 1124-1131 - Jian Wang
, Yi Cheng, Dongding Lin, Chak Tou Leong, Wenjie Li:
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation. 1132-1143 - Pengyu Wang, Linyang Li, Ke Ren, Botian Jiang, Dong Zhang, Xipeng Qiu:
SeqXGPT: Sentence-Level AI-Generated Text Detection. 1144-1156 - Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Ruizhe Chen, Xiangru Tang, Yumo Xu, Dragomir Radev, Arman Cohan:
QTSumm: Query-Focused Summarization over Tabular Data. 1157-1172 - Jiaxin Ge, Sanjay Subramanian, Trevor Darrell, Boyi Li:
From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation. 1173-1185 - Ronald Cardenas, Bingsheng Yao
, Dakuo Wang, Yufang Hou:
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Automatic Science Journalism. 1186-1202 - Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang:
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following. 1203-1217 - Wenhong Zhu, Hongkun Hao, Rui Wang:
Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation. 1218-1228 - Jianwei Li, Qi Lei, Wei Cheng, Dongkuan Xu:
Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models. 1229-1247 - Dave Makhervaks, Plia Gillis, Kira Radinsky:
Clinical Contradiction Detection. 1248-1263 - Jiacheng Liu, Wenya Wang
, Dianzhuo Wang
, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi:
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements. 1264-1287 - Victoria Lin, Louis-Philippe Morency, Eli Ben-Michael:
Text-Transport: Toward Learning Causal Effects of Natural Language. 1288-1304 - Ronak Pradeep, Kai Hui, Jai Gupta, Ádám D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? 1305-1321 - Jiaxin Wen, Pei Ke, Hao Sun, Zhexin Zhang, Chengfei Li, Jinfeng Bai, Minlie Huang:
Unveiling the Implicit Toxicity in Large Language Models. 1322-1338 - Chengwei Qin, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, Diyi Yang:
Is ChatGPT a General-Purpose Natural Language Processing Task Solver? 1339-1384 - Chenghao Xiao, Yizhi Li, G. Thomas Hudson, Chenghua Lin, Noura Al Moubayed:
Length is a Curse and a Blessing for Document-level Semantics. 1385-1396 - Xunjian Yin, Baizhou Huang, Xiaojun Wan:
ALCUNA: Large Language Models Meet New Knowledge. 1397-1414 - Nicholas Collin Suwono, Justin Chih-Yao Chen, Tun-Min Hung, Ting-Hao (Kenneth) Huang, I-Bin Liao, Yung-Hui Li, Lun-Wei Ku, Shao-Hua Sun:
Location-Aware Visual Question Generation with Lightweight Models. 1415-1432 - Eunjeong Hwang, Vered Shwartz:
MemeCap: A Dataset for Captioning and Interpreting Memes. 1433-1445 - Leshem Choshen, Elad Venezian, Shachar Don-Yehiya, Noam Slonim, Yoav Katz:
Where to start? Analyzing the potential value of intermediate models. 1446-1470 - Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani:
Transcending Scaling Laws with 0.1% Extra Compute. 1471-1486 - Minzhi Li, Taiwei Shi, Caleb Ziems, Min-Yen Kan, Nancy F. Chen, Zhengyuan Liu, Diyi Yang:
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation. 1487-1505 - Moshe Berchansky, Peter Izsak, Avi Caciularu, Ido Dagan, Moshe Wasserblat:
Optimizing Retrieval-augmented Reader Models via Token Elimination. 1506-1524 - Ruichao Yang
, Wei Gao, Jing Ma, Hongzhan Lin, Zhiwei Yang:
WSDMS: Debunk Fake News via Weakly Supervised Detection of Misinforming Sentences with Contextualized Social Wisdom. 1525-1538 - Moxin Li, Wenjie Wang, Fuli Feng, Yixin Cao, Jizhi Zhang, Tat-Seng Chua:
Robust Prompt Optimization for Large Language Models Against Distribution Shifts. 1539-1554 - Martin Josifoski, Marija Sakota, Maxime Peyrard, Robert West:
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction. 1555-1574 - Haoran Xu, Weiting Tan, Shuyue Stella Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray:
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules. 1575-1587 - Jared Fernandez, Jacob Kahn, Clara Na
, Yonatan Bisk, Emma Strubell:
The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment. 1588-1600 - Mohammadreza Pourreza, Davood Rafiei:
Evaluating Cross-Domain Text-to-SQL Models and Benchmarks. 1601-1611 - Simone Conia, Min Li, Daniel Lee, Umar Farooq Minhas, Ihab F. Ilyas, Yunyao Li:
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs. 1612-1634 - Chen Jia, Yue Zhang:
Memory-Based Invariance Learning for Out-of-Domain Text Classification. 1635-1647 - Xiuying Wei, Yunchen Zhang, Yuhang Li, Xiangguo Zhang, Ruihao Gong, Jinyang Guo, Xianglong Liu:
Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling. 1648-1665 - Jiaqi Li, Chuanyi Zhang, Miaozeng Du, Dehai Min, Yongrui Chen, Guilin Qi:
Three Stream Based Multi-level Event Contrastive Learning for Text-Video Event Extraction. 1666-1676 - Qi Gou, Zehua Xia, Bowen Yu, Haiyang Yu, Fei Huang, Yongbin Li, Cam-Tu Nguyen:
Diversify Question Generation with Retrieval-Augmented Style Transfer. 1677-1690 - Barrett Martin Lattimer, Patrick Chen, Xinyuan Zhang, Yi Yang:
Fast and Accurate Factual Inconsistency Detection Over Long Documents. 1691-1703 - Adi Simhi, Shaul Markovitch:
Interpreting Embedding Spaces by Conceptualization. 1704-1719 - Jinheon Baek, Soyeong Jeong, Minki Kang, Jong C. Park, Sung Ju Hwang:
Knowledge-Augmented Language Model Verification. 1720-1736 - Yuxuan Hu, Jing Zhang, Haoyang Li, Cuiping Li, Hong Chen:
A Generation-based Deductive Method for Math Word Problems. 1737-1750 - Zeyuan Yang, Peng Li, Yang Liu:
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation. 1751-1777 - Ryan Shea, Zhou Yu:
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning. 1778-1795 - Suyu Ge, Chenyan Xiong, Corby Rosset, Arnold Overwijk, Jiawei Han, Paul Bennett:
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories. 1796-1812 - Po-Nien Kung, Fan Yin, Di Wu, Kai-Wei Chang, Nanyun Peng:
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks. 1813-1829 - Maxime Bouthors, Josep Maria Crego, François Yvon:
Towards Example-Based NMT with Multi-Levenshtein Transformers. 1830-1846 - Afra Feyza Akyürek, Eric Pan, Garry Kuwanto, Derry Wijaya:
DUnE: Dataset for Unified Editing. 1847-1861 - Rishav Hada, Agrima Seth, Harshita Diddee, Kalika Bali:
"Fifty Shades of Bias": Normative Ratings of Gender Bias in GPT Generated English Text. 1862-1876 - Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, Jing Yao:
Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval. 1877-1888 - Ján Cegin, Jakub Simko, Peter Brusilovsky:
ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness. 1889-1905 - Xing Wu, Guangyuan Ma, Wanhui Qian, Zijia Lin, Songlin Hu:
Query-as-context Pre-training for Dense Passage Retrieval. 1906-1916 - Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo:
A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding. 1917-1947 - Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang
, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Democratizing Reasoning Ability: Tailored Learning from Large Language Model. 1948-1966 - Shmuel Amar, Liat Schiff, Ori Ernst, Asi Shefer, Ori Shapira, Ido Dagan:
OpenAsp: A Benchmark for Multi-document Open Aspect-based Summarization. 1967-1991 - Sumit Agarwal, Aditya Srikanth Veerubhotla, Srijan Bansal:
PEFTDebias : Capturing debiasing information using PEFTs. 1992-2000 - Nathan Fradet
, Nicolas Gutowski
, Fabien Chhel, Jean-Pierre Briot:
Byte Pair Encoding for Symbolic Music. 2001-2020 - Alejo Lopez-Avila, Víctor Suárez-Paniagua:
Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models. 2021-2032 - Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar, Partha Talukdar:
Self-Influence Guided Data Reweighting for Language Model Pre-training. 2033-2045 - Xinpeng Wang, Barbara Plank:
ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation. 2046-2052 - Zorik Gekhman, Jonathan Herzig, Roee Aharoni, Chen Elkind, Idan Szpektor:
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models. 2053-2070 - Ramon Ruiz-Dolz
, Javier Sanchez:
VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining. 2071-2077 - Xianlong Luo, Meng Yang, Yihao Wang
:
Tagging-Assisted Generation Model with Encoder and Decoder Supervision for Aspect Sentiment Triplet Extraction. 2078-2093 - Namrata Shivagunde, Vladislav Lialin, Anna Rumshisky:
Larger Probes Tell a Different Story: Extending Psycholinguistic Datasets Via In-Context Learning. 2094-2107 - Momose Oyama, Sho Yokoi, Hidetoshi Shimodaira:
Norm of Word Embedding Encodes Information Gain. 2108-2130 - Zhehao Zhang, Xitao Li, Yan Gao, Jian-Guang Lou:
CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data. 2131-2153 - Yash Kumar Atri, Arun Iyer
, Tanmoy Chakraborty, Vikram Goyal:
Promoting Topic Coherence and Inter-Document Consorts in Multi-Document Summarization via Simplicial Complex and Sheaf Graph. 2154-2166 - Arkil Patel, Satwik Bhattamishra, Siva Reddy, Dzmitry Bahdanau:
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations. 2167-2189 - Eric Zelikman, Wanjing Anya Ma, Jasmine E. Tran, Diyi Yang, Jason D. Yeatman, Nick Haber:
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency. 2190-2205 - Megha Chakraborty, S. M. Towhidul Islam Tonmoy, S. M. Mehedi Zaman, Shreya Gautam, Tanay Kumar, Krish Sharma, Niyar R. Barman, Chandan Gupta, Vinija Jain, Aman Chadha, Amit P. Sheth
, Amitava Das:
Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI). 2206-2239 - Tiago Pimentel, Clara Meister, Ethan Wilcox, Kyle Mahowald, Ryan Cotterell:
Revisiting the Optimality of Word Lengths. 2240-2255 - Yichun Liu, Zizhong Zhu, Xiaowang Zhang, Zhiyong Feng, Daoqi Chen, Yaxin Li:
Document-level Relationship Extraction by Bidirectional Constraints of Beta Rules. 2256-2266 - Zilin Xiao
, Ming Gong, Jie Wu, Xingyao Zhang, Linjun Shou, Daxin Jiang:
Instructed Language Models with Retrievers Are Powerful Entity Linkers. 2267-2282 - Xiang Li, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Fan Yang
, Yizhou Zhao, Rita Singh, Bhiksha Raj:
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text. 2283-2296 - Ke Wang, Xiutian Zhao, Yanghui Li, Wei Peng:
PROSE: A Pronoun Omission Solution for Chinese-English Spoken Language Translation. 2297-2311 - Aniket Pramanick, Yufang Hou, Saif M. Mohammad, Iryna Gurevych:
A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and Why? 2312-2326 - Boxi Cao, Qiaoyu Tang, Hongyu Lin, Xianpei Han, Le Sun:
Does the Correctness of Factual Knowledge Matter for Factual Knowledge-Enhanced Pre-trained Language Models? 2327-2340 - Jasper Jian, Siva Reddy:
Syntactic Substitutability as Unsupervised Dependency Syntax. 2341-2360 - Shuhui Wu, Yongliang Shen, Zeqi Tan, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu:
MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition. 2361-2374 - Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao, Dan Iter, Reid Pryzant, Chenguang Zhu, Heng Ji, Jiawei Han:
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions. 2375-2393 - Gaurav Verma, Ryan A. Rossi, Christopher Tensmeyer, Jiuxiang Gu, Ani Nenkova:
Learning the Visualness of Text Using Large Vision-Language Models. 2394-2408 - Hannah Kirk, Andrew M. Bean, Bertie Vidgen, Paul Röttger, Scott Hale:
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values. 2409-2430 - Vivek Gupta, Pranshu Kandoi, Mahek Bhavesh Vora, Shuo Zhang, Yujie He, Ridho Reinanda, Vivek Srikumar:
TempTabQA: Temporal Question Answering for Semi-Structured Tables. 2431-2453 - Chunhui Du, Jidong Tian, Haoran Liao, Jindou Chen, Hao He, Yaohui Jin:
Task-Level Thinking Steps Help Large Language Models for Challenging Classification Task. 2454-2470 - Fengji Zhang
, Bei Chen, Yue Zhang, Jacky Keung
, Jin Liu, Daoguang Zan, Yi Mao, Jian-Guang Lou, Weizhu Chen:
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation. 2471-2484 - Nikhil Anand, Joshua Tan, Maria Minakova:
Influence Scores at Scale for Efficient Language Data Sampling. 2485-2510 - Yang Liu, Dan Iter, Yichong Xu, Shuohang Wang, Ruochen Xu, Chenguang Zhu:
G-Eval: NLG Evaluation using Gpt-4 with Better Human Alignment. 2511-2522 - Qiushi Huang, Shuai Fu
, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang
:
Learning Retrieval Augmentation for Personalized Dialogue Generation. 2523-2540 - Vipula Rawte, Swagata Chakraborty, Agnibh Pathak, Anubhav Sarkar, S. M. Towhidul Islam Tonmoy, Aman Chadha, Amit P. Sheth, Amitava Das:
The Troubling Emergence of Hallucination in Large Language Models - An Extensive Definition, Quantification, and Prescriptive Remediations. 2541-2573 - Livio Soares, Daniel Gillick, Jeremy R. Cole, Tom Kwiatkowski:
NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders. 2574-2589 - Apoorv Khandelwal, Ellie Pavlick, Chen Sun:
Analyzing Modular Approaches for Visual Question Decomposition. 2590-2603 - Zonghai Yao
, Benjamin J. Schloss, Sai P. Selvaraj:
Improving Summarization with Human Edits. 2604-2620 - Elias Stengel-Eskin, Benjamin Van Durme:
Did You Mean...? Confidence-based Trade-offs in Semantic Parsing. 2621-2629 - Chiyu Zhang, Khai Duy Doan, Qisheng Liao, Muhammad Abdul-Mageed:
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages. 2630-2662 - Gustavo Gonçalves, Emma Strubell:
Understanding the Effect of Model Compression on Social Bias in Large Language Models. 2663-2675 - Odhran O'Donoghue, Aleksandar Shtedritski, John Ginger, Ralph Abboud, Ali Essa Ghareeb, Samuel G. Rodriques:
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology. 2676-2694 - Libo Qin, Qiguang Chen, Fuxuan Wei, Shijue Huang, Wanxiang Che:
Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages. 2695-2709 - Risto Luukkonen, Ville Komulainen, Jouni Luoma, Anni Eskelinen, Jenna Kanerva, Hanna-Mari Kupari
, Filip Ginter, Veronika Laippala, Niklas Muennighoff, Aleksandra Piktus, Thomas Wang, Nouamane Tazi, Teven Le Scao, Thomas Wolf, Osma Suominen, Samuli Sairanen, Mikko Merioksa, Jyrki Heinonen, Aija Vahtola, Samuel Antao, Sampo Pyysalo:
FinGPT: Large Generative Models for a Small Language. 2710-2726 - Yu Yang, Xiaotong Shen:
Boosting Summarization with Normalizing Flows and Aggressive Training. 2727-2751 - Shahbaz Syed, Dominik Schwabe, Khalid Al Khatib, Martin Potthast:
Indicative Summarization of Long Discussions. 2752-2788 - Jaewook Lee, Seongsik Park, Seong-Heum Park, Hongjin Kim, Harksoo Kim:
A Framework for Vision-Language Warm-up Tasks in Multimodal Dialogue Models. 2789-2799 - Yuanhang Yang, Shiyi Qi, Chuanyi Liu, Qifan Wang, Cuiyun Gao, Zenglin Xu:
Once is Enough: A Light-Weight Cross-Attention for Fast Sentence Pair Modeling. 2800-2806 - Tengxiao Liu, Qipeng Guo, Yuqing Yang, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang:
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts. 2807-2822 - Sha Li, Qiusi Zhan, Kathryn Conger, Martha Palmer, Heng Ji, Jiawei Han:
GLEN: General-Purpose Event Detection for Thousands of Types. 2823-2838 - Xiaochen Wang, Junyu Luo, Jiaqi Wang, Ziyi Yin, Suhan Cui, Yuan Zhong, Yaqing Wang, Fenglong Ma:
Hierarchical Pretraining on Multimodal Electronic Health Records. 2839-2852 - Mateusz Lango
, Ondrej Dusek:
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation. 2853-2862 - Wenyu Guo, Qingkai Fang, Dong Yu, Yang Feng:
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation. 2863-2874 - Xinwei Wu
, Junzhuo Li, Minghui Xu, Weilong Dong, Shuangzhi Wu, Chao Bian, Deyi Xiong:
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models. 2875-2886 - Manon Reusens, Philipp Borchert
, Margot Mieskes, Jochen De Weerdt, Bart Baesens:
Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques. 2887-2896 - Dayoon Ko, Sangho Lee, Gunhee Kim:
Can Language Models Laugh at YouTube Short-form Videos? 2897-2916 - Jiaang Li, Quan Wang, Yi Liu, Licheng Zhang, Zhendong Mao:
Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation. 2917-2928 - Zhongjian Miao, Wen Zhang, Jinsong Su, Xiang Li, Jian Luan, Yidong Chen, Bin Wang, Min Zhang:
Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation. 2929-2940 - David Wan, Shiyue Zhang, Mohit Bansal:
HistAlign: Improving Context Dependency in Language Generation by Aligning with History. 2941-2960 - Aitor Ormazabal, Mikel Artetxe, Eneko Agirre:
CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models. 2961-2974 - Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla:
Image Manipulation via Multi-Hop Instructions - A New Dataset and Weakly-Supervised Neuro-Symbolic Approach. 2975-3007 - Robin Algayres, Yossi Adi, Tu Anh Nguyen, Jade Copet, Gabriel Synnaeve, Benoît Sagot, Emmanuel Dupoux:
Generative Spoken Language Model based on continuous word-sized audio tokens. 3008-3028 - Ning Ding, Yulin Chen, Bokai Xu, Yujia Qin, Shengding Hu, Zhiyuan Liu, Maosong Sun, Bowen Zhou:
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations. 3029-3051 - Emanuele Bugliarello, Aida Nematzadeh, Lisa Anne Hendricks:
Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining. 3052-3071 - Hannan Cao, Liping Yuan, Yuchen Zhang, Hwee Tou Ng:
Unsupervised Grammatical Error Correction Rivaling Supervised Methods. 3072-3088 - Yuze Lou, Bailey Kuehl, Erin Bransom, Sergey Feldman, Aakanksha Naik, Doug Downey:
S2abEL: A Dataset for Entity Linking from Scientific Tables. 3089-3101 - Minghao Li, Yingxiu Zhao, Bowen Yu, Feifan Song, Hangyu Li, Haiyang Yu, Zhoujun Li, Fei Huang, Yongbin Li:
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs. 3102-3116 - Daniela Teodorescu
, Tiffany Cheng
, Alona Fyshe, Saif M. Mohammad:
Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers. 3117-3133 - Yuxin Jiang, Chunkit Chan, Mingyang Chen, Wei Wang:
Lion: Adversarial Distillation of Proprietary Large Language Models. 3134-3154 - Jiao Sun, Yufei Tian, Wangchunshu Zhou, Nan Xu, Qian Hu, Rahul Gupta, John Frederick Wieting, Nanyun Peng, Xuezhe Ma:
Evaluating Large Language Models on Controlled Generation Tasks. 3155-3168 - Xiaoyu Guo
, Yuan-Fang Li, Gholamreza Haffari:
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding. 3169-3180 - Adam Bouyamourn:
Why LLMs Hallucinate, and How to Get (Evidential) Closure: Perceptual, Intensional, and Extensional Learning for Faithful Natural Language Generation. 3181-3193 - Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents. 3194-3212 - Bingzhi Li, Lucia Donatelli, Alexander Koller, Tal Linzen, Yuekun Yao, Najoung Kim:
SLOG: A Structural Generalization Benchmark for Semantic Parsing. 3213-3232 - Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher D. Manning:
Pushdown Layers: Encoding Recursive Structure in Transformer Language Models. 3233-3247 - Basel Mousi, Nadir Durrani, Fahim Dalvi:
Can LLMs Facilitate Interpretation of Pre-trained Language Models? 3248-3268 - Su Ah Lee, Seokjin Oh, Woohwan Jung:
Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets. 3269-3279 - Zhengxuan Wu, Alex Tamkin, Isabel Papadimitriou:
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies. 3280-3289 - Yi Bin, Mengqun Han, Wenhao Shi, Lei Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen:
Non-Autoregressive Math Word Problem Solver with Unified Tree Structure. 3290-3301 - Peng Bai, Yue Zhou
, Meizhen Zheng, Wujin Sun, Xiaodong Shi:
Improving Chinese Pop Song and Hokkien Gezi Opera Singing Voice Synthesis by Enhancing Local Modeling. 3302-3312 - Navita Goyal, Eleftheria Briakou, Amanda Liu, Connor Baumler, Claire Bonial, Jeffrey Micher, Clare R. Voss, Marine Carpuat, Hal Daumé III:
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems. 3313-3330 - Aditya K. Surikuchi, Sandro Pezzelle
, Raquel Fernández:
GROOViST: A Metric for Grounding Objects in Visual Storytelling. 3331-3339 - Yuji Zhang, Jing Li
, Wenjie Li:
VIBE: Topic-Driven Temporal Adaptation for Twitter Classification. 3340-3354 - Sungryull Sohn, Yiwei Lyu, Anthony Z. Liu, Lajanugen Logeswaran, Dong-Ki Kim, Dongsub Shim, Honglak Lee:
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues. 3355-3371 - Changzai Pan, Feiyue Li, Ke Deng:
TopWORDS-Poetry: Simultaneous Text Segmentation and Word Discovery for Classical Chinese Poetry via Bayesian Inference. 3372-3386 - Yunzhi Yao, Peng Wang, Shengyu Mao, Chuanqi Tan, Fei Huang, Huajun Chen, Ningyu Zhang:
Knowledge Rumination for Pre-trained Language Models. 3387-3404 - Linjuan Wu
, Weiming Lu:
Struct-XLM: A Structure Discovery Multilingual Language Model for Enhancing Cross-lingual Transfer through Reinforcement Learning. 3405-3419 - Yongxin Huang, Kexin Wang, Sourav Dutta, Raj Nath Patel, Goran Glavas, Iryna Gurevych:
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification. 3420-3434 - Xibo Li, Bowei Zou, Yifan Fan, Yanling Li, Ai Ti Aw, Yu Hong:
Interview Evaluation: A Novel Approach for Automatic Evaluation of Conversational Question Answering Models. 3435-3446 - Rodrigo Wilkens, Alice Pintard, David Alfter, Vincent Folny
, Thomas François:
TCFLE-8: a Corpus of Learner Written Productions for French as a Foreign Language and its Application to Automated Essay Scoring. 3447-3465 - David Heineman, Yao Dou, Mounica Maddela, Wei Xu
:
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA. 3466-3495 - Silvia Casola, Soda Marem Lo, Valerio Basile, Simona Frenda, Alessandra Teresa Cignarella, Viviana Patti, Cristina Bosco:
Confidence-based Ensembling of Perspective-aware Models. 3496-3507 - Xinpeng Wang, Xiaoyuan Yi, Han Jiang, Shanlin Zhou, Zhihua Wei, Xing Xie:
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer. 3508-3533 - Zhen Wan, Fei Cheng
, Zhuoyuan Mao, Qianying Liu, Haiyue Song, Jiwei Li, Sadao Kurohashi:
GPT-RE: In-context Learning for Relation Extraction using Large Language Models. 3534-3547 - Sky CH-Wang, Arkadiy Saakyan, Oliver Li, Zhou Yu, Smaranda Muresan:
Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment. 3548-3564 - Chuyue Zhou, Wangjie You, Juntao Li, Jing Ye, Kehai Chen, Min Zhang:
INFORM : Information eNtropy based multi-step reasoning FOR large language Models. 3565-3576 - Jiamin Li
, Qiang Su
, Yitao Yang, Yimin Jiang, Cong Wang, Hong Xu:
Adaptive Gating in Mixture-of-Experts based Language Models. 3577-3587 - Maria R. Valentini, Jennifer Weber, Jesus Salcido, Téa Wright, Eliana Colunga, Katharina von der Wense:
On the Automatic Generation and Simplification of Children's Stories. 3588-3598 - Kangda Wei, Dawn J. Lawrie, Benjamin Van Durme, Yunmo Chen, Orion Weller:
When Do Decompositions Help for Machine Reading? 3599-3606 - Aviv Slobodkin, Omer Goldman, Avi Caciularu, Ido Dagan, Shauli Ravfogel:
The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models. 3607-3625 - Alexander Spangher, Nanyun Peng, Emilio Ferrara, Jonathan May
:
Identifying Informational Sources in News Articles. 3626-3639 - Sapan Shah, Sreedhar Reddy, Pushpak Bhattacharyya:
Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning. 3640-3654 - Junhan Yang, Zheng Liu, Chaozhuo Li, Guangzhong Sun, Xing Xie:
Longtriever: a Pre-trained Long Text Encoder for Dense Document Retrieval. 3655-3665 - Yiyang Liu, Jinpeng Li, Enwei Zhu:
Revisiting De-Identification of Electronic Medical Records: Evaluation of Within- and Cross-Hospital Generalization. 3666-3674 - Gurusha Juneja, Subhabrata Dutta, Soumen Chakrabarti, Sunny Manchanda, Tanmoy Chakraborty:
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning. 3675-3691 - Shaoyang Xu, Junzhuo Li, Deyi Xiong:
Language Representation Projection: Can We Transfer Factual Knowledge across Languages in Multilingual Language Models? 3692-3702 - James A. Michaelov
, Catherine Arnett, Tyler A. Chang, Ben Bergen:
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models. 3703-3720 - Jinhao Jiang, Kun Zhou, Wayne Xin Zhao, Yaliang Li, Ji-Rong Wen:
ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph. 3721-3735 - Felipe Urrutia, Cristian Buc Calderon, Valentin Barrière:
Deep Natural Language Feature Learning for Interpretable Prediction. 3736-3763 - David Esiobu, Xiaoqing Ellen Tan, Saghar Hosseini, Megan Ung, Yuchen Zhang, Jude Fernandes, Jane Dwivedi-Yu, Eleonora Presani, Adina Williams, Eric Michael Smith:
ROBBIE: Robust Bias Evaluation of Large Generative Language Models. 3764-3814 - Atsumoto Ohashi, Ryuichiro Higashinaka:
Enhancing Task-oriented Dialogue Systems with Generative Post-processing Networks. 3815-3828 - Alexis Chevalier, Alexander Wettig, Anirudh Ajith, Danqi Chen:
Adapting Language Models to Compress Contexts. 3829-3846 - Yichao Zhou, James B. Wendt, Navneet Potti, Jing Xie, Sandeep Tata:
Selective Labeling: How to Radically Lower Data-Labeling Costs for Document Extraction Models. 3847-3860 - Yue Chen, Dingnan Jin, Chen Huang, Jia Liu, Wenqiang Lei:
TRAVEL: Tag-Aware Conversational FAQ Retrieval via Reinforcement Learning. 3861-3872 - Hyundong Cho, Andrea Madotto, Zhaojiang Lin, Khyathi Raghavi Chandu, Satwik Kottur, Jing Xu, Jonathan May
, Chinnadhurai Sankar:
Continual Dialogue State Tracking via Example-Guided Question Answering. 3873-3886 - Shubham Mittal, Megha Sundriyal, Preslav Nakov:
Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media. 3887-3902 - Jongin Kim, Byeo Bak, Aditya Agrawal, Jiaxi Wu
, Veronika J. Wirtz, Traci Hong, Derry Wijaya:
COVID-19 Vaccine Misinformation in Middle Income Countries. 3903-3915 - Junlei Zhang, Zhenzhong Lan, Junxian He:
Contrastive Learning of Sentence Embeddings from Scratch. 3916-3932 - Sandra Sandoval, Jieyu Zhao, Marine Carpuat, Hal Daumé III:
A Rose by Any Other Name would not Smell as Sweet: Social Bias in Names Mistranslation. 3933-3945 - Jason Phang, Yao Zhao, Peter J. Liu:
Investigating Efficiently Extending Transformers for Long Input Summarization. 3946-3961 - Zishan Guo, Linhao Yu, Minghui Xu, Renren Jin, Deyi Xiong:
CS2W: A Chinese Spoken-to-Written Style Conversion Dataset with Multiple Conversion Types. 3962-3979 - Alan Ansell, Marinela Parovic, Ivan Vulic, Anna Korhonen, Edoardo M. Ponti:
Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity. 3980-3995 - Giuseppe Attanasio, Flor Miriam Plaza del Arco
, Debora Nozza, Anne Lauscher
:
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation. 3996-4014 - Weifeng Jiang, Qianren Mao, Chenghua Lin, Jianxin Li, Ting Deng
, Weiyi Yang
, Zheng Wang:
DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining. 4015-4030 - Da Yin, Xiao Liu, Fan Yin, Ming Zhong
, Hritik Bansal, Jiawei Han, Kai-Wei Chang:
Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation. 4031-4047 - Haoyu Wang
, Hongming Zhang, Yueguan Wang, Yuqian Deng, Muhao Chen, Dan Roth:
Are All Steps Equally Important? Benchmarking Essentiality Detection in Event Processes. 4048-4056 - Zui Chen, Jiaqi Han, Chaofan Yang, Yi Zhou:
Language Model is Suitable for Correction of Handwritten Mathematical Expressions Recognition. 4057-4068 - Gretel Liz De la Peña Sarracén, Paolo Rosso, Robert Litschko, Goran Glavas, Simone Paolo Ponzetto
:
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection. 4069-4085 - Junfeng Jiang, Chengzhang Dong, Sadao Kurohashi, Akiko Aizawa:
SuperDialseg: A Large-scale Dataset for Supervised Dialogue Segmentation. 4086-4101 - Yang Bai, Wenqian Zhao, Shuo Yin, Zixiao Wang, Bei Yu:
ATFormer: A Learned Performance Model with Transfer Learning Across Devices for Deep Learning Tensor Programs. 4102-4116 - Keighley Overbay, Jaewoo Ahn, Fatemeh Pesaran Zadeh, Joonsuk Park, Gunhee Kim:
mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images. 4117-4132 - Ning Ding, Xingtai Lv, Qiaosen Wang, Yulin Chen, Bowen Zhou, Zhiyuan Liu, Maosong Sun:
Sparse Low-rank Adaptation of Pre-trained Language Models. 4133-4145 - Shachar Don-Yehiya, Leshem Choshen, Omri Abend:
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney. 4146-4161 - Anastasiia Sedova, Benjamin Roth:
ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision. 4162-4176 - Jingyuan Qi, Zhiyang Xu, Ying Shen, Minqian Liu, Di Jin, Qifan Wang, Lifu Huang:
The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models. 4177-4199 - Songtao Liu, Ziling Luo, Minghua Xu, Lixiao Wei, Ziyao Wei, Han Yu, Wei Xiang, Bang Wang:
Ideology Takes Multiple Looks: A High-Quality Dataset for Multifaceted Ideology Detection. 4200-4213 - Pierre Colombo, Victor Pellegrain, Malik Boudiaf, Myriam Tami, Victor Storchan, Ismail Ben Ayed, Pablo Piantanida:
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models. 4214-4231 - Kabir Ahuja, Harshita Diddee, Rishav Hada, Millicent Ochieng, Krithika Ramesh, Prachi Jain, Akshay Uttama Nambi, Tanuja Ganu, Sameer Segal, Mohamed Ahmed, Kalika Bali, Sunayana Sitaram:
MEGA: Multilingual Evaluation of Generative AI. 4232-4267 - Xin Yuan, Jie Guo, Weidong Qiu, Zheng Huang, Shujun Li:
Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation. 4268-4280 - Yihang Li, Shuichiro Shimizu, Chenhui Chu, Sadao Kurohashi, Wei Li:
Video-Helpful Multimodal Machine Translation. 4281-4299 - Dohwan Ko, Ji Soo Lee, Woo-Young Kang, Byungseok Roh, Hyunwoo Kim:
Large Language Models are Temporal and Causal Reasoners for Video Question Answering. 4300-4316 - Alsu Sagirova, Mikhail Burtsev:
Uncertainty Guided Global Memory Improves Multi-Hop Question Answering. 4317-4328 - Yuanyuan Liang, Jianing Wang, Hanlun Zhu, Lei Wang, Weining Qian, Yunshi Lan:
Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation. 4329-4343 - Jinchuan Zhang, Yan Zhou, Binyuan Hui, Yaxin Liu, Ziming Li, Songlin Hu:
TrojanSQL: SQL Injection against Natural Language Interface to Database. 4344-4359 - Aly M. Kassem, Omar Mahmoud, Sherif Saad:
Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models. 4360-4379 - You-Jun Chen, Hsin-Yi Hsieh, Yu Lin, Yingtao Tian, Bert Chan, Yu-Sin Liu, Yi-Hsuan Lin, Richard Tzong-Han Tsai:
MingOfficial: A Ming Official Career Dataset and a Historical Context-Aware Representation Learning Framework. 4380-4401 - Seongho Joo, Hyukhun Koh, Kyomin Jung:
DPP-TTS: Diversifying prosodic features of speech via determinantal point processes. 4402-4417 - Nathan Hu, Eric Mitchell, Christopher D. Manning, Chelsea Finn:
Meta-Learning Online Adaptation of Language Models. 4418-4432 - Chak Tou Leong, Yi Cheng, Jiashuo Wang, Jian Wang, Wenjie Li:
Self-Detoxifying Language Models via Toxification Reversal. 4433-4449 - Felix Faltings, Michel Galley, Kianté Brantley, Baolin Peng, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan:
Interactive Text Generation. 4450-4468 - Md. Sultan:
Knowledge Distillation \approx Label Smoothing: Fact or Fallacy? 4469-4477 - Lisa Beinborn, Yuval Pinter
:
Analyzing Cognitive Plausibility of Subword Tokenization. 4478-4486 - Chenkai Ma, Xinya Du:
POE: Process of Elimination for Multiple Choice Reasoning. 4487-4496 - Ishaan Singh, Navdeep Kaur, Garima Gaur, Mausam:
NeuSTIP: A Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs. 4497-4516 - Gopendra Vikram Singh, Soumitra Ghosh, Atul Verma, Chetna Painkra, Asif Ekbal:
Standardizing Distress Analysis: Emotion-Driven Distress Identification and Cause Extraction (DICE) in Multimodal Online Posts. 4517-4532 - Linyi Yang, Yaoxian Song, Xuan Ren, Chenyang Lyu, Yidong Wang, Jingming Zhuo, Lingqiao Liu, Jindong Wang, Jennifer Foster, Yue Zhang:
Out-of-Distribution Generalization in Natural Language Processing: Past, Present, and Future. 4533-4559 - Hongyi Zheng, Abulhair Saparov:
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis. 4560-4568 - Noah Lee, Na An, James Thorne:
Can Large Language Models Capture Dissenting Human Voices? 4569-4585 - Ratish Puduppully, Anoop Kunchukuttan, Raj Dabre, Ai Ti Aw, Nancy Chen:
DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models. 4586-4602 - Hao Zhao, Jie Fu, Zhaofeng He:
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning. 4603-4615 - Ruotian Ma, Xiaolei Wang, Xin Zhou, Qi Zhang, Xuanjing Huang:
Towards Building More Robust NER datasets: An Empirical Study on NER Dataset Bias from a Dataset Difficulty View. 4616-4630 - Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen
, Hinrich Schütze:
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training. 4631-4646 - Hiroaki Yamagiwa, Momose Oyama, Hidetoshi Shimodaira:
Discovering Universal Geometry in Embeddings with ICA. 4647-4675 - Mikael Brunila, Jack LaViolette, Sky CH-Wang, Priyanka Verma, Clara Féré, Grant McKenzie
:
Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City. 4676-4695 - Lang Qin
, Yao Zhang, Hongru Liang, Jun Wang, Zhenglu Yang:
Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue. 4696-4709 - Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang:
Merging Generated and Retrieved Knowledge for Open-Domain QA. 4710-4728 - Nithish Kannen, Udit Sharma, Sumit Neelam, Dinesh Khandelwal, Shajith Ikbal, Hima Karanam, L. Venkata Subramaniam:
Best of Both Worlds: Towards Improving Temporal Knowledge Base Question Answering via Targeted Fact Extraction. 4729-4744 - Nishant Balepur, Jie Huang, Kevin Chen-Chuan Chang:
Text Fact Transfer. 4745-4764 - Jiaao Chen, Aston Zhang, Mu Li, Alex Smola, Diyi Yang:
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise. 4765-4775 - Gavin Abercrombie, Amanda Cercas Curry, Tanvi Dinkar, Verena Rieser, Zeerak Talat:
Mirages. On Anthropomorphism in Dialogue Systems. 4776-4790 - Kevin Liu, Stephen Casper, Dylan Hadfield-Menell, Jacob Andreas:
Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness? 4791-4797 - Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing. 4798-4815 - Libo Zhao, Kai Fan, Wei Luo, Jing Wu, Shushu Wang, Ziqian Zeng, Zhongqiang Huang:
Adaptive Policy with Wait-k Model for Simultaneous Translation. 4816-4832 - Xinyu Chen, Sheng Xu, Peifeng Li, Qiaoming Zhu:
Cross-Document Event Coreference Resolution on Discourse Structure. 4833-4843 - Yoonna Jang, Suhyune Son, Jeongwoo Lee, Junyoung Son, Yuna Hur, Jungwoo Lim, Hyeonseok Moon, Kisu Yang, Heuiseok Lim:
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations. 4844-4861 - Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, Baobao Chang:
Can We Edit Factual Knowledge by In-Context Learning? 4862-4876 - Siqi Liu, Weixi Feng, Tsu-Jui Fu, Wenhu Chen, William Wang:
EDIS: Entity-Driven Image Search over Multimodal Web Content. 4877-4894 - Joshua Ainslie, James Lee-Thorp, Michiel de Jong, Yury Zemlyanskiy, Federico Lebrón, Sumit Sanghai:
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints. 4895-4901 - Yifan Hou, Jiaoda Li, Yu Fei, Alessandro Stolfo, Wangchunshu Zhou, Guangtao Zeng, Antoine Bosselut
, Mrinmaya Sachan:
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models. 4902-4919 - Yiming Zhang, Sravani Nanduri, Liwei Jiang, Tongshuang Wu, Maarten Sap:
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases. 4920-4932 - Amita Kamath, Jack Hessel, Kai-Wei Chang:
Text encoders bottleneck compositionality in contrastive vision-language models. 4933-4944 - Sander Schulhoff, Jeremy Pinto, Anaum Khan, Louis-François Bouchard, Chenglei Si, Svetlina Anati, Valen Tagliabue, Anson Liu Kost, Christopher Carnahan, Jordan L. Boyd-Graber:
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition. 4945-4977 - Shangjie Li, Xiangpeng Wei, Shaolin Zhu, Jun Xie, Baosong Yang, Deyi Xiong:
MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks. 4978-4990 - Te-Lin Wu, Yu Zhou, Nanyun Peng:
Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge. 4991-5006 - Stephen Bothwell
, Justin DeBenedetto, Theresa Crnkovich, Hildegund Müller, David Chiang:
Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines. 5007-5039 - Jennifer Hu, Roger Levy:
Prompting is not a substitute for probability measurements in large language models. 5040-5060 - Josip Jukic, Jan Snajder:
Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings. 5061-5074 - Alon Jacovi, Avi Caciularu, Omer Goldman, Yoav Goldberg:
Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks. 5075-5084 - Joshua Ainslie, Tao Lei, Michiel de Jong, Santiago Ontañón, Siddhartha Brahma, Yury Zemlyanskiy, David C. Uthus, Mandy Guo, James Lee-Thorp, Yi Tay, Yun-Hsuan Sung, Sumit Sanghai:
CoLT5: Faster Long-Range Transformers with Conditional Computation. 5085-5100 - Praveen Venkateswaran, Evelyn Duesterwald, Vatche Isahagian:
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning. 5101-5112 - Winston Wu, Lu Wang, Rada Mihalcea:
Cross-Cultural Analysis of Human Values, Morals, and Biases in Folk Tales. 5113-5125 - Ruiqi Zhong, Charlie Snell, Dan Klein, Jason Eisner:
Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL. 5126-5152 - Theo Olausson, Alex Gu, Benjamin Lipkin
, Cedegao E. Zhang, Armando Solar-Lezama, Joshua B. Tenenbaum, Roger Levy:
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers. 5153-5176 - Zhengrui Ma, Shaolei Zhang, Shoutao Guo, Chenze Shao, Min Zhang, Yang Feng:
Non-autoregressive Streaming Transformer for Simultaneous Translation. 5177-5190 - Nam Nguyen, Thang Phan, Duc-Vu Nguyen, Kiet Van Nguyen:
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing. 5191-5207 - Shiao Meng, Xuming Hu, Aiwei Liu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen:
RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction. 5208-5226 - Zekun Li, Wenxuan Zhou, Yao-Yi Chiang
, Muhao Chen:
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding. 5227-5240 - Danis Alukaev, Semen Kiselev, Ilya Pershin, Bulat Ibragimov, Vladimir Ivanov, Alexey Kornaev, Ivan Titov:
Cross-Modal Conceptualization in Bottleneck Models. 5241-5253 - Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee-Peng Lim
, Lidong Bing, Xing Xu, Soujanya Poria, Roy Ka-Wei Lee:
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models. 5254-5276 - Florian Ruosch, Cristina Sarasua, Abraham Bernstein:
DREAM: Deployment of Recombination and Ensembles in Argument Mining. 5277-5290 - Debtanu Datta
, Shubham Soni, Rajdeep Mukherjee, Saptarshi Ghosh:
MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments. 5291-5302 - Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao, Nan Duan:
Query Rewriting in Retrieval-Augmented Large Language Models. 5303-5315 - Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji:
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation. 5316-5327 - Aviya Maimon, Reut Tsarfaty:
COHESENTIA: A Novel Benchmark of Incremental versus Holistic Assessment of Coherence in Generated Texts. 5328-5343 - Yating Wu
, Ritika Mangla, Greg Durrett, Junyi Jessy Li:
QUDeval: The Evaluation of Questions Under Discussion Discourse Parsing. 5344-5363 - Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter. 5364-5375 - Chang-Yu Tai, Ziru Chen, Tianshu Zhang, Xiang Deng, Huan Sun:
Exploring Chain of Thought Style Prompting for Text-to-SQL. 5376-5393 - Alexandra Butoi, Tim Vieira, Ryan Cotterell, David Chiang:
Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages. 5394-5416 - Yufei Tian, Felix Zhang, Nanyun Peng:
Harnessing Black-Box Control to Boost Commonsense in LM's Generation. 5417-5432 - Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher D. Manning:
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback. 5433-5442 - Zhao Yang, Yuanzhe Zhang, Dianbo Sui, Cao Liu, Jun Zhao, Kang Liu:
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process. 5443-5456 - Lovisa Hagström, Denitsa Saynova, Tobias Norlund, Moa Johansson, Richard Johansson:
The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models. 5457-5476 - Hassan Shahmohammadi, Adhiraj Ghosh, Hendrik P. A. Lensch:
ViPE: Visualise Pretty-much Everything. 5477-5494 - Junpeng Li, Zixia Jia, Zilong Zheng:
Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models. 5495-5505 - Kaitlyn Zhou, Dan Jurafsky, Tatsunori Hashimoto:
Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models. 5506-5524 - Yating Wu
, William Sheffield, Kyle Mahowald, Junyi Jessy Li:
Elaborative Simplification as Implicit Questions Under Discussion. 5525-5537 - Dhruv Mehra, Lingjue Xie, Ella Hofmann-Coyle, Mayank Kulkarni, Daniel Preotiuc-Pietro:
EntSUMv2: Dataset, Models and Evaluation for More Abstractive Entity-Centric Summarization. 5538-5547 - Amanpreet Singh, Mike D'Arcy, Arman Cohan, Doug Downey, Sergey Feldman:
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations. 5548-5566 - Shehzaad Dhuliawala, Vilém Zouhar, Mennatallah El-Assady, Mrinmaya Sachan:
A Diachronic Perspective on User Trust in AI under Uncertainty. 5567-5580 - Minxuan Lv, Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu:
CT-GAT: Cross-Task Generative Adversarial Attack based on Transferability. 5581-5591 - Hai Yu, Chong Deng, Qinglin Zhang, Jiaqing Liu, Qian Chen, Wen Wang:
Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling. 5592-5605 - Hyungjoo Chae, Yongho Song, Kai Tzu-iunn Ong, Taeyoon Kwon, Minjin Kim, Youngjae Yu, Dongha Lee, Dongyeop Kang, Jinyoung Yeo:
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents. 5606-5632 - Mario Giulianelli, Sarenne Wallbridge, Raquel Fernández:
Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives. 5633-5653 - Xin Miao, Yongqi Li, Tieyun Qian:
Generating Commonsense Counterfactuals for Stable Relation Extraction. 5654-5668 - Ameet Deshpande, Carlos E. Jimenez, Howard Chen, Vishvak Murahari, Victoria Graf, Tanmay Rajpurohit, Ashwin Kalyan, Danqi Chen, Karthik Narasimhan:
C-STS: Conditional Semantic Textual Similarity. 5669-5690 - Seraphina Goldfarb-Tarrant, Björn Ross, Adam Lopez:
Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis. 5691-5704 - Chang Yang, Peng Zhang, Wenbo Qiao, Hui Gao, Jiaming Zhao:
Rumor Detection on Social Media with Crowd Intelligence and ChatGPT-Assisted Networks. 5705-5717 - Yichi Zhang
, Jiayi Pan, Yuchen Zhou, Rui Pan, Joyce Chai:
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans? 5718-5728 - Freddy Heppell
, Kalina Bontcheva, Carolina Scarton:
Analysing State-Backed Propaganda Websites: a New Dataset and Linguistic Study. 5729-5741 - Tiantian Zhu, Yang Qin, Qingcai Chen, Xin Mu, Changlong Yu, Yang Xiang:
Controllable Contrastive Generation for Multilingual Biomedical Entity Linking. 5742-5753 - Truong Do, Le Khiem, Quang Pham, TrungTin Nguyen, Thanh-Nam Doan, Binh Nguyen, Chenghao Liu, Savitha Ramasamy, Xiaoli Li, Steven C. H. Hoi:
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts. 5754-5765 - Boning Zhang, Yang Yang:
MediaHG: Rethinking Eye-catchy Features in Social Media Headline Generation. 5766-5777 - Silei Xu, Shicheng Liu, Theo Culhane, Elizaveta Pertseva, Meng-Hsi Wu, Sina J. Semnani, Monica S. Lam:
Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata. 5778-5791 - Dheeraj Mekala, Jason Andrew Wolfe, Subhro Roy:
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models. 5792-5799 - Andrey Bout, Alexander Podolskiy, Sergey I. Nikolenko, Irina Piontkovskaya:
Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule. 5800-5816 - Xinyi Chen, Raquel Fernández, Sandro Pezzelle
:
The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models. 5817-5830 - Maxime Darrin, Pablo Piantanida, Pierre Colombo:
RainProof: An Umbrella to Shield Text Generator from Out-Of-Distribution Data. 5831-5857 - Ningchen Ma, Dong Wang, Hongyun Bao, Lei He, Suncong Zheng:
KEPL: Knowledge Enhanced Prompt Learning for Chinese Hypernym-Hyponym Extraction. 5858-5867 - Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang:
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings. 5868-5875 - Ji Qi, Chuchun Zhang, Xiaozhi Wang, Kaisheng Zeng, Jifan Yu, Jinxin Liu, Lei Hou, Juanzi Li, Xu Bin:
Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction. 5876-5890 - Lucie-Aimée Kaffee, Arnav Arora
, Isabelle Augenstein
:
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions. 5891-5909 - Sangmin Bae, Jongwoo Ko, Hwanjun Song, Se-Young Yun:
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding. 5910-5924 - Libo Qin, Wenbo Pan, Qiguang Chen, Lizi Liao
, Zhou Yu, Yue Zhang, Wanxiang Che, Min Li:
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions. 5925-5941 - Ori Yoran, Tomer Wolfson, Ben Bogin, Uri Katz
, Daniel Deutch, Jonathan Berant:
Answering Questions by Meta-Reasoning over Multiple Chains of Thought. 5942-5966 - Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Wang, Lei Li:
INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback. 5967-5994 - Dawei Li, Hengyuan Zhang, Yanran Li, Shiping Yang:
Multi-level Contrastive Learning for Script-based Character Understanding. 5995-6013 - Jaehyung Seo, Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Heuiseok Lim:
CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients. 6014-6029 - Ramon Ruiz-Dolz
, Stella Heras, Ana García-Fornes:
Automatic Debate Evaluation with Argumentation Semantics and Natural Language Argument Graph Networks. 6030-6040 - Evgeniia Razumovskaia, Ivan Vulic, Anna Korhonen:
Transfer-Free Data-Efficient Multilingual Slot Labeling. 6041-6055 - Kailai Yang, Shaoxiong Ji
, Tianlin Zhang, Qianqian Xie, Ziyan Kuang, Sophia Ananiadou:
Towards Interpretable Mental Health Analysis with Large Language Models. 6056-6077 - Youngwon Lee, Jinu Lee, Seung-won Hwang:
Learning to Rank Generation with Pairwise Partial Rewards. 6078-6092 - Yingqiang Gao, Jessica Lam, Nianlong Gu, Richard H. R. Hahnloser
:
GreedyCAS: Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information. 6093-6108 - Ryan Tran, Canwen Xu, Julian J. McAuley:
Spoiler Detection as Semantic Text Matching. 6109-6113 - Aishwarya Padmakumar, Mert Inan, Spandana Gella, Patrick Lange, Dilek Hakkani-Tur:
Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue. 6114-6131 - Zirui Shao, Feiyu Gao, Zhongda Qi, Hangdi Xing, Jiajun Bu, Zhi Yu, Qi Zheng, Xiaozhong Liu:
GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree. 6132-6145 - Kevin Pei, Ishan Jindal, Kevin Chen-Chuan Chang:
Abstractive Open Information Extraction. 6146-6158 - Sreyan Ghosh, Manan Suri, Purva Chiniya, Utkarsh Tyagi, Sonal Kumar, Dinesh Manocha:
CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network. 6159-6173 - Jingheng Ye, Yinghui Li, Qingyu Zhou
, Yangning Li, Shirong Ma, Hai-Tao Zheng, Ying Shen:
CLEME: Debiasing Multi-reference Evaluation for Grammatical Error Correction. 6174-6189 - Jonathan Kamp
, Lisa Beinborn, Antske Fokkens
:
Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods. 6190-6197 - Yuhao Wu, Karthick Sharma, Chun Seah, Shuhao Zhang:
SentiStream: A Co-Training Framework for Adaptive Online Sentiment Analysis in Evolving Data Streams. 6198-6212 - Liang Zhang, Chulun Zhou, Fandong Meng, Jinsong Su, Yidong Chen, Jie Zhou:
HyperNetwork-based Decoupling to Improve Model Generalization for Few-Shot Relation Extraction. 6213-6223 - Nitesh Kumar, Steven Schockaert:
Solving Hard Analogy Questions with Relation Embedding Chains. 6224-6236 - Jocelyn Shen, Maarten Sap, Pedro Colon-Hernandez, Hae Park, Cynthia Breazeal:
Modeling Empathic Similarity in Personal Narratives. 6237-6252 - Chandan Singh, John X. Morris, Alexander M. Rush, Jianfeng Gao, Yuntian Deng:
Tree Prompting: Efficient Task Adaptation without Fine-Tuning. 6253-6267 - Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. 6268-6278 - Liting Jiang, Di Wu, Bohui Mao, Yanbing Li, Wushour Slamu:
Empathy Intent Drives Empathy Detection. 6279-6290 - Yuanjun Shi, Linzhi Wu, Minglai Shao:
Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling. 6291-6301 - Joseph Marvin Imperial
, Ekaterina Kochmar:
BasahaCorpus: An Expanded Linguistic Resource for Readability Assessment in Central Philippine Languages. 6302-6309 - Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer:
ReTAG: Reasoning Aware Table to Analytic Text Generation. 6310-6324 - Liang Chen, Yang Deng
, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-Seng Chua, Kam-Fai Wong:
Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators. 6325-6341 - Yucheng Li, Bo Dong, Frank Guerin
, Chenghua Lin:
Compressing Context to Enhance Inference Efficiency of Large Language Models. 6342-6353 - Xiaonan Li, Xipeng Qiu:
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve. 6354-6374 - Carlos Gómez-Rodríguez, Diego Roca, David Vilares:
4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees. 6375-6384 - Chenghao Yang, Allyson Ettinger:
Can You Follow Me? Testing Situational Understanding for ChatGPT. 6385-6398 - Kellin Pelrine, Anne Imouza, Camille Thibault, Meilina Reksoprodjo, Caleb Gupta, Joel Christoph, Jean-François Godbout, Reihaneh Rabbany:
Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4. 6399-6429 - Bashar Alhafni, Go Inoue, Christian Khairallah, Nizar Habash:
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation. 6430-6448 - Junyi Li, Xiaoxue Cheng, Xin Zhao, Jian-Yun Nie, Ji-Rong Wen:
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models. 6449-6464 - Tianyu Gao, Howard Yen, Jiatong Yu, Danqi Chen:
Enabling Large Language Models to Generate Text with Citations. 6465-6488 - Mikel Artetxe, Vedanuj Goswami, Shruti Bhosale, Angela Fan, Luke Zettlemoyer:
Revisiting Machine Translation for Cross-lingual Classification. 6489-6499 - Shuwen Deng, Paul Prasse, David R. Reich, Tobias Scheffer, Lena A. Jäger
:
Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding. 6500-6507 - Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schütze, Kemal Oflazer
, David R. Mortensen:
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model. 6508-6524 - Quanyu Long, Wenya Wang
, Sinno Jialin Pan:
Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning. 6525-6542 - Davis Brown, Charles Godfrey, Nicholas Konz, Jonathan H. Tu, Henry Kvinge
:
Understanding the Inner-workings of Language Models Through Representation Dissimilarity. 6543-6558 - Peng Lu, Suyuchen Wang, Mehdi Rezagholizadeh, Bang Liu, Ivan Kobyzev:
Efficient Classification of Long Documents via State-Space Models. 6559-6565 - Tianyuan Shi, Liangzhi Li, Zijian Lin, Tao Yang, Xiaojun Quan, Qifan Wang:
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems. 6566-6580 - Joanne Boisson, Luis Espinosa Anke, José Camacho-Collados:
Construction Artifacts in Metaphor Identification Datasets. 6581-6590 - Deepak Nathani, David Wang, Liangming Pan, William Yang Wang:
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models. 6591-6616 - Yanzhao Shi, Junzhong Ji, Xiaodan Zhang, Liangqiong Qu, Ying Liu:
Granularity Matters: Pathological Graph-driven Cross-modal Alignment for Brain CT Report Generation. 6617-6630 - Zirui Wu, Nan Hu, Yansong Feng:
Enhancing Structured Evidence Extraction for Fact Verification. 6631-6641 - Di Wu, Wasi Uddin Ahmad, Kai-Wei Chang:
Rethinking Model Selection and Decoding for Keyphrase Generation with Pre-trained Sequence-to-Sequence Models. 6642-6658 - Hannah Bast, Matthias Hertel
, Natalie Prange:
A Fair and In-Depth Evaluation of Existing End-to-End Entity Linking Systems. 6659-6672 - Hongyi Wu, Xinshu Shen, Man Lan, Shaoguang Mao, Xiaopeng Bai, Yuanbin Wu:
A Multi-Task Dataset for Assessing Discourse Coherence in Chinese Essays: Structure, Theme, and Logic Analysis. 6673-6688 - Yi Chen, Liang He
:
SKD-NER: Continual Named Entity Recognition via Span-based Knowledge Distillation with Reinforcement Learning. 6689-6700 - Chengwei Qin, Chen Chen, Shafiq Joty:
Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation. 6701-6714 - Eve Fleisig, Rediet Abebe, Dan Klein:
When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks. 6715-6726 - Arthur Hemmer, Mickaël Coustaty, Nicola Bartolo, Jérôme Brachat, Jean-Marc Ogier:
Lazy-k Decoding: Constrained Decoding for Information Extraction. 6727-6736 - Hailin Chen, Amrita Saha, Steven Chu-Hong Hoi, Shafiq Joty:
Personalized Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation. 6737-6749 - Raghav Jain, Daivik Sojitra, Arkadeep Acharya, Sriparna Saha, Adam Jatowt, Sandipan Dandapat:
Do Language Models Have a Common Sense regarding Time? Revisiting Temporal Commonsense Reasoning in the Era of Large Language Models. 6750-6774 - Shreya Havaldar, Matthew Pressimone, Eric Wong
, Lyle H. Ungar:
Comparing Styles across Languages. 6775-6791 - Jintao Liu, Zequn Zhang, Kaiwen Wei, Zhi Guo, Xian Sun, Li Jin, Xiaoyu Li:
Event Causality Extraction via Implicit Cause-Effect Interactions. 6792-6804 - Nicholas Deas, Jessica Grieser, Shana Kleiner, Desmond Patton, Elsbeth Turcan, Kathleen R. McKeown:
Evaluation of African American Language Bias in Natural Language Generation. 6805-6824 - Songbo Hu
, Han Zhou
, Moy Yuan, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Anna Korhonen, Ivan Vulic:
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems. 6825-6851 - V. S. D. S. Mahesh Akavarapu, Arnab Bhattacharya:
Cognate Transformer for Automated Phonological Reconstruction and Cognate Reflex Prediction. 6852-6862 - Ximing Lu, Faeze Brahman, Peter West, Jaehun Jung, Khyathi Raghavi Chandu, Abhilasha Ravichander, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath
, Nouha Dziri, Jillian Fisher, Bill Y. Lin, Skyler Hallinan, Lianhui Qin, Xiang Ren, Sean Welleck, Yejin Choi:
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning. 6863-6883 - Kang-il Lee, Segwang Kim, Kyomin Jung:
Weakly Supervised Semantic Parsing with Execution-based Spurious Program Filtering. 6884-6894 - Karthikeyan K, Yogarshi Vyas, Jie Ma, Giovanni Paolini, Neha Anna John, Shuai Wang, Yassine Benajiba, Vittorio Castelli, Dan Roth, Miguel Ballesteros:
Taxonomy Expansion for Named Entity Recognition. 6895-6906 - Oliver Eberle, Ilias Chalkidis, Laura Cabello, Stephanie Brandl:
Rather a Nurse than a Physician - Contrastive Explanations under Investigation. 6907-6920 - Ashutosh Dwivedi, Pradhyumna Lavania, Ashutosh Modi:
EtiCor: Corpus for Analyzing LLMs for Etiquettes. 6921-6931 - Chengwen Qi, Bowen Li, Binyuan Hui, Bailin Wang, Jinyang Li, Jinwang Wu, Yuanjun Laili:
An Investigation of LLMs' Inefficacy in Understanding Converse Relations. 6932-6953 - Weishi Wang, Yue Wang, Steven C. H. Hoi, Shafiq Joty:
Towards Low-Resource Automatic Program Repair with Meta-Learning and Pretrained Language Models. 6954-6968 - Vipul Rathore, Rajdeep Dhingra, Parag Singla, Mausam:
ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters. 6969-6987 - Xue Han, Yitong Wang, Qian Hu, Pengwei Hu, Chao Deng, Junlan Feng:
Log-FGAER: Logic-Guided Fine-Grained Address Entity Recognition from Multi-Turn Spoken Dialogue. 6988-6997 - Sarkar Snigdha Sarathi Das, Haoran Zhang, Peng Shi, Wenpeng Yin
, Rui Zhang:
Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning. 6998-7010 - Franz Nowak
, Anej Svete, Li Du, Ryan Cotterell:
On the Representational Capacity of Recurrent Neural Language Models. 7011-7034 - Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan:
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis. 7035-7052 - Adithya Bhaskar, Tushar Tomar, Ashutosh Sathe, Sunita Sarawagi:
Benchmarking and Improving Text-to-SQL Generation under Ambiguity. 7053-7074 - Yu Zhang, Yue Zhang, Leyang Cui, Guohong Fu:
Non-autoregressive Text Editing with Copy-aware Latent Alignments. 7075-7085 - Rricha Jalota, Koel Dutta Chowdhury, Cristina España-Bonet, Josef van Genabith:
Translating away Translationese without Parallel Data. 7086-7100 - Xiao Yu
, Maximillian Chen, Zhou Yu:
Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning. 7101-7125 - Zhenwen Liang, Tianyu Yang, Jipeng Zhang, Xiangliang Zhang:
UniMath: A Foundational and Multimodal Mathematical Reasoner. 7126-7133 - Yixiao Ma, Yueyue Wu, Weihang Su, Qingyao Ai, Yiqun Liu:
CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding. 7134-7143 - William Watson
, Nicole Cho, Tucker Balch, Manuela Veloso:
HiddenTables and PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies. 7144-7159 - Yingxiu Zhao, Bowen Yu, Bowen Li, Haiyang Yu, Jinyang Li, Chao Wang, Fei Huang, Yongbin Li, Nevin L. Zhang:
Causal Document-Grounded Dialogue Pre-training. 7160-7174 - Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni:
Accented Speech Recognition With Accent-specific Codebooks. 7175-7188 - Gorjan Radevski
, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavas:
Linking Surface Facts to Large-Scale Knowledge Graphs. 7189-7207 - Xin Zhang, Linhai Zhang, Deyu Zhou:
Sentiment Analysis on Streaming User Reviews via Dual-Channel Dynamic Graph Neural Network. 7208-7220 - Wietse de Vries, Martijn Wieling, Malvina Nissim:
DUMB: A Dutch Model Benchmark. 7221-7241 - Zhan Shi, Guoyin Wang, Ke Bai, Jiwei Li, Xiang Li, Qingjun Cui, Belinda Zeng, Trishul Chilimbi, Xiaodan Zhu:
OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding. 7242-7254 - Juan Pablo Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico:
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation. 7255-7274 - Xinchen Yu
, Ashley Zhao, Eduardo Blanco, Lingzi Hong
:
A Fine-Grained Taxonomy of Replies to Hate Speech. 7275-7289 - Henry Peng Zou, Cornelia Caragea:
JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text Classification. 7290-7301 - Niloofar Mireshghallah, Nikolai Vogler, Junxian He, Omar Florez, Ahmed El-Kishky, Taylor Berg-Kirkpatrick:
Simple Temporal Adaptation to Changing Label Sets: Hashtag Prediction via Dense KNN. 7302-7311 - Kent K. Chang, Mackenzie Cramer, Sandeep Soni, David Bamman:
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4. 7312-7327 - Marion Di Marco, Katharina Hämmerl, Alexander Fraser:
A Study on Accessing Linguistic Information in Pre-Trained Language Models by Using Prompts. 7328-7336 - Martin Funkquist, Ilia Kuznetsov, Yufang Hou, Iryna Gurevych:
CiteBench: A Benchmark for Scientific Citation Text Generation. 7337-7353 - Zheyuan Zhang, Shane Storks
, Fengyuan Hu, Sungryull Sohn, Moontae Lee, Honglak Lee, Joyce Chai:
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning. 7354-7379 - Keito Kudo, Haruki Nagasawa, Jun Suzuki, Nobuyuki Shimizu:
A Challenging Multimodal Video Summary: Simultaneously Extracting and Generating Keyframe-Caption Pairs from Video. 7380-7402 - Antonia Karamolegkou
, Jiaang Li, Li Zhou, Anders Søgaard:
Copyright Violations and Large Language Models. 7403-7412 - Jue Hou
, Anisia Katinskaia, Anh-Duc Vu, Roman Yangarber
:
Effects of sub-word segmentation on performance of transformer language models. 7413-7425 - Justin T. Chiu, Wenting Zhao, Derek Chen, Saujas Vaduguru, Alexander M. Rush
, Daniel Fried:
Symbolic Planning and Code Generation for Grounded Dialogue. 7426-7436 - Xingchen Wan, Ruoxi Sun, Hootan Nakhost, Hanjun Dai, Julian Eisenschlos, Sercan Ö. Arik, Tomas Pfister:
Universal Self-Adaptive Prompting. 7437-7462 - Abdisalam Badel, Ting Zhong, Wenxin Tai, Fan Zhou:
Somali Information Retrieval Corpus: Bridging the Gap between Query Translation and Dedicated Language Resources. 7463-7469 - Biru Zhu, Lifan Yuan, Ganqu Cui, Yangyi Chen, Chong Fu, Bingxiang He, Yangdong Deng, Zhiyuan Liu, Maosong Sun, Ming Gu:
Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT. 7470-7483 - Qian Hu, Palash Goyal, Rahul Gupta:
Faithful Model Evaluation for Model-Based Metrics. 7484-7489 - Kai Zhang, Kaisong Song, Yangyang Kang, Xiaozhong Liu:
Content- and Topology-Aware Representation Learning for Scientific Multi-Literature. 7490-7502 - Ethan Wilcox, Clara Meister, Ryan Cotterell, Tiago Pimentel:
Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages. 7503-7511 - Zhaohui Yan, Songlin Yang, Wei Liu
, Kewei Tu:
Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks. 7512-7526 - Daman Arora, Himanshu Gaurav Singh, Mausam:
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models. 7527-7543 - Mattia Opper, Victor Prokhorov, Siddharth Narayanaswamy:
StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure. 7544-7560 - Ryo Kamoi, Tanya Goyal, Juan Diego Rodriguez, Greg Durrett:
WiCE: Real-World Entailment for Claims in Wikipedia. 7561-7583 - Mohammad Basit, Bashir Alam, Zubaida Fatima, Salman Shaikh
:
Natural Disaster Tweets Classification Using Multimodal Data. 7584-7594 - Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker:
On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research. 7595-7609 - Liviu P. Dinu, Ana Sabina Uban, Alina Maria Cristea, Anca Dinu, Ioan-Bogdan Iordache, Simona Georgescu
, Laurentiu Zoicas:
RoBoCoP: A Comprehensive ROmance BOrrowing COgnate Package and Benchmark for Multilingual Cognate Identification. 7610-7629 - Bin Wang, Zhengyuan Liu, Nancy Chen:
Instructive Dialogue Summarization with Query Aggregations. 7630-7653 - Brian de Silva, Kuan-Wen Huang, Gwang Lee, Karen Hovsepian, Yan Xu, Mingwei Shen:
Semantic matching for text classification with complex class descriptions. 7654-7680 - Jia-Chen Gu, Chao-Hong Tan, Caiyuan Chu, Zhen-Hua Ling, Chongyang Tao, Quan Liu, Cong Liu:
MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation. 7681-7692 - Sunkyung Lee
, Minjin Choi, Jongwuk Lee:
GLEN: Generative Retrieval via Lexical Index Learning. 7693-7704 - Zihan Zhang, Meng Fang, Fanghua Ye, Ling Chen, Mohammad-Reza Namazi-Rad:
Turn-Level Active Learning for Dialogue State Tracking. 7705-7719 - Haoqin Tu, Yitong Li, Fei Mi, Zhongliang Yang:
ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue. 7720-7735 - Yuan Tian, Nan Xu, Wenji Mao, Daniel Zeng:
Modeling Conceptual Attribute Likeness and Domain Inconsistency for Metaphor Detection. 7736-7752 - Ziling Huang, Shin'ichi Satoh:
Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network. 7753-7762 - Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro:
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study. 7763-7786 - Xinyuan Lu, Liangming Pan, Qian Liu, Preslav Nakov, Min-Yen Kan:
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables. 7787-7813 - Linlin Zhang, Kai Fan, Jiajun Bu, Zhongqiang Huang:
Training Simultaneous Speech Translation with Robust and Random Wait-k-Tokens Strategy. 7814-7831 - Deqing Fu, Ameya Godbole, Robin Jia
:
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples. 7832-7848 - Zhihong Zhu, Xuxin Cheng, Zhiqi Huang, Dongsheng Chen, Yuexian Zou:
Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence. 7849-7856 - Zedian Xiao, William Held, Yanchen Liu, Diyi Yang:
Task-Agnostic Low-Rank Adapters for Unseen English Dialects. 7857-7870 - Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor S. Sheng, Huaiyu Dai, Dejing Dou:
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization. 7871-7888 - Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia:
TheoremQA: A Theorem-driven Question Answering Dataset. 7889-7901 - Haoxiang Su, Hongyan Xie, Hao Huang, Shuangyong Song, Ruiyu Fang, Xiaomeng Huang, Sijie Feng:
Scalable-DSC: A Structural Template Prompt Approach to Scalable Dialogue State Correction. 7902-7914 - Xiang Zhang, Senyu Li, Bradley Hauer, Ning Shi, Grzegorz Kondrak:
Don't Trust ChatGPT when your Question is not in English: A Study of Multilingual Abilities and Types of LLMs. 7915-7927 - Ke Wang, Xiutian Zhao, Yanghui Li, Wei Peng:
M³Seg: A Maximum-Minimum Mutual Information Paradigm for Unsupervised Topic Segmentation in ASR Transcripts. 7928-7934 - Tingyu Xie, Qi Li
, Jian Zhang, Yan Zhang, Zuozhu Liu, Hongwei Wang:
Empirical Study of Zero-Shot NER with ChatGPT. 7935-7956 - Reid Pryzant, Dan Iter, Jerry Li, Yin Tat Lee, Chenguang Zhu, Michael Zeng:
Automatic Prompt Optimization with "Gradient Descent" and Beam Search. 7957-7968 - Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig:
Active Retrieval Augmented Generation. 7969-7992 - Mehar Bhatia, Vered Shwartz:
GD-COMET: A Geo-Diverse Commonsense Inference Model. 7993-8001 - Chenxu Yang, Zheng Lin, Lanrui Wang, Chong Tian, Liang Pang, Jiangnan Li, Qirong Ho, Yanan Cao, Weiping Wang:
Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation. 8002-8015 - Tomas Goldsack
, Zhihao Zhang, Chen Tang, Carolina Scarton, Chenghua Lin:
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs. 8016-8032 - Wenkai Shi, Wenbin An, Feng Tian, Qinghua Zheng, Qianying Wang, Ping Chen:
A Diffusion Weighted Graph Framework for New Intent Discovery. 8033-8042 - Thi-Nhung Nguyen, Hoang Ngo, Kiem-Hieu Nguyen, Tuan-Dung Cao:
A Self-enhancement Multitask Framework for Unsupervised Aspect Category Detection. 8043-8054 - Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, Ming Gao, Baoyuan Wang:
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. 8055-8068 - Anej Svete, Ryan Cotterell:
Recurrent Neural Language Models as Probabilistic Finite-state Automata. 8069-8086 - Xuanhong Li
, Peng Li, Po Hu:
Revisiting Source Context in Nearest Neighbor Machine Translation. 8087-8098 - Cennet Oguz, Pascal Denis, Emmanuel Vincent, Simon Ostermann, Josef van Genabith:
Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization. 8099-8110 - Adithya Pratapa, Kevin Small, Markus Dreyer:
Background Summarization of Event Timelines. 8111-8136 - Aleksandrs Berdicevskis, Gerlof Bouma, Robin Kurtz, Felix Morger, Joey Öhman, Yvonne Adesam, Lars Borin, Dana Dannélls, Markus Forsberg, Tim Isbister, Anna Lindahl, Martin Malmsten, Faton Rekathati, Magnus Sahlgren, Elena Volodina, Love Börjeson, Simon Hengchen, Nina Tahmasebi:
Superlim: A Swedish Language Understanding Evaluation Benchmark. 8137-8153 - Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu:
Reasoning with Language Model is Planning with World Model. 8154-8173 - Jianling Li, Meishan Zhang, Peiming Guo, Min Zhang, Yue Zhang:
LLM-enhanced Self-training for Cross-domain Constituency Parsing. 8174-8185 - Duzhen Zhang, Wei Cong, Jiahua Dong, Yahan Yu, Xiuyi Chen, Yonggang Zhang, Zhen Fang:
Continual Named Entity Recognition without Catastrophic Forgetting. 8186-8197 - Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Jinfeng Rao, Marc Najork
, Emma Strubell, Donald Metzler:
DSI++: Updating Transformer Memory with New Documents. 8198-8213 - Anshita Gupta, Debanjan Mondal, Akshay Krishna Sheshadri, Wenlong Zhao, Xiang Li, Sarah Wiegreffe, Niket Tandon:
Editing Common Sense in Transformers. 8214-8232 - Tianqi Zhong, Quan Wang, Jingxuan Han, Yongdong Zhang, Zhendong Mao:
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation. 8233-8248 - Hosein Mohebbi
, Grzegorz Chrupala
, Willem H. Zuidema, Afra Alishahi:
Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers. 8249-8260 - Weizhou Shen, Yingqi Gao, Canbin Huang, Fanqi Wan, Xiaojun Quan, Wei Bi:
Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System. 8261-8275 - Wenhao Yu, Meng Jiang, Peter Clark, Ashish Sabharwal:
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions. 8276-8288 - Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad, Jun Wang:
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances. 8289-8311 - Wookje Han, Jinsol Park, Kyungjae Lee:
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering. 8312-8322 - Verna Dankers, Ivan Titov, Dieuwke Hupkes:
Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation. 8323-8343 - Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh, Fei Liu:
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4. 8344-8357 - Haoyi Qiu, Zi-Yi Dou, Tianlu Wang, Asli Celikyilmaz, Nanyun Peng:
Gender Biases in Automatic Evaluation Metrics for Image Captioning. 8358-8375 - Rami Aly, Marek Strong, Andreas Vlachos:
QA-NatVer: Question Answering for Natural Logic-based Fact Verification. 8376-8391 - Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal:
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy. 8392-8417 - Jiacheng Ye, Chengzu Li, Lingpeng Kong, Tao Yu:
Generating Data for Symbolic Language with Large Language Models. 8418-8443 - Vageesh Saxena, Benjamin Bashpole, Gijs van Dijck
, Gerasimos Spanakis
:
IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort Advertisements. 8444-8464 - Laura Cabello, Emanuele Bugliarello
, Stephanie Brandl, Desmond Elliott
:
Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models. 8465-8483 - Yaxin Fan
, Feng Jiang, Peifeng Li, Fang Kong, Qiaoming Zhu:
Improving Dialogue Discourse Parsing via Reply-to Structures of Addressee Recognition. 8484-8495 - Myeongjun Jang, Thomas Lukasiewicz:
Improving Language Models' Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary. 8496-8510 - Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S., Sakshi Singh, Utkarsh Tyagi, Dinesh Manocha:
DALE: Generative Data Augmentation for Low-Resource Legal NLP. 8511-8565 - Xinge Ma, Jiangming Liu, Jin Wang, Xuejie Zhang:
FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models. 8566-8577 - Alexander Havrilla, Maksym Zhuravinskyi, Duy Phung, Aman Tiwari, Jonathan Tow, Stella Biderman, Quentin Anthony, Louis Castricato:
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback. 8578-8595 - Iker García-Ferrero, Begoña Altuna, Javier Álvez, Itziar Gonzalez-Dios, German Rigau:
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models. 8596-8615 - Chunyou Li, Mingtong Liu, Hongxiao Zhang, Yufeng Chen, Jinan Xu, Ming Zhou:
MT2: Towards a Multi-Task Machine Translation Model with Translation-Specific In-Context Learning. 8616-8627 - Susanna Rücker, Alan Akbik:
CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset. 8628-8645 - Jia Peng Lim, Hady W. Lauw:
Disentangling Transformer Language Models as Superposed Topic Models. 8646-8666 - Parag Jain, Mirella Lapata:
Conversational Semantic Parsing using Dynamic Context Graphs. 8667-8679 - Tharindu Madusanka, Iqra Zahid, Hao Li, Ian Pratt-Hartmann, Riza Batista-Navarro:
Not all quantifiers are equal: Probing Transformer-based language models' understanding of generalised quantifiers. 8680-8692 - Feng Zhao, Hongzhi Zou, Cheng Yan
:
Structure-aware Knowledge Graph-to-text Generation with Planning Selection and Similarity Distinction. 8693-8703 - Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing:
SOUL: Towards Sentiment and Opinion Understanding of Language. 8704-8711 - Catalina Goanta, Nikolaos Aletras, Ilias Chalkidis, Sofia Ranchordás, Gerasimos Spanakis:
Regulation and NLP (RegNLP): Taming Large Language Models. 8712-8724 - Zexue He, Yu Wang, An Yan, Yao Liu, Eric Y. Chang, Amilcare Gentili, Julian J. McAuley, Chun-Nan Hsu:
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation. 8725-8744 - Andreas Baumann, Andreas Stephan, Benjamin Roth:
Seeing through the mess: evolutionary dynamics of lexical polysemy. 8745-8762 - Xuyou Cheng, Michael Sejr Schlichtkrull, Guy Emerson:
Are Embedded Potatoes Still Vegetables? On the Limitations of WordNet Embeddings for Lexical Semantics. 8763-8775 - Andrea Sottana, Bin Liang, Kai Zou, Zheng Yuan:
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks. 8776-8788 - Eitan Wagner, Renana Keydar, Omri Abend:
Event-Location Tracking in Narratives: A Case Study on Holocaust Testimonies. 8789-8805 - Yerin Hwang, Yongil Kim, Hyunkyung Bae, Hwanhee Lee, Jeesoo Bang, Kyomin Jung:
Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources. 8806-8828 - Lingyun Feng:
Learning to Predict Task Transferability via Soft Prompt. 8829-8844 - Wang Zhu, Jesse Thomason, Robin Jia
:
Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering. 8845-8860 - Tong Zhu, Junfei Ren, Zijian Yu, Mengsong Wu, Guoliang Zhang, Xiaoye Qu, Wenliang Chen, Zhefeng Wang, Baoxing Huai, Min Zhang:
Mirror: A Universal Framework for Various Information Extraction Tasks. 8861-8876 - Kunal Handa, Margaret Clapper, Jessica Boyle, Rose E. Wang, Diyi Yang, David S. Yeager, Dorottya Demszky:
"Mistakes Help Us Grow": Facilitating and Evaluating Growth Mindset Supportive Language in Classrooms. 8877-8897 - Qi Cao, Takeshi Kojima, Yutaka Matsuo, Yusuke Iwasawa:
Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text. 8898-8913 - Yifu Qiu, Yftah Ziser, Anna Korhonen, Edoardo Maria Ponti, Shay B. Cohen:
Detecting and Mitigating Hallucinations in Multilingual Summarisation. 8914-8932 - Jordan Kodner
, Salam Khalifa, Sarah Ruth Brogden Payne:
Exploring Linguistic Probes for Morphological Inflection. 8933-8941 - Chao Lou, Kewei Tu:
AMR Parsing with Causal Hierarchical Attention and Pointers. 8942-8955 - Haowei Lin, Yuntian Gu:
FLatS: Principled Out-of-Distribution Detection with Feature-Based Likelihood Ratio Score. 8956-8963 - Haoqi Zheng, Qihuang Zhong, Liang Ding, Zhiliang Tian, Xin Niu, Changjian Wang, Dongsheng Li, Dacheng Tao:
Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks. 8964-8974 - David Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, John F. Canny:
IC3: Image Captioning by Committee Consensus. 8975-9003 - Potsawee Manakul, Adian Liusie, Mark J. F. Gales:
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models. 9004-9017 - Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller:
Fair Without Leveling Down: A New Intersectional Fairness Definition. 9018-9032 - Manuel Faysse, Gautier Viaud
, Céline Hudelot, Pierre Colombo:
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications. 9033-9048 - Sathish Indurthi, Shamil Chollampatt, Ravi Agrawal, Marco Turchi:
CLAD-ST: Contrastive Learning with Adversarial Data for Robust Speech Translation. 9049-9056 - Fei Zhao, Chunhui Li
, Zhen Wu, Yawen Ouyang, Jianbing Zhang, Xinyu Dai:
M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis. 9057-9070 - Siyuan Chen, Zhiling Zhang, Mengyue Wu, Kenny Q. Zhu:
Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts. 9071-9084 - Ahmed Alajrami, Katerina Margatina, Nikolaos Aletras:
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance? 9085-9108 - Hsiu-Wen Li, Ying-Jia Lin, Yi-Ting Li, Chun Lin, Hung-Yu Kao:
Improved Unsupervised Chinese Word Segmentation Using Pre-trained Knowledge and Pseudo-labeling Transfer. 9109-9118 - Hanlin Tang, Yifu Sun, Decheng Wu, Kai Liu, Jianchen Zhu, Zhanhui Kang:
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs. 9119-9128 - Mattia Atzeni, Mikhail Plekhanov, Frédéric A. Dreyer, Nora Kassner, Simone Merello, Louis Martin, Nicola Cancedda:
Polar Ducks and Where to Find Them: Enhancing Entity Linking with Duck Typing and Polar Box Embeddings. 9129-9146 - Qifan Wang, Yuning Mao, Jingang Wang, Hanchao Yu, Shaoliang Nie, Sinong Wang, Fuli Feng, Lifu Huang, Xiaojun Quan, Zenglin Xu, Dongfang Liu:
APrompt: Attention Prompt Tuning for Efficient Adaptation of Pre-trained Language Models. 9147-9160 - Amita Kamath, Jack Hessel, Kai-Wei Chang:
What's "up" with vision-language models? Investigating their struggle with spatial reasoning. 9161-9175 - Xiaoyue Wang, Xin Liu, Lijie Wang, Yaoxiang Wang, Jinsong Su, Hua Wu:
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models. 9176-9186 - Shijia Huang, Jianqiao Zhao, Yanyang Li, Liwei Wang:
Learning Preference Model for LLMs via Automatic Preference Data Generation. 9187-9199 - David Stap, Christof Monz:
Multilingual k-Nearest-Neighbor Machine Translation. 9200-9208 - Filip Miletic, Anne Przewozny-Desriaux, Ludovic Tanguy:
Understanding Computational Models of Semantic Change: New Insights from the Speech Community. 9209-9220 - Trang Nguyen, Naoaki Okazaki:
Causal Reasoning through Two Cognition Layers for Improving Generalization in Visual Question Answering. 9221-9236 - Jinhao Jiang, Kun Zhou, Zican Dong, Keming Ye, Xin Zhao, Ji-Rong Wen:
StructGPT: A General Framework for Large Language Model to Reason over Structured Data. 9237-9251 - Rosamond Elizabeth Thalken, Edward H. Stiglitz, David Mimno, Matthew Wilkens:
Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement. 9252-9265 - Mrigank Raman, Pratyush Maini, J. Zico Kolter, Zachary C. Lipton, Danish Pruthi:
Model-tuning Via Prompts Makes NLP Models Adversarially Robust. 9266-9286 - Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, Jinyoung Han:
Learning Co-Speech Gesture for Multimodal Aphasia Type Detection. 9287-9303 - Xurui Li
, Yue Qin, Rui Zhu, Tianqianjin Lin, Yongming Fan, Yangyang Kang, Kaisong Song, Fubang Zhao, Changlong Sun, Haixu Tang, Xiaozhong Liu:
STINMatch: Semi-Supervised Semantic-Topological Iteration Network for Financial Risk Detection via News Label Diffusion. 9304-9315 - Vyoma Raman
, Eve Fleisig, Dan Klein:
Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection. 9316-9329 - Bill Noble, Nikolai Ilinykh:
Describe Me an Auklet: Generating Grounded Perceptual Category Descriptions. 9330-9347 - Dominik Stammbach, Vilém Zouhar, Alexander Miserlis Hoyle, Mrinmaya Sachan, Elliott Ash:
Revisiting Automated Topic Model Evaluation with Large Language Models. 9348-9357 - Xiutian Zhao, Ke Wang, Wei Peng:
ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization. 9358-9375 - Nishanth Dikkala, Nikhil Ghosh, Raghu Meka, Rina Panigrahy, Nikhil Vyas, Xin Wang:
On the Benefits of Learning to Route in Mixture-of-Experts Models. 9376-9396 - Elizabeth Clark, Shruti Rijhwani, Sebastian Gehrmann, Joshua Maynez, Roee Aharoni, Vitaly Nikolaev, Thibault Sellam, Aditya Siddhant, Dipanjan Das, Ankur P. Parikh:
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation. 9397-9413 - Liang Wang, Nan Yang, Furu Wei:
Query2doc: Query Expansion with Large Language Models. 9414-9423 - Yan Xue, Xuefei Cao, Xingli Yang, Yu Wang, Ruibo Wang, Jihong Li:
We Need to Talk About Reproducibility in NLP Model Comparison. 9424-9434 - Fanqi Wan, Xinting Huang, Tao Yang, Xiaojun Quan, Wei Bi, Shuming Shi:
Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration. 9435-9454 - Kazuki Irie, Róbert Csordás, Jürgen Schmidhuber:
Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions. 9455-9465 - Bodhisattwa Prasad Majumder, Zexue He, Julian J. McAuley:
InterFair: Debiasing with Natural Language Feedback for Fair Interpretable Predictions. 9466-9471 - Jiashu Pu, Ling Cheng, Lu Fan, Tangjie Lv, Rongsheng Zhang:
Just Adjust One Prompt: Enhancing In-Context Dialogue Scoring via Constructing the Optimal Subgraph of Demonstrations and Prompts. 9472-9496 - Dmitry Nikolaev
, Tanise Ceron, Sebastian Padó
:
Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers. 9497-9511 - Mengze Li, Tianqi Zhao, Jionghao Bai, Baoyi He, Jiaxu Miao, Wei Ji, Zheqi Lv
, Zhou Zhao, Shengyu Zhang, Wenqiao Zhang, Fei Wu:
ART: rule bAsed futuRe-inference deducTion. 9512-9522 - Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar:
EpiK-Eval: Evaluation for Language Models as Epistemic Models. 9523-9557 - Shanshan Xu
, T. Y. S. S. Santosh, Oana Ichim, Isabella Risini, Barbara Plank, Matthias Grabmair:
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification. 9558-9576 - Yaoyiran Li
, Anna Korhonen, Ivan Vulic:
On Bilingual Lexicon Induction with Large Language Models. 9577-9599 - Parker Seegmiller, Sarah Preum:
Statistical Depth for Ranking and Characterizing Transformer-Based Text Embeddings. 9600-9611 - Kaiyan Zhang, Ning Ding, Biqing Qi, Xuekai Zhu, Xinwei Long, Bowen Zhou:
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model. 9612-9637 - Shivani Kumar, Ramaneswaran S., Md. Shad Akhtar, Tanmoy Chakraborty:
From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues. 9638-9652 - Eugenio Herrera-Berg, Tomás Vergara Browne, Pablo León-Villagrá, Marc-Lluís Vives, Cristian Buc Calderon:
Large Language Models are biased to overestimate profoundness. 9653-9661 - Philippe Laban, Wojciech Kryscinski, Divyansh Agarwal, Alexander R. Fabbri, Caiming Xiong, Shafiq Joty, Chien-Sheng Wu:
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization. 9662-9676 - Jun-Hyung Park, Hyuntae Park
, Youjin Kang