


default search action
31st COLING 2025: Abu Dhabi, UAE - Workshops
- Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025 - Workshops, Abu Dhabi, UAE, January 19-24, 2025. Association for Computational Linguistics 2025

1st Workshop on NLP for Languages Using Arabic Script
- Mo El-Haj:

Proceedings of the 1st Workshop on NLP for Languages Using Arabic Script. - Ngoc Tan Le, Ali Mijiyawa, Abdoulahat Leye, Fatiha Sadat:

The Best of Both Worlds: Exploring Wolofal in the Context of NLP. 1-6 - Farizeh Aldabbas, Shaina Ashraf, Rafet Sifa, Lucie Flek:

MultiProp Framework: Ensemble Models for Enhanced Cross-Lingual Propaganda Detection in Social Media and News using Data Augmentation, Text Segmentation, and Meta-Learning. 7-22 - Srihari Bandarupalli, Bhavana Akkiraju, Sri Charan Devarakonda, Harinie Sivaramasethu, Vamshiraghusimha Narasinga, Anil Vuppala:

Towards Unified Processing of Perso-Arabic Scripts for ASR. 23-28 - Mounes Zaval, Abdullah Ihsanoglu, Asim Ersoy, Olcay Taner Yildiz:

In-Depth Analysis of Arabic-Origin Words in the Turkish Morpholex. 29-36 - Sadegh Jafari, Farhan Farsi, Navid Ebrahimi, Mohamad Bagher Sajadi, Sauleh Eetemadi:

DadmaTools V2: an Adapter-Based Natural Language Processing Toolkit for the Persian Language. 37-43 - Vahide Tajalli, Mehrnoush Shamsfard, Fateme Kalantari:

Developing an Informal-Formal Persian Corpus: Highlighting the Differences between Two Writing Styles. 44-53 - Masoumeh Mohammadi, Mohammad Ruhul Amin, Shadi Tavakoli:

Boosting Sentiment Analysis in Persian through a GAN-Based Synthetic Data Augmentation Method. 54-63 - Sadegh Jafari, Mohammad Erfan Zare, Amireza Vishte, Mirzae Melike, Zahra Amiri, Sima Mohammadparast, Sauleh Eetemadi:

Psychological Health Chatbot, Detecting and Assisting Patients in their Path to Recovery. 64-77 - Reham Marzouk, Sondos Krouna, Nizar Habash:

A Derivational ChainBank for Modern Standard Arabic. 78-87 - Pankaj Dadure, Ananya Dixit, Kunal Tewatia, Nandini Paliwal, Anshika Malla:

Sentiment Analysis of Arabic Tweets Using Large Language Models. 88-94 - Abdulsalam obaid Alharbi, Abdullah Alsuhaibani, Abdulrahman Abdullah Alalawi, Usman Naseem, Shoaib Jameel, Salil S. Kanhere, Imran Razzak:

Evaluating Large Language Models on Health-Related Claims Across Arabic Dialects. 95-103 - Ayushman Gupta, Aryan Singhal, Thomas Law, Veekshith Rao, Evan Duan, Ryan Luo Li:

Can LLMs Verify Arabic Claims? Evaluating the Arabic Fact-Checking Abilities of Multilingual LLMs. 104-113 - Silvana Yakhni, Ali Chehab:

Can LLMs Translate Cultural Nuance in Dialects? A Case Study on Lebanese Arabic. 114-135 - Haq Nawaz, Manal Elobaid, Ali Al-Laith, Saif Ullah:

Automated Generation of Arabic Verb Conjugations with Multilingual Urdu Translation: An NLP Approach. 136-143 - Asma Ali Al Wazrah, Afrah Altamimi, Hawra Aljasim, Waad Alshammari Thuwaini, Rawan N. Al-Matham, Omar Elnashar, Mohamed Amin, Abdulrahman Alosaimy:

Evaluation of Large Language Models on Arabic Punctuation Prediction. 144-154 - Raghad Al-Rasheed, Abdullah Al Muaddi, Hawra Aljasim, Rawan N. Al-Matham, Muneera Alhoshan, Asma Al Wazrah, Abdulrahman Alosaimy:

Evaluating RAG Pipelines for Arabic Lexical Information Retrieval: A Comparative Study of Embedding and Generation Models. 155-164
18th Workshop on Building and Using Comparable Corpora (BUCC)
- Serge Sharoff, Ayla Rigouts Terryn, Pierre Zweigenbaum, Reinhard Rapp:

Proceedings of the 18th Workshop on Building and Using Comparable Corpora (BUCC). - Abdelhadi Soudi, Corinne Vinopol, Kristof Van Laerhoven:

Bilingual resources for Moroccan Sign Language Generation and Standard Arabic Skills Improvement of Deaf Children. 1-9 - Arofat Akhundjanova:

Harmonizing Annotation of Turkic Postverbial Constructions: A Comparative Study of UD Treebanks. 10-17 - Preslav Nakov:

Towards Truly Open, Language-Specific, Safe, Factual, and Specialized Large Language Models. 18 - Asli Umay Öztürk, Recep Firat Cekinel, Pinar Karagoz:

Make Satire Boring Again: Reducing Stylistic Bias of Satirical Corpus by Utilizing Generative LLMs. 19-35 - Ehsan Lotfi, Nikolay Banar, Walter Daelemans:

BEIR-NL: Zero-shot Information Retrieval Benchmark for the Dutch Language. 36-45 - Chia-Hsuan Chang, Tien-Yuan Huang, Yi-Hang Tsai, Chia-Ming Chang, San-Yih Hwang:

Refining Dimensions for Improving Clustering-based Cross-lingual Topic Models. 46-56 - Adam Meyers, Rodolfo Joel Zevallos Salazar, John E. Ortega, Lisa Wang:

The Role of Handling Attributive Nouns in Improving Chinese-To-English Machine Translation. 57-61 - Aso Mahmudi, Borja Herce, Demian Inostroza Améstica, Andreas Scherbakov, Eduard H. Hovy, Ekaterina Vylomova:

Can a Neural Model Guide Fieldwork? A Case Study on Morphological Data Collection. 62-72 - Kenneth Ward Church:

Comparable Corpora: Opportunities for New Research Directions. 73-82 - Manon Scholivet, Agata Savary, Louis Estève, Marie Candito, Carlos Ramisch:

SELEXINI - a large and diverse automatically parsed corpus of French. 83-98
First Workshop on Challenges in Processing South Asian Languages (CHiPSAL 2025)
- Kengatharaiyer Sarveswaran, Ashwini Vaidya, Bal Krishna Bal, Sana Shams, Surendrabikram Thapa:

Proceedings of the First Workshop on Challenges in Processing South Asian Languages (CHiPSAL 2025). - Kengatharaiyer Sarveswaran, Surendrabikram Thapa, Sana Shams, Ashwini Vaidya, Bal Krishna Bal:

A Brief Overview of the First Workshop on Challenges in Processing South Asian Languages (CHiPSAL). 1-8 - Prajwal Thapa, Jinu Nyachhyon, Mridul Sharma, Bal Krishna Bal:

Development of Pre-Trained Transformer-based Models for the Nepali Language. 9-16 - Munief Hassan Tahir, Sana Shams, Layba Fiaz, Farah Adeeba, Sarmad Hussain:

Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks. 17-34 - Nahida Akter Tanjila, Afrin Sultana Poushi, Sazid Abdullah Farhan, Abu Raihan Mostofa Kamal, Md. Azam Hossain, Md. Hamjajul Ashmafee:

Bengali ChartSumm: A Benchmark Dataset and study on feasibility of Large Language Models on Bengali Chart to Text Summarization. 35-45 - Varad Srivastava:

DweshVaani: An LLM for Detecting Religious Hate Speech in Code-Mixed Hindi-English. 46-60 - Rupak Raj Ghimire, Prakash Poudyal, Bal Krishna Bal:

Improving Accuracy of Low-resource ASR using Rule-Based Character Constituency Loss (RBCCL). 61-70 - Surendrabikram Thapa, Kritesh Rauniyar, Farhan Ahmad Jafri, Surabhi Adhikari, Kengatharaiyer Sarveswaran, Bal Krishna Bal, Hariram Veeramani, Usman Naseem:

Natural Language Understanding of Devanagari Script Languages: Language Identification, Hate Speech and its Target Detection. 71-82 - Uthayasanker Thayasivam, Thulasithan Gnanenthiram, Shamila Jeewantha, Upeksha Jayawickrama:

SiTa - Sinhala and Tamil Speaker Diarization Dataset in the Wild. 83-92 - Priyanka Dasari, Mupparapu Sohan Gupta, Nagaraju Vuppala, Pruthwik Mishra, Parameswari Krishnamurthy:

Sandhi Splitting in Tamil and Telugu: A Sequence-to-Sequence Approach Leveraging Transformer Models. 93-103 - Rahothvarman P., Adith John Rajeev, Kaveri Anuranjana, Radhika Mamidi:

Bridge the GAP: Multi-lingual Models For Ambiguous Pronominal Coreference Resolution in South Asian Languages. 104-114 - Krishan Chavinda, Uthayasanker Thayasivam:

A Dual Contrastive Learning Framework for Enhanced Hate Speech Detection in Low-Resource Languages. 115-123 - Prakash Dhakal, Daya Sagar Baral:

Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers. 124-133 - Aayush Neupane, Aayush Lamichhane, Ankit Paudel, Aman Shakya:

Structured Information Extraction from Nepali Scanned Documents using Layout Transformer and LLMs. 134-143 - Sharad Duwal, Suraj Prasai, Suresh Manandhar:

Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali. 144-153 - Antony Alexander James, Parameswari Krishnamurthy:

POS-Aware Neural Approaches for Word Alignment in Dravidian Languages. 154-159 - Rhitabrat Pokharel, Ameeta Agrawal:

neDIOM: Dataset and Analysis of Nepali Idioms. 160-171 - Ayesha Khalid, Farah Adeeba, Najm Ul Sehar, Sarmad Hussain:

Bridging the Bandwidth Gap: A Mixed Band Telephonic Urdu ASR Approach with Domain Adaptation for Banking Applications. 172-184 - Ganesh Dhakal Chhetri, Kiran Chandra Dahal, Prakash Poudyal:

Impacts of Vocoder Selection on Tacotron-based Nepali Text-To-Speech Synthesis. 185-192 - Jubeerathan Thevakumar, Luxshan Thavarasa, Thanikan Sivatheepan, Sajeev Kugarajah, Uthayasanker Thayasivam:

EmoTa: A Tamil Emotional Speech Dataset. 193-201 - Najm Ul Sehar, Ayesha Khalid, Farah Adeeba, Sarmad Hussain:

Benchmarking Whisper for Low-Resource Speech Recognition: An N-Shot Evaluation on Pashto, Punjabi, and Urdu. 202-207 - A. H. M. Rezaul Karim, Özlem Uzuner:

Leveraging Machine-Generated Data for Joint Intent Detection and Slot Filling in Bangla: A Resource-Efficient Approach. 208-216 - Omkar Khade, Shruti Jagdale, Abhishek Phaltankar, Gauri Takalikar, Raviraj Joshi:

Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning. 217-222 - Jebish Purbey, Siddartha Pullakhandam, Kanwal Mehreen, Muhammad Arham, Drishti Sharma, Ashay Srivastava, Ram Mohan Rao Kadiyala:

1-800-SHARED-TASKS@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech, and Targets using LLMs. 223-235 - Anik Mahmud Shanto, Mst. Sanjida Jamal Priya, Mohammad Shamsul Arefin:

AniSan@NLU of Devanagari Script Languages 2025: Optimizing Language Identification with Ensemble Learning. 236-241 - Rohith Gowtham Kodali, Durga Prasad Manukonda, Daniel Iglesias:

byteSizedLLM@NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification Using Customized Attention BiLSTM and XLM-RoBERTa Base Embeddings. 242-247 - Durga Prasad Manukonda, Rohith Gowtham Kodali:

byteSizedLLM@NLU of Devanagari Script Languages 2025: Language Identification Using Customized Attention BiLSTM and XLM-RoBERTa base Embeddings. 248-252 - Md. Refaj Hossan, Nazmus Sakib, Md. Alam Miah, Jawad Hossain, Mohammed Moshiul Hoque:

CUET_Big_O@NLU of Devanagari Script Languages 2025: Identifying Script Language and Detecting Hate Speech Using Deep Learning and Transformer Model. 253-259 - Sumaiya Rahman Aodhora, Shawly Ahsan, Mohammed Moshiul Hoque:

CUET_HateShield@NLU of Devanagari Script Languages 2025: Transformer-Based Hate Speech Detection in Devanagari Script Languages. 260-266 - Farjana Alam Tofa, Lorin Tasnim Zeba, Md Osama, Ashim Dey:

CUET_INSights@NLU of Devanagari Script Languages 2025: Leveraging Transformer-based Models for Target Identification in Hate Speech. 267-272 - Michael Ibrahim:

CUFE@NLU of Devanagari Script Languages 2025: Language Identification using fastText. 273-277 - Ashok Yadav, Vrijendra Singh:

Dll5143A@NLU of Devanagari Script Languages 2025: Detection of Hate Speech and Targets Using Hierarchical Attention Network. 278-288 - Shraddha Chauhan, Abhinav Kumar:

DSLNLP@NLU of Devanagari Script Languages 2025: Leveraging BERT-based Architectures for Language Identification, Hate Speech Detection and Target Classification. 289-294 - Siddhant Gupta, Siddh Singhal, Azmine Toushik Wasi:

IITR-CIOL@NLU of Devanagari Script Languages 2025: Multilingual Hate Speech Detection and Target Identification in Devanagari-Scripted Languages. 295-300 - Rushendra Sidibomma, Pransh Patwa, Parth Patwa, Aman Chadha, Vinija Jain, Amitava Das:

LLMsAgainstHate@NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs. 301-307 - Prabhat Ale, Anish Thapaliya, Suman Paudel:

MDSBots@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech, and Targets using MURTweet. 308-313 - Pilot Khadka, Ankit Bk, Ashish Acharya, Bikram K. C., Sandesh Shrestha, Rabin Thapa:

Nepali Transformers@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech and Targets. 314-319 - Anmol Guragain, Nadika Poudel, Rajesh Piryani, Bishesh Khanal:

NLPineers@ NLU of Devanagari Script Languages 2025: Hate Speech Detection using Ensembling of BERT-based models. 320-326 - Dola Chakraborty, Jawad Hossain, Mohammed Moshiul Hoque:

One_by_zero@ NLU of Devanagari Script Languages 2025: Target Identification for Hate Speech Leveraging Transformer-based Approach. 327-333 - Darwin Acharya, Sundeep Dawadi, Shivram Saud, Sunil Regmi:

Paramananda@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech and Targets using FastText and BERT. 334-338 - Shubham Shakya, Saral Sainju, Subham Krishna Shrestha, Prekshya Dawadi, Shreya Khatiwada:

SKPD Emergency @ NLU of Devanagari Script Languages 2025: Devanagari Script Classification using CBOW Embeddings with Attention-Enhanced BiLSTM. 339-343
1st Workshop on Computational Humor (CHum)
- Christian F. Hempelmann, Julia Rayz, Tiansi Dong, Tristan Miller:

Proceedings of the 1st Workshop on Computational Humor (CHum). - Alexander Kilpatrick, Maria Flaksman:

The Exception of Humor: Iconicity, Phonemic Surprisal, Memory Recall, and Emotional Associations. 1-8 - Ashwin Baluja:

Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor. 9-17 - Mathieu Dehouck, Marine Delaborde:

Rule-based Approaches to the Automatic Generation of Puns Based on Given Names in French. 18-22 - Yash Raj Sarrof:

Homophonic Pun Generation in Code Mixed Hindi English. 23-31 - Likhith Asapu, Prashant Kodali, Ashna Dua, Kapil Rajesh Kavitha, Manish Shrivastava:

Bridging Laughter Across Languages: Generation of Hindi-English Code-mixed Puns. 32-57 - Stephen Skalicky, Salvatore Attardo:

Testing Humor Theory Using Word and Sentence Embeddings. 58-62 - Joshua Lee, Wyatt Fong, Alexander Le, Sur Shah, Kevin Han, Kevin Zhu:

Pragmatic Metacognitive Prompting Improves LLM Performance on Sarcasm Detection. 63-70 - Joe Toplyn, Ori Amir:

Can AI Make Us Laugh? Comparing Jokes Generated by Witscript and a Human Expert. 71-78 - Narendra Nath Joshi:

Evaluating Human Perception and Bias in AI-Generated Humor. 79-87 - Piotr Mirowski, Kory W. Mathewson, Boyd Branch:

The Theater Stage as Laboratory: Review of Real-Time Comedy LLM Systems for Live Performance. 88-95 - Vittorio Marone:

The Algorithm is the Message: Computing as a Humor-Generating Mode. 96-100
New Horizons in Computational Linguistics for Religious Texts
- Sane Yagi, Majdi Sawalha, Bayan Abu Shawar, Abdallah AlShdaifat, Norhan Abbas:

Proceedings of the New Horizons in Computational Linguistics for Religious Texts. - A. D. Mahit Nandan, Ishan Godbole, Pranav M. Kapparad, Shrutilipi Bhattacharjee:

Comparative Analysis of Religious Texts: NLP Approaches to the Bible, Quran, and Bhagavad Gita. 1-10 - Kuanlin Liu:

Messages from the Quran and the Bible in Mandarin through Factor Analysis with Syntactic and Semantic Tags. 11-22 - Rashin Rahnamoun, Ramin Rahnamoun:

Semantic Analysis of Jurisprudential Zoroastrian Texts in Pahlavi: A Word Embedding Approach for an Extremely Under-Resourced, Extinct Language. 23-41 - Vera Pavlova:

Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval. 42-52 - Mohammad Mohammad Khair, Majdi Sawalha:

Automated Translation of Islamic Literature Using Large Language Models: Al-Shamela Library Application. 53-58 - Khubaib Amjad Alam, Maryam Khalid, Syed Ahmed Ali, Haroon Mahmood, Qaisar Shafi, Muhammad Haroon, Zulqarnain Haider:

Automated Authentication of Quranic Verses Using BERT (Bidirectional Encoder Representations from Transformers) based Language Models. 59-66 - Majdi Sawalha, Faisal Alshargi, Sane Yagi, Abdallah AlShdaifat, Bassam Hammo:

MASAQ Parser: A Fine-grained MorphoSyntactic Analyzer for the Quran. 67-75 - Shatha Altammami:

Leveraging AI to Bridge Classical Arabic and Modern Standard Arabic for Text Simplification. 76-85 - Pablo Mosteiro, Damián E. Blasi:

Word boundaries and the morphology-syntax trade-off. 86-93
5th Celtic Language Technology Workshop
- Brian Davis, Theodorus Fransen, Elaine Uí Dhonnchadha, Abigail Walsh:

Proceedings of the 5th Celtic Language Technology Workshop. - Adrian Doyle, John P. McCrae:

An Assessment of Word Separation Practices in Old Irish Text Resources and a Universal Method for Tokenising Old Irish Text. 1-11 - William Lamb, Dongge Han, Ondrej Klejch, Beatrice Alex, Peter Bell:

Synthesising a Corpus of Gaelic Traditional Narrative with Cross-Lingual Text Expansion. 12-26 - Monica Ward, Liang Xu, Elaine Uí Dhonnchadha:

A Pragmatic Approach to Using Artificial Intelligence and Virtual Reality in Digital Game-Based Language Learning. 27-34 - Liam Lonergan, Ibon Saratxaga, John Sloan, Oscar Maharg Bravo, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide:

Fotheidil: an Automatic Transcription System for the Irish Language. 35-45 - Teresa Clifford, Abigail Walsh, Brian Davis, Mícheál J. Ó Meachair:

Gaeilge Bhriste ó Shamhlacha Cliste: How Clever Are LLMs When Translating Irish Text? 46-51
Context and Meaning: Navigating Disagreements in NLP Annotation
- Michael Roth, Dominik Schlechtweg:

Proceedings of Context and Meaning: Navigating Disagreements in NLP Annotation. - Giulia Rizzi, Paolo Rosso, Elisabetta Fersini:

Is a bunch of words enough to detect disagreement in hateful content? 1-11 - Frances Yung, Vera Demberg:

On Crowdsourcing Task Design for Discourse Relation Annotation. 12-19 - Russel D'Souza, Venelin Kovatchev:

Sources of Disagreement in Data for LLM Instruction Tuning. 20-32 - Dominik Schlechtweg, Tejaswi Choppa, Wei Zhao, Michael Roth:

CoMeDi Shared Task: Median Judgment Classification & Mean Disagreement Ranking with Ordinal Word-in-Context Judgments. 33-47 - Mikhail Kuklin, Nikolay Arefyev:

Deep-change at CoMeDi: the Cross-Entropy Loss is not All You Need. 48-64 - Tejaswi Choppa, Michael Roth, Dominik Schlechtweg:

Predicting Median, Disagreement and Noise Label in Ordinal Word-in-Context Data. 65-77 - David Alfter, Mattias Appelgren:

GRASP at CoMeDi Shared Task: Multi-Strategy Modeling of Annotator Behavior in Multi-Lingual Semantic Judgments. 78-89 - Olufunke Oluyemi Sarumi, Charles Welch, Lucie Flek, Jörg Schlötterer:

Funzac at CoMeDi Shared Task: Modeling Annotator Disagreement from Word-In-Context Perspectives. 90-96 - Phuoc Duong Huy Chu:

FuocChuVIP123 at CoMeDi Shared Task: Disagreement Ranking with XLM-Roberta Sentence Embeddings and Deep Neural Regression. 97-102 - Zhu Liu, Zhen Hu, Ying Liu:

JuniperLiu at CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements. 103-112 - Tai Duc Le, Thin Dang Van:

MMLabUIT at CoMeDiShared Task: Text Embedding Techniques versus Generation-Based NLI for Median Judgment Classification. 113-121 - Ying Xuan Loke, Dominik Schlechtweg, Wei Zhao:

ABDN-NLP at CoMeDi Shared Task: Predicting the Aggregated Human Judgment via Weighted Few-Shot Prompting. 122-128 - Adrien Bibal, Nathaniel Gerlek, Goran Muric, Elizabeth Boschee, Steven Fincke, Mike Ross, Steven N. Minton:

Automating Annotation Guideline Improvements using LLMs: A Case Study. 129-144 - Shira Wein:

Ambiguity and Disagreement in Abstract Meaning Representation. 145-154 - Alec Sánchez-Montero, Gemma Bel-Enguix, Sergio-Luis Ojeda-Trueba, Gerardo Sierra:

Disagreement in Metaphor Annotation of Mexican Spanish Science Tweets. 155-164
First Workshop of Evaluation of Multi-Modal Generation
- Wei Emma Zhang, Xiang Dai, Desmond Elliott, Byron Fang, Mong Yuan Sim, Haojie Zhuang, Weitong Chen:

Proceedings of the First Workshop of Evaluation of Multi-Modal Generation. - Sana Javaid Raja, Adeel Zafar, Aqsa Shoaib:

A Dataset for Programming-based Instructional Video Classification and Question Answering. 1-9 - Mohammad Javad Pirhadi, Motahhare Mirzaei, Sauleh Eetemadi:

CVT5: Using Compressed Video Encoder and UMT5 for Dense Video Captioning. 10-23 - Yuyu Bai, Sandro Pezzelle:

If I feel smart, I will do the right thing: Combining Complementary Multimodal Information in Visual Language Models. 24-39 - Tao Sun, Oliver Liu, JinJin Li, Lan Ma:

LLaVA-RE: Binary Image-Text Relevancy Evaluation with Multimodal Large Language Model. 40-51 - Farhan Farsi, Shahriar Shariati Motlagh, Shayan Bali, Sadra Sabouri, Saeedeh Momtazi:

Persian in a Court: Benchmarking VLMs In Persian Multi-Modal Tasks. 52-56 - Hsin-Yi Hsieh, Shang-Wei Liu, Chang Chih Meng, Shuo-Yueh Lin, Chien-Hua Chen, Hung-Ju Lin, Hen-Hsen Huang, I-Chen Wu:

TaiwanVQA: A Benchmark for Visual Question Answering for Taiwanese Daily Life. 57-75 - Neelabh Sinha, Vinija Jain, Aman Chadha:

Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types. 76-94
Joint Workshop of the 9th Financial Technology and Natural Language Processing (FinNLP), the 6th Financial Narrative Processing (FNP), and the 1st Workshop on Large Language Models for Finance and Legal (LLMFinLegal)
- Chung-Chi Chen, Antonio Moreno-Sandoval, Jimin Huang, Qianqian Xie, Sophia Ananiadou, Hsin-Hsi Chen:

Proceedings of the Joint Workshop of the 9th Financial Technology and Natural Language Processing (FinNLP), the 6th Financial Narrative Processing (FNP), and the 1st Workshop on Large Language Models for Finance and Legal (LLMFinLegal). - Claudia Biancotti, Carolina Camassa, Andrea Coletta, Oliver Giudice, Aldo Glielmo:

Chat Bankman-Fried: an Exploration of LLM Alignment in Finance. 1-22 - Neelesh K. Shukla, Prabhat Prabhakar, Sakthivel Thangaraj, Sandeep Singh, Weiyi Sun, C. Prasanna Venkatesan, Viji Krishnamurthy:

GraphRAG Analysis for Financial Narrative Summarization and A Framework for Optimizing Domain Adaptation. 23-34 - Dongsheng Wang, Ran Zmigrod, Mathieu Sibue, Yulong Pei, Petr Babkin, Ivan Brugere, Xiaomo Liu, Nacho Navarro, Antony Papadimitriou, William Watson, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah:

BuDDIE: A Business Document Dataset for Multi-task Information Extraction. 35-47 - Xuanyu Zhang, Qing Yang:

FinMoE: A MoE-based Large Chinese Financial Language Model. 48-53 - Sunisth Kumar, Mohammed Elkholy, Davide Liu, Alexandre Boulenger:

Bridging the Gap: Efficient Cross-Lingual NER in Low-Resource Financial Domain. 54-62 - Alexei Gustavo Figueroa Rosero, Paul Grundmann, Julius Freidank, Wolfgang Nejdl, Alexander Löser:

Evaluating Financial Literacy of Large Language Models through Domain Specific Languages for Plain Text Accounting. 63-75 - Chetan Harsha, Karmvir Singh Phogat, Sridhar Dasaratha, Sai Akhil Puranam, Shashishekar Ramakrishna:

Synthetic Data Generation Using Large Language Models for Financial Question Answering. 76-95 - Cheng-Yu Lin, Jyh-Shing Roger Jang:

Concept-Based RAG Models: A High-Accuracy Fact Retrieval Approach. 96-100 - Benno Uthayasooriyar, Antoine Ly, Franck Vermet, Caio Corro:

Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain. 101-110 - Mateusz Klimaszewski, Pinzhen Chen, Liane Guillou, Ioannis Papaioannou, Barry Haddow, Alexandra Birch:

AveniBench: Accessible and Versatile Evaluation of Finance Intelligence. 111-117 - Felix Drinkall, Janet B. Pierrehumbert, Stefan Zohren:

Forecasting Credit Ratings: A Case Study where Traditional Methods Outperform Generative LLMs. 118-133 - Anushka Yadav, Sai Krishna Rallabandi, Parag Pravin Dakle, Preethi Raghavan:

Investigating the effectiveness of length based rewards in DPO for building Conversational Financial Question Answering Systems. 134-140 - Sixing Yan, Ting Zhu:

CreditLLM: Constructing Financial AI Assistant for Credit Products using Financial LLM and Few Data. 141-152 - Zhiyu Xu, Yi Liu, Yuchi Wang, Ruihan Bao, Keiko Harimoto, Xu Sun:

Modeling Interactions Between Stocks Using LLM-Enhanced Graphs for Volume Prediction. 153-163 - Yi-Te Lu, Yintong Huo:

Financial Named Entity Recognition: How Far Can LLM Go? 164-168 - Yuxiang Wang, Yuchi Wang, Yi Liu, Ruihan Bao, Keiko Harimoto, Xu Sun:

Proxy Tuning for Financial Sentiment Analysis: Overcoming Data Scarcity and Computational Barriers. 169-174 - Mohamed Ettaleb, Mouna Kamel, Nathalie Aussenac-Gilles, Véronique Moriceau:

The contribution of LLMs to relation extraction in the economic field. 175-183 - Shunsuke Nishida, Takehito Utsuro:

Generating Financial News Articles from Factors of Stock Price Rise / Decline by LLMs. 184-195 - Xinlin Wang, Mats Brorsson:

Can Large language model analyze financial statements well? 196-206 - Muhammad S. Abdo, Yash A. Hatekar, Damir Cavar:

AMWAL: Named Entity Recognition for Arabic Financial News. 207-213 - Antonio Moreno-Sandoval, Jordi Porta-Zamorano, Blanca Carbajo-Coronado, Yanco Torterolo, Doaa Samy:

The Financial Document Causality Detection Shared Task (FinCausal 2025). 214-221 - Neelesh K. Shukla, Sandeep Singh, Prabhat Kumar Prabhakar, Sakthivel Thangaraj, Weiyi Sun, C. Prasanna Venkatesan, Viji Krishnamurthy:

KULFi Framework: Knowledge Utilization for Optimizing Large Language Models for Financial Causal Reasoning. 222-229 - Ali Al-Laith:

Exploring the Effectiveness of Multilingual and Generative Large Language Models for Question Answering in Financial Texts. 230-235 - Vibhavkrishnan K. S., Pattabhi R. K. Rao, Sobha Lalitha Devi:

CLRG@FinCausal2025: Cause-Effect Extraction in Finance Domain. 236-241 - Avinash Trivedi, Gauri Toshniwal, Sangeetha Sivanesan, S. R. Balasundaram:

Sarang at FinCausal 2025: Contextual QA for Financial Causality Detection Combining Extractive and Generative Models. 242-247 - Pulkit Chatwal, Amit Agarwal, Ankush Mittal:

Enhancing Causal Relationship Detection Using Prompt Engineering and Large Language Models. 248-252 - Georg Niess, Houssam Razouk, Stasa Mandic, Roman Kern:

Addressing Hallucination in Causal Q&A: The Efficacy of Fine-tuning over Prompting in LLMs. 253-258 - Medha Jeenoor, Madiha Aziz, Saipriya Dipika Vaidyanathan, Avijit Samantraya, Sandeep Mathias:

PresiUniv at FinCausal 2025 Shared Task: Applying Fine-tuned Language Models to Explain Financial Cause and Effect with Zero-shot Learning. 259-264 - Marcelo Jose Moreno Aviles, Alejandro Vaca:

Extracting Financial Causality through QA: Insights from FinCausal 2025 Spanish Subtask. 265-270 - Zhiwei Liu, Keyi Wang, Zhuo Bao, Xin Zhang, Jiping Dong, Kailai Yang, Mohsinul Kabir, Polydoros Giannouris, Rui Xing, Seongchan Park, Jaehong Kim, Dong Li, Qianqian Xie, Sophia Ananiadou:

FinNLP-FNP-LLMFinLegal-2025 Shared Task: Financial Misinformation Detection Challenge Task. 271-276 - Zheyang Luo, Guangbin Zhang, Jiahao Xiao, Xuankang Zhang, Yulin Dou, Jiangming Liu:

FMD-Mllama at the Financial Misinformation Detection Challenge Task: Multimodal Reasoning and Evidence Generation. 277-282 - Sonal Singh, Rahul Mehta, Yadunath Gupta, Soudip Roy Chowdhury:

Ask Asper at the Financial Misinformation Detection Challenge Task: Enhancing Financial Decision-Making: A Dual Approach Using Explainable LLMs for Misinformation Detection. 283-287 - Ken Kawamura:

Team FMD LLM at the Financial Misinformation Detection Challenge Task: Exploring Task Structuring and Metadata Impact on Performance. 288-296 - Dongjun Lee, Heesoo Park:

Dunamu ML at the Financial Misinformation Detection Challenge Task: Improving Supervised Fine-Tuning with LLM-based Data Augmentation. 297-301 - Jebish Purbey, Siddhant Gupta, Nikhil Manali, Siddartha Pullakhandam, Drishti Sharma, Ashay Srivastava, Ram Mohan Rao Kadiyala:

1-800-SHARED-TASKS at the Financial Misinformation Detection Challenge Task: Sequential Learning for Claim Verification and Explanation Generation in Financial Domains. 302-307 - Alphaeus Dmonte, Roland Oruche, Marcos Zampieri, Eunmi Ko, Prasad Calyam:

GMU-MU at the Financial Misinformation Detection Challenge Task: Exploring LLMs for Financial Claim Verification. 308-312 - Harika Abburi, Alex Chandler, Edward Bowen, Sanmitra Bhattacharya, Nirmala Pudota:

Deloitte (Drocks) at the Financial Misinformation Detection Challenge Task: Enhancing Misinformation Detection through Instruction-Tuned Models. 313-320 - Yupeng Cao, Haohang Li, Yangyang Yu, Shashidhar Reddy Javaji:

Capybara at the Financial Misinformation Detection Challenge Task: Chain-of-Thought Enhanced Financial Misinformation Detection. 321-325 - Santiago Martínez, Juan Manuel Castañeda, Rubén Manrique:

A Scalable Framework for Legal Text Understanding in Regulatory and Financial Contexts. 326-334 - Jiajia Huang, Maowei Jiang, Haoran Zhu:

Audit-FT at the Regulations Challenge Task: An Open-Source Large Language Model for Audit. 335-348 - Pantid Chantangphol, Pornchanan Balee, Kantapong Sucharitpongpan, Chanatip Saetia, Tawunrat Chalothorn:

FinMind-Y-Me at the Regulations Challenge Task: Financial Mind Your Meaning based on THaLLE. 349-362 - Keyi Wang, Jaisal Patel, Charlie Shen, Daniel S. Kim, Andy Zhu, Alex Lin, Luca Borella, Cailean Osborne, Matt White, Steve Yang, Kairong Xiao, Xiao-Yang Liu:

FinNLP-FNP-LLMFinLegal-2025 Shared Task: Regulations Challenge. 363-370 - Shijia Jiang, Yongfu Dai, Haochen Jia, Yuxin Wang, Hao Wang:

IntelliChain Stars at the Regulations Challenge Task: A Large Language Model for Financial Regulation. 371-384 - Rungsiman Nararatwong, Natthawut Kertkeidkachorn, Hiroya Takamura, Ryutaro Ichise:

Fin-DBQA Shared-task: Database Querying and Reasoning. 385-391 - Jan Strich:

Adapt LLM for Multi-turn Reasoning QA using Tidy Data. 392-400 - Yangyang Yu, Haohang Li, Yupeng Cao, Keyi Wang, Zhiyang Deng, Zhiyuan Yao, Yuechen Jiang, Dong Li, Ruey-Ling Weng, Jordan W. Suchow:

FinNLP-FNP-LLMFinLegal @ COLING 2025 Shared Task: Agent-Based Single Cryptocurrency Trading Challenge. 401-406 - You Wang, Jingyi Wei, Mingsong Ye:

Sam's Fans at the Crypto Trading Challenge Task: A Threshold-Based Decision Approach Based on FinMem Framework. 407-413 - Artem Agarkov, Mihail Kulik, Leonid Shmyrkov:

300k/ns team at the Crypto Trading Challenge Task: Enhancing the justification of accurate trading decisions through parameter-efficient fine-tuning of reasoning models. 414-422
1st Workshop on GenAI Content Detection (GenAIDetect)
- Firoj Alam, Preslav Nakov, Nizar Habash, Iryna Gurevych, Shammur A. Chowdhury, Artem Shelmanov, Yuxia Wang, Ekaterina Artemova, Mucahid Kutlu, Georgios Mikros:

Proceedings of the 1st Workshop on GenAI Content Detection (GenAIDetect). - Aldan Creo, Shushanta Pudasaini:

SilverSpeak: Evading AI-Generated Text Detectors using Homoglyphs. 1-46 - Philipp Moeßner, Heike Adel:

Human vs. AI: A Novel Benchmark and a Comparative Study on the Detection of Generated Images and the Impact of Prompts. 47-58 - Josh Baradia, Shubham Gupta, Suman Kundu:

Mirror Minds : An Empirical Study on Detecting LLM-Generated Text via LLMs. 59-67 - Shushanta Pudasaini, Luis Miralles, David Lillis, Marisa Llorens-Salvador:

Benchmarking AI Text Detection: Assessing Detectors Against New Datasets, Evasion Tactics, and Enhanced LLMs. 68-77 - G. Charbel N. Kindji, Lina Maria Rojas-Barahona, Elisa Fromont, Tanguy Urvoy:

Cross-table Synthetic Tabular Data Detection. 78-84 - Hope McGovern, Rickard Stureborg, Yoshi Suhara, Dimitris Alikaniotis:

Your Large Language Models are Leaving Fingerprints. 85-95 - Ishika Rathi, Sydney Taylor, Benjamin Bergen, Cameron R. Jones:

GPT-4 is Judged More Human than Humans in Displaced and Inverted Turing Tests. 96-110 - Vasudha Varadarajan, Salvatore Giorgi, Siddharth Mangalik, Nikita Soni, David M. Markowitz, H. Andrew Schwartz:

The Consistent Lack of Variance of Psychological Factors Expressed by LLMs and Spambots. 111-119 - Elyas Masrour, Bradley Emi, Max Spero:

DAMAGE: Detecting Adversarially Modified AI Generated Text. 120-133 - Andric Valdez-Valenzuela, Helena Gómez-Adorno, Manuel Montes-y-Gómez:

Text Graph Neural Networks for Detecting AI-Generated Content. 134-139 - Kaan Efe Keles, Ömer Kaan Gürbüz, Mucahid Kutlu:

I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text. 140-149 - Zhaowen Zhang, Songhao Chen, Bingquan Liu:

DCBU at GenAI Detection Task 1: Enhancing Machine-Generated Text Detection with Semantic and Probabilistic Features. 150-154 - Hanh Thi Hong Tran, Tien Nam Nguyen:

L3i++ at GenAI Detection Task 1: Can Label-Supervised LLaMA Detect Machine-Generated Text? 155-160 - Gull Mehak, Amna Qasim, Abdul Gafar Manuel Meque, Nisar Hussain, Grigori Sidorov, Alexander F. Gelbukh:

TechExperts(IPN) at GenAI Detection Task 1: Detecting AI-Generated Text in English and Multilingual Contexts. 161-165 - Mihaly Kiss, Gábor Berend:

SzegedAI at GenAI Detection Task 1: Beyond Binary - Soft-Voting Multi-Class Classification for Binary Machine-Generated Text Detection Across Diverse Language Models. 166-172 - Claudiu Creanga, Teodor-George Marchitan, Liviu P. Dinu:

Team Unibuc - NLP at GenAI Detection Task 1: Qwen it detect machine-generated text? 173-177 - Karla Schäfer, Martin Steinebach:

Fraunhofer SIT at GenAI Detection Task 1: Adapter Fusion for AI-generated Text Detection. 178-183 - Shifali Agrahari, Sanasam Ranbir Singh:

OSINT at GenAI Detection Task 1: Multilingual MGT Detection: Leveraging Cross-Lingual Adaptation for Robust LLMs Text Identification. 184-190 - Hancheol Park, Jaeyeon Kim, Geonmin Kim, Tae-Ho Kim:

Nota AI at GenAI Detection Task 1: Unseen Language-Aware Detection System for Multilingual Machine-Generated Text. 191-196 - Annepaka Yadagiri, Sai Teja Lekkala, Mandadoddi Srikar Vardhan, Partha Pakray, Reddi Krishna:

CNLP-NITS-PP at GenAI Detection Task 1: AI-Generated Text Using Transformer-Based Approaches. 197-202 - Md Kamrujjaman Mobin, Md Saiful Islam:

LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts. 203-208 - Nhi Hoai Doan, Kentaro Inui:

Grape at GenAI Detection Task 1: Leveraging Compact Models and Linguistic Features for Robust Machine-Generated Text Detection. 209-217 - Avanti Bhandarkar, Ronald Wilson, Damon L. Woodard:

AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection. 218-224 - Kaan Efe Keles, Mucahid Kutlu:

TurQUaz at GenAI Detection Task 1: Dr. Perplexity or: How I Learned to Stop Worrying and Love the Finetuning. 225-229 - Azad Singh, Vishnu Tripathi, Ravindra Kumar Pandey, Pragyanand Saho, Prakhar Joshi, Neel Mani, Richa Alagh, Pallaw Mishra, Piyush Arora:

AI-Monitors at GenAI Detection Task 1: Fast and Scalable Machine Generated Text Detection. 230-235 - German Gritsai, Anastasia Voznyuk, Ildar A. Khabutdinov, Andrey V. Grabovoy:

Advacheck at GenAI Detection Task 1: AI Detection Powered by Domain-Aware Multi-Tasking. 236-243 - Yuxia Wang, Artem Shelmanov, Jonibek Mansurov, Akim Tsvigun, Vladislav Mikhailov, Rui Xing, Zhuohan Xie, Jiahui Geng, Giovanni Puccetti, Ekaterina Artemova, Jinyan Su, Minh Ngoc Ta, Mervat Abassy, Kareem Ashraf Elozeiri, Saad El Dine Ahmed El Etter, Maiya Goloburda, Tarek Mahmoud, Raj Vardhan Tomar, Nurkhan Laiyk, Osama Mohammed Afzal, Ryuto Koike, Masahiro Kaneko, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov:

GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human. 244-261 - Tolulope Olalekan Abiola, Tewodros Achamaleh Bizuneh, Fatima Uroosa, Nida Hafeez, Grigori Sidorov, Olga Kolesnikova, Olumide Ebenezer Ojo:

CIC-NLP at GenAI Detection Task 1: Advancing Multilingual Machine-Generated Text Detection. 262-270 - Tolulope Olalekan Abiola, Tewodros Achamaleh Bizuneh, Oluwatobi Joseph Abiola, Temitope Olasunkanmi Oladepo, Olumide Ebenezer Ojo, Grigori Sidorov, Olga Kolesnikova:

CIC-NLP at GenAI Detection Task 1: Leveraging DistilBERT for Detecting Machine-Generated Text in English. 271-277 - Sai Teja Lekkala, Annepaka Yadagiri, Mangadoddi Srikar Vardhan, Partha Pakray:

nits_teja_srikar at GenAI Detection Task 2: Distinguishing Human and AI-Generated Essays Using Machine Learning and Transformer Models. 278-283 - Mohammad Al-Smadi:

IntegrityAI at GenAI Detection Task 2: Detecting Machine-Generated Academic Essays in English and Arabic Using ELECTRA and Stylometry. 284-289 - Kaijie Jiao, Xingyu Yao, Shixuan Ma, Sifan Fang, Zikang Guo, Benfeng Xu, Licheng Zhang, Quan Wang, Yongdong Zhang, Zhendong Mao:

CMI-AIGCX at GenAI Detection Task 2: Leveraging Multilingual Proxy LLMs for Machine-Generated Text Detection in Academic Essays. 290-298 - Shifali Agrahari, Subhashi Jayant, Saurabh Kumar, Sanasam Ranbir Singh:

EssayDetect at GenAI Detection Task 2: Guardians of Academic Integrity: Multilingual Detection of AI-Generated Essays. 299-306 - Annepaka Yadagiri, Reddi Krishna, Partha Pakray:

CNLP-NITS-PP at GenAI Detection Task 2: Leveraging DistilBERT and XLM-RoBERTa for Multilingual AI-Generated Text Detection. 307-311 - Rana Gharib, Ahmed Elgendy:

RA at GenAI Detection Task 2: Fine-tuned Language Models For Detection of Academic Authenticity, Results and Thoughts. 312-316 - Vijayasaradhi Indurthi, Vasudeva Varma:

Tesla at GenAI Detection Task 2: Fast and Scalable Method for Detection of Academic Essay Authenticity. 317-322 - Shammur Absar Chowdhury, Hind A. Al-Merekhi, Mucahid Kutlu, Kaan Efe Keles, Fatema Ahmad, Tasnim Mohiuddin, Georgios Mikros, Firoj Alam:

GenAI Content Detection Task 2: AI vs. Human - Academic Essay Authenticity Challenge. 323-333 - Sai Teja Lekkala, Annepaka Yadagiri, Mangadoddi Srikar Vardhan, Partha Pakray:

CNLP-NITS-PP at GenAI Detection Task 3: Cross-Domain Machine-Generated Text Detection Using DistilBERT Techniques. 334-339 - Abishek R. Edikala, Gregorios A. Katsios, Noelie Creaghe, Ning Yu:

Leidos at GenAI Detection Task 3: A Weight-Balanced Transformer Approach for AI Generated Text Detection Across Domains. 340-346 - Bradley Emi, Max Spero, Elyas Masrour:

Pangram at GenAI Detection Task 3: An Active Learning Approach to Machine-Generated Text Detection. 347-351 - Md Kamrujjaman Mobin, Md Saiful Islam:

LuxVeri at GenAI Detection Task 3: Cross-Domain Detection of AI-Generated Text Using Inverse Perplexity-Weighted Ensemble of Fine-Tuned Transformer Models. 352-357 - Hemanth Kandula, Chak-Fai Li, Haoling Qiu, Damianos G. Karakos, Hieu Man, Thien Huu Nguyen, Brian Ulicny:

BBN-U.Oregon's ALERT system at GenAI Content Detection Task 3: Robust Authorship Style Representations for Cross-Domain Machine-Generated Text Detection. 358-364 - Shifali Agrahari, Prabhat Mishra, Sujit Kumar:

Random at GenAI Detection Task 3: A Hybrid Approach to Cross-Domain Detection of Machine-Generated Text with Adversarial Attack Mitigation. 365-370 - Matthieu Dubois, François Yvon, Pablo Piantanida:

MOSAIC at GENAI Detection Task 3 : Zero-Shot Detection Using an Ensemble of Models. 371-376 - Liam Dugan, Andrew Zhu, Firoj Alam, Preslav Nakov, Marianna Apidianaki, Chris Callison-Burch:

GenAI Content Detection Task 3: Cross-Domain Machine Generated Text Detection Challenge. 377-388
Workshop on Generative AI and Knowledge Graphs (GenAIK)
- Genet Asefa Gesese, Harald Sack, Heiko Paulheim, Albert Meroño-Peñuela, Lihu Chen:

Proceedings of the Workshop on Generative AI and Knowledge Graphs (GenAIK). - Pratik Saini, Tapas Nayak:

Effective Modeling of Generative Framework for Document-level Relational Triple Extraction. 1-12 - Anastasia Martynova, Vladislav Tishin, Natalia Semenova:

Learn Together: Joint Multitask Finetuning of Pretrained KG-enhanced LLM for Downstream Tasks. 13-19 - Samin Jamshidi, Yllias Chali:

GNET-QG: Graph Network for Multi-hop Question Generation. 20-26 - Aakash Mahalingam, Vinesh Kumar Gande, Aman Chadha, Vinija Jain, Divya Chaudhary:

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval. 27-42 - Dmitrii Iarosh, Alexander Panchenko, Mikhail Salnikov:

On Reducing Factual Hallucinations in Graph-to-Text Generation Using Large Language Models. 43-53 - Mariam Barry, Gaëtan Caillaut, Pierre Halftermeyer, Raheel Qader, Mehdi Mouayad, Fabrice Le Deit, Dimitri Cariolaro, Joseph Gesnouin:

GraphRAG: Leveraging Graph-Based Efficiency to Minimize Hallucinations in LLM-Driven RAG for Finance Data. 54-65 - Farida Helmy Eldessouky, Nourhan Ehab, Carolin Schindler

, Mervat Abuelkheir, Wolfgang Minker:
Structured Knowledge meets GenAI: A Framework for Logic-Driven Language Models. 66-68 - Thamer Mecharnia, Mathieu d'Aquin:

Performance and Limitations of Fine-Tuned LLMs in SPARQL Query Generation. 69-77 - Na Dong, Natthawut Kertkeidkachorn, Xin Liu, Kiyoaki Shirai:

Refining Noisy Knowledge Graph with Large Language Models. 78-86 - André Gomes Regino, Júlio Cesar dos Reis:

Can LLMs be Knowledge Graph Curators for Validating Triple Insertions? 87-99 - Makbule Gulcin Ozsoy, Leila Messallem, Jon Besga, Gianandrea Minneci:

Text2Cypher: Bridging Natural Language and Graph Databases. 100-108 - Anuj Kumar, Pardeep Kumar, Abhishek Yadav, Satyadev Ahlawat, Yamuna Prasad:

KGFakeNet: A Knowledge Graph-Enhanced Model for Fake News Detection. 109-122 - Martina Toshevska, Slobodan Kalajdziski, Sonja Gievska:

Style Knowledge Graph: Augmenting Text Style Transfer with Knowledge Graphs. 123-135 - Morteza Kamaladdini Ezzabady, Farah Benamara:

Entity Quality Enhancement in Knowledge Graphs through LLM-based Question Answering. 136-145 - Hamit Kavas, Marc Serra-Vidal, Leo Wanner:

Multilingual Skill Extraction for Job Vacancy-Job Seeker Matching in Knowledge Graphs. 146-155
First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages
- Ruvan Weerasinghe, Isuri Anuradha, Deshan K. Sumanathilaka:

Proceedings of the First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages. - Daisy Monika Lal, Paul Rayson, Mo El-Haj:

Hindi Reading Comprehension: Do Large Language Models Exhibit Semantic Understanding? 1-10 - Sandun Sameera Perera, Deshan Koshala Sumanathilaka:

Machine Translation and Transliteration for Indo-Aryan Languages: A Systematic Review. 11-21 - Atharva Mutsaddi, Anvi Jamkhande, Aryan Shirish Thakre, Yashodhara Haribhakta:

BERTopic for Topic Modeling of Hindi Short Texts: A Comparative Study. 22-32 - Muhammad Saad Amin, Luca Anselma, Alessandro Mazzei:

Evaluating Structural and Linguistic Quality in Urdu DRS Parsing and Generation through Bidirectional Evaluation. 33-43 - Rashi Goel, Fatiha Sadat:

Studying the Effect of Hindi Tokenizer Performance on Downstream Tasks. 44-49 - Raviraj Joshi, Kanishk Singla, Anusha Kamath, Raunak Kalani, Rakesh Paul, Utkarsh Vaidya, Sanjay Singh Chauhan, Niranjan Wartikar, Eileen Long:

Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus: A Case Study for Hindi LLMs. 50-57 - Shantipriya Parida, Shashikanta Sahoo, Sambit Sekhar, Kalyanamalini Sahoo, Ketan Kotwal, Sonal Khosla, Satya Ranjan Dash, Aneesh Bose, Guneet Singh Kohli, Smruti Smita Lenka, Ondrej Bojar:

OVQA: A Dataset for Visual Question Answering and Multimodal Research in Odia Language. 58-66 - Braveenan Sritharan, Uthayasanker Thayasivam:

Advancing Multilingual Speaker Identification and Verification for Indo-Aryan and Dravidian Languages. 67-73 - Isuru Bandaranayake, Hakim Usoof:

Sentiment Analysis of Sinhala News Comments Using Transformers. 74-82 - Riddhiman Swanan Debnath, Nahian Beente Firuj, Abdul Wadud Shakib, Sadia Sultana, Md Saiful Islam:

ExMute: A Context-Enriched Multimodal Dataset for Hateful Memes. 83-89 - Yash Kumar, Subhajit Roy:

Studying the capabilities of Large Language Models in solving Combinatorics Problems posed in Hindi. 90-99 - Hrithik Majumdar Shibu, Shrestha Datta, Md. Sumon Miah, Nasrullah Sami, Mahruba Sharmin Chowdhury, Md Saiful Islam:

From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs. 100-107 - Xinjie Zhao, Hao Wang, Shyaman Maduranga Sriwarnasinghe, Jiacheng Tang, Shiyun Wang, Sayaka Sugiyama, So Morikawa:

Enhancing Participatory Development Research in South Asia through LLM Agents System: An Empirically-Grounded Methodological Initiative from Field Evidence in Sri Lankan. 108-121 - Bharath Kancharla, Prabhjot Singh, Lohith Bhagavan Kancharla, Yashita Chama, Raksha Sharma:

Identifying Aggression and Offensive Language in Code-Mixed Tweets: A Multi-Task Transfer Learning Approach. 122-128 - Saurabh Kumar, Dhruvkumar Babubhai Kakadiya, Sanasam Ranbir Singh:

Team IndiDataMiner at IndoNLP 2025: Hindi Back Transliteration - Roman to Devanagari using LLaMa. 129-134 - Sandun Sameera Perera, Lahiru Prabhath Jayakodi, Deshan Koshala Sumanathilaka, Isuri Anuradha:

IndoNLP 2025 Shared Task: Romanized Sinhala to Sinhala Reverse Transliteration Using BERT. 135-140 - Samreen Kazi, Maria Rahim, Shakeel Ahmed Khoja:

Crossing Language Boundaries: Evaluation of Large Language Models on Urdu-English Question Answering. 141-151 - Sudhansu Bala Das, Samujjal Choudhury, Tapas Kumar Mishra, Bidyut Kr. Patra:

Investigating the Effect of Backtranslation for Indic Languages. 152-165 - Yomal De Mel, Kasun Wickramasinghe, Nisansa de Silva, Surangika Ranathunga:

Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches. 166-173 - Bajiyo Baiju, Kavya Manohar, Leena G. Pillai, Elizabeth Sherly:

Romanized to Native Malayalam Script Transliteration Using an Encoder-Decoder Framework. 174-178
First Workshop on Language Models for Low-Resource Languages
- Hansi Hettiarachchi, Tharindu Ranasinghe, Paul Rayson, Ruslan Mitkov, Mohamed Medhat Gaber, Damith Premasiri, Fiona Anting Tan, Lasitha Uyangodage:

Proceedings of the First Workshop on Language Models for Low-Resource Languages. - Hansi Hettiarachchi, Tharindu Ranasinghe, Paul Rayson, Ruslan Mitkov, Mohamed Medhat Gaber, Damith Premasiri, Fiona Anting Tan, Lasitha Randunu Chandrakantha Uyangodage:

Overview of the First Workshop on Language Models for Low-Resource Languages (LoResLM 2025). 1-8 - Guokan Shang, Hadi Abdine, Yousef Khoubrane, Amr Mohamed, Yassine Abbahaddou, Sofiane Ennadir, Imane Momayiz, Xuguang Ren, Eric Moulines, Preslav Nakov, Michalis Vazirgiannis, Eric P. Xing:

Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect. 9-30 - Hojjat Mokhtarabadi, Ziba Zamani, Abbas Maazallahi, Mohammad Hossein Manshaei:

Empowering Persian LLMs for Instruction Following: A Novel Dataset and Training Approach. 31-67 - Sadia Alam, Md Farhan Ishmam, Navid Hasin Alvee, Md Shahnewaz Siddique, Md Azam Hossain, Abu Raihan Mostofa Kamal:

BnSentMix: A Diverse Bengali-English Code-Mixed Dataset for Sentiment Analysis. 68-77 - Zahra Habibzadeh, Masoud Asadpour:

Using Language Models for assessment of users' satisfaction with their partner in Persian. 78-88 - Atharva Mutsaddi, Aditya Choudhary:

Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing. 89-100 - Sani Abdullahi Sani, Shamsuddeen Hassan Muhammad, Devon Jarvis:

Investigating the Impact of Language-Adaptive Fine-Tuning on Sentiment Analysis in Hausa Language Using AfriBERTa. 101-111 - Anastasia Zhukova, Christian E. Matt, Bela Gipp:

Automated Collection of Evaluation Dataset for Semantic Search in Low-Resource Domain Language. 112-122 - Lance Calvin Lim Gamboa, Mark Lee:

Filipino Benchmarks for Measuring Sexist and Homophobic Bias in Multilingual Language Models from Southeast Asia. 123-134 - Van-Hien Tran, Raj Dabre, Hour Kaing, Haiyue Song, Hideki Tanaka, Masao Utiyama:

Exploiting Word Sense Disambiguation in Large Language Models for Machine Translation. 135-144 - Maciej Rapacz, Aleksander Smywinski-Pohl:

Low-Resource Interlinear Translation: Morphology-Enhanced Neural Models for Ancient Greek. 145-165 - Ibrahim Merad, Amos Wolf, Ziad Mazzawi, Yannick Léo:

Language verY Rare for All. 166-174 - Sundesh Donthi, Maximilian Spencer, Om Patel, Joon Yong Doh, Eid Rodan, Kevin Zhu, Sean O'Brien:

Improving LLM Abilities in Idiomatic Translation. 175-181 - Yifan Liu, Gelila Tilahun, Xinxiang Gao, Qianfeng Wen, Michael Gervers:

A Comparative Study of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters. 182-192 - Maïmouna Ouattara, Abdoul Kader Kaboré, Jacques Klein, Tegawendé F. Bissyandé:

Bridging Literacy Gaps in African Informal Business Management with Low-Resource Conversational Agents. 193-203 - Jayanta Sadhu, Maneesha Rani Saha, Rifat Shahriyar:

Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias. 204-218 - Jan Christian Blaise Cruz:

Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation. 219-224 - Sina Bagheri Nezhad, Ameeta Agrawal, Rhitabrat Pokharel:

Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models. 225-239 - Alexis Matzopoulos, Charl Hendriks, Hishaam Mahomed, Francois Meyer:

BabyLMs for isiXhosa: Data-Efficient Language Modelling in a Low-Resource Context. 240-248 - Andreea Ioana Tudor, Tsegaye Misikir Tashu:

Mapping Cross-Lingual Sentence Representations for Low-Resource Language Pairs Using Pre-trained Language Models. 249-257 - Anika Harju, Rob van der Goot:

How to age BERT Well: Continuous Training for Historical Language Adaptation. 258-267 - Muhammad Saad Amin, Luca Anselma, Alessandro Mazzei:

Exploiting Task Reversibility of DRS Parsing and Generation: Challenges and Insights from a Multi-lingual Perspective. 268-286 - Latofat Bobojonova, Arofat Akhundjanova, Phil Sidney Ostheimer, Sophie Fellenz:

BBPOS: BERT-based Part-of-Speech Tagging for Uzbek. 287-293 - Vikrant Dewangan, Bharath Raj S, Garvit Suri, Raghav Sonavane:

When Every Token Counts: Optimal Segmentation for Low-Resource Language Models. 294-308 - Yana Veitsman, Mareike Hartmann:

Recent Advancements and Challenges of Turkic Central Asian Language Processing. 309-324 - Uriel Anderson Lasheras, Vládia Pinheiro:

CaLQuest.PT: Towards the Collection and Evaluation of Natural Causal Ladder Questions in Portuguese for AI Agents. 325-343 - Kamyar Zeinalipour, Neda Jamshidi, Fahimeh Akbari, Marco Maggini, Monica Bianchini, Marco Gori:

PersianMCQ-Instruct: A Comprehensive Resource for Generating Multiple-Choice Questions in Persian. 344-372 - Galim Turumtaev:

Stop Jostling: Adaptive Negative Sampling Reduces the Marginalization of Low-Resource Language Tokens by Cross-Entropy Loss. 373-386 - Omer Nacar, Serry Taiseer Sibaee, Samar Ahmed, Safa Ben Atitallah, Adel Ammar, Yasser AlHabashi, Abdulrahman S. Al-Batati, Arwa Alsehibani, Nour Qandos, Omar Elshehy, Mohamed Abdelkader, Anis Koubaa:

Towards Inclusive Arabic LLMs: A Culturally Aligned Benchmark in Arabic Large Language Model Evaluation. 387-401 - Daria Kryvosheieva, Roger Levy:

Controlled Evaluation of Syntactic Knowledge in Multilingual Language Models. 402-413 - Hongpu Zhu, Yuqi Liang, Wenjing Xu, Hongzhi Xu:

Evaluating Large Language Models for In-Context Learning of Linguistic Patterns In Unseen Low Resource Languages. 414-426 - Yuqian Dai, Chun Fai Chan, Ying Ki Wong, Tsz Ho Pun:

Next-Level Cantonese-to-Mandarin Translation: Fine-Tuning and Post-Processing with LLMs. 427-436 - Archchana Sindhujan, Diptesh Kanojia, Constantin Orasan, Shenbin Qian:

When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages. 437-459 - Alphaeus Dmonte, Shrey Satapara, Rehab Alsudais, Tharindu Ranasinghe, Marcos Zampieri:

Does Machine Translation Impact Offensive Language Identification? The Case of Indo-Aryan Languages. 460-468 - Zola Mahlaza, C. Maria Keet, Imaan Sayed, Alexander Van Der Leek:

IsiZulu noun classification based on replicating the ensemble approach for Runyankore. 469-478 - Kamyar Zeinalipour, Moahmmad Saad, Marco Maggini, Marco Gori:

From Arabic Text to Puzzles: LLM-Driven Development of Arabic Educational Crosswords. 479-495
First Workshop on Multilingual Counterspeech Generation
- Helena Bonaldi, María Estrella Vallecillo Rodríguez, Irune Zubiaga, Arturo Montejo-Ráez, Aitor Soroa, María Teresa Martín-Valdivia, Marco Guerini, Rodrigo Agerri:

Proceedings of the First Workshop on Multilingual Counterspeech Generation. - Michael Bennie, Demi Zhang, Bushi Xiao, Jing Cao, Chryseis Xinyi Liu, Jian Meng, Alayo Tripp:

PANDA - Paired Anti-hate Narratives Dataset from Asia: Using an LLM-as-a-Judge to Create the First Chinese Counterspeech Dataset. 1-12 - Ravindran V.:

RSSN at Multilingual Counterspeech Generation: Leveraging Lightweight Transformers for Efficient and Context-Aware Counter-Narrative Generation. 13-18 - Sahil Wadhwa, Chengtian Xu, Haoming Chen, Aakash Mahalingam, Akankshya Kar, Divya Chaudhary:

Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization. 19-28 - David Salvador Preciado-Márquez, Helena Gómez-Adorno, Ilia Markov, Selene Baez Santamaría:

NLP@IIMAS-CLTL at Multilingual Counterspeech Generation: Combating Hate Speech Using Contextualized Knowledge Graph Representations and LLMs. 29-36 - Michael Bennie, Bushi Xiao, Chryseis Xinyi Liu, Demi Zhang, Jian Meng, Alayo Tripp:

CODEOFCONDUCT at Multilingual Counterspeech Generation: A Context-Aware Model for Robust Counterspeech Generation in Low-Resource Languages. 37-46 - Xinglin Lyu, Haolin Wang, Min Zhang, Hao Yang:

HW-TSC at Multilingual Counterspeech Generation. 47-55 - Emanuele Moscato, Arianna Muti, Debora Nozza:

MilaNLP@Multilingual Counterspeech Generation: Evaluating Translation and Background Knowledge Filtering. 56-64 - Md Shariq Farhan:

Hyderabadi Pearls at Multilingual Counterspeech Generation : HALT : Hate Speech Alleviation using Large Language Models and Transformers. 65-76 - Daniel Russo:

TrenTeam at Multilingual Counterspeech Generation: Multilingual Passage Re-Ranking Approaches for Knowledge-Driven Counterspeech Generation Against Hate. 77-91 - Helena Bonaldi, María Estrella Vallecillo Rodríguez, Irune Zubiaga, Arturo Montejo-Ráez, Aitor Soroa, María Teresa Martín Valdivia, Marco Guerini, Rodrigo Agerri:

The First Workshop on Multilingual Counterspeech Generation at COLING 2025: Overview of the Shared Task. 92-107
First International Workshop on Nakba Narratives as Language Resources
- Mustafa Jarrar, Habash Habash, Mo El-Haj:

Proceedings of the first International Workshop on Nakba Narratives as Language Resources. - Zainab Sabra:

Deciphering Implicatures: On NLP and Oral Testimonies. 1-8 - Terry Regier, Muhammad Ali Khalidi:

A cultural shift in Western perceptions of Palestine. 9-17 - Annie K. Lamar, Rick Castle, Carissa Chappell, Emmanouela Schoinoplokaki, Allene M. Seet, Amit Shilo, Chloe Nahas:

Cognitive Geographies of Catastrophe Narratives: Georeferenced Interview Transcriptions as Language Resource for Models of Forced Displacement. 18-29 - Huthaifa I. Ashqar:

Sentiment Analysis of Nakba Oral Histories: A Critical Study of Large Language Models. 30-36 - Izza AbuHaija, Salim Al Mandhari, Mo El-Haj, Jonas Sibony, Paul Rayson:

The Nakba Lexicon: Building a Comprehensive Dataset from Palestinian Literature. 37-47 - Osama Hamed, Nadeem Zaidkilani:

Arabic Topic Classification Corpus of the Nakba Short Stories. 48-55 - Osama Hamed, Nadeem Zaidkilani:

Exploring Author Style in Nakba Short Stories: A Comparative Study of Transformer-Based Models. 56-62 - Nada Hamarsheh, Zahia Elabour, Aya Murra, Adnan Yahya:

Detecting Inconsistencies in Narrative Elements of Cross Lingual Nakba Texts. 63-74 - Mohamed Ibrahim Ragab, Ensaf Hussein Mohamed, Walaa Medhat:

Multilingual Propaganda Detection: Exploring Transformer-Based Models mBERT, XLM-RoBERTa, and mT5. 75-82 - Ghadir A. Awad, Tamara N. Rayan, Lavinia Dunagan, David Gamba:

Collective Memory and Narrative Cohesion: A Computational Study of Palestinian Refugee Oral Histories in Lebanon. 83-102 - Paulina Garcia Corral, Hannah Béchara, Krishnamoorthy Manohara, Slava Jankin:

The Missing Cause: An Analysis of Causal Attributions in Reporting on Palestine. 103-113 - Marryam Yahya Mohammed, Esraa Ismail Mohamed, Mariam Nabil Esmat, Yomna Ashraf Nagib, Nada Ahmed Radwan, Ziad Mohamed Elshaer, Ensaf Hussein Mohamed:

Bias Detection in Media: Traditional Models vs. Transformers in Analyzing Social Media Coverage of the Israeli-Gaza Conflict. 114-121 - Esma Fatima Bilgin Tasdemir, Saziye Betül Özates:

NakbaTR: A Turkish NER Dataset for Nakba Narratives. 122-126 - Sara Nabhani, Claudia Borg, Kurt Micallef, Khalid Al-Khatib:

Integrating Argumentation Features for Enhanced Propaganda Detection in Arabic Narratives on the Israeli War on Gaza. 127-149
Bridging Neurons and Symbols for Natural Language Processing and Knowledge Graphs Reasoning @ COLING 2025
- Kang Liu, Yangqiu Song, Zhen Han, Rafet Sifa, Shizhu He, Yunfei Long:

Proceedings of Bridging Neurons and Symbols for Natural Language Processing and Knowledge Graphs Reasoning @ COLING 2025. - Kangil Lee, Jinwoo Jang, Youngjin Lim, Minsu Shin:

Chain of Knowledge Graph: Information-Preserving Multi-Document Summarization for Noisy Documents. 1-5 - Jinze Sun, Yongpan Sheng, Lirong He, Yongbin Qin, Ming Liu, Tao Jia:

CEGRL-TKGR: A Causal Enhanced Graph Representation Learning Framework for Temporal Knowledge Graph Reasoning. 6-17 - Yu Bai, Baoqiang Liu, Shuang Xue, Fang Cai, Na Ye, Guiping Zhang:

Reasoning Knowledge Filter for Logical Table-to-Text Generation. 18-30 - Wangtao Sun, Shizhu He, Jun Zhao, Kang Liu:

From Chain to Tree: Refining Chain-like Rules into Tree-like Rules on Knowledge Graphs. 31-39 - Rui Guo, Barry Devereux, Greg Farnan, Niall McLaughlin:

LAB-KG: A Retrieval-Augmented Generation Method with Knowledge Graphs for Medical Lab Test Interpretation. 40-50 - Tiansi Dong, Writwick Das, Rafet Sifa:

Bridging Language and Scenes through Explicit 3-D Model Construction. 51-60 - Yu Bai, Lianji Wang, Xiang Liu, Haifeng Chi, Guiping Zhang:

VCRMNER: Visual Cue Refinement in Multimodal NER using CLIP Prompts. 61-70 - Xin Kang, Veronika Shteyngardt, Yuhan Wang, Dov Dori:

Neuro-Conceptual Artificial Intelligence: Integrating OPM with Deep Learning to Enhance Question Answering Quality. 71-85 - Ali Al-Saeedi, Aki Härmä:

Emergence of symbolic abstraction heads for in-context learning in large language models. 86-96 - Yulia Zinova, David Arps, Katharina Spalek, Jacopo Romoli:

Linking language model predictions to human behaviour on scalar implicatures. 97-106 - Harish Tayyar Madabushi, Taylor Hudson, Claire Bonial:

Generative FrameNet: Scalable and Adaptive Frames for Interpretable Knowledge Storage and Retrieval for LLMs Powered by LLMs. 107-119
1st Regulatory NLP Workshop (RegNLP 2025)
- Tuba Gokhan, Kexin Wang, Iryna Gurevych, Ted Briscoe:

Proceedings of the 1st Regulatory NLP Workshop (RegNLP 2025). - Tuba Gokhan, Kexin Wang, Iryna Gurevych, Ted Briscoe:

Shared Task RIRAG-2025: Regulatory Information Retrieval and Answer Generation. 1-4 - Shriya Vaagdevi Chikati, Samuel Larkin, David Minicola, Chi-kiu Lo:

Challenges in Technical Regulatory Text Variation Detection. 5-9 - Ehsan Lotfi, Nikolay Banar, Nerses Yuzbashyan, Walter Daelemans:

Bilingual BSARD: Extending Statutory Article Retrieval to Dutch. 10-21 - Kishore Vanapalli, Aravind Kilaru, Omair Shafiq, Shahzad Khan:

Unifying Large Language Models and Knowledge Graphs for efficient Regulatory Information Retrieval and Answer Generation. 22-30 - Jhon Stewar Rayo Mosquera, Carlos Raul De La Rosa Peredo, Mario Garrido Cordoba:

A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory Texts. 31-35 - Jebish Purbey, Drishti Sharma, Siddhant Gupta, Khawaja Murad, Siddartha Pullakhandam, Ram Mohan Rao Kadiyala:

1-800-SHARED-TASKS at RegNLP: Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering. 36-40 - Yash H. Malviya, Karan Dhingra, Maneesh Singh:

MST-R: Multi-Stage Tuning for Retrieval Systems and Metric Evaluation. 41-51 - Ioannis Chasandras, Odysseas S. Chlapanis, Ion Androutsopoulos:

AUEB-Archimedes at RIRAG-2025: Is Obligation concatenation really all you need? 52-58 - Asim Abbas, Mark Lee, Niloofer Shanavas, Venelin Kovatchev, Mubashir Ali:

Structured Tender Entities Extraction from Complex Tables with Few-short Learning. 59-67 - Fengzhao Sun, Jun Yu, Jiaming Hou, Yutong Lin, Tianyu Liu:

A Two-Stage LLM System for Enhanced Regulatory Information Retrieval and Answer Generation. 68-72 - Mariam Babar Khan, Huma Ameer, Seemab Latif, Mehwish Fatima:

NUST Nova at RIRAG 2025: A Hybrid Framework for Regulatory Information Retrieval and Question Answering. 73-78 - Muhammad Rouhan Faisal, Muhammad Abdullah, Faizyaab Ali Shah, Shalina Riaz, Huma Ameer, Seemab Latif, Mehwish Fatima:

NUST Alpha at RIRAG 2025: Fusion RAG for Bridging Lexical and Semantic Retrieval and Question Answering. 79-84 - Huma Ameer, Muhammad Hannan Akram, Seemab Latif, Mehwish Fatima:

NUST Omega at RIRAG 2025: Investigating Context-aware Retrieval and Answer Generations-Lessons and Challenges. 85-90 - Kübranur Umar, Hakan Dogan, Onur Özcan, Ismail Karakaya, Alper Karamanlioglu, Berkan Demirel:

Enhancing Regulatory Compliance Through Automated Retrieval, Reranking, and Answer Generation. 91-96 - Ozan Bayer, Elif Nehir Ulu, Yasemin Sarkin, Ekrem Sütçü, Defne Buse Çelik, Alper Karamanlioglu, Ismail Karakaya, Berkan Demirel:

A REGNLP Framework: Developing Retrieval-Augmented Generation for Regulatory Document Analysis. 97-101 - Devin Quinn, Sumit Pai, Iman Yousfi, Nirmala Pudota, Sanmitra Bhattacharya:

Regulatory Question-Answering using Generative AI. 102-106 - Xinyan Zhang, Xiaobing Feng, Xiujuan Xu, Zhiliang Zheng, Kai Wu:

RIRAG: A Bi-Directional Retrieval-Enhanced Framework for Financial Legal QA in ObliQA Shared Task. 107-113 - Islam Aushev, Egor Kratkov, Evgenii Nikolaev, Andrei Glinskii, Vasilii Krikunov, Alexander Panchenko, Vasily Konovalov, Julia Belikova:

RAGulator: Effective RAG for Regulatory Question Answering. 114-120
Second Workshop in South East Asian Language Processing
- Derry Wijaya, Alham Fikri Aji, Clara Vania, Genta Indra Winata, Ayu Purwarianti:

Proceedings of the Second Workshop in South East Asian Language Processing. - Jacob Simon Bernardo, Maria Regina Justina E. Estuar:

bAI-bAI: A Context-Aware Transliteration System for Baybayin Scripts. 1-9 - Wilson Wongso, David Samuel Setiawan, Steven Limcorn, Ananto Joyoadikusumo:

NusaBERT: Teaching IndoBERT to be Multilingual and Multicultural. 10-26 - Pachara Boonsarngsuk, Pacharapon Arpanantikul, Supakorn Hiranwipas, Wipu Watcharakajorn, Ekapol Chuangsuwanich:

Evaluating Sampling Strategies for Similarity-Based Short Answer Scoring: a Case Study in Thailand. 27-41 - Phakphum Artkaew:

Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning. 42-51 - Sulthan Abiyyu Hakim, Rizal Setya Perdana, Tirana Noor Fatyanosa:

Anak Baik: A Low-Cost Approach to Curate Indonesian Ethical and Unethical Instructions. 52-62 - Rifqi Naufal Abdjul, Dessi Puji Lestari, Ayu Purwarianti, Candy Olivia Mawalim, Sakriani Sakti, Masashi Unoki:

Indonesian Speech Content De-Identification in Low Resource Transcripts. 63-71 - Ian Kamajaya, David Moeljadi:

IndoMorph: a Morphology Engine for Indonesian. 72-81 - Ayu Purwarianti, Dea Adhista, Agung Baptiso, Miftahul Mahfuzh, Yusrina Sabila, Aulia Adila, Samuel Cahyawijaya, Alham Fikri Aji:

NusaDialogue: Dialogue Summarization and Generation for Underrepresented and Extremely Low-Resource Languages. 82-100
Second Workshop on Scaling Up Multilingual & Multi-Cultural Evaluation
- Proceedings of the Second Workshop on Scaling Up Multilingual & Multi-Cultural Evaluation.

- Rodolfo Joel Zevallos Salazar, Annika Marie Schoene, John E. Ortega:

The First Multilingual Model For The Detection of Suicide Texts. 1-11 - Geyu Lin, Bin Wang, Zhengyuan Liu, Nancy F. Chen:

CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment. 12-23 - Dipankar Srirag, Nihar Ranjan Sahoo, Aditya Joshi:

Evaluating Dialect Robustness of Language Models via Conversation Understanding. 24-38 - Tsegaye Misikir Tashu, Eduard-Raul Kontos, Matthia Sabatelli, Matias Valdenegro-Toro:

Cross-Lingual Document Recommendations with Transformer-Based Representations: Evaluating Multilingual Models and Mapping Techniques. 39-47 - Yuta Nozaki, Dai Nakashima, Ryo Sato, Naoki Asaba, Shintaro Kawamura:

VRCP: Vocabulary Replacement Continued Pretraining for Efficient Multilingual Language Models. 48-59
12th Workshop on NLP for Similar Languages, Varieties and Dialects
- Yves Scherrer, Tommi Jauhiainen, Nikola Ljubesic, Preslav Nakov, Jörg Tiedemann, Marcos Zampieri:

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects. - Yves Scherrer, Rob van der Goot, Petter Mæhlum:

Findings of the VarDial Evaluation Campaign 2025: The NorSID Shared Task on Norwegian Slot, Intent and Dialect Identification. 1-8 - Diego Alves:

Information Theory and Linguistic Variation: A Study of Brazilian and European Portuguese. 9-19 - Yee Man Ng, Ilia Markov:

Leveraging Open-Source Large Language Models for Native Language Identification. 20-28 - Melissa Torgbi, Andrew Clayman, Jordan J. Speight, Harish Tayyar Madabushi:

Adapting Whisper for Regional Dialects: Enhancing Public Services for Vulnerable Populations in the United Kingdom. 29-38 - Md Mahfuz Ibn Alam, Antonios Anastasopoulos:

Large Language Models as a Normalizer for Transliteration and Dialectal Translation. 39-67 - Fahim Faisal, Antonios Anastasopoulos:

Testing the Boundaries of LLMs: Dialectal and Language-Variety Tasks. 68-92 - Alistair Plum, Tharindu Ranasinghe, Christoph Purschke:

Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy. 93-104 - Piroska Lendvai, Uwe D. Reichel, Anna Jouravel, Achim Rabus, Elena Renje:

Retrieval of Parallelizable Texts Across Church Slavic Variants. 105-114 - Anne-Marie Lutgen, Alistair Plum, Christoph Purschke, Barbara Plank:

Neural Text Normalization for Luxembourgish Using Real-Life Variation Data. 115-127 - Xaver Maria Krückl, Verena Blaschke, Barbara Plank:

Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study. 128-146 - Steven Coats, Chloé Diskin-Holdaway, Debbie Loakes:

Regional Distribution of the /el/-/æl/ Merger in Australian English. 147-156 - Salam Khalifa, Abdelrahim Qaddoumi, Jordan Kodner, Owen Rambow:

Learning Cross-Dialectal Morphophonology with Syllable Structure Constraints. 157-167 - Javier A. Lopetegui, Arij Riabi, Djamé Seddah:

Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties. 168-181 - Verena Blaschke, Felicia Körner, Barbara Plank:

Add Noise, Tasks, or Layers? MaiNLP at the VarDial 2025 Shared Task on Norwegian Dialectal Slot and Intent Detection. 182-199 - Marthe Løken Midtgaard, Petter Mæhlum, Yves Scherrer:

LTG at VarDial 2025 NorSID: More and Better Training Data for Slot and Intent Detection. 200-208 - Jaione Bengoetxea, Mikel Zubillaga, Ekhi Azurmendi, Maite Heredia, Julen Etxaniz, Markel Ferro, Jeremy Barnes:

HiTZ at VarDial 2025 NorSID: Overcoming Data Scarcity with Language Transfer and Automatic Data Annotation. 209-219 - Michael Ibrahim:

CUFE@VarDial 2025 NorSID: Multilingual BERT for Norwegian Dialect Identification and Intent Detection. 220-223
4th Workshop on Arabic Corpus Linguistics (WACL-4)
- Saad Ezzini, Hamza Alami, Ismail Berrada, Abdessamad Benlahbib, Abdelkader El Mahdaouy, Salima Lamsiyah, Hatim Derrouz, Amal Haddad Haddad, Mustafa Jarrar, Mo El-Haj, Ruslan Mitkov, Paul Rayson:

Proceedings of the 4th Workshop on Arabic Corpus Linguistics (WACL-4). - Salima Lamsiyah, Kamyar Zeinalipour, Samir El-amrany, Matthias R. Brust, Marco Maggini, Pascal Bouvry, Christoph Schommer:

ArabicSense: A Benchmark for Evaluating Commonsense Reasoning in Arabic with Large Language Models. 1-11 - Mohamed Motasim Hamed, Muhammad Hreden, Khalil Hennara, Zeina Aldallal, Sara Chrouf, Safwan AlModhayan:

Lahjawi: Arabic Cross-Dialect Translator. 12-24 - Julien Bezançon, Rimane Karam, Gaël Lejeune:

Lost in Variation: An Unsupervised Methodology for Mining Lexico-syntactic Patterns in Middle Arabic Texts. 25-37 - Salwa Saad Alahmari:

SADSLyC: A Corpus for Saudi Arabian Multi-dialect Identification through Song Lyrics. 38-43 - Shehenaz Hossain, Fouad Shammary, Bahaulddin Shammary, Haithem Afli:

Enhancing Dialectal Arabic Intent Detection through Cross-Dialect Multilingual Input Augmentation. 44-49 - Abdullah Salem Khered, Youcef Benkhedda, Riza Batista-Navarro:

Dial2MSA-Verified: A Multi-Dialect Arabic Social Media Dataset for Neural Machine Translation to Modern Standard Arabic. 50-62 - Yousra El-Ghawi:

Web-Based Corpus Compilation of the Emirati Arabic Dialect. 63-67 - Ali Al-Laith, Rachida Kebdani:

Evaluating Calibration of Arabic Pre-trained Language Models on Dialectal Text. 68-76 - Azzedine Aftiss, Salima Lamsiyah, Christoph Schommer, Said Ouatik El Alaoui:

Empirical Evaluation of Pre-trained Language Models for Summarizing Moroccan Darija News Articles. 77-85 - Salmane Chafik, Saad Ezzini, Ismail Berrada:

Dialect2SQL: A Novel Text-to-SQL Dataset for Arabic Dialects with a Focus on Moroccan Darija. 86-92 - Alaa Bouomar, Noorhan Abbas:

AraSim: Optimizing Arabic Dialect Translation in Children's Literature with LLMs and Similarity Scores. 93-102 - Ahmed Haj Ahmed, Rui-Jie Yew, Xerxes Minocher, Suresh Venkatasubramanian:

Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection. 103-108
First Workshop on Writing Aids at the Crossroads of AI, Cognitive Science and NLP (WRAICOGS 2025)
- Michael Zock, Kentaro Inui, Zheng Yuan:

Proceedings of the First Workshop on Writing Aids at the Crossroads of AI, Cognitive Science and NLP (WRAICOGS 2025). - Ioana Buhnila, Georgeta Cislaru, Amalia Todirascu:

Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts. 1-15 - Ken Shi, Gerald Penn:

Semantic Masking in a Needle-in-a-haystack Test for Evaluating Large Language Model Long-Text Capabilities. 16-23 - Nouran Khallaf, Carlo Eugeni, Serge Sharoff:

Reading Between the Lines: A dataset and a study on why some texts are tougher than others. 24-34 - Léane Jourdan, Florian Boudin, Richard Dufour, Nicolas Hernandez, Akiko Aizawa:

ParaRev : Building a dataset for Scientific Paragraph Revision annotated with revision instruction. 35-44 - Chiara Maggi, Andrea Vitaletti:

Towards an operative definition of creative writing: a preliminary assessment of creativeness in AI and human texts. 45-52 - Anna Sato, Ichiro Kobayashi:

Decoding Semantic Representations in the Brain Under Language Stimuli with Large Language Models. 53-67

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














