


default search action
24th NoDaLiDa 2023: Tórshavn, Faroe Islands
- Tanel Alumäe, Mark Fishel:

Proceedings of the 24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023, Tórshavn, Faroe Islands, May 22-24, 2023. University of Tartu Library 2023, ISBN 978-99-1621-999-7 - Frontmatter.

- Ghazaal Sheikhi, Samia Touileb, Sohail Ahmed Khan:

Automated Claim Detection for Fact-checking: A Case Study using Norwegian Pre-trained Language Models. 1-9 - Cedric Lothritz, Bertrand Lebichot, Kevin Allix, Saad Ezzini, Tegawendé F. Bissyandé, Jacques Klein, Andrey Boytsov, Clément Lefebvre, Anne Goujon:

Evaluating the Impact of Text De-Identification on Downstream NLP Tasks. 10-16 - Thór Sverrisson, Hafsteinn Einarsson:

Abstractive Text Summarization for Icelandic. 17-31 - Carlos Daniel Hernandez Mena, Annika Simonsen, Jón Guðnason:

ASR Language Resources for Faroese. 32-41 - Yuri Bizzoni, Pascale Moreira, Nicole Dwenger, Ida Marie Schytt Lassen, Mads Rosendahl Thomsen, Kristoffer L. Nielbo:

Good Reads and Easy Novels: Readability and Literary Quality in a Corpus of US-published Fiction. 42-51 - Maciej Janicki, Antti Kanner, Eetu Mäkelä:

Detection and attribution of quotes in Finnish news media: BERT vs. rule-based approach. 52-59 - Marina Björnsdóttir, Nora Hollenstein, Maria Barrett:

Dyslexia Prediction from Natural Reading of Danish Texts. 60-70 - Örvar Kárason, Hrafn Loftsson:

Is Part-of-Speech Tagging a Solved Problem for Icelandic? 71-79 - Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot, Barbara Plank:

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction. 80-85 - Tiina Lindh-Knuutila, Hrafn Loftsson, Pedro Alonso Doval, Sebastian Andersson, Bjarni Barkarson, Héctor Cerezo-Costas, Jón Guðnason, Jökull Gylfason, Jarmo Hemminki, Heiki-Jaan Kaalep:

Microservices at Your Service: Bridging the Gap between NLP Research and Industry. 86-91 - Xiulin Yang, Jingyan Chen, Arjan van Eerden, Ahnaf Mozib Samin, Arianna Bisazza:

Slaapte or Sliep? Extending Neural-Network Simulations of English Past Tense Learning to Dutch and German. 92-102 - Denitsa Saynova, Bastiaan Bruinsma, Moa Johansson, Richard Johansson:

Class Explanations: the Role of Domain-Specific Content and Stop Words. 103-112 - Daniel Holmer, Evelina Rennes:

Constructing Pseudo-parallel Swedish Sentence Corpora for Automatic Text Simplification. 113-123 - Daniel Holmer, Lars Ahrenberg, Julius Monsen, Arne Jönsson, Mikael Apel, Marianna Grimaldi:

Who said what? Speaker Identification from Anonymous Minutes of Meetings. 124-134 - Luise Dürlich, Evangelia Gogoulou, Joakim Nivre:

On the Concept of Resource-Efficiency in NLP. 135-145 - Jeremy Barnes, Samia Touileb, Petter Mæhlum, Pierre Lison:

Identifying Token-Level Dialectal Features in Social Media. 146-158 - Sardana Ivanova, Fredrik Aas Andreassen, Matias Jentoft, Sondre Wold, Lilja Øvrelid:

NorQuAD: Norwegian Question Answering Dataset. 159-168 - Carl Börstell

:
Extracting Sign Language Articulation from Videos with MediaPipe. 169-178 - Kadri Muischnek, Kaili Müürisep:

Named Entity layer in Estonian UD treebanks. 179-184 - Dan Saattrup Nielsen

:
ScandEval: A Benchmark for Scandinavian Natural Language Processing. 185-201 - Lucas Georges Gabriel Charpentier, Sondre Wold, David Samuel, Egil Rønningstad:

BRENT: Bidirectional Retrieval Enhanced Norwegian Transformer. 202-214 - Anastassia Shaitarova, Anne Göhring, Martin Volk:

Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms. 215-227 - Bernt Ivar Utstøl Nødland:

Training and Evaluating Norwegian Sentence Embedding Models. 228-237 - Michele Boggia, Stig-Arne Grönroos, Niki Andreas Lopi, Timothee Mickus

, Alessandro Raganato, Jörg Tiedemann, Raúl Vázquez:
Dozens of Translation Directions or Millions of Shared Parameters? Comparing Two Types of Multilinguality in Modular Machine Translation. 238-247 - Sara Kolding, Katrine Nymann, Ida Hansen, Kenneth C. Enevoldsen, Ross Deans Kristensen-McLachlan:

DanSumT5: Automatic Abstractive Summarization for Danish. 248-264 - Nhan Phan, Tamás Grósz, Mikko Kurimo:

CaptainA - A mobile app for practising Finnish pronunciation. 265-270 - Kia Kirstein Hansen, Maria Barrett, Max Müller-Eberstein, Cathrine Damgaard, Trine Eriksen, Rob van der Goot:

DanTok: Domain Beats Language for Danish Social Media POS Tagging. 271-279 - Aleksei Dorkin, Kairit Sirts:

Comparison of Current Approaches to Lemmatization: A Case Study in Estonian. 280-285 - Atli Jasonarson, Steinþór Steingrímsson, Einar Freyr Sigurðsson, Árni Magnússon, Finnur Ingimundarson:

Generating Errors: OCR Post-Processing for Icelandic. 286-291 - Annika Willoch Olstad, Anthi Papadopoulou, Pierre Lison:

Generation of Replacement Options in Text Sanitization. 292-300 - Jannik S. Pedersen, Martin S. Laursen, Pernille J. Vinholt, Thiusius Rajeeth Savarimuthu:

MeDa-BERT: A medical Danish pretrained transformer model. 301-307 - Sandra Saxov Lamhauge, Iben Nyholm Debess, Carlos Daniel Hernandez Mena, Annika Simonsen

, Jón Guðnason:
Standardising Pronunciation for a Grapheme-to-Phoneme Converter for Faroese. 308-317 - Thomas Vakili, Hercules Dalianis:

Using Membership Inference Attacks to Evaluate Privacy-Preserving Language Modeling Fails for Pseudonymizing Data. 318-323 - Ali Allaith, Kirstine Nielsen Degn, Alexander Conroy, Bolette S. Pedersen, Jens Bjerring-Hansen, Daniel Hershcovich:

Sentiment Classification of Historical Danish and Norwegian Literary Texts. 324-334 - Sara Stymne, Carin Östman, David Håkansson:

Parser Evaluation for Analyzing Swedish 19th-20th Century Literature. 335-346 - Mehrdad Farahani, Richard Johansson:

An Empirical Study of Multitask Learning to Improve Open Domain Dialogue Systems. 347-357 - Aarne Talman, Hande Çelikkanat, Sami Virpioja, Markus Heinonen, Jörg Tiedemann:

Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging. 358-365 - Finn Nielsen:

Alignment of Wikidata lexemes and Det Centrale Ordregister. 366-370 - Ekaterina Artemova, Barbara Plank:

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models. 371-385 - Anders Mølmen Høst, Pierre Lison, Leon Moonen:

Constructing a Knowledge Graph from Textual Descriptions of Software Vulnerabilities in the National Vulnerability Database. 386-391 - Verena Blaschke

, Hinrich Schütze, Barbara Plank:
A Survey of Corpora for Germanic Low-Resource Languages and Dialects. 392-414 - Aleksandrs Berdicevskis, Viktor Erbro:

You say tomato, I say the same: A large-scale study of linguistic accommodation in online communities. 415-424 - Dag Trygve Truslew Haug, Ahmet Yildirim, Kristin Hagen, Anders Nøklestad:

Integrating rules and neural nets for morphological tagging of Norwegian - Results and challenges. 425-435 - Laurent Prévot, Julie Hunter, Philippe Muller:

Comparing Methods for Segmenting Elementary Discourse Units in a French Conversational Corpus. 436-446 - José carlos Rosales Núñez, Djamé Seddah, Guillaume Wisniewski:

Multi-way Variational NMT for UGC: Improving Robustness in Zero-shot Scenarios via Mixture Density Networks. 447-459 - Rafal Cerniavski, Sara Stymne:

Multilingual Automatic Speech Recognition for Scandinavian Languages. 460-466 - Phoebe Parsons, Knut Kvale, Torbjørn Svendsen, Giampiero Salvi:

A character-based analysis of impacts of dialects on end-to-end Norwegian ASR. 467-476 - Dmytro Kalpakchi, Johan Boye:

Quasi: a synthetic Question-Answering dataset in Swedish using GPT-3 and zero-shot learning. 477-491 - Tanel Alumäe, Joonas Kalda, Külliki Bode, Martin Kaitsa:

Automatic Closed Captioning for Estonian Live Broadcasts. 492-499 - Steinunn Rut Friðriksdóttir, Hafsteinn Einarsson:

The Effect of Data Encoding on Relation Triplet Identification. 500-507 - Per Erik Solberg, Pablo Ortiz, Phoebe Parsons, Torbjørn Svendsen, Giampiero Salvi:

Improving Generalization of Norwegian ASR with Limited Linguistic Resources. 508-517 - Wei Zhou

, Nina Tahmasebi
, Haim Dubossarsky:
The Finer They Get: Combining Fine-Tuned Models For Better Semantic Change Detection. 518-528 - Ilmari Kylliäinen, Roman Yangarber:

Question Answering and Question Generation for Finnish. 529-540 - David Kletz, Marie Candito, Pascal Amsili:

Probing structural constraints of negation in Pretrained Language Models. 541-554 - Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen:

Boosting Norwegian Automatic Speech Recognition. 555-564 - Niklas Zechner:

Length Dependence of Vocabulary Richness. 565-573 - Arianna Masciolini:

A query engine for L1-L2 parallel dependency treebanks. 574-587 - Steinþór Steingrímsson, Hrafn Loftsson, Andy Way:

Filtering Matters: Experiments in Filtering Training Sets for Machine Translation. 588-600 - Luke O'Brien, Finnur Ingimundarson, Jón Guðnason, Steinþór Steingrímsson:

Gamli - Icelandic Oral History Corpus: Design, Collection and Evaluation. 601-609 - Matias Jentoft, David Samuel:

NoCoLA: The Norwegian Corpus of Linguistic Acceptability. 610-617 - David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina:

NorBench - A Benchmark for Norwegian Language Models. 618-633 - Oskar Holmström, Ehsan Doostmohammadi:

Making Instruction Finetuning Accessible to Non-English Languages: A Case Study on Swedish Models. 634-642 - Flammie A. Pirinen, Sjur N. Moshagen, Katri Hiovain-Asikainen:

GiellaLT - a stable infrastructure for Nordic minority languages and beyond. 643-649 - Kristjan Runarsson, Kristín Bjarnadóttir:

Adapting an Icelandic morphological database to Faroese. 650-654 - Martin S. Laursen, Jannik S. Pedersen, Rasmus Søgaard Hansen

, Thiusius Rajeeth Savarimuthu, Pernille J. Vinholt:
Danish Clinical Named Entity Recognition and Relation Extraction. 655-666 - Dimitrios Kokkinakis, Ricardo Muñoz Sánchez

, Mia-Marie Hammarlin:
Scaling-up the Resources for a Freely Available Swedish VADER (svVADER). 667-672 - Yiyi Chen, Russa Biswas, Johannes Bjerva:

Colex2Lang: Language Embeddings from Semantic Typology. 673-684 - Anni Eskelinen, Laura Silvala, Filip Ginter, Sampo Pyysalo, Veronika Laippala:

Toxicity Detection in Finnish Using Machine Translation. 685-697 - Thórunn Arnardóttir, Hinrik Hafsteinsson, Atli Jasonarson, Anton Ingaon, Steinþór Steingrímsson:

Evaluating a Universal Dependencies Conversion Pipeline for Icelandic. 698-704 - Agnes Luhtaru

, Rauno Jaaska, Karl Kruusamäe, Mark Fishel:
Automatic Transcription for Estonian Children's Speech. 705-709 - Hele-Andra Kuulmets, Mark Fishel:

Translated Benchmarks Can Be Misleading: the Case of Estonian Question Answering. 710-716 - Peter Vajdecka, Elena Callegari, Desara Xhura, Atli Ásmundsson:

Predicting the presence of inline citations in academic text using binary classification. 717-722 - Liisa Rätsep, Mark Fishel:

Neural Text-to-Speech Synthesis for Võro. 723-727 - Vésteinn Snæbjarnarson, Annika Simonsen, Goran Glavas, Ivan Vulic:

Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese. 728-737 - Anssi Moisio, Mathias Creutz, Mikko Kurimo:

Evaluating Morphological Generalisation in Machine Translation by Distribution-Based Compositionality Assessment. 738-751 - Kairit Sirts:

Estonian Named Entity Recognition: New Datasets and Models. 752-761 - Lisa Yankovskaya, Maali Tars, Andre Tättar, Mark Fishel:

Machine Translation for Low-resource Finno-Ugric Languages. 762-771 - Elizaveta Korotkova, Mark Fishel:

Distilling Estonian Text Domains for Production-Oriented Machine Translation. 772-781 - Kais Allkivi-Metsoja

, Jaagup Kippar:
Spelling Correction for Estonian Learner Language. 782-788

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














