default search action
Language Resources and Evaluation, Volume 57
Volume 57, Number 1, March 2023
- Nancy Ide, Nicoletta Calzolari:
Editorial: LRE updates. 1-3 - Jan Wira Gotama Putra, Kana Matsumura, Simone Teufel, Takenobu Tokunaga:
TIARA 2.0: an interactive tool for annotating discourse structure and text improvement. 5-29 - Yoshinao Sato, Kouki Miyazawa:
Statistical quality estimation for partially subjective classification tasks through crowdsourcing. 31-56 - James F. Allen, Hannah An, Ritwik Bose, William de Beaumont, Choh Man Teng:
COLLIE: a broad-coverage ontology and lexicon of verbs in English. 57-86 - James F. Allen, Hannah An, Ritwik Bose, William de Beaumont, Choh Man Teng:
Correction: COLLIE: a broad-coverage ontology and lexicon of verbs in English. 87 - Michael Fell, Elena Cabrio, Maroua Tikat, Franck Michel, Michel Buffa, Fabien Gandon:
The WASABI song corpus and knowledge graph for music lyrics analysis. 89-119 - Nico Blokker, André Blessing, Erenay Dayanik, Jonas Kuhn, Sebastian Padó, Gabriella Lapesa:
Between welcome culture and border fence. 121-153 - Endang Wahyu Pamungkas, Valerio Basile, Viviana Patti:
Investigating the role of swear words in abusive language detection tasks. 155-188 - Camiel Colruyt, Orphée De Clercq, Thierry Desot, Véronique Hoste:
EventDNA: a dataset for Dutch news event extraction as a basis for news diversification. 189-221 - Kezban Basibüyük, Deniz Zeyrek:
Usage disambiguation of Turkish discourse connectives. 223-256 - Zahra Rahimi, Mohammad Mehdi Homayounpour:
The impact of preprocessing on word embedding quality: a comparative study. 257-291 - Johana Bodard, Céline Jost, Gérard Uzan, Isis Truck:
Spelling errors made by people with dyslexia. 293-322 - Lu Li, Xiaotian Wang:
Nonverbal communication with emojis in social media: dissociating hedonic intensity from frequency. 323-342 - Lucas Gautheron, Nicolas Rochat, Alejandrina Cristià:
Managing, storing, and sharing long-form recordings and their annotations. 343-375 - Lenin Laitonjam, Sanasam Ranbir Singh:
Manipuri-English comparable corpus for cross-lingual studies. 377-413 - Tomaz Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubesic, Kiril Simov, Andrej Pancur, Michal Rudolf, Matyás Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çagri Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana D. de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevicius, Tomas Krilavicius, Roberts Dargis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fiser:
The ParlaMint corpora of parliamentary proceedings. 415-448 - Çagri Çöltekin, A. Seza Dogruöz, Özlem Çetinoglu:
Resources for Turkish natural language processing: A critical survey. 449-488 - Çagri Çöltekin, A. Seza Dogruöz, Özlem Çetinoglu:
Correction to: Resources for Turkish natural language processing: A critical survey. 489 - Thipe I. Modipa, Marelie H. Davel:
Correction to: Two sepedi‑english code‑switched speech corpora. 491
Volume 57, Number 2, June 2023
- Manuela Sanguinetti, Cristina Bosco, Lauren Cassidy, Özlem Çetinoglu, Alessandra Teresa Cignarella, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djamé Seddah, Amir Zeldes:
Treebanking user-generated content: a UD based overview of guidelines, corpora and unified recommendations. 493-544 - Özlem Çetinoglu, Çagri Çöltekin:
Two languages, one treebank: building a Turkish-German code-switching treebank and its challenges. 545-579 - Krister Lindén, Tommi Jauhiainen, Sam Hardwick:
FinnSentiment: a Finnish social media corpus for sentiment polarity annotation. 581-609 - Phuong Le-Hong, Erik Cambria:
A semantics-aware approach for multilingual natural language inference. 611-639 - Izidor Mlakar, Darinka Verdonik, Simona Majhenic, Matej Rojc:
Understanding conversational interaction in multiparty conversations: the EVA Corpus. 641-671 - Rafaël Poiret, Tak-Sum Wong, John Lee, Kim Gerdes, Herman Leung:
Universal Dependencies for Mandarin Chinese. 673-710 - Rafaël Poiret, Tak-Sum Wong, John Lee, Kim Gerdes, Herman Leung:
Correction to: Universal Dependencies for Mandarin Chinese. 711 - Muhammad Umair Arshad, Raza Ali, Mirza Omer Beg, Waseem Shahzad:
UHated: hate speech detection in Urdu language using transfer learning. 713-732 - Manish Puri, Aparna S. Varde, Gerard de Melo:
Commonsense based text mining on urban policy. 733-763 - Sourav Das, Soumitra Ghosh, Anup Kumar Kolya, Asif Ekbal:
Unparalleled sarcasm: a framework of parallel deep LSTMs with cross activation functions towards detection and generation of sarcastic statements. 765-802 - Jiarui Li, Jian Liu, Yufeng Chen, Jinan Xu:
Multi-domain adaptation for named entity recognition with multi-aspect relevance learning. 803-818 - Loic De Langhe, Orphée De Clercq, Véronique Hoste:
Constructing a cross-document event coreference corpus for Dutch. 819-848 - Marco Rospocher:
Detecting explicit lyrics: a case study in Italian music. 849-867 - Daan Schueler, Maarten Marx:
Speech acts in the Dutch COVID-19 Press Conferences. 869-892 - Lucas Nunes Vieira, Carol O'Sullivan, Xiaochun Zhang, Minako O'Hagan:
Machine translation in society: insights from UK users. 893-914 - Nimra Zaheer, Obaid Ullah Ahmad, Mudassir Shabbir, Agha Ali Raza:
Speech emotion recognition for the Urdu language. 915-944
Volume 57, Number 3, September 2023
- Yubin Qian:
Mining culture from professional discourse: a lexicon-based hybrid method. 945-961 - Martha Yifiru Tachbelie, Solomon Teferra Abate:
Lexical modeling for the development of Amharic automatic speech recognition systems. 963-984 - Ege Kesim, Tuge Numanoglu, Öykü Zeynep Bayramoglu, Bekir Berker Turker, Nusrah Hussain, T. Metin Sezgin, Yucel Yemez, Engin Erzin:
The eHRI database: a multimodal database of engagement in human-robot interactions. 985-1009 - Nabil Ababou, Azzeddine Mazroui, Rachid Belehbib:
From extended chunking to dependency parsing using traditional Arabic grammar. 1011-1043 - Veronika Laippala, Samuel Rönnqvist, Miika Oinonen, Aki-Juhani Kyröläinen, Anna Salmela, Douglas Biber, Jesse Egbert, Sampo Pyysalo:
Register identification from the unrestricted open Web using the Corpus of Online Registers of English. 1045-1079 - Sean Trott, Benjamin K. Bergen, Eva Wittenberg:
Spontaneous, controlled acts of reference between friends and strangers. 1081-1105 - Tommaso Caselli, Johan Bos:
Investigating interoperable event corpora: limitations of reusability of resources and portability of models. 1107-1137 - Arnaldo Cândido Júnior, Edresson Casanova, Anderson da Silva Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio:
CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese. 1139-1171 - Oliver Hellwig, Sebastian Nehrdich, Sven Sellmer:
Data-driven dependency parsing of Vedic Sanskrit. 1173-1206 - Natalie Weber, Tyler Brown, Joshua Celli, McKenzie Denham, Hailey Dykstra, Rodrigo Hernandez-Merlin, Evan Hochstein, Pinyu Hwang, Nico Kidd, Diana Kulmizev, Hannah Morrison, Matty Norris, Lena Venkatraman:
Blackfoot Words: a database of Blackfoot lexical forms. 1207-1262 - Mingzhou Xu, Longyue Wang, Siyou Liu, Derek F. Wong, Shuming Shi, Zhaopeng Tu:
A benchmark dataset and evaluation methodology for Chinese zero pronoun translation. 1263-1293 - Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Ragheb Al-Ghezi, Mietta Lennes, Tamás Grósz, Krister Lindén, Mikko Kurimo:
Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks. 1295-1327 - Yftah Ziser, Bonnie Webber, Shay B. Cohen:
Rant or rave: variation over time in the language of online reviews. 1329-1359 - Rachana Gusain, Satya Ranjan Dash, Shantipriya Parida, Girish Nath Jha:
Automatic language identification: a case study of Pahari languages. 1361-1387 - Chenhui Chu, Zhuoyuan Mao, Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohashi:
SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain. 1389-1403 - Brygida Sawicka-Stepinska:
Design and construction of Guayaquil radio speech corpus (CHARG). 1405-1422 - Xi Huang, Hong Xu:
Rei Miyata: controlled document authoring in a machine translation age. 1423-1430
Volume 57, Number 4, December 2023
- Angelina Gaspar, Ani Grubisic, Ines Saric-Grgic:
Evaluation of a rule-based approach to automatic factual question generation using syntactic and semantic analysis. 1431-1461 - Dmitri Roussinov, Serge Sharoff, Nadezhda Puchnina:
Fine-tuning language models to recognize semantic relations. 1463-1486 - Daniel García-Baena, Miguel Ángel García Cumbreras, Salud M. Jiménez-Zafra, José Antonio García-Díaz, Rafael Valencia-García:
Hope speech detection in Spanish. 1487-1514 - Irina Bigoulaeva, Viktor Hangya, Iryna Gurevych, Alexander Fraser:
Label modification and bootstrapping for zero-shot cross-lingual hate speech detection. 1515-1546 - Jakab Buda, Renáta Németh, Bori Simonovits, Gábor Simonovits:
The language of discrimination: assessing attention discrimination by Hungarian local governments. 1547-1570 - Herbert Ullrich, Jan Drchal, Martin Rýpar, Hana Vincourová, Václav Moravec:
CsFEVER and CTKFacts: acquiring Czech data for fact verification. 1571-1605 - Dolores Lemmenmeier-Batini, Josip Batinic, Anastasia Escher:
Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland. 1607-1644 - Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo:
Finnish parliament ASR corpus. 1645-1670 - Séamus Lankford, Haithem Afli, Andy Way:
adaptNMT: an open-source, language-agnostic development environment for neural machine translation. 1671-1696 - Eric Sadit Tellez, Daniela Moctezuma, Sabino Miranda, Mario Graff, Guillermo Ruiz:
Regionalized models for Spanish language variations based on Twitter. 1697-1727 - Neil Cohn, Bruno Cardoso, Bien Klomberg, Irmak Hacimusaoglu:
The Visual Language Research Corpus (VLRC): an annotated corpus of comics from Asia, Europe, and the United States. 1729-1744 - Kabil Essefar, Hassan Ait Baha, Abdelkader El Mahdaouy, Abdellah El Mekki, Ismail Berrada:
OMCD: Offensive Moroccan Comments Dataset. 1745-1765 - Tho Chi Luong, Phuong Le-Hong, Oanh Thi Tran:
A rich task-oriented dialogue corpus in Vietnamese. 1767-1786 - Matthias Urban, Evelyn Michelle Aguilar Panchi, Saetbyul Lee, Evgenia Brodetsky:
CINWA (database of terminology for cultivated plants in indigenous languages of northwestern South America): introducing a resource for research in ethnobiology, anthropology, historical linguistics, and interdisciplinary research on the neolithic transition in South America. 1787-1817 - Hongzheng Li, Ruojin Wang:
Book Review: the Routledge Handbook of Translation and Ethics. 1819-1822
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.