


default search action
26th SPECOM 2024: Belgrade, Serbia - Part II
- Alexey Karpov

, Vlado Delic
:
Speech and Computer - 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024, Proceedings, Part II. Lecture Notes in Computer Science 15300, Springer 2025, ISBN 978-3-031-78013-4
Computational Paralinguistics
- Denis Dresvyanskiy

, Alexey Karpov
, Wolfgang Minker
:
A Cross-Multi-modal Fusion Approach for Enhanced Engagement Recognition. 3-17 - Gábor Gosztolya, András Bence Lázár, Ildikó Hoffmann, Otília Bagi, Fruzsina Fanni Farkas, Janka Gajdics, László Tóth, János Kálmán:

Automatic Assessment of Signs of Alcohol Dependency Syndrome from Spontaneous Speech. 18-29 - Aya Abdalla, Nada Sharaf, Caroline Sabty:

An Enhanced Compact Convolution Transformer for Age, Gender and Emotion Detection in Egyptian Arabic Speech. 30-42 - Elizaveta Vologina

, Anastasiia Matveeva
, Olesia Makhnytkina
, Yuri Matveev
, Nursaule Burambayeva
:
RAG and Few-Shot Prompting in Emotional Text Generation. 43-53 - Ahmed Sherif, Caroline Sabty:

Sentiment Analysis for Egyptian Arabic-English Code-Switched Data Using Traditional Neural Models and Advanced Language Models. 54-69 - Uliana E. Kochetkova

, Pavel A. Skrelin
, Vera Evdokimova
, Nikolay Borisov
, Pavel Scherbakov, Petr Fedkin
, Rada German
:
Automatic Detection of Irony Based on Acoustic Features and Facial Expressions. 70-82
Affective Computing
- Olga V. Frolova

, Anton Matveev
, Elena E. Lyakso
, Tamara Kuznetsova
, Inna Golubeva
:
Emotion Recognition by Vocalizations of Nonhuman Primates: Human and Automatic Classification. 85-94 - Aman Goel

, Abhishek Poswal
:
MMHS: Multimodal Model for Hate Speech Intensity Prediction. 95-108 - Tijana Durkic

, Nikola Simic
, Sinisa Suzic
, Dragana Bajovic
, Zoran Peric
, Vlado Delic
:
Multimodal Emotion Recognition Using Compressed Graph Neural Networks. 109-121 - Olesia Makhnytkina

, Yuri Matveev
, Alexander Zubakov
, Anton Matveev
:
Utilizing Speaker Models and Topic Markers for Emotion Recognition in Dialogues. 122-137 - Elena E. Lyakso

, Olga V. Frolova
, Aleksandr Nikolaev
, Severin Grechanyi
, Yulia Filatova
, Ruban Nersisson
:
How Children Recognize Emotions from Video and Audio. 138-153
Speaker Recognition
- Jahangir Alam, Md Shahidul Alam:

On the Influence of CNN-Based Feature Learning Modules in Neural Speaker Verification Framework. 157-170 - Jacek Kudera

, Miriam Coccia, Sharifeh Fadaeijouybari, Till Preidt, Akshay Ranjan, Angelika Braun:
Voice Cloning and Mismatch Conditions in Forensic Automatic Speaker Recognition. 171-184 - Shalini Tomar, Shashidhar G. Koolagudi:

Transformation of Emotional Speech to Anger Speech to Reduce Mismatches in Testing and Enrollment Speech for Speaker Recognition System. 185-200 - Parth Sanjay Khadse, Sabyasachi Chandra, Puja Bharati, Debolina Pramanik, G. Satya Prasad, Aniket Aitawade, Shyamal Kumar Das Mandal:

Investigating Data Requirements for Hindi Speaker Recognition: A Comparative Study with English. 201-209 - Rodmonga Potapova

, Vsevolod Potapov
, Irina Kuryanova
:
Practical Evaluation and Validation of Methods for Automatic Speaker Identification (as Applied to Various Languages). 210-223
Digital Speech Processing
- Branislav Gerazov, Paul Konstantin Krug, Daniel R. van Niekerk, Anqi Xu, Peter Birkholz, Yi Xu:

In Pursuit for the Best Error Metric for Optimisation of Articulatory Vowel Synthesis. 227-237 - Lukas Förner, Maximilian Dauner

:
Exploring MetaConformer for Speech Enhancement. 238-249 - YingWei Tan:

Integration of Short-Term and Long-Term Harmonic Peaks in a Two-Level Discriminative Weight Training Framework for Voice Activity Detection. 250-263 - Anandakumar Singaravelan, Jia-Lien Hsu:

Separating Party Conversation by Applying Contrastive Learning Methodology. 264-276 - Himadri Mukherjee, Matteo Marciano, Ankita Dhar, Kaushik Roy:

DuFCALF: Instilling Sentience in Computerized Song Analysis. 277-292
Natural Language Processing
- Manar Ouled Ahmed, Zuheng Ming

, Alice Othmani
:
Harnessing Knowledge Distillation for Enhanced Text-to-Text Translation in Low-Resource Languages. 295-307 - Yasser Saeid, Thomas Kopinski:

Bias Unveiled: Enhancing Fairness in German Word Embeddings with Large Language Models. 308-325 - Prateek Verma:

Conformer LLM - Convolution Augmented Large Language Models. 326-333 - Valery Solovyev

, Anna Ivleva
:
How to Detect Imbalances in the Google Books Ngram Corpus? 334-348 - Vladimir V. Bochkarev

, Andrey V. Savinkov
, Anna V. Shevlyakova
:
Predicting the Valence Rating of Russian Words Using Various Pre-trained Word Embeddings. 349-361 - Radek Marík

, Renata Landgráfová
, Jirí Liska:
Ancient Egyptian Hieroglyphic Texts Structure Identification. 362-377

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














