25th Interspeech 2024: Kos, Greece

Refine list

showing all ?? records

Keynote 1 ISCA Medallist

L2 Speech, Bilingualism and Code-Switching

Speaker Diarization 1

Speech and Audio Analysis and Representations

Acoustic Event Detection and Classification 2

Detection and Classification of Bioacoustic Signals

Acoustic Echo Cancellation

Speech Synthesis: Voice Conversion 1

Neural Network Architectures for ASR 2

Decoding Algorithms

Pronunciation Assessment

Spoken Language Processing

Spoken Machine Translation 2

Biosignal-enabled Spoken Communication

Individual and Social Factors in Phonetics

Paralinguistics

Speaker Recognition: Adversarial and Spoofing Attacks

Audio Event Detection and Classification 1

Source Separation 2

Noise Reduction, Dereverberation, and Echo Cancellation

Computationally-Efficient Speech Enhancement

Zero-shot TTS

Noise Robustness, Far-Field, and Multi-Talker ASR

Contextual Biasing and Adaptation

Spoken Language Understanding

Spoken Machine Translation 1

Hearing Disorders

Speech Disorders 2

TAUKADIAL Challenge: Speech-Based Cognitive Assessment in Chinese and English (Special Session)

Show and Tell 1

Keynote 2

Phonetics and Phonology of Second Language Acquisition

Corpora-based Approaches in Automatic Emotion Recognition

Analysis of Speakers States and Traits

Spoofing and Deepfake Detection

Audio Captioning, Tagging, and Audio-Text Retrieval

Generative Speech Enhancement

Speech Synthesis: Evaluation

Multilingual ASR

General Topics in ASR

Spoken Language Understanding

Speech and Multimodal Resources

Pathological Speech Analysis 1

Speech and Language in Health: from Remote Monitoring to Medical Conversations - 1 (Special Session)

Speech and Brain

Innovative Methods in Phonetics and Phonology

Voice, Tones and F0

Emotion Recognition: Resources and Benchmarks

Speaker and Language Identification and Diarization

Audio-Text Retrieval

Speech Enhancement

Speech Coding

Speech Synthesis: Expressivity and Emotion

Speech Synthesis: Tools and Data

Speech Synthesis: Singing Voice Synthesis

LLM in ASR

Vision and Speech

Spoken Document Summarization

Speech and Language in Health: from Remote Monitoring to Medical Conversations - 2 (Special Sessions)

Show and Tell 2

Prosody

Foundational Models for Deepfake and Spoofed Speech Detection

Speaker Recognition 1

Source Separation 1

Audio-Visual and Generative Speech Enhancement

Speech Privacy and Bandwidth Expansion

Speech Synthesis: Prosody

Accented Speech, Prosodic Features, Dialect, Emotion, Sound Classification

Neural Network Adaptation

ASR and LLMs

Pathological Speech Analysis 3

Speech Disorders 3

Speech Recognition with Large Pretrained Speech Models for Under-represented Languages (Special Session)

Speech Processing Using Discrete Speech Units (Special Session)

Keynote 3

Databases and Progress in Methodology

Articulation, Convergence and Perception

Speech Emotion Recognition

Self-Supervised Models in Speaker Recognition

Speech Quality Assessment

Privacy and Security in Speech Communication 1

Speech Synthesis: Voice Conversion 2

Speech Synthesis: Text Processing

Training Methods, Self-Supervised Learning, Adaptation

Novel Architectures for ASR

Multimodality and Foundation Models

Spoken Dialogue Systems and Conversational Analysis 1

Speech Technology