26th Interspeech 2025: Rotterdam, The Netherlands

Refine list

showing all ?? records

Keynote 1 - Roger Moore: From Talking and Listening Devices to Intelligent Communicative Machines

Spoken Machine Translation 1

Real-time Speech Enhancement

Multilinguality, Cross-linguistic Studies, L2 Speech

Speech Emotion Recognition 1

Multimodal Resources

Interpretability in Audio and Speech Technology

Summarization

Show and Tell 1: ASR / Tools

Models of Speech Production

Speech and Grammar/Articulatory Analyses

Speaking Styles, Register and Conversational Speech

Emotional Distress in Speech

Prosody in Speech Synthesis

Depression Detection and Assessment 1

Speech Analysis, Detection and Classification 1

Speech-based Cognitive Assessment 1

Large Language Models in Speech Recognition

Speech Coding and Echo Cancellation

Decoding Algorithms

Queer and Trans Speech Science and Technology

Tone

Cross-Lingual and Multilingual Processing

Echo Cancellation, Feedback Control, and Near-end Enhancement

Pathological Speech Analysis 1

Hearing Disorders

Interspeech 2025 URGENT Challenge

Spoken Machine Translation 2

Spatial Audio and Acoustics 1

Articulatory and Vocal Tract Modelling

Acoustic Assessment of Respiratory Health

Advances in Modelling and Imaging

Conversation, Communication and Interaction 1

Robust Speaker Verification

Multilingual ASR

Multi-channel Speech Enhancement

Self-supervised Learning

Singing Voice and Audio Synthesis

Acoustic and Articulatory Cues in Speech Perception

Audio Event Detection and Classification

Inclusivity

Voice Conversion 1

Speech-based Cognitive Assessment 2

Source Separation 1

Language and Accent Identification and Speaker Privacy

Source Tracing: The Origins of Synthetic or Manipulated Speech

Speaker Diarization 1

Multilingual Speech Synthesis and Special Applications 1

Characterization and Multimodal Approaches for Speaker Recognition

Acoustic Analysis and Bioacoustics

Keynote 2 - Alexander Waibel: From Speech Science to Language Transparence

Spoken Dialogue Systems 1

Speech Assessment

Audio-Visual ASR and Multimodal System

Speech and Voice Disorders 1

Multimodal Information Based Speech Processing (MISP) 2025 Challenge

Speaker Extraction 1

Low Resource Speech Recognition

Computational Resource Constrained ASR

Speech and Language Technology for Health Applications

Responsible Speech Foundation Models + SUPERB Challenge

Dysarthric Speech Assessment 1

Show and Tell 2: Speech Synthesis

Databases and Progress in Methodology

Novel Architectures for ASR

Deepfake Detection

Tools for Speech Analysis

Text Processing and Evaluation for Speech Synthesis 1

Segmental and Tonal Units

Speech Quality Assessment

Speech Enhancement

Language Learning and Assessment

Speech Synthesis Paradigms and Methods 1

Spatial Audio and Acoustics 2

Text Processing and Evaluation for Speech Synthesis 2

General Topics in ASR

Acoustic Event Detection and Classification

Keyword Spotting and Retrieval

Multimodal Systems

Dysarthric Speech Assessment 2

Dialect Identification in Different Languages

Connecting Speech Science and Speech Technology for Children's Speech

Brain and Cognition

Regional, Social and Diachronic Variation

Speaker Extraction 2

Multimodal Emotion Recognition

Conversation, Communication and Interaction 2