


default search action
20th SPECOM 2018: Leipzig, Germany
- Alexey Karpov, Oliver Jokisch, Rodmonga Potapova:

Speech and Computer - 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18-22, 2018, Proceedings. Lecture Notes in Computer Science 11096, Springer 2018, ISBN 978-3-319-99578-6 - Oleg Akhtiamov, Vasily Palkov:

Gaze, Prosody and Semantics: Relevance of Various Multimodal Signals to Addressee Detection in Human-Human-Computer Conversations. 1-10 - Mohammed Salah Al-Radhi

, Tamás Gábor Csapó
, Géza Németh
:
A Continuous Vocoder Using Sinusoidal Model for Statistical Parametric Speech Synthesis. 11-20 - Sergei Astapov, Aleksandr Lavrentyev, Evgeniy Shuranov:

Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming. 21-31 - Vladimir Bataev

, Maxim Korenevsky, Ivan Medennikov
, Alexander Zatvornitskiy
:
Exploring End-to-End Techniques for Low-Resource Speech Recognition. 32-41 - Natalia Bogdanova-Beglarian

, Tatiana Y. Sherstinova
, Olga Blinova
, Gregory Y. Martynenko
, Ekaterina Baeva
:
Towards a Description of Pragmatic Markers in Russian Everyday Speech. 42-48 - Christopher G. Buchanan, Matthew P. Aylett, David A. Braude:

Adding Personality to Neutral Speech Synthesis Voices. 49-57 - Martin Bulín

, Lubos Smídl
, Jan Svec
:
Towards Network Simplification for Low-Cost Devices by Removing Synapses. 58-67 - Lukás Bures, Petr Neduchal

, Miroslav Hlavác
, Marek Hrúz
:
Generation of Synthetic Images of Full-Text Documents. 68-75 - Felix Burkhardt, Benjamin Weiss:

Speech Synthesizing Simultaneous Emotion-Related States. 76-85 - Marco Canora, Fernando García-Granada

, Emilio Sanchis, Encarna Segarra
:
An Approach to Automatic Summarization of Television Programs. 86-93 - George Christodoulides

:
The Prosody of Discourse Makers alors and et in French: A Corpus-Based Study on Multiple Speaking Styles. 94-102 - Adam Chýlek

, Lubos Smídl
, Jakub Nedved:
Choosing a Dialogue System's Modality in Order to Minimize User's Workload. 103-112 - Erik Edwards, Michael Brenndoerfer, Amanda Robinson, Najmeh Sadoughi, Greg P. Finley, Maxim Korenevsky, Nico Axtmann, Mark Miller, David Suendermann-Oeft:

A Free Synthetic Corpus for Speaker Diarization Research. 113-122 - Erik Edwards, Amanda Robinson, Najmeh Sadoughi, Greg P. Finley, Maxim Korenevsky, Michael Brenndoerfer, Nico Axtmann, Mark Miller, David Suendermann-Oeft:

Speaker Diarization: A Top-Down Approach Using Syllabic Phonology. 123-133 - Olga Egorow, Ingo Siegert, Andreas Wendemuth:

Improving Emotion Recognition Performance by Random-Forest-Based Feature Selection. 134-144 - Polina Eismont

, Vladislav Metelyagin
, Elena I. Riekhakaynen
:
Coherence Understanding Through Cohesion Markers: The Case of Child Spoken Language. 145-154 - Dmitrii Fedotov, Heysem Kaya

, Alexey Karpov
:
Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup. 155-165 - Carlos Ferreira

, Bruno Direito
, Alexandre Sayal
, Marco Simões
, Inês Cadório
, Paula Martins
, Marisa Lousada
, Daniela Figueiredo
, Miguel Castelo-Branco
, António J. S. Teixeira
:
Functional Mapping of Inner Speech Areas: A Preliminary Study with Portuguese Speakers. 166-176 - Greg P. Finley, Erik Edwards, Wael Salloum, Amanda Robinson, Najmeh Sadoughi, Nico Axtmann, Maxim Korenevsky, Michael Brenndoerfer, Mark Miller, David Suendermann-Oeft:

Semi-Supervised Acoustic Model Retraining for Medical ASR. 177-187 - Jing Han, Maximilian Schmitt, Björn W. Schuller

:
You Sound Like Your Counterpart: Interpersonal Speech Analysis. 188-197 - François Hernandez, Vincent Nguyen, Sahar Ghannay

, Natalia A. Tomashenko
, Yannick Estève:
TED-LIUM 3: Twice as Much Data and Corpus Repartition for Experiments on Speaker Adaptation. 198-208 - Miroslav Hlavác

, Ivan Gruber
, Milos Zelezný, Alexey Karpov
:
LipsID Using 3D Convolutional Neural Networks. 209-214 - Rüdiger Hoffmann, Peter Birkholz

, Falk Gabriel, Rainer Jäckel:
From Kratzenstein to the Soviet Vocoder: Some Results of a Historic Research Project in Speech Technology. 215-225 - Marek Hrúz

, Miroslav Hlavác
:
LSTM Neural Network for Speaker Change Detection in Telephone Conversations. 226-233 - Takuto Isoyama, Masashi Unoki

:
Noise Suppression Method Based on Modulation Spectrum Analysis. 234-244 - Denis Ivanko

, Dmitry Ryumin
, Alexandr Axyonov
, Milos Zelezný:
Designing Advanced Geometric Features for Automatic Russian Visual Speech Recognition. 245-254 - Markéta Juzová:

On the Comparison of Different Phrase Boundary Detection Approaches Trained on Czech TTS Speech Corpora. 255-263 - Tatiana Kachkovskaia

, Mayya Nurislamova:
Word-Initial Consonant Lengthening in Stressed and Unstressed Syllables in Russian. 264-273 - Arman Kaliyev

, Sergey V. Rybin
, Yuri N. Matveev
:
Phoneme Duration Prediction for Kazakh Language. 274-280 - Stamatis Karlos

, Konstantinos Kaleris, Nikos Fazakis
, Vasileios G. Kanas, Sotiris Kotsiantis
:
Optimized Active Learning Strategy for Audiovisual Speaker Recognition. 281-290 - Irina S. Kipyatkova:

Improving Russian LVCSR Using Deep Neural Networks for Acoustic and Language Modeling. 291-300 - Daniil Kocharov

, Vera Evdokimova, Karina Evgrafova
, Mariia Morskovatykh:
Labialization of Unstressed Vowels in Russian: Phonetic and Perceptual Evidence. 301-310 - Liubov Kovriguina

, Ivan Shilin
, Alina Putintseva
, Alexander Shipilo
:
Multilevel Annotation in the Corpus for Parsing Russian Spontaneous Speech. 311-320 - Anat Lerner

, Oren Miara, Sarit Malayev, Vered Silber-Varod
:
The Influence of the Interlocutor's Gender on the Speaker's Role Identification. 321-330 - Tatiana Litvinova

, Pavel Seredin
, Olga Litvinova
, Tatiana Dankova, Olga Zagorovskaya
:
On the Stability of Some Idiolectal Features. 331-336 - Boris Lobanov, Vladimir Zhitko, Vadim Zahariev:

A Prototype of the Software System for Study, Training and Analysis of Speech Intonation. 337-346 - Elena E. Lyakso, Olga V. Frolova:

Speech Interaction in "Mother-Child" Dyads with 4-7 Years Old Typically Developing Children and Children with Autism Spectrum Disorders. 347-356 - Elena E. Lyakso, Olga V. Frolova, Aleksey Grigorev

, Viktor Gorodnyi
, Aleksandr Nikolaev
, Yuri N. Matveev
:
Speech Features of Adults with Autism Spectrum Disorders and Mental Retardation. 357-366 - Thomas Manzini, Alan W. Black:

Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise. 367-376 - Nikita Markovnikov, Irina S. Kipyatkova, Elena E. Lyakso:

End-to-End Speech Recognition in Russian. 377-386 - Martin Matura, Markéta Juzová:

Correction of Formal Prosodic Structures in Czech Corpora Using Legendre Polynomials. 387-397 - Martin Matura, Markéta Juzová, Jindrich Matousek

:
On the Contribution of Articulatory Features to Speech Synthesis. 398-407 - Martin Meszaros, Franziska Trojahn, Michael Maruschke, Oliver Jokisch

:
QuARTCS: A Tool Enabling End-to-Any Speech Quality Assessment of WebRTC-Based Calls. 408-418 - Petr Mizera, Petr Pollák:

Automatic Phonetic Segmentation and Pronunciation Detection with Various Approaches of Acoustic Modeling. 419-429 - Eduardo Mizraji, Andrés Pomi, Juan Lin:

Improving Neural Models of Language with Input-Output Tensor Contexts. 430-440 - Anfisa Naumova

:
Sociolinguistic Variability of Predicate Groups in Colloquial Russian Speech. 441-450 - Thai Son Nguyen, Matthias Sperber, Sebastian Stüker, Alex Waibel:

Building Real-Time Speech Recognition Without CMVN. 451-460 - Dariya Novokhrestova

, Evgeny Kostyuchenko
, Roman V. Meshcheryakov
:
Choice of Signal Short-Term Energy Parameter for Assessing Speech Intelligibility in the Process of Speech Rehabilitation. 461-469 - Jaromír Novotný, Pavel Ircing:

The Benefit of Document Embedding in Unsupervised Document Classification. 470-478 - Siham Ouamour

, Halim Sayoud
:
A Comparative Survey of Authorship Attribution on Short Arabic Texts. 479-489 - Vedhas Pandit

, Maximilian Schmitt, Nicholas Cummins
, Franz Graf, Lucas Paletta
, Björn W. Schuller
:
How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers. 490-500 - Olga Perepelkina

, Evdokia Kazimirova, Maria Konstantinova:
RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing. 501-510 - Gábor Pintér, Mira Schielke, Rico Petrick:

Investigating Word Segmentation Techniques for German Using Finite-State Transducers. 511-521 - Branislav M. Popovic, Edvin Pakoci, Darko Pekar:

A Comparison of Language Model Training Techniques in a Continuous Speech Recognition System for Serbian. 522-531 - Rodmonga Potapova

, Liliya Komalova
, Vsevolod Potapov
:
Perceptual-Auditory Evaluation of the Aggressive Speech Behavior: Gender Aspect (on the Basis of Russian and Spanish Languages). 532-541 - Rodmonga Potapova

, Vsevolod Potapov
:
Main Determinants of the Acmeologic Personality Profiling. 542-551 - Eran Raveh

, Ingmar Steiner
, Iona Gessinger
, Bernd Möbius
:
Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System. 552-562 - Najmeh Sadoughi, Greg P. Finley, Erik Edwards, Amanda Robinson, Maxim Korenevsky, Michael Brenndoerfer, Nico Axtmann, Mark Miller, David Suendermann-Oeft:

Detecting Section Boundaries in Medical Dictations: Toward Real-Time Conversion of Medical Dictations to Clinical Reports. 563-573 - Michelina Savino, Loredana Lapertosa, Mario Refice:

Seeing or Not Seeing Your Conversational Partner: The Influence of Interaction Modality on Prosodic Entrainment. 574-584 - Tina Schuh, Stephan Dreiseitl:

Evaluating Novel Features for Aggressive Language Detection. 585-595 - Tatiana Y. Sherstinova

:
Quantitative Data on POS Distribution in the Beginnings and the Ends of Utterances in Everyday Russian Speech. 596-605 - Tatiana Shevchenko

, Tatiana Sokoreva
:
Corpus Data on Adult Life-Long Trajectory of Prosody Development in American English, with Special Reference to Middle Age. 606-614 - Nikolay Shilov, Alexey M. Kashevnik, Sergey Mikhailov:

Context-Aware Generation of Personalized Audio Tours: Approach and Evaluation. 615-624 - Ingo Siegert, Alicia Flores Lotz, Olga Egorow, Susann Wolff:

Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition. 625-635 - Vered Silber-Varod

, Anat Lerner
, Oliver Jokisch
:
Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers' Role. 636-645 - Lubos Smídl

, Jan Svec
, Ales Prazák, Jan Trmal:
Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition. 646-655 - Anton Stepikhov, Anastassia Loukina:

Personality, Working Memory Capacity and Expert Manual Annotation of German Spontaneous Speech. 656-666 - Mikhail Stolbov, Marina Tatarnikova, Quan Trong The:

Using Dual-Element Microphone Arrays for Automatic Keyword Recognition. 667-675 - Daniel Tihelka

, Zdenek Hanzlícek
, Markéta Juzová, Jindrich Matousek
:
First Steps Towards Hybrid Speech Synthesis in Czech TTS System ARTIC. 676-686 - Maxim Tkachenko, Alexander Yamshinin, Mikhail Kotov, Marina Nastasenko:

Lightweight Embeddings for Speaker Verification. 687-696 - László Tóth, György Kovács, Dirk Van Compernolle:

A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models. 697-706 - Constanze Tschöpe

, Frank Duckhorn
, Markus Huber, Werner Meyer, Matthias Wolff
:
A Cognitive User Interface for a Multi-modal Human-Machine Interaction. 707-717 - Amir Vaheb, Ali Janalizadeh Choobbasti, S. H. E. Mortazavi Najafabadi, Saeid Safavi:

Investigating Language Variability on the Performance of Speaker Verification Systems. 718-727 - Jan Vanek

, Josef Michálek
, Josef Psutka
:
Recurrent DNNs and Its Ensembles on the TIMIT Phone Recognition Task. 728-736 - Alena Velichko

, Viktor Budkov, Ildar Kagirov
, Alexey A. Karpov
:
Comparative Analysis of Classification Methods for Automatic Deception Detection in Speech. 737-746 - Jochen Weiner, Tanja Schultz

:
Selecting Features for Automatic Screening for Dementia Based on Speech. 747-756 - Matthias Wolff

, Günther Wirsching, Markus Huber, Peter beim Graben, Ronald Römer, Ingo Schmitt:
A Fock Space Toolbox and Some Applications in Computational Cognition. 757-767 - Olga Yakovenko

, Ivan Bondarenko, Mariya Borovikova, Daniil Vodolazsky
:
Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. 768-777 - Zbynek Zajíc

, Lucie Zajícová
, Josef V. Psutka, Petr Salajka
, Jaromír Novotný, Ales Prazák, Ludek Müller
:
First Insight into the Processing of the Language Consulting Center Data. 778-787

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














