


default search action
18. SPECOM 2016: Budapest, Hungary
- Andrey Ronzhin, Rodmonga Potapova, Géza Németh:

Speech and Computer - 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings. Lecture Notes in Computer Science 9811, Springer 2016, ISBN 978-3-319-43957-0
Invited Talks
- Ralf Schlüter

, Patrick Doetsch, Pavel Golik
, Markus Kitza, Tobias Menne, Kazuki Irie, Zoltán Tüske, Albert Zeyer
:
Automatic Speech Recognition Based on Neural Networks. 3-17 - Nick Campbell:

Machine Processing of Dialogue States; Speculations on Conversational Entropy. 18-25 - Attila Vékony:

Speech Recognition Challenges in the Car Navigation Industry. 26-40
Conference Papers
- Elena E. Lyakso

, Olga V. Frolova
, Aleksey Grigorev
:
A Comparison of Acoustic Features of Speech of Typically Developing Children and Children with Autism Spectrum Disorders. 43-50 - Mohamed S. Elaraby, Mustafa Abdallah

, Sherif M. Abdou, Mohsen A. Rashwan:
A Deep Neural Networks (DNN) Based Models for a Computer Aided Pronunciation Learning System. 51-58 - Tijana Delic, Branislav Gerazov

, Branislav M. Popovic, Milan Secujski:
A Linguistic Interpretation of the Atom Decomposition of Fundamental Frequency Contour for American English. 59-66 - Edvin Pakoci, Branislav M. Popovic, Niksa Jakovljevic

, Darko Pekar, Fathy Yassa:
A Phonetic Segmentation Procedure Based on Hidden Markov Models. 67-74 - Yuyun Huang, Emer Gilmartin

, Benjamin R. Cowan
, Nick Campbell:
A Preliminary Exploration of Group Social Engagement Level Recognition in Multiparty Casual Conversation. 75-83 - Branislav Gerazov

, Philip N. Garner
:
An Agonist-Antagonist Pitch Production Model. 84-91 - Darko Pekar, Sinisa Suzic

, Robert Mak, Meir Friedlander, Milan Secujski:
An Algorithm for Phase Manipulation in a Speech Signal. 92-99 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova

, Olga Blinova
, Gregory Y. Martynenko
:
An Exploratory Study on Sociolinguistic Variation of Russian Everyday Speech. 100-107 - László Tóth

, Gábor Gosztolya:
Adaptation of DNN Acoustic Models Using KL-divergence Regularization and Multi-task Training. 108-115 - Ivan Medennikov

, Alexey Prudnikov:
Advances in STC Russian Spontaneous Speech Recognition System. 116-123 - Andrey Shulipa, Sergey Novoselov, Aleksandr Melnikov:

Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance. 124-130 - Alexander Sepúlveda-Sepúlveda

, Germán Castellanos-Domínguez:
Assessment of the Relation Between Low-Frequency Features and Velum Opening by Using Real Articulatory Data. 131-139 - András Beke, György Szaszák:

Automatic Summarization of Highly Spontaneous Speech. 140-147 - Michimasa Inaba, Kenichi Takahashi:

Backchanneling via Twitter Data for Conversational Dialogue Systems. 148-155 - Alexey A. Petrovsky, Vadzim Herasimovich

, Alexander A. Petrovsky:
Bio-Inspired Sparse Representation of Speech and Audio Using Psychoacoustic Adaptive Matching Pursuit. 156-164 - György Szaszák, Máté Ákos Tündik, Branislav Gerazov

, Aleksandar Gjoreski:
Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech. 165-173 - Konstantin Simonchik, Sergey Novoselov, Galina Lavrentyeva:

Comparative Analysis of Classifiers for Automatic Language Recognition in Spontaneous Speech. 174-181 - Lucie Skorkovská

:
Comparison of Retrieval Approaches and Blind Relevance Feedback Methods Within the Czech Speech Information Retrieval. 182-190 - Marek Hrúz

, Marie Kunesová
:
Convolutional Neural Network in the Task of Speaker Change Detection. 191-198 - Milan Secujski, Branislav Gerazov

, Tamás Gábor Csapó
, Vlado Delic
, Philip N. Garner
, Aleksandar Gjoreski, David Guennec, Zoran A. Ivanovski, Aleksandar Melov, Géza Németh
, Ana Stojkovic
, György Szaszák:
Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer. 199-206 - Markéta Juzová, Daniel Tihelka

, Jindrich Matousek
:
Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation. 207-215 - Kseniya Proença, Kris Demuynck, Dirk Van Compernolle:

Designing Syllable Models for an HMM Based Speech Recognition System. 216-223 - Vasilisa Verkhodanova

, Vladimir Shapranov
:
Detecting Filled Pauses and Lengthenings in Russian Spontaneous Speech Using SVM. 224-231 - Gábor Gosztolya:

Detecting Laughter and Filler Events by Time Series Smoothing with Genetic Algorithms. 232-239 - Denis Gordeev

:
Detecting State of Aggression in Sentences Using CNN. 240-245 - Irina S. Kipyatkova, Alexey Karpov

:
DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi. 246-253 - Péter Nagy, Géza Németh

:
DNN-Based Duration Modeling for Synthesizing Short Sentences. 254-261 - Olga V. Frolova

, Elena E. Lyakso
:
Emotional Speech of 3-Years Old Children: Norm-Risk-Deprivation. 262-270 - Bálint Pál Tóth, Kornél István Kis, György Szaszák, Géza Németh

:
Ensemble Deep Neural Network Based Waveform-Driven Stress Model for Speech Synthesis. 271-278 - Hunor Nagy, György Wersényi

:
Evaluation of Response Times on a Touch Screen Using Stereo Panned Speech Command Auditory Feedback. 279-286 - Evgeny Kostyuchenko

, Roman V. Mescheryakov, Dariya Ignatieva, Alexander Pyatkov, Evgeny L. Choinzonov
, Lidiya N. Balatskaya:
Evaluation of the Speech Quality During Rehabilitation After Surgical Treatment of the Cancer of Oral Cavity and Oropharynx Based on a Comparison of the Fourier Spectra. 287-295 - Daniel Tihelka

, Martin Gruber
, Markéta Juzová:
Experiments with One-Class Classifier as a Predictor of Spectral Discontinuities in Unit Concatenation. 296-303 - Natalia A. Tomashenko

, Yuri Y. Khokhlov, Anthony Larcher, Yannick Estève:
Exploring GMM-derived Features for Unsupervised Adaptation of Deep Neural Network Acoustic Models. 304-311 - Maxim Korenevsky, Aleksei Romanenko

:
Feature Space VTS with Phase Term Modeling. 312-320 - Evgeniy Shuranov, Aleksandr Lavrentyev, Alexey Kozlyaev, Galina Lavrentyeva, Valeriya Volkovaya:

Finding Speaker Position Under Difficult Acoustic Conditions. 321-327 - Evaldas Vaiciukynas

, Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Kestutis Vaskevicius, Virgilijus Uloza, Evaldas Padervinskis, Jolita Ciceliene:
Fusing Various Audio Feature Sets for Detection of Parkinson's Disease from Sustained Voice and Speech Recordings. 328-337 - Vasilisa Verkhodanova

, Alexander L. Ronzhin, Irina S. Kipyatkova, Denis Ivanko
, Alexey Karpov
, Milos Zelezný:
HAVRUS Corpus: High-Speed Recordings of Audio-Visual Russian Speech. 338-345 - Alexander V. Smirnov, Alexey M. Kashevnik

, Igor Lashkov
:
Human-Smartphone Interaction for Dangerous Situation Detection and Recommendation Generation While Driving. 346-353 - Marvin Coto-Jiménez

, John Goddard Close, Fabiola Martínez Licona
:
Improving Automatic Speech Recognition Containing Additive Noise Using Deep Denoising Autoencoders of LSTM Networks. 354-361 - Maxim Korenevsky, Ivan Medennikov

, Vadim Shchemelinin:
Improving the Quality of Automatic Speech Recognition in Trucks. 362-369 - Chitralekha Bhat, Bhavik Vachhani, Sunil Kumar Kopparapu

:
Improving Recognition of Dysarthric Speech Using Severity Based Tempo Adaptation. 370-377 - Iosif Mporas, Saeid Safavi, Reza Sotudeh:

Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities. 378-385 - Bálint Pál Tóth, Balázs Szórádi, Géza Németh

:
Improvements to Prosodic Variation in Long Short-Term Memory Based Intonation Models Using Random Forest. 386-394 - André Mansikkaniemi, Mikko Kurimo, Krister Lindén

:
In-Document Adaptation for a Human Guided Automatic Transcription Service. 395-402 - Anastasiia Spirina

, Olesia Vaskovskaia, Maxim Sidorov, Alexander Schmitt:
Interaction Quality as a Human-Human Task-Oriented Conversation Performance. 403-410 - Zbynek Zajíc

, Marie Kunesová
, Vlasta Radová
:
Investigation of Segmentation in i-Vector Based Speaker Diarization of Telephone Speech. 411-418 - Victor Budkov

, Irina V. Vatamaniuk
, Vladimir V. Basov
, Daniyar Volf:
Investigation of Speech Signal Parameters Reflecting the Truth of Transmitted Information. 419-426 - Sai Sirisha Rallabandi, Sai Krishna Rallabandi, Naina Teertha, R. Kumaraswamy

, Suryakanth V. Gangashetty
:
Investigating Signal Correlation as Continuity Metric in a Syllable Based Unit Selection Synthesis System. 427-434 - Andrei Smirnov, Valentin Mendelev:

Knowledge Transfer for Utterance Classification in Low-Resource Languages. 435-442 - Maxim Tkachenko, Alexander Yamshinin, Nikolay Lyubimov, Mikhail Kotov, Marina Nastasenko:

Language Identification Using Time Delay Neural Network D-Vector on Short Utterances. 443-449 - Swaran Lata, Swati Arora, Simerjeet Kaur:

Lexical Stress in Punjabi and Its Representation in PLS. 450-460 - Anton Stepikhov

, Anastassia Loukina:
Low Inter-Annotator Agreement in Sentence Boundary Detection and Annotator Personality. 461-468 - Ivan Medennikov

, Anna Bulusheva:
LSTM-Based Language Models for Spontaneous Speech Recognition. 469-475 - Michelina Savino, Loredana Lapertosa, Alessandro O. Caffò

, Mario Refice:
Measuring Prosodic Entrainment in Italian Collaborative Game-Based Dialogues. 476-483 - Mikhail Stolbov, Sergei Aleinik:

Microphone Array Directivity Improvement in Low-Frequency Band for Speech Processing. 484-490 - Olga Blinova

:
Modeling Imperative Utterances in Russian Spoken Dialogue: Verb-Central Quantitative Approach. 491-498 - Rodmonga Potapova

, Liliya Komalova
:
Multimodal Perception of Aggressive Behavior. 499-506 - Rodmonga Potapova

, Vsevolod Potapov
:
On Individual Polyinformativity of Speech and Voice Regarding Speakers Auditive Attribution (Forensic Phonetic Aspect). 507-514 - Gerasimos Arvanitis

, Konstantinos Moustakas, Nikos Fakotakis:
Online Biometric Identification with Face Analysis in Web Applications. 515-522 - Sergei Aleinik:

Optimization of Zelinski Post-filtering Calculation. 523-530 - Vera Evdokimova, Pavel A. Skrelin

, Andrey Barabanov, Karina Evgrafova
:
Phonetic Aspects of High Level of Naturalness in Speech Synthesis. 531-538 - Rodmonga Potapova

, Vsevolod Potapov
:
Polybasic Attribution of Social Network Discourse. 539-546 - Andrey Barabanov, Valentin V. Magerkin, Evgenij Vikulov:

Precise Estimation of Harmonic Parameter Trend and Modification of a Speech Signal. 547-554 - Tatiana Litvinova

, Olga Zagorovskaya
, Olga Litvinova
, Pavel Seredin
:
Profiling a Set of Personality Traits of a Text's Author: A Corpus-Based Approach. 555-562 - Izzad Ramli, Noraini Seman

, Norizah Ardi, Nursuriati Jamil
:
Prosody Analysis of Malay Language Storytelling Corpus. 563-570 - Michael Maruschke, Oliver Jokisch

, Martin Meszaros, Franziska Trojahn, M. Hoffmann:
Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication. 571-579 - Surasak Boonkla, Masashi Unoki

, Stanislav S. Makhanov
:
Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments. 580-587 - Irina V. Vatamaniuk

, Dmitriy Levonevskiy, Anton I. Saveliev
, Alexander Denisov
:
Scenarios of Multimodal Information Navigation Services for Users in Cyberphysical Environment. 588-595 - Andrey Shulipa, Sergey Novoselov, Yuri Matveev

:
Scores Calibration in Speaker Recognition Systems. 596-603 - Lukás Bures, Ludek Müller

:
Selecting Keypoint Detector and Descriptor Combination for Augmented Reality Application. 604-612 - Elena Bulgakova, Aleksey Sholohov:

Semi-automatic Speaker Verification System Based on Analysis of Formant, Durational and Pitch Characteristics. 613-619 - Aleksei Romanenko

, Valentin Mendelev:
Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition. 620-626 - Tatiana Y. Sherstinova

:
Speech Acts Annotation of Everyday Conversations in the ORD Сorpus of Spoken Russian. 627-635 - Mikhail Stolbov, Alexander Lavrentyev:

Speech Enhancement with Microphone Array Using a Multi Beam Adaptive Noise Suppressor. 636-644 - Ivan Rakhmanenko

, Roman V. Meshcheryakov
:
Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System. 645-650 - Stamatis Karlos

, Nikos Fazakis
, Katerina Karanikola, Sotiris B. Kotsiantis
, Kyriakos N. Sgarbas
:
Speech Recognition Combining MFCCs and Image Features. 651-658 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova

, Olga Blinova
, Olga Ermolova, Ekaterina Baeva, Gregory Y. Martynenko
, Anastassia Ryko:
Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech. 659-666 - Miklós Gábriel Tulics, Ferenc Kazinczi, Klára Vicsi:

Statistical Analysis of Acoustical Parameters in the Voice of Children with Juvenile Dysphonia. 667-674 - Róbert Sabo

, Milan Rusko
, Andrej Ridzik, Jakub Rajcáni
:
Stress, Arousal, and Stress Detector Trained on Acted Speech Database. 675-682 - Yuto Tanaka, Mitsunori Mizumachi, Yoshihisa Nakatoh:

Study on the Improvement of Intelligibility for Elderly Speech Using Formant Frequency Shift Method. 683-690 - Ksenia Oskina:

Text Classification in the Domain of Applied Linguistics as Part of a Pre-editing Module for Machine Translation Systems. 691-698 - Nina B. Volskaya, Tatiana Kachkovskaia

:
Tonal Specification of Perceptually Prominent Non-nuclear Pitch Accents in Russian. 699-705 - Zdenek Krnoul, Pavel Jedlicka, Jakub Kanis, Milos Zelezný:

Toward Sign Language Motion Capture Dataset Building. 706-713 - Andrey Barabanov, Aleksandr Melnikov:

Trade-Off Between Speed and Accuracy for Noise Variance Minimization (NVM) Pitch Estimation Algorithm. 714-721 - Varvara Krayvanova

, Svetlana Duka:
Unsupervised Trained Functional Discourse Parser for e-Learning Materials Scaffolding. 722-728

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














