default search action
Paavo Alku
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j94]Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Automatic classification of the severity level of Parkinson's disease: A comparison of speaking tasks, features, and classifiers. Comput. Speech Lang. 83: 101548 (2024) - [j93]Sudarsana Reddy Kadiri, Farhad Javanmardi, Paavo Alku:
Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals. Comput. Speech Lang. 83: 101550 (2024) - [j92]Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
A comparison of data augmentation methods in voice pathology detection. Comput. Speech Lang. 83: 101552 (2024) - [j91]Paavo Alku, Manila Kodali, Laura Laaksonen, Sudarsana Reddy Kadiri:
AVID: A speech database for machine learning studies on vocal intensity. Speech Commun. 157: 103039 (2024) - [j90]Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, Paavo Alku, K. Sreenivasa Rao, Pabitra Mitra:
Automatic classification of neurological voice disorders using wavelet scattering features. Speech Commun. 157: 103040 (2024) - [j89]Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
Pre-trained models for detection and severity level classification of dysarthria from speech. Speech Commun. 158: 103047 (2024) - [j88]Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
Exploring the Impact of Fine-Tuning the Wav2vec2 Model in Database-Independent Detection of Dysarthric Speech. IEEE J. Biomed. Health Informatics 28(8): 4951-4962 (2024) - [i16]Anne-Maria Laukkanen, Sudarsana Reddy Kadiri, Shrikanth Narayanan, Paavo Alku:
Can a Machine Distinguish High and Low Amount of Social Creak in Speech? CoRR abs/2410.17028 (2024) - 2023
- [j87]Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana:
Analysis of Instantaneous Frequency Components of Speech Signals for Epoch Extraction. Comput. Speech Lang. 78: 101443 (2023) - [j86]Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda:
Refining a deep learning-based formant tracker using linear prediction methods. Comput. Speech Lang. 81: 101515 (2023) - [j85]Mittapalle Kiran Reddy, Yagnavajjula Madhu Keerthana, Paavo Alku:
Classification of functional dysphonia using the tunable Q wavelet transform. Speech Commun. 155: 102989 (2023) - [j84]Yuanyuan Liu, Mittapalle Kiran Reddy, Nelly Penttilä, Tiina Ihalainen, Paavo Alku, Okko Räsänen:
Automatic Assessment of Parkinson's Disease Using Speech Representations of Phonation and Articulation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 242-255 (2023) - [j83]Mittapalle Kiran Reddy, Paavo Alku:
Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1386-1396 (2023) - [c161]Farhad Javanmardi, Saska Tirronen, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Wav2vec-Based Detection and Severity Level Classification of Dysarthria From Speech. ICASSP 2023: 1-5 - [c160]Manila Kodali, Sudarsana Reddy Kadiri, Laura Laaksonen, Paavo Alku:
Automatic Classification of Vocal Intensity Category from Speech. ICASSP 2023: 1-5 - [c159]Saska Tirronen, Farhad Javanmardi, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Utilizing Wav2Vec In Database-Independent Voice Disorder Detection. ICASSP 2023: 1-5 - [c158]Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku:
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features. INTERSPEECH 2023: 2393-2397 - [c157]Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings. INTERSPEECH 2023: 4134-4138 - [i15]Sudarsana Reddy Kadiri, Farhad Javanmardi, Paavo Alku:
Investigation of Self-supervised Pre-trained Models for Classification of Voice Quality from Speech and Neck Surface Accelerometer Signals. CoRR abs/2308.03226 (2023) - [i14]Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku:
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features. CoRR abs/2308.09042 (2023) - [i13]Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda:
Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods. CoRR abs/2308.09051 (2023) - [i12]Dhananjaya Gowda, Sudarsana Reddy Kadiri, Brad H. Story, Paavo Alku:
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals. CoRR abs/2308.16540 (2023) - [i11]Sudarsana Reddy Kadiri, Paavo Alku:
Analysis and Detection of Pathological Voice using Glottal Source Features. CoRR abs/2309.14080 (2023) - [i10]Farhad Javanmardi, Saska Tirronen, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech. CoRR abs/2309.14107 (2023) - 2022
- [j82]Sudarsana Reddy Kadiri, Paavo Alku:
Subjective Evaluation of Basic Emotions from Audio-Visual Data. Sensors 22(13): 4931 (2022) - [j81]Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo:
A formant modification method for improved ASR of children's speech. Speech Commun. 136: 98-106 (2022) - [j80]Mittapalle Kiran Reddy, Hilla Pohjalainen, Pyry Helkkula, Kasimir Kaitue, Mikko Minkkinen, Heli Tolppanen, Tuomo Nieminen, Paavo Alku:
Glottal flow characteristics in vowels produced by speakers with heart failure. Speech Commun. 137: 35-43 (2022) - [j79]Mittapalle Kiran Reddy, Yagnavajjula Madhu Keerthana, Paavo Alku:
End-to-End Pathological Speech Detection Using Wavelet Scattering Network. IEEE Signal Process. Lett. 29: 1863-1867 (2022) - [c156]Farhad Javanmardi, Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku:
Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers. INTERSPEECH 2022: 2173-2177 - [c155]Sudarsana Reddy Kadiri, Farhad Javanmardi, Paavo Alku:
Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals. INTERSPEECH 2022: 5253-5257 - [i9]Dhananjaya Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku:
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks. CoRR abs/2201.01525 (2022) - 2021
- [j78]Mittapalle Kiran Reddy, Paavo Alku:
A Comparison of Cepstral Features in the Detection of Pathological Voices by Varying the Input and Filterbank of the Cepstrum Computation. IEEE Access 9: 135953-135963 (2021) - [j77]Dhananjaya N. Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku:
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks. IEEE Access 9: 151631-151640 (2021) - [j76]N. P. Narendra, Paavo Alku:
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features. Comput. Speech Lang. 65: 101117 (2021) - [j75]Mittapalle Kiran Reddy, Pyry Helkkula, Yagnavajjula Madhu Keerthana, Kasimir Kaitue, Mikko Minkkinen, Heli Tolppanen, Tuomo Nieminen, Paavo Alku:
The automatic detection of heart failure using speech signals. Comput. Speech Lang. 69: 101205 (2021) - [j74]Sudarsana Reddy Kadiri, Paavo Alku:
Glottal features for classification of phonation type from speech and neck surface accelerometer signals. Comput. Speech Lang. 70: 101232 (2021) - [j73]Sudarsana Reddy Kadiri, Paavo Alku, Bayya Yegnanarayana:
Extraction and Utilization of Excitation Information of Speech: A Review. Proc. IEEE 109(12): 1920-1941 (2021) - [j72]N. P. Narendra, Björn W. Schuller, Paavo Alku:
The Detection of Parkinson's Disease From Speech Using Voice Source Information. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1925-1936 (2021) - [c154]Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo:
Spectral modification for recognition of children's speech undermismatched conditions. NoDaLiDa 2021: 94-100 - 2020
- [j71]Mittapalle Kiran Reddy, Paavo Alku, Krothapalli Sreenivasa Rao:
Detection of Specific Language Impairment in Children Using Glottal Source Features. IEEE Access 8: 15273-15279 (2020) - [j70]Sudarsana Reddy Kadiri, Paavo Alku:
Excitation Features of Speech for Speaker-Specific Emotion Detection. IEEE Access 8: 60382-60391 (2020) - [j69]N. P. Narendra, Paavo Alku:
Glottal Source Information for Pathological Voice Detection. IEEE Access 8: 67745-67755 (2020) - [j68]Rashmi Kethireddy, Sudarsana Reddy Kadiri, Paavo Alku, Suryakanth V. Gangashetty:
Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification. IEEE Access 8: 174871-174879 (2020) - [j67]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j66]Sudarsana Reddy Kadiri, P. Gangamohan, Suryakanth V. Gangashetty, Paavo Alku, B. Yegnanarayana:
Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference. Circuits Syst. Signal Process. 39(9): 4459-4481 (2020) - [j65]Sudarsana Reddy Kadiri, Paavo Alku:
Analysis and Detection of Pathological Voice Using Glottal Source Features. IEEE J. Sel. Top. Signal Process. 14(2): 367-379 (2020) - [j64]Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana:
Analysis and classification of phonation types in speech and singing voice. Speech Commun. 118: 33-47 (2020) - [j63]N. P. Narendra, Paavo Alku:
Automatic intelligibility assessment of dysarthric speech using glottal parameters. Speech Commun. 123: 1-9 (2020) - [j62]Krishna Gurugubelli, Anil Kumar Vuppala, N. P. Narendra, Paavo Alku:
Duration of the rhotic approximant /ɹ/ in spastic dysarthria of different severity levels. Speech Commun. 125: 61-68 (2020) - [j61]Dhananjaya N. Gowda, Sudarsana Reddy Kadiri, Brad H. Story, Paavo Alku:
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1901-1914 (2020) - [c153]Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana:
Comparison of Glottal Closure Instants Detection Algorithms for Emotional Speech. ICASSP 2020: 7379-7383 - [c152]Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo:
Study of Formant Modification for Children ASR. ICASSP 2020: 7429-7433 - [c151]Sudarsana Reddy Kadiri, Rashmi Kethireddy, Paavo Alku:
Parkinson's Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients. INTERSPEECH 2020: 4971-4975
2010 – 2019
- 2019
- [j60]Shreyas Seshadri, Lauri Juvela, Okko Räsänen, Paavo Alku:
Vocal Effort Based Speaking Style Conversion Using Vocoder Features and Parallel Learning. IEEE Access 7: 17230-17246 (2019) - [j59]Emma Jokinen, Rahim Saeidi, Tomi Kinnunen, Paavo Alku:
Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task. Comput. Speech Lang. 53: 1-11 (2019) - [j58]N. P. Narendra, Manu Airaksinen, Brad H. Story, Paavo Alku:
Estimation of the glottal source from coded telephone speech using deep neural networks. Speech Commun. 106: 95-104 (2019) - [j57]Paavo Alku, Tiina Murtola, Jarmo Malinen, Juha Kuortti, Brad H. Story, Manu Airaksinen, Mika Salmi, Erkki Vilkman, Ahmed Geneid:
OPENGLOT - An open environment for the evaluation of glottal inverse filtering. Speech Commun. 107: 38-47 (2019) - [j56]Tiina Murtola, Jarmo Malinen, Ahmed Geneid, Paavo Alku:
Analysis of phonation onsets in vowel production, using information from glottal area and flow estimate. Speech Commun. 109: 55-65 (2019) - [j55]N. P. Narendra, Paavo Alku:
Dysarthric speech classification from coded telephone speech using glottal features. Speech Commun. 110: 47-55 (2019) - [j54]Bajibabu Bollepalli, Lauri Juvela, Manu Airaksinen, Cassia Valentini-Botinhao, Paavo Alku:
Normal-to-Lombard adaptation of speech synthesis using long short-term memory recurrent neural networks. Speech Commun. 110: 64-75 (2019) - [j53]Lauri Juvela, Bajibabu Bollepalli, Vassilis Tsiaras, Paavo Alku:
GlotNet - A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 1019-1030 (2019) - [c150]Manu Airaksinen, Lauri Juvela, Paavo Alku, Okko Räsänen:
Data Augmentation Strategies for Neural Network F0 Estimation. ICASSP 2019: 6485-6489 - [c149]Shreyas Seshadri, Lauri Juvela, Junichi Yamagishi, Okko Räsänen, Paavo Alku:
Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion. ICASSP 2019: 6835-6839 - [c148]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks. ICASSP 2019: 6915-6919 - [c147]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram. INTERSPEECH 2019: 694-698 - [c146]Sudarsana Reddy Kadiri, Paavo Alku:
Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech. INTERSPEECH 2019: 2508-2512 - [c145]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System. INTERSPEECH 2019: 2833-2837 - [c144]Shreyas Seshadri, Lauri Juvela, Paavo Alku, Okko Räsänen:
Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion. INTERSPEECH 2019: 2838-2842 - [i8]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis. CoRR abs/1903.05955 (2019) - [i7]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram. CoRR abs/1904.03976 (2019) - [i6]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i5]Thomas Drugman, Paavo Alku, Abeer Alwan, Bayya Yegnanarayana:
Glottal Source Processing: from Analysis to Applications. CoRR abs/1912.12604 (2019) - 2018
- [j52]Tiina Murtola, Paavo Alku, Jarmo Malinen, Ahmed Geneid:
Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy. Speech Commun. 96: 67-80 (2018) - [j51]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction. Speech Commun. 99: 62-79 (2018) - [j50]Sofoklis Kakouros, Okko Räsänen, Paavo Alku:
Comparison of spectral tilt measures for sentence prominence in speech - Effects of dimensionality and adverse noise conditions. Speech Commun. 103: 11-26 (2018) - [j49]Parham Mokhtari, Brad H. Story, Paavo Alku, Hiroshi Ando:
Estimation of the glottal flow from speech pressure signals: Evaluation of three variants of iterative adaptive inverse filtering using computational physical modelling of voice production. Speech Commun. 104: 24-38 (2018) - [j48]Manu Airaksinen, Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1658-1670 (2018) - [c143]Lauri Juvela, Bajibabu Bollepalli, Xin Wang, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks. ICASSP 2018: 5679-5683 - [c142]Manu Airaksinen, Lauri Juvela, Okko Räsänen, Paavo Alku:
Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech. INTERSPEECH 2018: 701-705 - [c141]Lauri Juvela, Vassilis Tsiaras, Bajibabu Bollepalli, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speaker-independent Raw Waveform Model for Glottal Excitation. INTERSPEECH 2018: 2012-2016 - [c140]N. P. Narendra, Paavo Alku:
Dysarthric Speech Classification Using Glottal Features Computed from Non-words, Words and Sentences. INTERSPEECH 2018: 3403-3407 - [i4]Lauri Juvela, Bajibabu Bollepalli, Xin Wang, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speech waveform synthesis from MFCC sequences with generative adversarial networks. CoRR abs/1804.00920 (2018) - [i3]Lauri Juvela, Vassilis Tsiaras, Bajibabu Bollepalli, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speaker-independent raw waveform model for glottal excitation. CoRR abs/1804.09593 (2018) - [i2]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention. CoRR abs/1810.12051 (2018) - [i1]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks. CoRR abs/1810.12598 (2018) - 2017
- [j47]Dong Liu, Elina Kankare, Anne-Maria Laukkanen, Paavo Alku:
Comparison of parametrization methods of electroglottographic and inverse filtered acoustic speech pressure signals in distinguishing between phonation types. Biomed. Signal Process. Control. 36: 183-193 (2017) - [j46]Manu Airaksinen, Bajibabu Bollepalli, Jouni Pohjalainen, Paavo Alku:
Glottal Vocoding With Frequency-Warped Time-Weighted Linear Prediction. IEEE Signal Process. Lett. 24(4): 446-450 (2017) - [j45]Manu Airaksinen, Tom Bäckström, Paavo Alku:
Quadratic Programming Approach to Glottal Inverse Filtering by Joint Norm-1 and Norm-2 Optimization. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 929-939 (2017) - [j44]Paavo Alku, Rahim Saeidi:
The Linear Predictive Modeling of Speech From Higher-Lag Autocorrelation Coefficients Applied to Noise-Robust Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1606-1617 (2017) - [j43]Emma Jokinen, Ulpu Remes, Paavo Alku:
Intelligibility Enhancement of Telephone Speech Using Gaussian Process Regression for Normal-to-Lombard Spectral Tilt Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1985-1996 (2017) - [c139]Ana Ramírez López, Rahim Saeidi, Lauri Juvela, Paavo Alku:
Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch. ICASSP 2017: 4940-4944 - [c138]Bajibabu Bollepalli, Manu Airaksinen, Paavo Alku:
Lombard speech synthesis using long short-term memory recurrent neural networks. ICASSP 2017: 5505-5509 - [c137]Tomi Kinnunen, Lauri Juvela, Paavo Alku, Junichi Yamagishi:
Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation. ICASSP 2017: 5535-5539 - [c136]Manu Airaksinen, Bajibabu Bollepalli, Jouni Pohjalainen, Paavo Alku:
Frequency-warped time-weighted linear prediction for glottal vocoding. ICASSP 2017: 5630-5634 - [c135]Ana Ramírez López, Shreyas Seshadri, Lauri Juvela, Okko Räsänen, Paavo Alku:
Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs. INTERSPEECH 2017: 1363-1367 - [c134]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System. INTERSPEECH 2017: 1368-1372 - [c133]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions. INTERSPEECH 2017: 1512-1516 - [c132]Sofoklis Kakouros, Okko Räsänen, Paavo Alku:
Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions. INTERSPEECH 2017: 3211-3215 - [c131]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis. INTERSPEECH 2017: 3394-3398 - [c130]N. P. Narendra, Manu Airaksinen, Paavo Alku:
Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network. INTERSPEECH 2017: 3931-3935 - [c129]Manu Airaksinen, Paavo Alku:
Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs. INTERSPEECH 2017: 3946-3950 - 2016
- [j42]Ulpu Remes, Ana Ramírez López, Lauri Juvela, Kalle J. Palomäki, Guy J. Brown, Paavo Alku, Mikko Kurimo:
Comparing human and automatic speech recognition in a perceptual restoration experiment. Comput. Speech Lang. 35: 14-31 (2016) - [j41]Maria Hakonen, Patrick J. C. May, Jussi Alho, Paavo Alku, Emma Jokinen, Iiro P. Jääskeläinen, Hannu Tiitinen:
Previous exposure to intact speech increases intelligibility of its digitally degraded counterpart as a function of stimulus complexity. NeuroImage 125: 131-143 (2016) - [j40]Tuomo Raitio, Lauri Juvela, Antti Suni, Martti Vainio, Paavo Alku:
Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis. Speech Commun. 81: 104-119 (2016) - [j39]Emma Jokinen, Hannu Pulakka, Paavo Alku:
Phase modification for increasing the intelligibility of telephone speech in near-end noise conditions - evaluation of two methods. Speech Commun. 83: 64-80 (2016) - [j38]Rahim Saeidi, Paavo Alku, Tom Bäckström:
Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 42-53 (2016) - [c128]Dhananjaya N. Gowda, Manu Airaksinen, Paavo Alku:
Quasi closed phase analysis of speech signals using time varying weighted linear prediction for accurate formant tracking. ICASSP 2016: 4980-4984 - [c127]Lauri Juvela, Bajibabu Bollepalli, Manu Airaksinen, Paavo Alku:
High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network. ICASSP 2016: 5120-5124 - [c126]