default search action
Tomi Kinnunen
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j50]Tomi H. Kinnunen, Kong Aik Lee, Hemlata Tak, Nicholas W. D. Evans, Andreas Nautsch:
t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2622-2637 (2024) - [j49]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1261-1273 (2024) - 2023
- [j48]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2507-2522 (2023) - [j47]Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki:
GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters. IEEE Trans. Games 15(4): 566-579 (2023) - 2022
- [j46]Lauri Tavi, Tomi Kinnunen, Rosa González Hautamäki:
Improving speaker de-identification with functional data analysis of f0 trajectories. Speech Commun. 140: 1-10 (2022) - [j45]Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi:
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems. IEEE ACM Trans. Audio Speech Lang. Process. 30: 477-488 (2022) - 2021
- [j44]Kong Aik Lee, Ville Vestman, Tomi Kinnunen:
ASVtorch toolkit: Speaker verification with deep neural networks. SoftwareX 14: 100697 (2021) - [j43]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimizing Multi-Taper Features for Deep Speaker Verification. IEEE Signal Process. Lett. 28: 2187-2191 (2021) - [j42]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi H. Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Trans. Biom. Behav. Identity Sci. 3(2): 252-265 (2021) - 2020
- [j41]Ville Vestman, Tomi Kinnunen, Rosa González Hautamäki, Md. Sahidullah:
Voice Mimicry Attacks Assisted by Automatic Speaker Verification. Comput. Speech Lang. 59: 36-54 (2020) - [j40]Jean-François Bonastre, Tomi Kinnunen, Anthony Larcher, Junichi Yamagishi:
Introduction to the special issue "Speaker and language characterization and recognition: Voice modeling, conversion, synthesis and ethical aspects". Comput. Speech Lang. 60 (2020) - [j39]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee:
Voice biometrics security: Extrapolating false alarm rate via hierarchical Bayesian modeling of speaker verification scores. Comput. Speech Lang. 60 (2020) - [j38]Bhusan Chettri, Tomi Kinnunen, Emmanouil Benetos:
Deep generative variational autoencoding for replay spoof detection in automatic speaker verification. Comput. Speech Lang. 63: 101092 (2020) - [j37]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j36]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2195-2210 (2020) - 2019
- [j35]Emma Jokinen, Rahim Saeidi, Tomi Kinnunen, Paavo Alku:
Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task. Comput. Speech Lang. 53: 1-11 (2019) - [j34]Akihiro Kato, Tomi H. Kinnunen:
Statistical Regression Models for Noise Robust F0 Estimation Using Recurrent Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2336-2349 (2019) - 2018
- [j33]Alexey Sholokhov, Md. Sahidullah, Tomi Kinnunen:
Semi-supervised speech activity detection with an application to automatic speaker verification. Comput. Speech Lang. 47: 132-156 (2018) - [j32]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction. Speech Commun. 99: 62-79 (2018) - [j31]Md. Sahidullah, Dennis Alexander Lehmann Thomsen, Rosa González Hautamäki, Tomi Kinnunen, Zheng-Hua Tan, Robert Parts, Martti Pitkänen:
Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 44-56 (2018) - 2017
- [j30]Junichi Yamagishi, Tomi Kinnunen, Nicholas W. D. Evans, Phillip L. De Leon, Isabel Trancoso:
Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification. IEEE J. Sel. Top. Signal Process. 11(4): 585-587 (2017) - [j29]Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov, Nicholas W. D. Evans, Massimiliano Todisco:
ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge. IEEE J. Sel. Top. Signal Process. 11(4): 588-604 (2017) - [j28]Rosa González Hautamäki, Md. Sahidullah, Ville Hautamäki, Tomi Kinnunen:
Acoustical and perceptual study of voice disguise by age modification in speaker verification. Speech Commun. 95: 1-15 (2017) - [j27]Aleksandr Sizov, Kong-Aik Lee, Tomi Kinnunen:
Direct Optimization of the Detection Cost for I-Vector-Based Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 588-597 (2017) - 2016
- [j26]Md. Sahidullah, Tomi Kinnunen:
Local spectral variability features for speaker verification. Digit. Signal Process. 50: 1-11 (2016) - [j25]Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov:
Spoofing detection goes noisy: An analysis of synthetic speech detection in the presence of additive noise. Speech Commun. 85: 83-97 (2016) - [j24]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee:
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 29-41 (2016) - 2015
- [j23]Hamid Behravan, Ville Hautamäki, Tomi Kinnunen:
Factors affecting i-vector based foreign accent recognition: A case study in spoken Finnish. Speech Commun. 66: 118-129 (2015) - [j22]Zhizheng Wu, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li:
Spoofing and countermeasures for speaker verification: A survey. Speech Commun. 66: 130-153 (2015) - [j21]Rosa González Hautamäki, Tomi Kinnunen, Ville Hautamäki, Anne-Maria Laukkanen:
Automatic versus human speaker verification: The case of voice mimicry. Speech Commun. 72: 13-31 (2015) - [j20]Aleksandr Sizov, Elie Khoury, Tomi Kinnunen, Zhizheng Wu, Sébastien Marcel:
Joint Speaker Verification and Antispoofing in the i-Vector Space. IEEE Trans. Inf. Forensics Secur. 10(4): 821-832 (2015) - 2014
- [j19]Padmanabhan Rajan, Anton Afanasyev, Ville Hautamäki, Tomi Kinnunen:
From single to multiple enrollment i-vectors: Practical PLDA scoring variants for speaker verification. Digit. Signal Process. 31: 93-101 (2014) - [j18]Cemal Hanilçi, Tomi Kinnunen:
Source cell-phone recognition from recorded speech using non-speech segments. Digit. Signal Process. 35: 75-85 (2014) - [j17]Jouni Pohjalainen, Cemal Hanilçi, Tomi Kinnunen, Paavo Alku:
Mixture Linear Prediction in Speaker Verification Under Vocal Effort Mismatch. IEEE Signal Process. Lett. 21(12): 1516-1520 (2014) - 2013
- [j16]Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:
Multitaper MFCC and PLP features for speaker verification using i-vectors. Speech Commun. 55(2): 237-251 (2013) - [j15]Olaf Schleusing, Tomi Kinnunen, Brad H. Story, Jean-Marc Vesin:
Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution. IEEE Trans. Speech Audio Process. 21(8): 1560-1572 (2013) - [j14]Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong-Aik Lee, Bin Ma, Haizhou Li:
Sparse Classifier Fusion for Speaker Verification. IEEE Trans. Speech Audio Process. 21(8): 1622-1631 (2013) - 2012
- [j13]Cemal Hanilçi, Tomi Kinnunen, Figen Ertas, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku:
Regularized All-Pole Models for Speaker Verification Under Noisy Environments. IEEE Signal Process. Lett. 19(3): 163-166 (2012) - [j12]Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion. IEEE Signal Process. Lett. 19(12): 914-917 (2012) - [j11]Tomi Kinnunen, Rahim Saeidi, Filip Sedlak, Kong-Aik Lee, Johan Sandberg, Maria Hansson-Sandsten, Haizhou Li:
Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Trans. Speech Audio Process. 20(7): 1990-2001 (2012) - [j10]Pejman Mowlaee, Rahim Saeidi, Mads Græsbøll Christensen, Zheng-Hua Tan, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen:
A Joint Approach for Single-Channel Speaker Identification and Speech Separation. IEEE Trans. Speech Audio Process. 20(9): 2586-2601 (2012) - 2011
- [j9]Tomi Kinnunen, Ilja Sidoroff, Marko Tuononen, Pasi Fränti:
Comparison of clustering methods: A case study of text-independent speaker modeling. Pattern Recognit. Lett. 32(13): 1604-1617 (2011) - [j8]Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, Khe Chai Sim:
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 19(4): 861-870 (2011) - 2010
- [j7]Tomi Kinnunen, Haizhou Li:
An overview of text-independent speaker recognition: From features to supervectors. Speech Commun. 52(1): 12-40 (2010) - [j6]Johan Sandberg, Maria Hansson-Sandsten, Tomi Kinnunen, Rahim Saeidi, Patrick Flandrin, Pierre Borgnat:
Multitaper Estimation of Frequency-Warped Cepstra With Application to Speaker Verification. IEEE Signal Process. Lett. 17(4): 343-346 (2010) - [j5]Rahim Saeidi, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku:
Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification. IEEE Signal Process. Lett. 17(6): 599-602 (2010) - 2009
- [j4]Tomi Kinnunen, Juhani Saastamoinen, Ville Hautamäki, Mikko Vinni, Pasi Fränti:
Comparative evaluation of maximum a Posteriori vector quantization and gaussian mixture models in speaker verification. Pattern Recognit. Lett. 30(4): 341-347 (2009) - 2008
- [j3]Ville Hautamäki, Tomi Kinnunen, Pasi Fränti:
Text-independent speaker recognition using graph matching. Pattern Recognit. Lett. 29(9): 1427-1432 (2008) - [j2]Ville Hautamäki, Tomi Kinnunen, Ismo Kärkkäinen, Juhani Saastamoinen, Marko Tuononen, Pasi Fränti:
Maximum a Posteriori Adaptation of the Centroid Model for Speaker Verification. IEEE Signal Process. Lett. 15: 162-165 (2008) - 2006
- [j1]Tomi Kinnunen, Evgeny Karpov, Pasi Fränti:
Real-time speaker identification and verification. IEEE Trans. Speech Audio Process. 14(1): 277-288 (2006)
Conference and Workshop Papers
- 2024
- [c144]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen, Nicholas W. D. Evans, Jean-François Bonastre, Itshak Lapidot:
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification. Odyssey 2024: 158-164 - 2023
- [c143]Mark Anderson, Tomi Kinnunen, Naomi Harte:
Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank Initialisation. ICASSP 2023: 1-5 - [c142]Hye-jin Shim, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
How to Construct Perfect and Worse-than-Coin-Flip Spoofing Countermeasures: A Word of Warning on Shortcut Learning. INTERSPEECH 2023: 785-789 - [c141]Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen:
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech. INTERSPEECH 2023: 1948-1952 - [c140]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Speaker-Aware Anti-spoofing. INTERSPEECH 2023: 2498-2502 - [c139]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen:
Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing. INTERSPEECH 2023: 3804-3808 - [c138]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings. INTERSPEECH 2023: 3989-3993 - 2022
- [c137]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable Nonlinear Compression for Robust Speaker Verification. ICASSP 2022: 7962-7966 - [c136]Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas W. D. Evans, Tomi Kinnunen:
SASV 2022: The First Spoofing-Aware Speaker Verification Challenge. INTERSPEECH 2022: 2893-2897 - [c135]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation. Odyssey 2022: 85-91 - [c134]Alexey Sholokhov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Baselines and Protocols for Household Speaker Recognition. Odyssey 2022: 185-192 - [c133]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. Odyssey 2022: 330-337 - [c132]Sandip Ghimire, Tomi Kinnunen, Rosa González Hautamäki:
Gamified Speaker Comparison by Listening. Odyssey 2022: 421-427 - [c131]Rhythm Bhatia, Tomi H. Kinnunen:
An Initial Study on Birdsong Re-synthesis Using Neural Vocoders. SPECOM 2022: 64-74 - 2021
- [c130]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimized Power Normalized Cepstral Coefficients Towards Robust Deep Speaker Verification. ASRU 2021: 185-190 - [c129]Khaled Hechmi, Trung Ngo Trong, Ville Hautamäki, Tomi Kinnunen:
Voxceleb Enrichment for Age and Gender Recognition. ASRU 2021: 687-693 - [c128]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Parameterized Channel Normalization for Far-Field Deep Speaker Verification. ASRU 2021: 1132-1138 - [c127]Bhusan Chettri, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
Data Quality as Predictor of Voice Anti-Spoofing Generalization. Interspeech 2021: 1659-1663 - [c126]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. Interspeech 2021: 4299-4303 - [c125]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable MFCCs for Speaker Verification. ISCAS 2021: 1-5 - [c124]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. SLT 2021: 323-329 - [c123]Lauri Tavi, Tomi Kinnunen, Einar Meister, Rosa González Hautamäki, Anton Malmi:
Articulation During Voice Disguise: A Pilot Study. SPECOM 2021: 680-691 - 2020
- [c122]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --. Blizzard Challenge / Voice Conversion Challenge 2020 - [c121]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. Blizzard Challenge / Voice Conversion Challenge 2020 - [c120]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings. INTERSPEECH 2020: 3221-3225 - [c119]Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. INTERSPEECH 2020: 4213-4217 - [c118]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee:
Extrapolating False Alarm Rates in Automatic Speaker Verification. INTERSPEECH 2020: 4218-4222 - [c117]Rosa González Hautamäki, Tomi Kinnunen:
Why Did the x-Vector System Miss a Target Speaker? Impact of Acoustic Mismatch Upon Target Score on VoxCeleb Data. INTERSPEECH 2020: 4313-4317 - [c116]Ville Vestman, Kong Aik Lee, Tomi Kinnunen:
Neural i-vectors. Odyssey 2020: 67-74 - [c115]Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi:
An Initial Investigation on Optimizing Tandem Speaker Verification and Countermeasure Systems Using Reinforcement Learning. Odyssey 2020: 151-158 - [c114]Bhusan Chettri, Tomi Kinnunen, Emmanouil Benetos:
Subband Modeling for Spoofing Detection in Automatic Speaker Verification. Odyssey 2020: 341-348 - 2019
- [c113]Rosa González Hautamäki, Tomi H. Kinnunen:
Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing. ASRU 2019: 749-756 - [c112]Ville Vestman, Bilal Soomro, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen:
Who Do I Sound like? Showcasing Speaker Recognition Technology by Youtube Voice Search. ICASSP 2019: 5781-5785 - [c111]Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md. Sahidullah:
Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. ICASSP 2019: 6146-6150 - [c110]Xiaoting Wu, Eric Granger, Tomi H. Kinnunen, Xiaoyi Feng, Abdenour Hadid:
Audio-Visual Kinship Verification in the Wild. ICB 2019: 1-8 - [c109]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of i-Vectors Enabled by GPU Acceleration. INTERSPEECH 2019: 351-355 - [c108]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi H. Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. INTERSPEECH 2019: 1008-1012 - [c107]Kong Aik Lee, Ville Hautamäki, Tomi H. Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Massimiliano Todisco:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. INTERSPEECH 2019: 1497-1501 - 2018
- [c106]Massimiliano Todisco, Héctor Delgado, Kong-Aik Lee, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi:
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion. INTERSPEECH 2018: 77-81 - [c105]Akihiro Kato, Tomi Kinnunen:
Waveform to Single Sinusoid Regression to Estimate the F0 Contour from Noisy Speech Using Recurrent Deep Neural Networks. INTERSPEECH 2018: 327-331 - [c104]Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. Odyssey 2018: 187-194 - [c103]Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. Odyssey 2018: 195-202 - [c102]Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen:
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data. Odyssey 2018: 240-247 - [c101]Akihiro Kato, Tomi Kinnunen:
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech. Odyssey 2018: 275-282 - [c100]Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Kong-Aik Lee, Junichi Yamagishi:
ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements. Odyssey 2018: 296-303 - [c99]Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. Odyssey 2018: 312-319 - [c98]Rosa González Hautamäki, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen:
Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification. Odyssey 2018: 320-326 - [c97]Ville Vestman, Tomi Kinnunen:
Supervector Compression Strategies to Speed up I-Vector System Development. Odyssey 2018: 357-364 - [c96]Fuming Fang, Junichi Yamagishi, Isao Echizen, Md. Sahidullah, Tomi Kinnunen:
Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems. WIFS 2018: 1-9 - 2017
- [c95]Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Md. Sahidullah, Wei Ming Liu, Federico Alegre, Tomi Kinnunen, Benoit G. B. Fauve:
Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification. BIOSIG 2017: 173-183 - [c94]Anssi Kanervisto, Ville Vestman, Md. Sahidullah, Ville Hautamäki, Tomi Kinnunen:
Effects of gender information in text-independent and text-dependent speaker verification. ICASSP 2017: 5360-5364 - [c93]Tomi Kinnunen, Md. Sahidullah, Mauro Falcone, Luca Costantini, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Achintya Kumar Sarkar, Zheng-Hua Tan, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Ville Hautamäki, Kong-Aik Lee:
RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research. ICASSP 2017: 5395-5399 - [c92]Tomi Kinnunen, Lauri Juvela, Paavo Alku, Junichi Yamagishi:
Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation. ICASSP 2017: 5535-5539 - [c91]Tomi Kinnunen, Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection. INTERSPEECH 2017: 2-6 - [c90]Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332 - [c89]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions. INTERSPEECH 2017: 1512-1516 - [c88]Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan, Tomi Kinnunen:
Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data. INTERSPEECH 2017: 2611-2615 - 2016
- [c87]Alexey Sholokhov, Tomi Kinnunen, Sandro Cumani:
Discriminative multi-domain PLDA for speaker verification. ICASSP 2016: 5030-5034 - [c86]Tomi Kinnunen, Md. Sahidullah, Ivan Kukanov, Héctor Delgado, Massimiliano Todisco, Achintya Kumar Sarkar, Nicolai Bæk Thomsen, Ville Hautamäki, Nicholas W. D. Evans, Zheng-Hua Tan:
Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus. INTERSPEECH 2016: 430-434 - [c85]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu, Tomi Kinnunen, Nicholas W. D. Evans, Zheng-Hua Tan:
Integrated Spoofing Countermeasures and Automatic Speaker Verification: An Evaluation on ASVspoof 2015. INTERSPEECH 2016: 1700-1704 - [c84]Md. Sahidullah, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts, Martti Pitkänen:
Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech. INTERSPEECH 2016: 1720-1724 - [c83]Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Alexander Lehmann Thomsen, Md. Sahidullah, Zheng-Hua Tan:
HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors. INTERSPEECH 2016: 2992-2996 - [c82]Amir Hossein Poorjam, Rahim Saeidi, Tomi Kinnunen, Ville Hautamäki:
Incorporating uncertainty as a Quality Measure in I-Vector Based Language Recognition. Odyssey 2016: 74-80 - [c81]Aleksandr Sizov, Kong-Aik Lee, Tomi Kinnunen:
Discriminating Languages in a Probabilistic Latent Subspace. Odyssey 2016: 81-88 - [c80]Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen, Ville Hautamäki:
Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy. Odyssey 2016: 277-282 - [c79]Hamid Behravan, Tomi Kinnunen, Ville Hautamäki:
Out-of-Set i-Vector Selection for Open-set Language Identification. Odyssey 2016: 303-310 - [c78]Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Achintya Kumar Sarkar, Nicholas W. D. Evans, Tomi Kinnunen, Zheng-Hua Tan:
Further optimisations of constant Q cepstral processing for integrated utterance and text-dependent speaker verification. SLT 2016: 179-185 - [c77]Sami Sieranoja, Tomi Kinnunen, Pasi Fränti:
GPS Trajectory Biometrics: From Where You Were to How You Move. S+SSPR 2016: 450-460 - 2015
- [c76]Rahim Saeidi, Tuija Niemi, Hanna Karppelin, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku:
Speaker recognition for speech under face cover. INTERSPEECH 2015: 1012-1016 - [c75]Zhizheng Wu, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov:
ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge. INTERSPEECH 2015: 2037-2041 - [c74]Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov:
Classifiers for synthetic speech detection: a comparison. INTERSPEECH 2015: 2057-2061 - [c73]Md. Sahidullah, Tomi Kinnunen, Cemal Hanilçi:
A comparison of features for synthetic speech detection. INTERSPEECH 2015: 2087-2091 - [c72]Anna Fedorova, Ondrej Glembek, Tomi Kinnunen, Pavel Matejka:
Exploring ANN back-ends for i-vector based speaker age estimation. INTERSPEECH 2015: 3036-3040 - [c71]Zhizheng Wu, Tomi Kinnunen:
Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): introductory talk by the organizers. INTERSPEECH 2015 - 2014
- [c70]Alexey Sholokhov, Timur Pekhovsky, Oleg Kudashev, Andrey Shulipa, Tomi Kinnunen:
Bayesian analysis of similarity matrices for speaker diarization. ICASSP 2014: 106-110 - [c69]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee:
Introducing attribute features to foreign accent recognition. ICASSP 2014: 5332-5336 - [c68]Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu, Sébastien Marcel:
Introducing i-vectors for joint anti-spoofing and speaker verification. INTERSPEECH 2014: 61-65 - [c67]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, Chin-Hui Lee:
Dialect levelling in Finnish: a universal speech attribute approach. INTERSPEECH 2014: 2165-2169 - [c66]Ville Hautamäki, Rosa González Hautamäki, Tomi Kinnunen, Anne-Maria Laukkanen:
Comparison of human listeners and speaker verification systems using voice mimicry data. Odyssey 2014: 137-144 - [c65]Alan McCree, Douglas A. Reynolds, Daniel Garcia-Romero, Tomi Kinnunen, Craig S. Greenberg, Désiré Bansé, George R. Doddington, John J. Godfrey, Alvin F. Martin, Mark A. Przybocki:
The NIST 2014 Speaker Recognition i-vector Machine Learning Challenge. Odyssey 2014: 224-230 - [c64]Ville Hautamäki, Antti Pöllänen, Tomi Kinnunen, Kong-Aik Lee, Haizhou Li, Pasi Fränti:
A Comparison of Categorical Attribute Data Clustering Methods. S+SSPR 2014: 53-62 - [c63]Aleksandr Sizov, Kong-Aik Lee, Tomi Kinnunen:
Unifying Probabilistic Linear Discriminant Analysis Variants in Biometric Authentication. S+SSPR 2014: 464-475 - 2013
- [c62]Tomi Kinnunen, Padmanabhan Rajan:
A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data. ICASSP 2013: 7229-7233 - [c61]Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Speaker identification from shouted speech: Analysis and compensation. ICASSP 2013: 8027-8031 - [c60]Hamid Behravan, Ville Hautamäki, Tomi Kinnunen:
Foreign accent detection from spoken Finnish using i-vectors. INTERSPEECH 2013: 79-83 - [c59]Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi:
Spoofing and countermeasures for automatic speaker verification. INTERSPEECH 2013: 925-929 - [c58]Rosa González Hautamäki, Tomi Kinnunen, Ville Hautamäki, Timo Leino, Anne-Maria Laukkanen:
I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry. INTERSPEECH 2013: 930-934 - [c57]Zhizheng Wu, Anthony Larcher, Kong-Aik Lee, Engsiong Chng, Tomi Kinnunen, Haizhou Li:
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints. INTERSPEECH 2013: 950-954 - [c56]Ville Hautamäki, Kong-Aik Lee, David A. van Leeuwen, Rahim Saeidi, Anthony Larcher, Tomi Kinnunen, Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, John H. L. Hansen, Benoit G. B. Fauve:
Automatic regularization of cross-entropy cost for speaker recognition fusion. INTERSPEECH 2013: 1609-1613 - [c55]Rahim Saeidi, Kong-Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit G. B. Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilçi, Billy Braithwaite, Rosa González Hautamäki, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David A. van Leeuwen, Bin Ma, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Sébastien Marcel, John S. D. Mason, Eliathamby Ambikairajah:
I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification. INTERSPEECH 2013: 1986-1990 - [c54]Padmanabhan Rajan, Tomi Kinnunen, Cemal Hanilçi, Jouni Pohjalainen, Paavo Alku:
Using group delay functions from all-pole models for speaker recognition. INTERSPEECH 2013: 2489-2493 - [c53]Rosa González Hautamäki, Ville Hautamäki, Padmanabhan Rajan, Tomi Kinnunen:
Merging human and automatic system decisions to improve speaker recognition performance. INTERSPEECH 2013: 2519-2523 - [c52]Cemal Hanilçi, Tomi Kinnunen, Padmanabhan Rajan, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort. INTERSPEECH 2013: 2881-2885 - [c51]Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061 - [c50]Tomi Kinnunen, Md. Jahangir Alam, Pavel Matejka, Patrick Kenny, Jan Cernocký, Douglas D. O'Shaughnessy:
Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations. INTERSPEECH 2013: 3122-3126 - [c49]Padmanabhan Rajan, Tomi Kinnunen, Ville Hautamäki:
Effect of multicondition training on i-vector PLDA configurations for speaker recognition. INTERSPEECH 2013: 3694-3697 - [c48]Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206 - 2012
- [c47]Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li, Eliathamby Ambikairajah:
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. APSIPA 2012: 1-5 - [c46]Tomi Kinnunen, Henri Leisma, Monika Machunik, Tuomo Kakkonen, Jean-Luc LeBrun:
SWAN - Scientific Writing AssistaNt. A Tool for Helping Scholars to Write Reader-Friendly Manuscripts. EACL 2012: 20-24 - [c45]Tomi Kinnunen, Zhizheng Wu, Kong-Aik Lee, Filip Sedlak, Engsiong Chng, Haizhou Li:
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech. ICASSP 2012: 4401-4404 - [c44]Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas, Johan Sandberg, Maria Hansson-Sandsten:
Comparing spectrum estimators in speaker verification under additive noise degradation. ICASSP 2012: 4769-4772 - [c43]Sadjad Siddiq, Tomi Kinnunen, Martti Vainio, Stefan Werner:
Intonational speaker verification: A study on parameters and performance under noisy conditions. ICASSP 2012: 4777-4780 - [c42]Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Regularization of all-pole models for speaker verification under additive noise. Odyssey 2012: 236-242 - [c41]Ville Hautamäki, Kong-Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, Haizhou Li:
Variational Bayes logistic regression as regularized fusion for NIST SRE 2010. Odyssey 2012: 268-274 - [c40]Tomi Kinnunen, Rahim Saeidi, Jussi Leppänen, Jukka Saarinen:
Audio context recognition in variable mobile environments from short segments using speaker and language recognizers. Odyssey 2012: 304-311 - 2011
- [c39]Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Alberto Abad, Oscar Koller, Isabel Trancoso, Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo, Rahim Saeidi, Mehdi Soufifar, Tomi Kinnunen, Torbjørn Svendsen, Pasi Fränti:
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation. ASRU 2011: 377-382 - [c38]Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:
Multi-taper MFCC features for speaker verification using I-vectors. ASRU 2011: 547-552 - [c37]Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong-Aik Lee, Haizhou Li:
Classifier subset selection and fusion for speaker verification. ICASSP 2011: 4544-4547 - [c36]Jouni Pohjalainen, Paavo Alku, Tomi Kinnunen:
Shout detection in noise. ICASSP 2011: 4968-4971 - [c35]Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen:
Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge. INTERSPEECH 2011: 677-680 - [c34]Ville Hautamäki, Kong-Aik Lee, Tomi Kinnunen, Bin Ma, Haizhou Li:
Regularized Logistic Regression Fusion for Speaker Verification. INTERSPEECH 2011: 2745-2748 - 2010
- [c33]Tomi Kinnunen, Filip Sedlak, Roman Bednarik:
Towards task-independent person authentication using eye movement signals. ETRA 2010: 187-190 - [c32]Kong-Aik Lee, Haizhou Li, Chang Huai You, Tomi Kinnunen, Khe Chai Sim:
Discrete expected likelihood kernel for SVM-based speaker verification. EUSIPCO 2010: 591-595 - [c31]Rahim Saeidi, Tomi Kinnunen, Hamid Reza Sadegh Mohammadi, Robert D. Rodman, Pasi Fränti:
Joint frame and Gaussian selection for text independent speaker verification. ICASSP 2010: 4530-4533 - [c30]Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals. ICPR 2010: 4565-4568 - [c29]Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Improving monaural speaker identification by double-talk detection. INTERSPEECH 2010: 1069-1072 - [c28]Ville Hautamäki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong-Aik Lee, Bin Ma, Haizhou Li:
Approaching human listener accuracy with modern speaker verification. INTERSPEECH 2010: 1473-1476 - [c27]Jouni Pohjalainen, Rahim Saeidi, Tomi Kinnunen, Paavo Alku:
Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions. INTERSPEECH 2010: 1477-1480 - [c26]Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Text-independent F0 transformation with non-parallel data for voice conversion. INTERSPEECH 2010: 1732-1735 - [c25]Tomi Kinnunen, Rahim Saeidi, Johan Sandberg, Maria Hansson-Sandsten:
What else is new than the hamming window? robust MFCCs for speaker recognition via multitapering. INTERSPEECH 2010: 2734-2737 - [c24]Rahim Saeidi, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku:
Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise. Odyssey 2010: 8 - 2009
- [c23]Pasi Fränti, Juhani Saastamoinen, Ismo Kärkkäinen, Tomi Kinnunen, Ville Hautamäki, Ilja Sidoroff:
Developing Speaker Recognition System: From Prototype to Practical Application. e-Forensics 2009: 102-115 - [c22]Tomi Kinnunen, Juhani Saastamoinen, Ville Hautamäki, Mikko Vinni, Pasi Fränti:
Comparing maximum a posteriori vector quantization and Gaussian mixture models in speaker verification. ICASSP 2009: 4229-4232 - [c21]Tomi Kinnunen, Paavo Alku:
On separating glottal source and vocal tract information in telephony speaker verification. ICASSP 2009: 4545-4548 - 2008
- [c20]Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen, Donglai Zhu:
Characterizing speech utterances for speaker verification with sequence kernel SVM. INTERSPEECH 2008: 1397-1400 - [c19]Tomi Kinnunen, Kong-Aik Lee, Haizhou Li:
Dimension reduction of the modulation spectrogram for speaker verification. Odyssey 2008: 30 - 2007
- [c18]Rahim Saeidi, H. R. S. Mohammadi, Robert D. Rodman, Tomi Kinnunen:
A New Segmentation Algorithm Combined with Transient Frames Power for Text Independent Speaker Verification. ICASSP (4) 2007: 305-308 - [c17]Tomi Kinnunen, Bingjun Zhang, Jia Zhu, Ye Wang:
Speaker Verification with Adaptive Spectral Subband Centroids. ICB 2007: 58-66 - [c16]Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen:
A GMM-based probabilistic sequence kernel for speaker verification. INTERSPEECH 2007: 294-297 - 2006
- [c15]Tomi Kinnunen:
Joint Acoustic-Modulation Frequency for Speaker Recognition. ICASSP (1) 2006: 665-668 - [c14]Tomi Kinnunen, Ville Hautamäki, Pasi Fränti:
On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition. ISCSLP 2006 - [c13]Tomi Kinnunen, Chin-Wei Eugene Koh, Lei Wang, Haizhou Li, Eng Siong Chng:
Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification. ISCSLP 2006 - [c12]Kong-Aik Lee, Hanwu Sun, Rong Tong, Bin Ma, Minghui Dong, Changhuai You, Donglai Zhu, Chin-Wei Eugene Koh, Lei Wang, Tomi Kinnunen, Chng Eng Siong, Haizhou Li:
The IIR Submission to CSLP 2006 Speaker Recognition Evaluation. ISCSLP (Selected Papers) 2006: 494-505 - [c11]Rong Tong, Bin Ma, Kong-Aik Lee, Changhuai You, Donglai Zhu, Tomi Kinnunen, Hanwu Sun, Minghui Dong, Chng Eng Siong, Haizhou Li:
Fusion of Acoustic and Tokenization Features for Speaker Recognition. ISCSLP (Selected Papers) 2006: 566-577 - 2005
- [c10]Roman Bednarik, Tomi Kinnunen, Andrei Mihaila, Pasi Fränti:
Eye-Movements as a Biometric. SCIA 2005: 780-789 - [c9]Ville Hautamäki, Svetlana Cherednichenko, Ismo Kärkkäinen, Tomi Kinnunen, Pasi Fränti:
Improving K-Means by Outlier Removal. SCIA 2005: 978-987 - 2004
- [c8]Pasi Fränti, Evgeny Karpov, Tomi Kinnunen:
Real-time speaker identification. INTERSPEECH 2004: 1805-1808 - [c7]Tomi Kinnunen, Evgeny Karpov, Pasi Fränti:
Efficient online cohort selection method for speaker verification. INTERSPEECH 2004: 2401-2404 - 2003
- [c6]Tomi Kinnunen, Evgeny Karpov, Pasi Fränti:
A Speaker Pruning Algorithm for Real-Time Speaker Identification. AVBPA 2003: 639-646 - [c5]Tomi Kinnunen, Ville Hautamäki, Pasi Fränti:
On the fusion of dissimilarity-based classifiers for speaker identification. INTERSPEECH 2003: 2641-2644 - 2002
- [c4]Tomi Kinnunen:
Designing a speaker-discriminative adaptive filter bank for speaker recognition. INTERSPEECH 2002: 2325-2328 - [c3]Tomi Kinnunen, Ismo Kärkkäinen:
Class-Discriminative Weighted Distortion Measure for VQ-based Speaker Identification. SSPR/SPR 2002: 681-688 - 2001
- [c2]Tomi Kinnunen, Pasi Fränti:
Speaker Discriminative Weighting Method for VQ-Based Speaker Identification. AVBPA 2001: 150-156 - [c1]Tomi Kinnunen, Ismo Kärkkäinen, Pasi Fränti:
Is speech data clustered? - statistical analysis of cepstral features. INTERSPEECH 2001: 2627-2630
Parts in Books or Collections
- 2019
- [p2]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
Introduction to Voice Presentation Attack Detection and Recent Advances. Handbook of Biometric Anti-Spoofing, 2nd Ed. 2019: 321-361 - 2014
- [p1]Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Zhizheng Wu, Federico Alegre, Phillip L. De Leon:
Speaker Recognition Anti-spoofing. Handbook of Biometric Anti-Spoofing 2014: 125-146
Editorship
- 2020
- [e1]Junichi Yamagishi, Zhenhua Ling, Rohan Kumar Das, Simon King, Tomi Kinnunen, Tomoki Toda, Wen-Chin Huang, Xiao Zhou, Xiaohai Tian, Yi Zhao:
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, Shanghai, China, October 30, 2020. ISCA 2020 [contents]
Reference Works
- 2015
- [r2]Nicholas W. D. Evans, Federico Alegre, Zhizheng Wu, Tomi Kinnunen:
Anti-spoofing, Voice Conversion. Encyclopedia of Biometrics 2015: 115-122 - [r1]Nicholas W. D. Evans, Federico Alegre, Tomi Kinnunen, Zhizheng Wu, Junichi Yamagishi:
Anti-spoofing, Voice Databases. Encyclopedia of Biometrics 2015: 123-128
Informal and Other Publications
- 2024
- [i67]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space. CoRR abs/2401.11156 (2024) - [i66]Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen:
ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker Verification. CoRR abs/2402.15214 (2024) - [i65]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen, Nicholas W. D. Evans, Jean-François Bonastre, Itshak Lapidot:
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification. CoRR abs/2403.01355 (2024) - [i64]Xin Wang, Tomi Kinnunen, Kong Aik Lee, Paul-Gauthier Noé, Junichi Yamagishi:
Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis. CoRR abs/2406.10836 (2024) - [i63]Hye-jin Shim, Md. Sahidullah, Jee-weon Jung, Shinji Watanabe, Tomi Kinnunen:
Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing. CoRR abs/2406.17246 (2024) - [i62]Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans, Kong Aik Lee, Junichi Yamagishi:
ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale. CoRR abs/2408.08739 (2024) - [i61]Manasi Chhibber, Jagabandhu Mishra, Hye-Jin Shim, Tomi H. Kinnunen:
An Explainable Probabilistic Attribute Embedding Approach for Spoofed Speech Characterization. CoRR abs/2409.11027 (2024) - 2023
- [i60]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification. CoRR abs/2303.01125 (2023) - [i59]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Speaker-Aware Anti-Spoofing. CoRR abs/2303.01126 (2023) - [i58]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards single integrated spoofing-aware speaker verification embeddings. CoRR abs/2305.19051 (2023) - [i57]Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen:
Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing. CoRR abs/2305.19953 (2023) - [i56]Hye-jin Shim, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
How to Construct Perfect and Worse-than-Coin-Flip Spoofing Countermeasures: A Word of Warning on Shortcut Learning. CoRR abs/2306.00044 (2023) - [i55]Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen:
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech. CoRR abs/2306.07501 (2023) - [i54]Tomi Kinnunen, Kong Aik Lee, Hemlata Tak, Nicholas W. D. Evans, Andreas Nautsch:
t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators. CoRR abs/2309.12237 (2023) - 2022
- [i53]Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi:
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems. CoRR abs/2201.09709 (2022) - [i52]Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Hong-Goo Kang, Ha-Jin Yu, Nicholas W. D. Evans, Tomi Kinnunen:
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan. CoRR abs/2201.10283 (2022) - [i51]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable Nonlinear Compression for Robust Speaker Verification. CoRR abs/2202.05236 (2022) - [i50]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation. CoRR abs/2203.10992 (2022) - [i49]Lauri Tavi, Tomi Kinnunen, Rosa González Hautamäki:
Improving speaker de-identification with functional data analysis of f0 trajectories. CoRR abs/2203.16738 (2022) - [i48]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. CoRR abs/2204.09976 (2022) - [i47]Alexey Sholokhov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Baselines and Protocols for Household Speaker Recognition. CoRR abs/2205.00288 (2022) - [i46]Sandip Ghimire, Tomi Kinnunen, Rosa González Hautamäki:
Gamified Speaker Comparison by Listening. CoRR abs/2205.04923 (2022) - [i45]Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki:
GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters. CoRR abs/2205.07060 (2022) - [i44]Rhythm Bhatia, Tomi H. Kinnunen:
An Initial study on Birdsong Re-synthesis Using Neural Vocoders. CoRR abs/2209.10479 (2022) - [i43]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. CoRR abs/2210.02437 (2022) - [i42]Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022) - 2021
- [i41]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech. CoRR abs/2102.05889 (2021) - [i40]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Learnable MFCCs for Speaker Verification. CoRR abs/2102.10322 (2021) - [i39]Bhusan Chettri, Rosa González Hautamäki, Md. Sahidullah, Tomi Kinnunen:
Data Quality as Predictor of Voice Anti-Spoofing Generalization. CoRR abs/2103.14602 (2021) - [i38]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. CoRR abs/2106.06362 (2021) - [i37]Jean-François Bonastre, Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noé, Jose Patino, Md. Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia A. Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi:
Benchmarking and challenges in security and privacy for voice biometrics. CoRR abs/2109.00281 (2021) - [i36]Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md. Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi:
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan. CoRR abs/2109.00535 (2021) - [i35]Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md. Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas W. D. Evans, Héctor Delgado:
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection. CoRR abs/2109.00537 (2021) - [i34]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Parameterized Channel Normalization for Far-field Deep Speaker Verification. CoRR abs/2109.12056 (2021) - [i33]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification. CoRR abs/2109.12058 (2021) - [i32]Khaled Hechmi, Trung Ngo Trong, Ville Hautamäki, Tomi Kinnunen:
VoxCeleb Enrichment for Age and Gender Recognition. CoRR abs/2109.13510 (2021) - [i31]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
Optimizing Multi-Taper Features for Deep Speaker Verification. CoRR abs/2110.10983 (2021) - 2020
- [i30]Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi:
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning. CoRR abs/2002.03801 (2020) - [i29]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen:
Neural i-vectors. CoRR abs/2004.01559 (2020) - [i28]Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. CoRR abs/2004.08849 (2020) - [i27]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. CoRR abs/2007.05979 (2020) - [i26]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. CoRR abs/2007.13118 (2020) - [i25]Xuechen Liu, Md. Sahidullah, Tomi Kinnunen:
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings. CoRR abs/2007.15283 (2020) - [i24]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee:
Extrapolating false alarm rates in automatic speaker verification. CoRR abs/2008.03590 (2020) - [i23]Rosa González Hautamäki, Tomi Kinnunen:
Why Did the x-Vector System Miss a Target Speaker? Impact of Acoustic Mismatch Upon Target Score on VoxCeleb Data. CoRR abs/2008.04578 (2020) - [i22]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion. CoRR abs/2008.12527 (2020) - [i21]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. CoRR abs/2009.03554 (2020) - [i20]Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki:
Policy Supervectors: General Characterization of Agents by their Behaviour. CoRR abs/2012.01244 (2020) - 2019
- [i19]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
Introduction to Voice Presentation Attack Detection and Recent Advances. CoRR abs/1901.01085 (2019) - [i18]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. CoRR abs/1904.05441 (2019) - [i17]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i16]Ville Vestman, Tomi Kinnunen, Rosa González Hautamäki, Md. Sahidullah:
Voice Mimicry Attacks Assisted by Automatic Speaker Verification. CoRR abs/1906.01454 (2019) - [i15]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration. CoRR abs/1906.08556 (2019) - [i14]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee:
Voice Biometrics Security: Extrapolating False Alarm Rate via Hierarchical Bayesian Modeling of Speaker Verification Scores. CoRR abs/1911.01182 (2019) - [i13]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - 2018
- [i12]Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen:
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data. CoRR abs/1803.00860 (2018) - [i11]Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. CoRR abs/1804.04262 (2018) - [i10]Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. CoRR abs/1804.08438 (2018) - [i9]Rosa González Hautamäki, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen:
Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification. CoRR abs/1804.08910 (2018) - [i8]Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. CoRR abs/1804.09618 (2018) - [i7]Ville Vestman, Tomi Kinnunen:
Supervector Compression Strategies to Speed up I-Vector System Development. CoRR abs/1805.01156 (2018) - [i6]Akihiro Kato, Tomi Kinnunen:
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech. CoRR abs/1805.02958 (2018) - [i5]Akihiro Kato, Tomi Kinnunen:
Waveform to Single Sinusoid Regression to Estimate the F0 Contour from Noisy Speech Using Recurrent Deep Neural Networks. CoRR abs/1807.00752 (2018) - [i4]Fuming Fang, Junichi Yamagishi, Isao Echizen, Md. Sahidullah, Tomi Kinnunen:
Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems. CoRR abs/1809.04274 (2018) - [i3]Ville Vestman, Bilal Soomro, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen:
Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search. CoRR abs/1811.03293 (2018) - [i2]Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md. Sahidullah:
Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. CoRR abs/1811.03790 (2018) - 2016
- [i1]Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov:
Spoofing Detection Goes Noisy: An Analysis of Synthetic Speech Detection in the Presence of Additive Noise. CoRR abs/1603.03947 (2016)
Coauthor Index
aka: Jee-weon Jung
aka: Kong Aik Lee
aka: Zhenhua Ling
aka: Hye-jin Shim
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:17 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint