default search action
Kong-Aik Lee
Kong Aik Lee
Person information
- affiliation: Institute for Infocomm Research, Singapore
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j41]Qiongqiong Wang, Hardik B. Sailor, Kong Aik Lee, Kai Ma, Kim Huat Goh, Wai Fong Boh:
Using Twitter Dataset for Social Listening in Singapore. IEEE Access 12: 100015-100025 (2024) - [j40]Tomi H. Kinnunen, Kong Aik Lee, Hemlata Tak, Nicholas W. D. Evans, Andreas Nautsch:
t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2622-2637 (2024) - [j39]Qiongqiong Wang, Kong Aik Lee:
Cosine Scoring With Uncertainty for Neural Speaker Embedding. IEEE Signal Process. Lett. 31: 845-849 (2024) - [j38]Turghun Tayir, Lin Li, Bei Li, Jianquan Liu, Kong Aik Lee:
Encoder-Decoder Calibration for Multimodal Machine Translation. IEEE Trans. Artif. Intell. 5(8): 3965-3973 (2024) - [j37]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1261-1273 (2024) - [j36]Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2324-2337 (2024) - [c147]Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng:
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification. ICASSP 2024: 10336-10340 - [c146]Linjuan Zhang, Kong Aik Lee, Lin Zhang, Longbiao Wang, Baoning Niu:
CPAUG: Refining Copy-Paste Augmentation for Speech Anti-Spoofing. ICASSP 2024: 10996-11000 - [c145]Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li:
Gradient Weighting for Speaker Verification in Extremely Low Signal-to-Noise Ratio. ICASSP 2024: 11311-11315 - [c144]Shihao Chen, Liping Chen, Jie Zhang, Kong-Aik Lee, Zhenhua Ling, Lirong Dai:
Adversarial Speech for Voice Privacy Protection from Personalized Speech Generation. ICASSP 2024: 11411-11415 - [c143]Liping Chen, Kong Aik Lee, Wu Guo, Zhen-Hua Ling:
Modeling Pseudo-Speaker Uncertainty in Voice Anonymization. ICASSP 2024: 11601-11605 - [c142]Xingmei Wang, Jiaxiang Meng, Kong Aik Lee, Boquan Li, Jinghan Liu:
Two-stage Semi-supervised Speaker Recognition with Gated Label Learning. IJCAI 2024: 6495-6503 - [i70]Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li:
Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio. CoRR abs/2401.02626 (2024) - [i69]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space. CoRR abs/2401.11156 (2024) - [i68]Shihao Chen, Liping Chen, Jie Zhang, Kong-Aik Lee, Zhenhua Ling, Lirong Dai:
Adversarial speech for voice privacy protection from Personalized Speech generation. CoRR abs/2401.11857 (2024) - [i67]Weiwei Lin, Chenhang He, Man-Wai Mak, Jiachen Lian, Kong Aik Lee:
VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis. CoRR abs/2403.00529 (2024) - [i66]Qiongqiong Wang, Kong Aik Lee:
Cosine Scoring with Uncertainty for Neural Speaker Embedding. CoRR abs/2403.06404 (2024) - [i65]Hossein Zeinali, Kong Aik Lee, Jahangir Alam, Lukás Burget:
Text-dependent Speaker Verification (TdSV) Challenge 2024: Challenge Evaluation Plan. CoRR abs/2404.13428 (2024) - [i64]Rui Wang, Liping Chen, Kong-Aik Lee, Zhen-Hua Ling:
Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding. CoRR abs/2406.08200 (2024) - [i63]Xin Wang, Tomi Kinnunen, Kong Aik Lee, Paul-Gauthier Noé, Junichi Yamagishi:
Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis. CoRR abs/2406.10836 (2024) - [i62]Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng:
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection. CoRR abs/2406.17376 (2024) - [i61]Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. CoRR abs/2407.15188 (2024) - [i60]Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans, Kong Aik Lee, Junichi Yamagishi:
ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale. CoRR abs/2408.08739 (2024) - [i59]Massimiliano Todisco, Michele Panariello, Xin Wang, Héctor Delgado, Kong Aik Lee, Nicholas W. D. Evans:
Malacopula: adversarial automatic speaker verification attacks using a neural-based generalised Hammerstein model. CoRR abs/2408.09300 (2024) - [i58]Tianchi Liu, Ivan Kukanov, Zihan Pan, Qiongqiong Wang, Hardik B. Sailor, Kong Aik Lee:
Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing. CoRR abs/2409.08346 (2024) - [i57]Junjie Li, Ke Zhang, Shuai Wang, Haizhou Li, Man-Wai Mak, Kong Aik Lee:
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction. CoRR abs/2409.09589 (2024) - [i56]Hieu-Thi Luong, Duc-Tuan Truong, Kong Aik Lee, Eng Siong Chng:
Room Impulse Responses help attackers to evade Deep Fake Detection. CoRR abs/2409.14712 (2024) - [i55]Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng:
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. CoRR abs/2409.14743 (2024) - [i54]Nikita Kuzmin, Hieu-Thi Luong, Jixun Yao, Lei Xie, Kong Aik Lee, Eng Siong Chng:
NTU-NPU System for Voice Privacy 2024 Challenge. CoRR abs/2410.02371 (2024) - 2023
- [j35]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
A Dual Latent Variable Personalized Dialogue Agent. SN Comput. Sci. 4(2): 159 (2023) - [j34]Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, Helen Meng:
Meta-Generalization for Domain-Invariant Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1024-1036 (2023) - [j33]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023) - [j32]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2507-2522 (2023) - [j31]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka:
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition. IEEE Trans. Inf. Forensics Secur. 18: 3936-3947 (2023) - [c141]Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. ASRU 2023: 1-8 - [c140]Hui Chen, Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang:
Self-Supervised Audio-Visual Speaker Representation with Co-Meta Learning. ICASSP 2023: 1-5 - [c139]Xiaohui Liu, Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, Jianwu Dang:
Leveraging Positional-Related Local-Global Dependency for Synthetic Speech Detection. ICASSP 2023: 1-5 - [c138]Meng Liu, Kong Aik Lee, Longbiao Wang, Hanyi Zhang, Chang Zeng, Jianwu Dang:
Cross-Modal Audio-Visual Co-Learning for Text-Independent Speaker Verification. ICASSP 2023: 1-5 - [c137]Alexey Sholokhov, Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng:
Probabilistic Back-ends for Online Speaker Recognition and Clustering. ICASSP 2023: 1-5 - [c136]Yao Sun, Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang:
Noise-Disentanglement Metric Learning for Robust Speaker Verification. ICASSP 2023: 1-5 - [c135]Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker Recognition with Two-Step Multi-Modal Deep Cleansing. ICASSP 2023: 1-5 - [c134]Qiongqiong Wang, Kong Aik Lee, Tianchi Liu:
Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification. ICASSP 2023: 1-5 - [c133]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Speaker-Aware Anti-spoofing. INTERSPEECH 2023: 2498-2502 - [c132]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings. INTERSPEECH 2023: 3989-3993 - [c131]Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Disentangling Voice and Content with Self-Supervision for Speaker Recognition. NeurIPS 2023 - [c130]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Partially Randomizing Transformer Weights for Dialogue Response Diversity. PACLIC 2023: 486-498 - [i53]Alexey Sholokhov, Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng:
Probabilistic Back-ends for Online Speaker Recognition and Clustering. CoRR abs/2302.09523 (2023) - [i52]Meng Liu, Kong Aik Lee, Longbiao Wang, Hanyi Zhang, Chang Zeng, Jianwu Dang:
Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification. CoRR abs/2302.11254 (2023) - [i51]Qiongqiong Wang, Kong Aik Lee, Tianchi Liu:
Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification. CoRR abs/2302.11763 (2023) - [i50]Xuechen Liu, Md. Sahidullah, Kong Aik Lee, Tomi Kinnunen:
Speaker-Aware Anti-Spoofing. CoRR abs/2303.01126 (2023) - [i49]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards single integrated spoofing-aware speaker verification embeddings. CoRR abs/2305.19051 (2023) - [i48]Tomi Kinnunen, Kong Aik Lee, Hemlata Tak, Nicholas W. D. Evans, Andreas Nautsch:
t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators. CoRR abs/2309.12237 (2023) - [i47]Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR. CoRR abs/2309.13573 (2023) - [i46]Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng:
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification. CoRR abs/2309.14838 (2023) - [i45]Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Disentangling Voice and Content with Self-Supervision for Speaker Recognition. CoRR abs/2310.01128 (2023) - [i44]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Partially Randomizing Transformer Weights for Dialogue Response Diversity. CoRR abs/2311.10943 (2023) - [i43]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
An Empirical Bayes Framework for Open-Domain Dialogue Generation. CoRR abs/2311.10945 (2023) - [i42]Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification. CoRR abs/2312.03620 (2023) - 2022
- [j30]Hongning Zhu, Kong Aik Lee, Haizhou Li:
Discriminative speaker embedding with serialized multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022) - [j29]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022) - [c129]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation. ConvAI@ACL 2022: 1-11 - [c128]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation. ICAART (2) 2022: 193-202 - [c127]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146 - [c126]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents. ICASSP 2022: 7052-7056 - [c125]Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, Hui Chen:
Learning Domain-Invariant Transformation for Speaker Verification. ICASSP 2022: 7177-7181 - [c124]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521 - [c123]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c122]Qiongqiong Wang, Kong Aik Lee, Tianchi Liu:
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA? INTERSPEECH 2022: 600-604 - [c121]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. ISCSLP 2022: 488-492 - [c120]Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang:
Deep Spectro-temporal Artifacts for Detecting Synthesized Speech. DDAM@MM 2022: 69-75 - [c119]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. Odyssey 2022: 330-337 - [c118]Lin Li, Kaixi Hu, Turghun Tayir, Jianquan Liu, Kong Aik Lee:
Noise-Robust Semi-supervised Multi-modal Machine Translation. PRICAI (2) 2022: 155-168 - [e3]Kong Aik Lee, Hung-yi Lee, Yanfeng Lu, Minghui Dong:
13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022. IEEE 2022, ISBN 979-8-3503-9796-3 [contents] - [i41]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances. CoRR abs/2202.01624 (2022) - [i40]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i39]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents. CoRR abs/2202.05971 (2022) - [i38]Qiongqiong Wang, Kong Aik Lee, Tianchi Liu:
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA? CoRR abs/2204.03965 (2022) - [i37]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. CoRR abs/2204.09976 (2022) - [i36]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022) - [i35]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. CoRR abs/2210.02437 (2022) - [i34]Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang:
Deep Spectro-temporal Artifacts for Detecting Synthesized Speech. CoRR abs/2210.05254 (2022) - [i33]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs. CoRR abs/2210.15385 (2022) - [i32]Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker recognition with two-step multi-modal deep cleansing. CoRR abs/2210.15903 (2022) - [i31]Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022) - 2021
- [j28]Meng Liu, Longbiao Wang, Jianwu Dang, Kong Aik Lee, Seiichi Nakagawa:
Replay attack detection using variable-frequency resolution phase and magnitude features. Comput. Speech Lang. 66: 101161 (2021) - [j27]Kong Aik Lee, Ville Vestman, Tomi Kinnunen:
ASVtorch toolkit: Speaker verification with deep neural networks. SoftwareX 14: 100697 (2021) - [j26]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
Xi-Vector Embedding for Speaker Recognition. IEEE Signal Process. Lett. 28: 1385-1389 (2021) - [j25]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi H. Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Trans. Biom. Behav. Identity Sci. 3(2): 252-265 (2021) - [c117]Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li:
PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction. ASRU 2021: 106-113 - [c116]Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, Chang Zeng, Jianwu Dang:
DeepLip: A Benchmark for Deep Learning-Based Audio-Visual Lip Biometrics. ASRU 2021: 122-129 - [c115]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto:
Task-aware Warping Factors in Mask-based Speech Enhancement. EUSIPCO 2021: 476-480 - [c114]Lin Li, Kaixi Hu, Yunpei Zheng, Jianquan Liu, Kong Aik Lee:
COOPNet: Multi-Modal Cooperative Gender Prediction in Social Media User Profiling. ICASSP 2021: 4310-4314 - [c113]Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, Hui Chen:
Meta-Learning for Cross-Channel Speaker Verification. ICASSP 2021: 5839-5843 - [c112]Meng Liu, Longbiao Wang, Kong Aik Lee, Xuanda Chen, Jianwu Dang:
Replay-Attack Detection Using Features With Adaptive Spectro-Temporal Resolution. ICASSP 2021: 6374-6378 - [c111]Hongning Zhu, Kong Aik Lee, Haizhou Li:
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding. Interspeech 2021: 106-110 - [c110]Yibo Wu, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang:
Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network. Interspeech 2021: 1089-1093 - [c109]Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. Interspeech 2021: 1094-1098 - [c108]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. Interspeech 2021: 4299-4303 - [i30]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech. CoRR abs/2102.05889 (2021) - [i29]Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, Chang Zeng, Jianwu Dang:
Exploring Deep Learning for Joint Audio-Visual Lip Biometrics. CoRR abs/2104.08510 (2021) - [i28]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. CoRR abs/2106.06362 (2021) - [i27]Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. CoRR abs/2106.09320 (2021) - [i26]Hongning Zhu, Kong Aik Lee, Haizhou Li:
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding. CoRR abs/2107.06493 (2021) - [i25]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Generating Personalized Dialogue via Multi-Task Meta-Learning. CoRR abs/2108.03377 (2021) - [i24]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
Xi-Vector Embedding for Speaker Recognition. CoRR abs/2108.05679 (2021) - [i23]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto:
Task-aware Warping Factors in Mask-based Speech Enhancement. CoRR abs/2108.12128 (2021) - [i22]Jean-François Bonastre, Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noé, Jose Patino, Md. Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia A. Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi:
Benchmarking and challenges in security and privacy for voice biometrics. CoRR abs/2109.00281 (2021) - [i21]Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md. Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi:
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan. CoRR abs/2109.00535 (2021) - [i20]Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md. Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas W. D. Evans, Héctor Delgado:
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection. CoRR abs/2109.00537 (2021) - [i19]