


Остановите войну!
for scientists:


default search action
Haizhou Li 0001
李海洲
Person information

- unicode name: 李海洲
- affiliation: Chinese University of Hong Kong (Shenzhen), China
- affiliation: National University of Singapore, Department of Electrical and Computer Engineering, Singapore
- affiliation (2006 - 2016): Nanyang Technological University, Singapore
- affiliation (2003 - 2016): Institute for Infocomm Research, A*STAR, Singapore
- affiliation (2011): University of New South Wales, Sydney, Australia
- affiliation (2009): University of Eastern Finland, Kuopio, Finland
- affiliation (PhD 1990): South China University of Technology, Guangzhou, China
Other persons with the same name
- Haizhou Li 0002 — Blaise Pascal University, Clermont-Ferrand, France
- Haizhou Li 0003 — City University of Hong Kong, Department of Computer Science, Hong Kong
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j145]Hui Tian
, Yiqin Qiu
, Wojciech Mazurczyk
, Haizhou Li
, Zhenxing Qian
:
STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams. IEEE ACM Trans. Audio Speech Lang. Process. 31: 277-289 (2023) - [j144]Qiquan Zhang
, Xinyuan Qian
, Zhaoheng Ni, Aaron Nicolson, Eliathamby Ambikairajah
, Haizhou Li:
A Time-Frequency Attention Module for Neural Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 462-475 (2023) - [j143]Xinyuan Qian
, Zhengdong Wang, Jiadong Wang
, Guohui Guan, Haizhou Li
:
Audio-Visual Cross-Attention Network for Robotic Speaker Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 31: 550-562 (2023) - [j142]Jibin Wu
, Yansong Chua
, Malu Zhang
, Guoqi Li
, Haizhou Li
, Kay Chen Tan:
A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 34(1): 446-460 (2023) - 2022
- [j141]Xianghu Yue
, Jingru Lin, Fabian Ritter Gutierrez, Haizhou Li:
Self-Supervised Learning With Segmental Masking for Speech Representation. IEEE J. Sel. Top. Signal Process. 16(6): 1367-1379 (2022) - [j140]Hongqiang Du
, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. Neural Networks 148: 74-84 (2022) - [j139]Jibin Wu
, Chenglin Xu, Xiao Han, Daquan Zhou, Malu Zhang
, Haizhou Li
, Kay Chen Tan
:
Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7824-7840 (2022) - [j138]Kun Zhou
, Berrak Sisman
, Rui Liu
, Haizhou Li
:
Emotional voice conversion: Theory, databases and ESD. Speech Commun. 137: 1-18 (2022) - [j137]Hongning Zhu
, Kong Aik Lee
, Haizhou Li
:
Discriminative speaker embedding with serialized multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022) - [j136]Tianchi Liu
, Rohan Kumar Das
, Kong Aik Lee
, Haizhou Li
:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022) - [j135]Zexu Pan
, Xinyuan Qian
, Haizhou Li
:
Speaker Extraction With Co-Speech Gestures Cue. IEEE Signal Process. Lett. 29: 1467-1471 (2022) - [j134]Haizhou Li:
A Unique ICASSP 2022: During an Unusual Time [Conference Highlights]. IEEE Signal Process. Mag. 39(2): 159-160 (2022) - [j133]Zexu Pan
, Ruijie Tao, Chenglin Xu
, Haizhou Li
:
Selective Listening by Synchronizing Speech With Lips. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1650-1664 (2022) - [j132]Rui Liu
, Berrak Sisman
, Guanglai Gao, Haizhou Li
:
Decoding Knowledge Transfer for Neural Text-to-Speech Training. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1789-1802 (2022) - [j131]Zexu Pan
, Meng Ge
, Haizhou Li
:
USEV: Universal Speaker Extraction With Visual Cue. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022) - [j130]Enze Su
, Siqi Cai
, Longhan Xie
, Haizhou Li
, Tanja Schultz
:
STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG. IEEE Trans. Biomed. Eng. 69(7): 2233-2242 (2022) - [j129]Siqi Cai
, Enze Su
, Longhan Xie
, Haizhou Li
:
EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention. IEEE Trans. Hum. Mach. Syst. 52(2): 256-266 (2022) - [j128]Malu Zhang
, Jiadong Wang
, Jibin Wu
, Ammar Belatreche
, Burin Amornpaisannon, Zhixuan Zhang, Venkata Pavan Kumar Miriyala
, Hong Qu
, Yansong Chua
, Trevor E. Carlson
, Haizhou Li
:
Rectified Linear Postsynaptic Potential Function for Backpropagation in Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 33(5): 1947-1958 (2022) - [c633]Chen Zhang, Luis Fernando D'Haro, Thomas Friedrichs, Haizhou Li:
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation. AAAI 2022: 11657-11666 - [c632]Jinming Zhao, Tenggan Zhang, Jingwen Hu, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li:
M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL (1) 2022: 5699-5710 - [c631]Bin Wang, C.-C. Jay Kuo, Haizhou Li:
Just Rank: Rethinking Evaluation with Word and Sentence Similarities. ACL (1) 2022: 6060-6077 - [c630]Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova
, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CVPR 2022: 18973-18990 - [c629]Xiaoxue Gao
, Chitralekha Gupta, Haizhou Li:
Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. ICASSP 2022: 791-795 - [c628]Marvin Borsdorf, Kevin Scheck, Haizhou Li, Tanja Schultz:
Experts Versus All-Rounders: Target Language Extraction for Multiple Target Languages. ICASSP 2022: 846-850 - [c627]Jinming Zhao, Ruichen Li, Qin Jin, Xinchao Wang, Haizhou Li:
Memobert: Pre-Training Model with Prompt-Based Learning for Multimodal Emotion Recognition. ICASSP 2022: 4703-4707 - [c626]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146 - [c625]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. ICASSP 2022: 7287-7291 - [c624]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521 - [c623]Qiquan Zhang, Qi Song, Zhaoheng Ni, Aaron Nicolson, Haizhou Li:
Time-Frequency Attention for Monaural Speech Enhancement. ICASSP 2022: 7852-7856 - [c622]Junchen Lu, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li:
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. ICASSP 2022: 8032-8036 - [c621]Jiadong Wang, Jibin Wu, Malu Zhang, Qi Liu, Haizhou Li:
A Hybrid Learning Framework for Deep Spiking Neural Networks with One-Spike Temporal Coding. ICASSP 2022: 8942-8946 - [c620]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li:
ADD 2022: the first Audio Deep Synthesis Detection Challenge. ICASSP 2022: 9216-9220 - [c619]Marvin Borsdorf, Kevin Scheck, Haizhou Li, Tanja Schultz:
Blind Language Separation: Disentangling Multilingual Cocktail Party Voices by Language. INTERSPEECH 2022: 256-260 - [c618]Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li:
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT. INTERSPEECH 2022: 1686-1690 - [c617]Zexu Pan, Meng Ge, Haizhou Li:
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. INTERSPEECH 2022: 1786-1790 - [c616]Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion. INTERSPEECH 2022: 2603-2607 - [c615]Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. INTERSPEECH 2022: 2658-2662 - [c614]Qu Yang, Qi Liu, Haizhou Li:
Deep residual spiking neural network for keyword spotting in low-resource settings. INTERSPEECH 2022: 3023-3027 - [c613]Zeyang Song, Qi Liu, Qu Yang, Haizhou Li:
Knowledge distillation for In-memory keyword spotting model. INTERSPEECH 2022: 4128-4132 - [c612]Rui Liu, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li:
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. INTERSPEECH 2022: 5493-5497 - [c611]Jianhua Tao, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Liang, Pengyuan Zhang, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi:
DDAM '22: 1st International Workshop on Deepfake Detection for Audio Multimedia. ACM Multimedia 2022: 7405-7406 - [c610]Peiwen Li, Enze Su, Jia Li, Siqi Cai, Longhan Xie, Haizhou Li:
Esaa: An Eeg-Speech Auditory Attention Detection Database. O-COCOSDA 2022 2022: 1-6 - [e23]Rong Tong, Yanfeng Lu, Minghui Dong, Wengao Gong, Haizhou Li:
International Conference on Asian Language Processing, IALP 2022, Singapore, October 27-28, 2022. IEEE 2022, ISBN 978-1-6654-7674-4 [contents] - [e22]Svetlana Stoyanchev, Stefan Ultes, Haizhou Li
:
Conversational AI for Natural Human-Centric Interaction - 12th International Workshop on Spoken Dialogue System Technology, IWSDS 2021, Singapore. Lecture Notes in Electrical Engineering 943, Springer 2022, ISBN 978-981-19-5537-2 [contents] - [e21]Jianhua Tao, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Lian, Pengyuan Zhang:
DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, Lisboa, Portugal, 14 October 2022. ACM 2022, ISBN 978-1-4503-9496-3 [contents] - [i125]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. CoRR abs/2201.03967 (2022) - [i124]Hongqiang Du, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. CoRR abs/2201.10693 (2022) - [i123]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances. CoRR abs/2202.01624 (2022) - [i122]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu:
ADD 2022: the First Audio Deep Synthesis Detection Challenge. CoRR abs/2202.08433 (2022) - [i121]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. CoRR abs/2202.09995 (2022) - [i120]Bin Wang, C.-C. Jay Kuo, Haizhou Li:
Just Rank: Rethinking Evaluation with Word and Sentence Similarities. CoRR abs/2203.02679 (2022) - [i119]Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li:
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT. CoRR abs/2203.15610 (2022) - [i118]Zexu Pan, Xinyuan Qian, Haizhou Li:
Speaker Extraction with Co-Speech Gestures Cue. CoRR abs/2203.16840 (2022) - [i117]Zexu Pan, Meng Ge, Haizhou Li:
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. CoRR abs/2203.16843 (2022) - [i116]Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. CoRR abs/2203.17113 (2022) - [i115]Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
Genre-conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. CoRR abs/2204.03307 (2022) - [i114]Jinming Zhao, Tenggan Zhang, Jingwen Hu, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li:
M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. CoRR abs/2205.10237 (2022) - [i113]Rui Liu, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li:
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. CoRR abs/2206.07229 (2022) - [i112]Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
PoLyScribers: Joint Training of Vocal Extractor and Lyrics Transcriber for Polyphonic Music. CoRR abs/2207.07336 (2022) - [i111]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis with Mixed Emotions. CoRR abs/2208.05890 (2022) - [i110]Jiadong Wang, Xinyuan Qian, Haizhou Li:
Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception. CoRR abs/2209.01768 (2022) - [i109]Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li:
Controllable Accented Text-to-Speech Synthesis. CoRR abs/2209.10804 (2022) - [i108]Qutang Cai, Guoqiang Hong, Zhijian Ye, Ximin Li, Haizhou Li:
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.11433 (2022) - [i107]Bin Wang, Chen Zhang, Chengwei Wei, Haizhou Li:
A Focused Study on Sequence Length for Dialogue Summarization. CoRR abs/2209.11910 (2022) - [i106]Chutong Meng, Junyi Ao, Tom Ko, Mingxuan Wang, Haizhou Li:
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning. CoRR abs/2210.04062 (2022) - [i105]Qu Yang, Jibin Wu, Malu Zhang, Yansong Chua, Xinchao Wang, Haizhou Li:
Training Spiking Neural Networks with Local Tandem Learning. CoRR abs/2210.04532 (2022) - [i104]Bin Wang, Chen Zhang, Yan Zhang, Yiming Chen, Haizhou Li:
Analyzing and Evaluating Faithfulness in Dialogue Summarization. CoRR abs/2210.11777 (2022) - [i103]Kun Zhou, Berrak Sisman, Carlos Busso, Haizhou Li:
Mixed Emotion Modelling for Emotional Voice Conversion. CoRR abs/2210.13756 (2022) - [i102]Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation. CoRR abs/2210.13832 (2022) - [i101]Haolin Zuo, Rui Liu, Jinming Zhao, Guanglai Gao, Haizhou Li:
Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities. CoRR abs/2210.15359 (2022) - [i100]Yifan Hu, Rui Liu, Guanglai Gao, Haizhou Li:
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis. CoRR abs/2210.15360 (2022) - [i99]Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li:
Explicit Intensity Control for Accented Text-to-speech. CoRR abs/2210.15364 (2022) - [i98]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs. CoRR abs/2210.15385 (2022) - [i97]Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker recognition with two-step multi-modal deep cleansing. CoRR abs/2210.15903 (2022) - [i96]Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li:
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text. CoRR abs/2210.16755 (2022) - [i95]Yiming Chen, Yan Zhang, Bin Wang, Zuozhu Liu, Haizhou Li:
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework. CoRR abs/2210.16798 (2022) - [i94]Zexu Pan, Wupeng Wang, Marvin Borsdorf, Haizhou Li:
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting. CoRR abs/2211.00109 (2022) - [i93]Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022) - [i92]Xiaoxue Gao, Xianghu Yue, Haizhou Li:
Self-Transriber: Few-shot Lyrics Transcription with Self-training. CoRR abs/2211.10152 (2022) - [i91]Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li:
Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation. CoRR abs/2211.11004 (2022) - [i90]Bin Wang, Haizhou Li:
Relational Sentence Embedding for Flexible Semantic Matching. CoRR abs/2212.08802 (2022) - [i89]Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment. CoRR abs/2212.08992 (2022) - 2021
- [j127]Jibin Wu, Qi Liu, Malu Zhang, Zihan Pan, Haizhou Li, Kay Chen Tan:
HuRAI: A brain-inspired computational model for human-robot auditory interface. Neurocomputing 465: 103-113 (2021) - [j126]Rui Liu
, Berrak Sisman
, Yixing Lin, Haizhou Li:
FastTalker: A neural text-to-speech architecture with shallow and group autoregression. Neural Networks 141: 306-314 (2021) - [j125]Hongqiang Du
, Xiaohai Tian, Lei Xie, Haizhou Li
:
Factorized WaveNet for voice conversion with limited data. Speech Commun. 130: 45-54 (2021) - [j124]Tharshini Gunendradasan, Eliathamby Ambikairajah
, Julien Epps, Vidhyasaharan Sethu
, Haizhou Li:
An adaptive transmission line cochlear model based front-end for replay attack detection. Speech Commun. 132: 114-122 (2021) - [j123]Bidisha Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li
:
NHSS: A speech and singing parallel database. Speech Commun. 133: 9-22 (2021) - [j122]Xinyuan Qian
, Qi Liu
, Jiadong Wang, Haizhou Li
:
Three-Dimensional Speaker Localization: Audio-Refined Visual Scaling Factor Estimation. IEEE Signal Process. Lett. 28: 1405-1409 (2021) - [j121]Berrak Sisman
, Junichi Yamagishi
, Simon King
, Haizhou Li
:
An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 29: 132-157 (2021) - [j120]Rui Liu
, Berrak Sisman
, Feilong Bao, Jichen Yang
, Guanglai Gao, Haizhou Li
:
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 274-285 (2021) - [j119]Mingyang Zhang
, Yi Zhou, Li Zhao, Haizhou Li
:
Transfer Learning From Speech Synthesis to Voice Conversion With Non-Parallel Training Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1290-1302 (2021) - [j118]Rui Liu
, Berrak Sisman
, Guanglai Gao, Haizhou Li
:
Expressive TTS Training With Frame and Style Reconstruction Loss. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1806-1818 (2021) - [j117]Chen Zhang
, Grandee Lee
, Luis Fernando D'Haro
, Haizhou Li
:
D-Score: Holistic Dialogue Evaluation Without Reference. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2502-2516 (2021) - [j116]Zihan Pan
, Malu Zhang
, Jibin Wu
, Jiadong Wang, Haizhou Li
:
Multi-Tone Phase Coding of Interaural Time Difference for Sound Source Localization With Spiking Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2656-2670 (2021) - [j115]Chenglin Xu
, Wei Rao
, Jibin Wu
, Haizhou Li
:
Target Speaker Verification With Selective Auditory Attention for Single and Multi-Talker Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2696-2709 (2021) - [j114]Yi Zhou
, Xiaohai Tian
, Haizhou Li
:
Language Agnostic Speaker Embedding for Cross-Lingual Personalized Speech Generation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3427-3439 (2021) - [c609]Yan Zhang, Ruidan He, Zuozhu Liu, Lidong Bing, Haizhou Li:
Bootstrapped Unsupervised Sentence Representation Learning. ACL/IJCNLP (1) 2021: 5168-5180 - [c608]Chen Zhang
, Yiming Chen, Luis Fernando D'Haro, Yan Zhang, Thomas Friedrichs, Grandee Lee, Haizhou Li:
DynaEval: Unifying Turn and Dialogue Level Evaluation. ACL/IJCNLP (1) 2021: 5676-5689 - [c607]Jinhu Li, Chitralekha Gupta, Haizhou Li:
Training Explainable Singing Quality Assessment Network with Augmented Data. APSIPA ASC 2021: 904-911 - [c606]Chitralekha Gupta, Jinhu Li, Haizhou Li:
Towards Reference-Independent Rhythm Assessment of Solo Singing. APSIPA ASC 2021: 912-919 - [c605]Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li:
PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction. ASRU 2021: 106-113 - [c604]Bidisha Sharma, Maulik C. Madhavi, Xuehao Zhou, Haizhou Li:
Exploring Teacher-Student Learning Approach for Multi-Lingual Speech-to-Intent Classification. ASRU 2021: 419-426 - [c603]Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. ASRU 2021: 594-601 - [c602]Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li:
DEEPA: A Deep Neural Analyzer for Speech and Singing Vocoding. ASRU 2021: 618-625 - [c601]Marvin Borsdorf, Haizhou Li, Tanja Schultz:
Target Language Extraction at Multilingual Cocktail Parties. ASRU 2021: 717-724 - [c600]Enze Su, Siqi Cai, Peiwen Li, Longhan Xie, Haizhou Li:
Auditory Attention Detection with EEG Channel Attention. EMBC 2021: 5804-5807 - [c599]Siqi Cai, Pengcheng Sun, Tanja Schultz, Haizhou Li:
Low-Latency Auditory Spatial Attention Detection Based on Spectro-Spatial Features from EEG. EMBC 2021: 5812-5815 - [c598]Yiming Chen, Yan Zhang, Chen Zhang, Grandee Lee, Ran Cheng, Haizhou Li:
Revisiting Self-training for Few-shot Learning of Language Model. EMNLP (1) 2021: 9125-9135 - [c597]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. ICASSP 2021: 666-670 - [c596]