


Остановите войну!
for scientists:


default search action
Lei Xie 0001
Person information

- affiliation: Northwestern Polytechnical University, School of Computer Science, Xi'an, China
- affiliation (2006 - 2007): The Chinese University of Hong Kong, Department of Systems Engineering and Engineering Management, Hong Kong
- affiliation (2004 - 2006): City University of Hong Kong, School of Creative Media, Hong Kong
- affiliation (PhD 2004): Northwestern Polytechnical University, Xi'an, China
- affiliation (2001 - 2002): Vrije Universiteit Brussel, Department of Electronics and Information Processing, Belgium
Other persons with the same name
- Lei Xie — disambiguation page
- Lei Xie 0002 — Xi'an Jiaotong University, China
- Lei Xie 0003
— Zhejiang University, College of Information Science and Electronic Engineering, Hangzhou, China
- Lei Xie 0004
— Nanjing University, State Key Laboratory for Novel Software Technology, China
- Lei Xie 0005
— Delft University of Technology, Laboratory of Computer Engineering, The Netherlands
- Lei Xie 0006
— City University of New York, Department of Computer Science, Hunter College, NY, USA (and 1 more)
- Lei Xie 0007
— Zhejiang University, State Key Laboratory of Industrial Control Technology, Hangzhou, China (and 2 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j58]Xiang Hao
, Chenglin Xu, Lei Xie:
Neural speech enhancement with unsupervised pre-training and mixture training. Neural Networks 158: 216-227 (2023) - [j57]Zhichao Wang
, Yuanzhe Chen, Lei Xie
, Qiao Tian, Yuping Wang:
LM-VC: Zero-Shot Voice Conversion via Speech Generation Based on Language Models. IEEE Signal Process. Lett. 30: 1157-1161 (2023) - [i116]Zhanheng Yang, Sining Sun, Xiong Wang, Yike Zhang, Long Ma, Lei Xie:
Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer. CoRR abs/2301.06735 (2023) - [i115]Ao Zhang, He Wang, Pengcheng Guo, Yihui Fu, Lei Xie, Yingying Gao, Shilei Zhang, Junlan Feng:
VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting. CoRR abs/2302.13523 (2023) - [i114]Li Zhang, Qing Wang, Hongji Wang, Yue Li, Wei Rao, Yannan Wang, Lei Xie:
Distance-based Weight Transfer from Near-field to Far-field Speaker Verification. CoRR abs/2303.00264 (2023) - [i113]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. CoRR abs/2305.12425 (2023) - [i112]Li Zhang, Huan Zhao, Yue Li, Bowen Pang, Yannan Wang, Hongji Wang, Wei Rao, Qing Wang, Lei Xie:
The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022. CoRR abs/2307.15400 (2023) - [i111]Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, Jingjing Yin, Hongbin Zhou, Heng Lu, Lei Xie:
PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts. CoRR abs/2309.09262 (2023) - [i110]Dake Guo, Xinfa Zhu, Liumeng Xue, Tao Li, Yuanjun Lv, Yuepeng Jiang, Lei Xie:
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS. CoRR abs/2309.13907 (2023) - 2022
- [j56]Hongqiang Du
, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. Neural Networks 148: 74-84 (2022) - [j55]Chenggang Mi
, Lei Xie, Yanning Zhang:
Improving data augmentation for low resource speech-to-text translation with diverse paraphrasing. Neural Networks 148: 194-205 (2022) - [j54]Jingyong Hou
, Lei Xie, Shilei Zhang:
Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution. Neural Networks 150: 28-42 (2022) - [j53]Yi Lei
, Shan Yang, Xinfa Zhu, Lei Xie
, Dan Su:
Cross-Speaker Emotion Transfer Through Information Perturbation in Emotional Speech Synthesis. IEEE Signal Process. Lett. 29: 1948-1952 (2022) - [j52]Xiaochun An, Frank K. Soong, Lei Xie
:
Disentangling Style and Speaker Attributes for TTS Style Transfer. IEEE ACM Trans. Audio Speech Lang. Process. 30: 646-658 (2022) - [j51]Yi Lei, Shan Yang, Xinsheng Wang
, Lei Xie
:
MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 30: 853-864 (2022) - [j50]Tao Li
, Xinsheng Wang
, Qicong Xie, Zhichao Wang, Lei Xie
:
Cross-Speaker Emotion Disentangling and Transfer for End-to-End Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1448-1460 (2022) - [j49]Liumeng Xue
, Frank K. Soong, Shaofei Zhang, Lei Xie
:
ParaTTS: Learning Linguistic and Prosodic Cross-Sentence Information in Paragraph-Based TTS. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2854-2864 (2022) - [c210]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171 - [c209]Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng:
WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition. ICASSP 2022: 6182-6186 - [c208]Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma:
Conversational Speech Recognition by Learning Conversation-Level Characteristics. ICASSP 2022: 6752-6756 - [c207]Zhichao Wang, Qicong Xie, Tao Li, Hongqiang Du, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
One-Shot Voice Conversion For Style Transfer Based On Speaker Adaptation. ICASSP 2022: 6792-6796 - [c206]Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis. ICASSP 2022: 7237-7241 - [c205]Yihui Fu, Yun Liu, Jingdong Li, Dawei Luo, Shubo Lv, Yukai Jv, Lei Xie:
Uformer: A Unet Based Dilated Complex & Real Dual-Path Conformer Network for Simultaneous Speech Enhancement and Dereverberation. ICASSP 2022: 7417-7421 - [c204]Shubo Lv, Yihui Fu, Mengtao Xing, Jiayao Sun, Lei Xie, Jun Huang, Yannan Wang, Tao Yu:
S-DCCRN: Super Wide Band DCCRN with Learnable Complex Feature for Speech Enhancement. ICASSP 2022: 7767-7771 - [c203]Shimin Zhang
, Ziteng Wang, Jiayao Sun, Yihui Fu, Biao Tian, Qiang Fu, Lei Xie:
Multi-Task Deep Residual Echo Suppression with Echo-Aware Loss. ICASSP 2022: 9127-9131 - [c202]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c201]Yukai Ju, Wei Rao, Xiaopeng Yan, Yihui Fu, Shubo Lv, Luyao Cheng, Yannan Wang, Lei Xie, Shidong Shang:
TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge. ICASSP 2022: 9291-9295 - [c200]Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie:
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings. INTERSPEECH 2022: 560-564 - [c199]Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma:
Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR. INTERSPEECH 2022: 1016-1020 - [c198]Binbin Zhang, Di Wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu:
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit. INTERSPEECH 2022: 1661-1665 - [c197]Zhanheng Yang, Sining Sun, Jin Li, Xiaoming Zhang, Xiong Wang, Long Ma, Lei Xie:
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer. INTERSPEECH 2022: 1681-1685 - [c196]Shimin Zhang, Ziteng Wang, Yukai Ju, Yihui Fu, Yueyue Na, Qiang Fu, Lei Xie:
Personalized Acoustic Echo Cancellation for Full-duplex Communications. INTERSPEECH 2022: 2518-2522 - [c195]Liumeng Xue, Shan Yang, Na Hu, Dan Su, Lei Xie:
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers. INTERSPEECH 2022: 2548-2552 - [c194]Yi Lei, Shan Yang, Jian Cong, Lei Xie, Dan Su:
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion. INTERSPEECH 2022: 2563-2567 - [c193]Zhanheng Yang, Hang Lv, Xiong Wang, Ao Zhang, Lei Xie:
Minimizing Sequential Confusion Error in Speech Command Recognition. INTERSPEECH 2022: 3193-3197 - [c192]Qijie Shao, Jinghao Yan, Jian Kang, Pengcheng Guo, Xian Shi, Pengfei Hu, Lei Xie:
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition. INTERSPEECH 2022: 3719-3723 - [c191]Yu Wang, Xinsheng Wang, Pengcheng Zhu, Jie Wu, Hanzhao Li, Heyang Xue, Yongmao Zhang, Lei Xie, Mengxiao Bi:
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis. INTERSPEECH 2022: 4242-4246 - [c190]Heyang Xue, Xinsheng Wang, Yongmao Zhang, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher. INTERSPEECH 2022: 4267-4271 - [c189]Li Zhang, Yue Li, Huan Zhao, Qing Wang, Lei Xie:
Backend Ensemble for Speaker Verification and Spoofing Countermeasure. INTERSPEECH 2022: 4381-4385 - [c188]Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, Lei Xie:
Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis. INTERSPEECH 2022: 5498-5502 - [c187]Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios. ISCSLP 2022: 66-70 - [c186]Kun Song, Jian Cong, Xinsheng Wang, Yongmao Zhang, Lei Xie, Ning Jiang, Haiying Wu:
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS. ISCSLP 2022: 71-75 - [c185]Yongmao Zhang, Zhichao Wang, Peiji Yang, Hongshen Sun, Zhisheng Wang, Lei Xie:
AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker TTS with Accents. ISCSLP 2022: 76-80 - [c184]Qicong Xie, Shan Yang, Yi Lei, Lei Xie, Dan Su:
End-to-End Voice Conversion with Information Perturbation. ISCSLP 2022: 91-95 - [c183]Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su:
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation. ISCSLP 2022: 319-323 - [c182]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. ISCSLP 2022: 488-492 - [c181]Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang, Qing Wang, Lei Xie:
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge. ISCSLP 2022: 502-506 - [c180]Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. ISCSLP 2022: 507-511 - [c179]Yuhao Liang, Peikun Chen, Fan Yu, Xinfa Zhu, Tianyi Xu, Yingying Gao, Lei Xie:
The NPU-ASLP System for The ISCSLP 2022 Magichub Code-Swiching ASR Challenge. ISCSLP 2022: 532-536 - [c178]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie:
MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario. SLT 2022: 144-151 - [c177]Shubo Lv, Yihui Fu, Yukai Jv, Lei Xie, Weixin Zhu, Wei Rao, Yannan Wang:
Spatial-DCCRN: DCCRN Equipped with Frame-Level Angle Feature and Hybrid Filtering for Multi-Channel Speech Enhancement. SLT 2022: 436-443 - [c176]Yukai Ju, Shimin Zhang, Wei Rao, Yannan Wang, Tao Yu, Lei Xie, Shidong Shang:
TEA-PSE 2.0: Sub-Band Network for Real-Time Personalized Speech Enhancement. SLT 2022: 472-479 - [i109]Wendong Gan, Bolong Wen, Ying Yan, Haitao Chen, Zhichao Wang, Hongqiang Du, Lei Xie, Kaixuan Guo, Hai Li:
IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion. CoRR abs/2201.00269 (2022) - [i108]Yi Lei, Shan Yang, Xinsheng Wang, Lei Xie:
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis. CoRR abs/2201.06460 (2022) - [i107]Yu Wang, Xinsheng Wang, Pengcheng Zhu, Jie Wu, Hanzhao Li, Heyang Xue, Yongmao Zhang, Lei Xie, Mengxiao Bi:
Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis. CoRR abs/2201.07429 (2022) - [i106]Xiaochun An, Frank K. Soong, Lei Xie:
Disentangling Style and Speaker Attributes for TTS Style Transfer. CoRR abs/2201.09472 (2022) - [i105]Hongqiang Du, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. CoRR abs/2201.10693 (2022) - [i104]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i103]Shimin Zhang, Ziteng Wang, Jiayao Sun, Yihui Fu, Biao Tian, Qiang Fu, Lei Xie:
Multi-Task Deep Residual Echo Suppression with Echo-aware Loss. CoRR abs/2202.06850 (2022) - [i102]Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma:
Conversational Speech Recognition By Learning Conversation-level Characteristics. CoRR abs/2202.07855 (2022) - [i101]Binbin Zhang, Di Wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu:
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit. CoRR abs/2203.15455 (2022) - [i100]Heyang Xue, Xinsheng Wang, Yongmao Zhang, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher. CoRR abs/2203.16408 (2022) - [i99]Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie:
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings. CoRR abs/2203.16834 (2022) - [i98]Qijie Shao, Jinghao Yan, Jian Kang, Pengcheng Guo, Xian Shi, Pengfei Hu, Lei Xie:
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition. CoRR abs/2204.03398 (2022) - [i97]Shimin Zhang, Ziteng Wang, Yukai Ju, Yihui Fu, Yueyue Na, Qiang Fu, Lei Xie:
Personalized Acoustic Echo Cancellation for Full-duplex Communications. CoRR abs/2205.15195 (2022) - [i96]Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su:
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation. CoRR abs/2206.00208 (2022) - [i95]Qicong Xie, Shan Yang, Yi Lei, Lei Xie, Dan Su:
End-to-End Voice Conversion with Information Perturbation. CoRR abs/2206.07569 (2022) - [i94]Liumeng Xue, Shan Yang, Na Hu, Dan Su, Lei Xie:
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers. CoRR abs/2207.00756 (2022) - [i93]Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma:
Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR. CoRR abs/2207.01039 (2022) - [i92]Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, Lei Xie:
Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis. CoRR abs/2207.01198 (2022) - [i91]Zhanheng Yang, Hang Lv, Xiong Wang, Ao Zhang, Lei Xie:
Minimizing Sequential Confusion Error in Speech Command Recognition. CoRR abs/2207.01261 (2022) - [i90]Zhanheng Yang, Sining Sun, Jin Li, Xiaoming Zhang, Xiong Wang, Long Ma, Lei Xie:
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer. CoRR abs/2207.01267 (2022) - [i89]Li Zhang, Yue Li, Huan Zhao, Qing Wang, Lei Xie:
Backend Ensemble for Speaker Verification and Spoofing Countermeasure. CoRR abs/2207.01802 (2022) - [i88]Yi Lei, Shan Yang, Jian Cong, Lei Xie, Dan Su:
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion. CoRR abs/2207.01832 (2022) - [i87]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022) - [i86]Liumeng Xue, Frank K. Soong, Shaofei Zhang, Lei Xie:
ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS. CoRR abs/2209.06484 (2022) - [i85]Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, Lei Xie:
NWPU-ASLP System for the VoicePrivacy 2022 Challenge. CoRR abs/2209.11969 (2022) - [i84]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie:
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario. CoRR abs/2210.05265 (2022) - [i83]Shubo Lv, Yihui Fu, Yukai Jv, Lei Xie, Weixin Zhu, Wei Rao, Yannan Wang:
spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement. CoRR abs/2210.08802 (2022) - [i82]Yuhao Liang, Peikun Chen, Fan Yu, Xinfa Zhu, Tianyi Xu, Lei Xie:
The NPU-ASLP System for The ISCSLP 2022 Magichub Code-Swiching ASR Challenge. CoRR abs/2210.14448 (2022) - [i81]Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang, Qing Wang, Lei Xie:
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge. CoRR abs/2210.14653 (2022) - [i80]Jie Wang, Menglong Xu, Jingyong Hou, Binbin Zhang, Xiao-Lei Zhang, Lei Xie, Fuping Pan:
WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit. CoRR abs/2210.16743 (2022) - [i79]Yongmao Zhang, Zhichao Wang, Peiji Yang, Hongshen Sun, Zhisheng Wang, Lei Xie:
AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker TTS with Accents. CoRR abs/2210.17305 (2022) - [i78]Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. CoRR abs/2211.01585 (2022) - [i77]Yongmao Zhang, Heyang Xue, Hanzhao Li, Lei Xie, Tingwei Guo, Ruixiong Zhang, Caixia Gong:
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer. CoRR abs/2211.02903 (2022) - [i76]Jixun Yao, Yi Lei, Qing Wang, Pengcheng Guo, Ziqian Ning, Lei Xie, Hai Li, Junhui Liu, Danming Xie:
Preserving background sound in noise-robust voice conversion via multi-task learning. CoRR abs/2211.03036 (2022) - [i75]Jixun Yao, Qing Wang, Yi Lei, Pengcheng Guo, Lei Xie, Namin Wang, Jie Liu:
Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling. CoRR abs/2211.03038 (2022) - [i74]Ziqian Ning, Qicong Xie, Pengcheng Zhu, Zhichao Wang, Liumeng Xue, Jixun Yao, Lei Xie, Mengxiao Bi:
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features. CoRR abs/2211.04710 (2022) - [i73]Zhuoyuan Yao, Shuo Ren, Sanyuan Chen, Ziyang Ma, Pengcheng Guo, Lei Xie:
TESSP: Text-Enhanced Self-Supervised Speech Pre-training. CoRR abs/2211.13443 (2022) - [i72]Yue Li, Li Zhang, Namin Wang, Jie Liu, Lei Xie:
MSV Challenge 2022: NPU-HC Speaker Verification System for Low-resource Indian Languages. CoRR abs/2211.16694 (2022) - 2021
- [j48]Liumeng Xue
, Shifeng Pan
, Lei He, Lei Xie, Frank K. Soong:
Cycle consistent network for end-to-end style transfer TTS training. Neural Networks 140: 223-236 (2021) - [j47]Xiaochun An
, Frank K. Soong
, Shan Yang, Lei Xie:
Effective and direct control of neural TTS prosody by removing interactions between different attributes. Neural Networks 143: 250-260 (2021) - [j46]Hongqiang Du
, Xiaohai Tian, Lei Xie, Haizhou Li
:
Factorized WaveNet for voice conversion with limited data. Speech Commun. 130: 45-54 (2021) - [j45]Hang Lv
, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang
, Lei Xie, Sanjeev Khudanpur
:
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation. IEEE Signal Process. Lett. 28: 703-707 (2021) - [c175]Qijie Shao, Jingyong Hou, Yanxin Hu, Qing Wang, Lei Xie, Xin Lei:
Target Speaker Extraction for Customizable Query-by-Example Keyword Spotting. APSIPA ASC 2021: 672-678 - [c174]Li Zhang, Qing Wang, Lei Xie:
Duality Temporal-Channel-Frequency Attention Enhanced Speaker Representation Learning. ASRU 2021: 206-213 - [c173]Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Leijing Hou, Shilei Zhang:
Boundary and Context Aware Training for CIF-Based Non-Autoregressive End-to-End ASR. ASRU 2021: 328-334 - [c172]Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe
, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing. ASRU 2021: 679-686 - [c171]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. ICASSP 2021: 5864-5868 - [c170]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition. ICASSP 2021: 6019-6023 - [c169]Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. ICASSP 2021: 6918-6922 - [c168]Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu:
The Multi-Speaker Multi-Style Voice Cloning Challenge 2021. ICASSP 2021: 8613-8617 - [c167]Xian Shi, Pan Zhou, Wei Chen, Lei Xie:
Efficient Gradient-Based Neural Architecture Search For End-to-End ASR. ICMI Companion 2021: 91-96 - [c166]Zhiwei Chen
, Weizhao Yang, Jinrong Li, Jiale Wang, Shuai Li, Ziwen Wang, Lei Xie:
A Web-Based Longitudinal Mental Health Monitoring System. ICMI Companion 2021: 121-125 - [c165]Yi Chen, Shan Yang, Na Hu, Lei Xie, Dan Su:
TeNC: Low Bit-Rate Speech Coding with VQ-VAE and GAN. ICMI Companion 2021: 126-130 - [c164]Heyang Xue, Xiao Zhang, Jie Wu, Jian Luan
, Yujun Wang, Lei Xie:
Noise Robust Singing Voice Synthesis Using Gaussian Mixture Variational Autoencoder. ICMI Companion 2021: 131-136 - [c163]Dongyan Huang, Björn W. Schuller, Jianhua Tao, Lei Xie, Jie Yang:
ASMMC21: The 6th International Workshop on Affective Social Multimedia Computing. ICMI 2021: 864-867 - [c162]Zhichao Wang, Xinyong Zhou, Fengyu Yang, Tao Li, Hongqiang Du, Lei Xie, Wendong Gan, Haitao Chen, Hai Li:
Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion. Interspeech 2021: 831-835 - [c161]