


default search action
Fenglong Xie
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c18]Yujia Xiao, Lei He, Haohan Guo, Fenglong Xie, Tan Lee:
PodAgent: A Comprehensive Framework for Podcast Generation. ACL (Findings) 2025: 23923-23937 - [c17]Haohan Guo, Fenglong Xie, Dongchao Yang, Xixin Wu, Helen Meng:
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation. ICASSP 2025: 1-5 - [i13]Kaituo Xu, Feng-Long Xie, Xu Tang, Yao Hu:
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration. CoRR abs/2501.14350 (2025) - [i12]Yujia Xiao, Lei He, Haohan Guo, Fenglong Xie, Tan Lee:
PodAgent: A Comprehensive Framework for Podcast Generation. CoRR abs/2503.00455 (2025) - [i11]Haohan Guo, Kun Xie, Yi-Chen Wu, Feng-Long Xie, Xu Tang, Yao Hu:
FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System. CoRR abs/2503.20499 (2025) - 2024
- [c16]Haohan Guo, Fenglong Xie, Dongchao Yang, Hui Lu, Xixin Wu, Helen Meng:
Addressing Index Collapse of Large-Codebook Speech Tokenizer With Dual-Decoding Product-Quantized Variational Auto-Encoder. SLT 2024: 548-553 - [c15]Haohan Guo, Fenglong Xie, Kun Xie, Dongchao Yang, Dake Guo, Xixin Wu, Helen Meng:
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec For Efficient Language Model Based Text-to-Speech Synthesis. SLT 2024: 645-651 - [i10]Haohan Guo, Fenglong Xie, Dongchao Yang, Hui Lu, Xixin Wu, Helen Meng:
Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder. CoRR abs/2406.02940 (2024) - [i9]Haohan Guo, Fenglong Xie, Kun Xie, Dongchao Yang, Dake Guo, Xixin Wu, Helen Meng:
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis. CoRR abs/2409.00933 (2024) - [i8]Haohan Guo, Kun Liu, Feiyu Shen, Yi-Chen Wu, Feng-Long Xie, Kun Xie, Kaituo Xu:
FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications. CoRR abs/2409.03283 (2024) - [i7]Haohan Guo, Fenglong Xie, Dongchao Yang, Xixin Wu, Helen Meng:
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation. CoRR abs/2409.11630 (2024) - 2023
- [j2]Haohan Guo
, Fenglong Xie, Xixin Wu
, Frank K. Soong, Helen Meng:
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1811-1824 (2023) - [c14]Kun Xie, Yi-Chen Wu, Feng-Long Xie:
FireRedTTS: The Xiaohongshu Speech Synthesis System for Blizzard Challenge 2023. Blizzard Challenge 2023 - [i6]Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng:
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning. CoRR abs/2309.00126 (2023) - 2022
- [c13]Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng:
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. INTERSPEECH 2022: 1611-1615 - [i5]Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng:
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. CoRR abs/2209.10887 (2022) - [i4]Haohan Guo, Fenglong Xie, Xixin Wu, Hui Lu, Helen Meng:
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations. CoRR abs/2210.15131 (2022) - 2021
- [c12]Shilun Lin, Wen-Chao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu:
Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS. Blizzard Challenge 2021 - [c11]Feng-Long Xie, Xinhui Li, Wen-Chao Su, Li Lu, Frank K. Soong:
A New High Quality Trajectory Tiling Based Hybrid TTS In Real Time. ICASSP 2021: 5704-5708 - [c10]Shilun Lin, Fenglong Xie, Li Meng, Xinhui Li, Li Lu:
Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet. Interspeech 2021: 3640-3644 - [i3]Shilun Lin, Fenglong Xie, Xinhui Li, Li Lu:
Triple M: A Practical Neural Text-to-speech System With Multi-guidance Attention And Multi-band Multi-time Lpcnet. CoRR abs/2102.00247 (2021) - [i2]Shilun Lin, Wen-Chao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu:
Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS. CoRR abs/2109.13673 (2021) - 2020
- [c9]Yibin Zheng, Xinhui Li, Fenglong Xie, Li Lu:
Improving End-to-End Speech Synthesis with Local Recurrent Neural Network Enhanced Transformer. ICASSP 2020: 6734-6738 - [c8]Feng-Long Xie, Xinhui Li, Bo Liu, Yibin Zheng, Li Meng, Li Lu, Frank K. Soong:
An Improved Frame-Unit-Selection Based Voice Conversion System Without Parallel Training Data. ICASSP 2020: 7754-7758
2010 – 2019
- 2019
- [j1]Feng-Long Xie
, Frank K. Soong, Haifeng Li:
Voice conversion with SI-DNN and KL divergence based mapping without parallel training data. Speech Commun. 106: 57-67 (2019) - 2018
- [c7]Feng-Long Xie, Frank K. Soong, Xi Wang, Lei He, Haifeng Li:
Frame Selection in SI-DNN Phonetic Space with WaveNet Vocoder for Voice Conversion without Parallel Training Data. ISCSLP 2018: 56-60 - [i1]Min-Jae Hwang, Frank K. Soong, Feng-Long Xie, Xi Wang, Hong-Goo Kang:
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis. CoRR abs/1811.11913 (2018) - 2016
- [c6]Feng-Long Xie, Frank K. Soong, Haifeng Li:
A KL divergence and DNN approach to cross-lingual TTS. ICASSP 2016: 5515-5519 - [c5]Feng-Long Xie, Frank K. Soong, Haifeng Li:
A KL Divergence and DNN-Based Approach to Voice Conversion without Parallel Training Sentences. INTERSPEECH 2016: 287-291 - 2014
- [c4]Yuchen Fan, Yao Qian, Feng-Long Xie, Frank K. Soong:
TTS synthesis with bidirectional LSTM based recurrent neural networks. INTERSPEECH 2014: 1964-1968 - [c3]Feng-Long Xie, Yao Qian, Yuchen Fan, Frank K. Soong, Haifeng Li:
Sequence error (SE) minimization training of neural network for voice conversion. INTERSPEECH 2014: 2283-2287 - [c2]Feng-Long Xie, Yao Qian, Frank K. Soong, Haifeng Li:
Pitch transformation in neural network based voice conversion. ISCSLP 2014: 197-200 - 2012
- [c1]Feng-Long Xie, Yi-Jian Wu, Frank K. Soong:
Cross validation and Minimum Generation Error for improved model clustering in HMM-based TTS. ISCSLP 2012: 60-63
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-07-30 19:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint