default search action

combined dblp search
author search
venue search
publication search

ask others

Fenglong Xie

Feng-Long Xie

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c18]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Xiao0GX025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Xiao0GX025
Yujia Xiao, Lei He, Haohan Guo, Fenglong Xie, Tan Lee:
PodAgent: A Comprehensive Framework for Podcast Generation. ACL (Findings) 2025: 23923-23937
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoXYWM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoXYWM25
Haohan Guo, Fenglong Xie, Dongchao Yang, Xixin Wu, Helen Meng:
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation. ICASSP 2025: 1-5
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-14350
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-14350
Kaituo Xu, Feng-Long Xie, Xu Tang, Yao Hu:
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration. CoRR abs/2501.14350 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-00455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-00455
Yujia Xiao, Lei He, Haohan Guo, Fenglong Xie, Tan Lee:
PodAgent: A Comprehensive Framework for Podcast Generation. CoRR abs/2503.00455 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-20499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-20499
Haohan Guo, Kun Xie, Yi-Chen Wu, Feng-Long Xie, Xu Tang, Yao Hu:
FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System. CoRR abs/2503.20499 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-02020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-02020
Kun Xie, Feiyu Shen, Junjie Li, Fenglong Xie, Xu Tang, Yao Hu:
FireRedTTS-2: Towards Long Conversational Speech Generation for Podcast and Chatbot. CoRR abs/2509.02020 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-06502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-06502
Junjie Chen, Yao Hu, Junjie Li, Kangyue Li, Kun Liu, Wenpeng Li, Xu Li, Ziyuan Li, Feiyu Shen, Xu Tang, Manzhen Wei, Yichen Wu, Fenglong Xie, Kaituo Xu, Kun Xie:
FireRedChat: A Pluggable, Full-Duplex Voice Interaction System with Cascaded and Semi-Cascaded Implementations. CoRR abs/2509.06502 (2025)
2024
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/GuoXYLWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/GuoXYLWM24
Haohan Guo, Fenglong Xie, Dongchao Yang, Hui Lu, Xixin Wu, Helen Meng:
Addressing Index Collapse of Large-Codebook Speech Tokenizer With Dual-Decoding Product-Quantized Variational Auto-Encoder. SLT 2024: 548-553
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/GuoXXYGWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/GuoXXYGWM24
Haohan Guo, Fenglong Xie, Kun Xie, Dongchao Yang, Dake Guo, Xixin Wu, Helen Meng:
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec For Efficient Language Model Based Text-to-Speech Synthesis. SLT 2024: 645-651
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02940
Haohan Guo, Fenglong Xie, Dongchao Yang, Hui Lu, Xixin Wu, Helen Meng:
Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder. CoRR abs/2406.02940 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00933
Haohan Guo, Fenglong Xie, Kun Xie, Dongchao Yang, Dake Guo, Xixin Wu, Helen Meng:
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis. CoRR abs/2409.00933 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03283
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03283
Haohan Guo, Kun Liu, Feiyu Shen, Yi-Chen Wu, Feng-Long Xie, Kun Xie, Kaituo Xu:
FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications. CoRR abs/2409.03283 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11630
Haohan Guo, Fenglong Xie, Dongchao Yang, Xixin Wu, Helen Meng:
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation. CoRR abs/2409.11630 (2024)
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuoXWSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuoXWSM23
Haohan Guo, Fenglong Xie, Xixin Wu, Frank K. Soong, Helen Meng:
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1811-1824 (2023)
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/XieWX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/XieWX23
Kun Xie, Yi-Chen Wu, Feng-Long Xie:
FireRedTTS: The Xiaohongshu Speech Synthesis System for Blizzard Challenge 2023. Blizzard Challenge 2023
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00126
Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng:
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning. CoRR abs/2309.00126 (2023)
2022
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoXSWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoXSWM22
Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng:
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. INTERSPEECH 2022: 1611-1615
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10887
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10887
Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng:
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. CoRR abs/2209.10887 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15131
Haohan Guo, Fenglong Xie, Xixin Wu, Hui Lu, Helen Meng:
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations. CoRR abs/2210.15131 (2022)
2021
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/LinSMXLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LinSMXLL21
Shilun Lin, Wen-Chao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu:
Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS. Blizzard Challenge 2021
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieLSLS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieLSLS21
Feng-Long Xie, Xinhui Li, Wen-Chao Su, Li Lu, Frank K. Soong:
A New High Quality Trajectory Tiling Based Hybrid TTS In Real Time. ICASSP 2021: 5704-5708
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinXMLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinXMLL21
Shilun Lin, Fenglong Xie, Li Meng, Xinhui Li, Li Lu:
Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet. Interspeech 2021: 3640-3644
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-00247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-00247
Shilun Lin, Fenglong Xie, Xinhui Li, Li Lu:
Triple M: A Practical Neural Text-to-speech System With Multi-guidance Attention And Multi-band Multi-time Lpcnet. CoRR abs/2102.00247 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13673
Shilun Lin, Wen-Chao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu:
Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS. CoRR abs/2109.13673 (2021)
2020
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengLXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengLXL20
Yibin Zheng, Xinhui Li, Fenglong Xie, Li Lu:
Improving End-to-End Speech Synthesis with Local Recurrent Neural Network Enhanced Transformer. ICASSP 2020: 6734-6738
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieLLZMLS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieLLZMLS20
Feng-Long Xie, Xinhui Li, Bo Liu, Yibin Zheng, Li Meng, Li Lu, Frank K. Soong:
An Improved Frame-Unit-Selection Based Voice Conversion System Without Parallel Training Data. ICASSP 2020: 7754-7758

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/XieSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/XieSL19
Feng-Long Xie, Frank K. Soong, Haifeng Li:
Voice conversion with SI-DNN and KL divergence based mapping without parallel training data. Speech Commun. 106: 57-67 (2019)
2018
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XieSWHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XieSWHL18
Feng-Long Xie, Frank K. Soong, Xi Wang, Lei He, Haifeng Li:
Frame Selection in SI-DNN Phonetic Space with WaveNet Vocoder for Voice Conversion without Parallel Training Data. ISCSLP 2018: 56-60
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-11913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-11913
Min-Jae Hwang, Frank K. Soong, Feng-Long Xie, Xi Wang, Hong-Goo Kang:
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis. CoRR abs/1811.11913 (2018)
2016
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieSL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieSL16
Feng-Long Xie, Frank K. Soong, Haifeng Li:
A KL divergence and DNN approach to cross-lingual TTS. ICASSP 2016: 5515-5519
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieSL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieSL16
Feng-Long Xie, Frank K. Soong, Haifeng Li:
A KL Divergence and DNN-Based Approach to Voice Conversion without Parallel Training Sentences. INTERSPEECH 2016: 287-291
2014
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FanQXS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FanQXS14
Yuchen Fan, Yao Qian, Feng-Long Xie, Frank K. Soong:
TTS synthesis with bidirectional LSTM based recurrent neural networks. INTERSPEECH 2014: 1964-1968
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieQFSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieQFSL14
Feng-Long Xie, Yao Qian, Yuchen Fan, Frank K. Soong, Haifeng Li:
Sequence error (SE) minimization training of neural network for voice conversion. INTERSPEECH 2014: 2283-2287
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XieQSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XieQSL14
Feng-Long Xie, Yao Qian, Frank K. Soong, Haifeng Li:
Pitch transformation in neural network based voice conversion. ISCSLP 2014: 197-200
2012
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XieWS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XieWS12
Feng-Long Xie, Yi-Jian Wu, Frank K. Soong:
Cross validation and Minimum Generation Error for improved model clustering in HMM-based TTS. ISCSLP 2012: 60-63

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.