default search action

combined dblp search
author search
venue search
publication search

ask others

Xixin Wu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/FengWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/FengWM24
Xiaohan Feng, Xixin Wu, Helen Meng:
Injecting Linguistic Knowledge Into BERT for Dialogue State Tracking. IEEE Access 12: 93761-93770 (2024)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/eaai/YangWHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eaai/YangWHG24
Zihao Yang, Xixin Wu, Xindang He, Xiaofei Guan:
A multiscale analysis-assisted two-stage reduced-order deep learning approach for effective thermal conductivity of arbitrary contrast heterogeneous materials. Eng. Appl. Artif. Intell. 136: 108916 (2024)
2023
[j9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taffco/WuZWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/WuZWW23
Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland:
Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors. IEEE Trans. Affect. Comput. 14(4): 2810-2822 (2023)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuoXWSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuoXWSM23
Haohan Guo, Fenglong Xie, Xixin Wu, Frank K. Soong, Helen Meng:
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1811-1824 (2023)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeiZCWWKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeiZCWWKM23
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Xixin Wu, Shiyin Kang, Helen Meng:
MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3290-3303 (2023)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuLLWLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuLLWLM23
Xixin Wu, Hui Lu, Kun Li, Zhiyong Wu, Xunying Liu, Helen Meng:
Hiformer: Sequence Modeling Networks With Hierarchical Attention Mechanisms. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3993-4003 (2023)
2021
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuCLLKWLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuCLLKWLM21
Xixin Wu, Yuewen Cao, Hui Lu, Songxiang Liu, Shiyin Kang, Zhiyong Wu, Xunying Liu, Helen Meng:
Exemplar-Based Emotive Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 874-886 (2021)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuCWWLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuCWWLM21
Songxiang Liu, Yuewen Cao, Disong Wang, Xixin Wu, Xunying Liu, Helen Meng:
Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1717-1728 (2021)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuCLLWWLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuCLLWWLM21
Xixin Wu, Yuewen Cao, Hui Lu, Songxiang Liu, Disong Wang, Zhiyong Wu, Xunying Liu, Helen Meng:
Speech Emotion Recognition Using Sequential Capsule Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3280-3291 (2021)
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LiWM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LiWM17
Kun Li, Xixin Wu, Helen M. Meng:
Intonation classification for L2 English speech using multi-distribution deep neural networks. Comput. Speech Lang. 43: 18-33 (2017)
2015
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/WuZWLM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/WuZWLM15
Zhiyong Wu, Kai Zhao, Xixin Wu, Xinyu Lan, Helen Meng:
Acoustic to articulatory mapping with deep neural network. Multim. Tools Appl. 74(22): 9889-9907 (2015)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Tang0WHCLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Tang0WHCLM24
Boshi Tang, Zhiyong Wu, Xixin Wu, Qiaochu Huang, Jun Chen, Shun Lei, Helen Meng:
SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes. AAAI 2024: 15267-15275
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuWGL0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuWGL0M24
Hui Lu, Xixin Wu, Haohan Guo, Songxiang Liu, Zhiyong Wu, Helen Meng:
Unifying One-Shot Voice Conversion and Cloning with Disentangled Speech Representations. ICASSP 2024: 11141-11145
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0002MCGWLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0002MCGWLM24
Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng:
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition. ICASSP 2024: 11986-11990
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWWMM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWWMM24
Yuejiao Wang, Xixin Wu, Disong Wang, Lingwei Meng, Helen Meng:
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization. ICASSP 2024: 12306-12310
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWZ00WM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWZ00WM24
Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng:
Stylespeech: Self-Supervised Style Enhancing with VQ-VAE-Based Pre-Training for Expressive Audiobook Speech Synthesis. ICASSP 2024: 12316-12320
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWWW0LM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWWW0LM24
Xueyuan Chen, Yuejiao Wang, Xixin Wu, Disong Wang, Zhiyong Wu, Xunying Liu, Helen Meng:
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction. ICASSP 2024: 12341-12345
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Lei0CLWWKJZ0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Lei0CLWWKJZ0M24
Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu, Xixin Wu, Shiyin Kang, Tao Jiang, Yahui Zhou, Yuxing Han, Helen Meng:
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts. ICASSP 2024: 12662-12666
[c67]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YangT0HLGCSZ0ZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangT0HLGCSZ0ZW24
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Haohan Guo, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Zhou Zhao, Xixin Wu, Helen M. Meng:
UniAudio: Towards Universal Audio Generation with Large Language Models. ICML 2024
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WuCWLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WuCWLM24
Wenxuan Wu, Xueyuan Chen, Xixin Wu, Haizhou Li, Helen Meng:
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy. IJCNN 2024: 1-8
[c65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/ZhouHLZWKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ZhouHLZWKM24
Jingyan Zhou, Minda Hu, Junan Li, Xiaoying Zhang, Xixin Wu, Irwin King, Helen Meng:
Rethinking Machine Ethics - Can LLMs Perform Moral Reasoning through the Lens of Moral Theories? NAACL-HLT (Findings) 2024: 2227-2242
[c64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/ZhangGLCGGKWMG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ZhangGLCGGKWMG24
Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, Jim Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. NAACL-HLT (Findings) 2024: 4131-4155
2023
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LuoZCGKWMG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LuoZCGKWMG23
Hongyin Luo, Tianhua Zhang, Yung-Sung Chuang, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, James R. Glass:
Search Augmented Instruction Learning. EMNLP (Findings) 2023: 3717-3729
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSLZLWLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSLZLWLM23
Jinchao Li, Kaitao Song, Junan Li, Bo Zheng, Dongsheng Li, Xixin Wu, Xunying Liu, Helen Meng:
Leveraging Pretrained Representations With Task-Related Keywords for Alzheimer's Disease Detection. ICASSP 2023: 1-5
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiWSLLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiWSLLM23
Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng:
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition. ICASSP 2023: 1-5
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuGWWLD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuGWWLD23
Yuhao Liu, Cheng Gong, Longbiao Wang, Xixin Wu, Qiuyu Liu, Jianwu Dang:
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature Distillation. ICASSP 2023: 1-5
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengKCWWM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengKCWWM23
Lingwei Meng, Jiawen Kang, Mingyu Cui, Yuejiao Wang, Xixin Wu, Helen Meng:
A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker One. ICASSP 2023: 1-5
[c58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengMMFGKLMWWWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengMMFGKLMWWWW23
Helen Meng, Brian Mak, Man-Wai Mak, Helene H. Fung, Xianmin Gong, Timothy C. Y. Kwok, Xunying Liu, Vincent C. T. Mok, Patrick C. M. Wong, Jean Woo, Xixin Wu, Ka Ho Wong, Sean Shensheng Xu, Naijun Zheng, Ranzo Huang, Jiawen Kang, Xiaoquan Ke, Junan Li, Jinchao Li, Yi Wang:
Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders. INTERSPEECH 2023: 1713-1717
[c57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLWM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLWM23
Yunxiang Li, Pengfei Liu, Xixin Wu, Helen Meng:
PunCantonese: A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts. INTERSPEECH 2023: 2183-2187
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengKCWWM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengKCWWM23
Lingwei Meng, Jiawen Kang, Mingyu Cui, Haibin Wu, Xixin Wu, Helen Meng:
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator. INTERSPEECH 2023: 3467-3471
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LuW0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LuW0M23
Hui Lu, Xixin Wu, Zhiyong Wu, Helen Meng:
SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody. ACM Multimedia 2023: 2829-2837
2022
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/acl-dialdoc/LiZTLLWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl-dialdoc/LiZTLLWM22
Kun Li, Tianhua Zhang, Liping Tang, Junan Li, Hongyuan Lu, Xixin Wu, Helen Meng:
Grounded Dialogue Generation with Cross-encoding Re-ranker, Grounding Span Prediction, and Passage Dropout. DialDoc@ACL 2022: 123-129
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuZLWLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuZLWLM22
Haibin Wu, Bo Zheng, Xu Li, Xixin Wu, Hung-Yi Lee, Helen Meng:
Characterizing the Adversarial Vulnerability of Speech self-Supervised Learning. ICASSP 2022: 3164-3168
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWLSLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWLSLM22
Disong Wang, Songxiang Liu, Xixin Wu, Hui Lu, Lifa Sun, Xunying Liu, Helen Meng:
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation. ICASSP 2022: 6677-6681
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuHWLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuHWLM22
Xixin Wu, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng:
Neural Architecture Search for Speech Emotion Recognition. ICASSP 2022: 6902-6906
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SuZDLWLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SuZDLWLM22
Hang Su, Danyang Zhao, Long Dang, Minglei Li, Xixin Wu, Xunying Liu, Helen Meng:
A Multitask Learning Framework for Speaker Change Detection with Content Information from Unsupervised Speech Decomposition. ICASSP 2022: 8087-8091
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengLWMKWWSM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengLWMKWWSM22
Naijun Zheng, Na Li, Xixin Wu, Lingwei Meng, Jiawen Kang, Haibin Wu, Chao Weng, Dan Su, Helen Meng:
The CUHK-Tencent Speaker Diarization System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 9161-9165
[c48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenSTWK0M22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenSTWK0M22
Jie Chen, Changhe Song, Deyi Tuo, Xixin Wu, Shiyin Kang, Zhiyong Wu, Helen Meng:
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information. INTERSPEECH 2022: 426-430
[c47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoLWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoLWM22
Haohan Guo, Hui Lu, Xixin Wu, Helen Meng:
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS. INTERSPEECH 2022: 1566-1570
[c46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoXSWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoXSWM22
Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng:
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. INTERSPEECH 2022: 1611-1615
[c45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWYMHWLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWYMHWLM22
Yi Wang, Tianzi Wang, Zi Ye, Lingwei Meng, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng:
Exploring linguistic feature and model combination for speech recognition based automatic AD detection. INTERSPEECH 2022: 3328-3332
[c44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuMKLLWLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuMKLLWLM22
Haibin Wu, Lingwei Meng, Jiawen Kang, Jinchao Li, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng:
Spoofing-Aware Speaker Verification by Multi-Level Fusion. INTERSPEECH 2022: 4357-4361
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChungLLLWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChungLLLWM22
HoLam Chung, Junan Li, Pengfei Liu, Wai-Kim Leung, Xixin Wu, Helen Meng:
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition. ISCSLP 2022: 26-30
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenHWWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenHWWM22
Xueyuan Chen, Qiaochu Huang, Xixin Wu, Zhiyong Wu, Helen Meng:
HILvoice:Human-in-the-Loop Style Selection for Elder-Facing Speech Synthesis. ISCSLP 2022: 86-90
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiMW0JMTWW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiMW0JMTWW22
Jingbei Li, Yi Meng, Xixin Wu, Zhiyong Wu, Jia Jia, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang:
Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks. ACM Multimedia 2022: 5811-5820
[c40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/WuKMZW0LM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/WuKMZW0LM22
Haibin Wu, Jiawen Kang, Lingwei Meng, Yang Zhang, Xixin Wu, Zhiyong Wu, Hung-yi Lee, Helen Meng:
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion. Odyssey 2022: 92-99
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/robio/LiHNCWDMHLCNNCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robio/LiHNCWDMHLCNNCL22
Jixiu Li, Yisen Huang, Wing Yin Ng, Truman Cheng, Xixin Wu, Qi Dou, Helen Meng, Pheng-Ann Heng, Yunhui Liu, Shannon Melissa Chan, David Navarro-Alarcon, Calvin Sze Hang Ng, Philip Wai Yan Chiu, Zheng Li:
Speech-Vision Based Multi-Modal AI Control of a Magnetic Anchored and Actuated Endoscope. ROBIO 2022: 403-408
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LuWWWLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LuWWWLM22
Hui Lu, Disong Wang, Xixin Wu, Zhiyong Wu, Xunying Liu, Helen Meng:
Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE. SLT 2022: 814-821
2021
[c37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DouWWLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DouWWLG21
Qingyun Dou, Xixin Wu, Moquan Wan, Yiting Lu, Mark J. F. Gales:
Deliberation-Based Multi-Pass Speech Synthesis. Interspeech 2021: 136-140
[c36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lu0WLKLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lu0WLKLM21
Hui Lu, Zhiyong Wu, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng:
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis. Interspeech 2021: 3775-3779
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWLLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWLLM21
Xu Li, Xixin Wu, Hui Lu, Xunying Liu, Helen Meng:
Channel-Wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks. Interspeech 2021: 4314-4318
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLSWLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLSWLM21
Disong Wang, Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion. Interspeech 2021: 4813-4817
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangYWSLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangYWSLM21
Disong Wang, Jianwei Yu, Xixin Wu, Lifa Sun, Xunying Liu, Helen Meng:
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization. ISCSLP 2021: 1-5
2020
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuWCSWKWLSYM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuWCSWKWLSYM20
Songxiang Liu, Disong Wang, Yuewen Cao, Lifa Sun, Xixin Wu, Shiyin Kang, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
End-To-End Accent Conversion Without Using Native Utterances. ICASSP 2020: 6289-6293
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZWYLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZWYLM20
Xu Li, Jinghua Zhong, Xixin Wu, Jianwei Yu, Xunying Liu, Helen Meng:
Adversarial Attacks on GMM I-Vector Based Speaker Verification Systems. ICASSP 2020: 6579-6583
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaoLWKLWLSYM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaoLWKLWLSYM20
Yuewen Cao, Songxiang Liu, Xixin Wu, Shiyin Kang, Peng Liu, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora. ICASSP 2020: 7619-7623
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangYWLSLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangYWLSLM20
Disong Wang, Jianwei Yu, Xixin Wu, Songxiang Liu, Lifa Sun, Xunying Liu, Helen Meng:
End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction. ICASSP 2020: 7744-7748
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KnillW0WG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KnillW0WG20
Kate M. Knill, Linlin Wang, Yu Wang, Xixin Wu, Mark J. F. Gales:
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems. INTERSPEECH 2020: 255-259
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLZWLSYM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLZWLSYM20
Xu Li, Na Li, Jinghua Zhong, Xixin Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification. INTERSPEECH 2020: 1540-1544
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengWZLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengWZLM20
Naijun Zheng, Xixin Wu, Jinghua Zhong, Xunying Liu, Helen Meng:
Speaker-Aware Linear Discriminant Analysis in Speaker Verification. INTERSPEECH 2020: 3012-3016
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKGM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKGM20
Xixin Wu, Kate M. Knill, Mark J. F. Gales, Andrey Malinin:
Ensemble Approaches for Uncertainty in Spoken Language Assessment. INTERSPEECH 2020: 3860-3864
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LiZYHWLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LiZYHWLM20
Xu Li, Jinghua Zhong, Jianwei Yu, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification. Odyssey 2020: 365-371
2019
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/LiaoLZWWW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LiaoLZWWW19
Ming Liao, Jing Li, Haisong Zhang, Lingzhi Wang, Xixin Wu, Kam-Fai Wong:
Coupling Global and Local Context for Unsupervised Aspect Extraction. EMNLP/IJCNLP (1) 2019: 4578-4588
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuLXLYWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuLXLYWLM19
Shoukang Hu, Max W. Y. Lam, Xurong Xie, Shansong Liu, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition. ICASSP 2019: 6555-6559
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuLCLYDMHWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuLCLYDMHWLM19
Xixin Wu, Songxiang Liu, Yuewen Cao, Xu Li, Jianwei Yu, Dongyang Dai, Xi Ma, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng:
Speech Emotion Recognition Using Capsule Networks. ICASSP 2019: 6695-6699
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaoWLYLWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaoWLYLWLM19
Yuewen Cao, Xixin Wu, Songxiang Liu, Jianwei Yu, Xu Li, Zhiyong Wu, Xunying Liu, Helen Meng:
End-to-end Code-switched TTS with Mix of Monolingual Recordings. ICASSP 2019: 6935-6939
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWWKTLSYM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWWKTLSYM19
Mu Wang, Xixin Wu, Zhiyong Wu, Shiyin Kang, Deyi Tuo, Guangzhi Li, Dan Su, Dong Yu, Helen Meng:
Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis. ICASSP 2019: 7060-7064
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuLCHLWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuLCHLWLM19
Jianwei Yu, Max W. Y. Lam, Xie Chen, Shoukang Hu, Songxiang Liu, Xixin Wu, Xunying Liu, Helen Meng:
Recurrent Neural Network Language Model Training Using Natural Gradient. ICASSP 2019: 7260-7264
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DaiWLW0M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DaiWLW0M19
Dongyang Dai, Zhiyong Wu, Runnan Li, Xixin Wu, Jia Jia, Helen Meng:
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition. ICASSP 2019: 7405-7409
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCWSLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCWSLM19
Songxiang Liu, Yuewen Cao, Xixin Wu, Lifa Sun, Xunying Liu, Helen Meng:
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams. INTERSPEECH 2019: 714-718
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaiWKW0S0M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaiWKW0S0M19
Dongyang Dai, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT. INTERSPEECH 2019: 2090-2094
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuXLLYWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuXLLYWLM19
Shoukang Hu, Xurong Xie, Shansong Liu, Max W. Y. Lam, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition. INTERSPEECH 2019: 2793-2797
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuDWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuDWLM19
Hang Su, Borislav Dzodzo, Xixin Wu, Xunying Liu, Helen Meng:
Unsupervised Methods for Audio Classification from Lecture Discussion Recordings. INTERSPEECH 2019: 3347-3351
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuLHWLCLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuLHWLCLM19
Jianwei Yu, Max W. Y. Lam, Shoukang Hu, Xixin Wu, Xu Li, Yuewen Cao, Xunying Liu, Helen Meng:
Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models. INTERSPEECH 2019: 3510-3514
2018
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuSKLWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuSKLWLM18
Xixin Wu, Lifa Sun, Shiyin Kang, Songxiang Liu, Zhiyong Wu, Xunying Liu, Helen Meng:
Feature Based Adaptation for Speaking Style Synthesis. ICASSP 2018: 5304-5308
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/MaoWLLWM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/MaoWLLWM18
Shaoguang Mao, Zhiyong Wu, Xu Li, Runnan Li, Xixin Wu, Helen Meng:
Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech. ICME 2018: 1-6
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuZSWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuZSWLM18
Songxiang Liu, Jinghua Zhong, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. INTERSPEECH 2018: 496-500
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiMWLLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiMWLLM18
Xu Li, Shaoguang Mao, Xixin Wu, Kun Li, Xunying Liu, Helen Meng:
Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis. INTERSPEECH 2018: 2554-2558
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuXLHLWWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuXLHLWWLM18
Jianwei Yu, Xurong Xie, Shansong Liu, Shoukang Hu, Max W. Y. Lam, Xixin Wu, Ka Ho Wong, Xunying Liu, Helen Meng:
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. INTERSPEECH 2018: 2938-2942
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCWLKWLSYM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCWLKWLSYM18
Xixin Wu, Yuewen Cao, Mu Wang, Songxiang Liu, Shiyin Kang, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis. INTERSPEECH 2018: 3072-3076
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangWKW0SYM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangWKW0SYM18
Mu Wang, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:
Speech Super-Resolution Using Parallel WaveNet. ISCSLP 2018: 260-264
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LiuSWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LiuSWLM18
Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
The HCCL-CUHK System for the Voice Conversion Challenge 2018. Odyssey 2018: 248-254
2015
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/WuWNJCM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/WuWNJCM15
Xixin Wu, Zhiyong Wu, Yishuang Ning, Jia Jia, Lianhong Cai, Helen M. Meng:
Understanding speaking styles of internet speech data with LSTM and low-resource training. ACII 2015: 815-820
2014
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WuWJMCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WuWJMCL14
Xixin Wu, Zhiyong Wu, Jia Jia, Helen M. Meng, Lianhong Cai, Weifeng Li:
Automatic speech data clustering with human perception based weighted distance. ISCSLP 2014: 216-220
2012
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WuWJC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WuWJC12
Xixin Wu, Zhiyong Wu, Jia Jia, Lianhong Cai:
Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers. ISCSLP 2012: 363-367

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-04152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-04152
Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng:
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition. CoRR abs/2401.04152 (2024)
[i62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14664
Yuejiao Wang, Xixin Wu, Disong Wang, Lingwei Meng, Helen Meng:
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization. CoRR abs/2401.14664 (2024)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-17796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-17796
Xueyuan Chen, Yuejiao Wang, Xixin Wu, Disong Wang, Zhiyong Wu, Xunying Liu, Helen Meng:
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction. CoRR abs/2401.17796 (2024)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16078
Wenxuan Wu, Xueyuan Chen, Xixin Wu, Haizhou Li, Helen Meng:
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy. CoRR abs/2403.16078 (2024)
[i59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02328
Dongchao Yang, Dingdong Wang, Haohan Guo, Xueyuan Chen, Xixin Wu, Helen Meng:
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models. CoRR abs/2406.02328 (2024)
[i58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02940
Haohan Guo, Fenglong Xie, Dongchao Yang, Hui Lu, Xixin Wu, Helen Meng:
Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder. CoRR abs/2406.02940 (2024)
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08336
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08336
Xueyuan Chen, Dongchao Yang, Dingdong Wang, Xixin Wu, Zhiyong Wu, Helen Meng:
CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction. CoRR abs/2406.08336 (2024)
[i56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10056
Dongchao Yang, Haohan Guo, Yuanyuan Wang, Rongjie Huang, Xiang Li, Xu Tan, Xixin Wu, Helen Meng:
UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner. CoRR abs/2406.10056 (2024)
[i55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10991
Tianhua Zhang, Kun Li, Hongyin Luo, Xixin Wu, James R. Glass, Helen Meng:
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers. CoRR abs/2406.10991 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14092
Jing Xu, Minglin Wu, Xixin Wu, Helen Meng:
Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models. CoRR abs/2406.14092 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-01850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-01850
Jingyan Zhou, Kun Li, Junan Li, Jiawen Kang, Minda Hu, Xixin Wu, Helen Meng:
Purple-teaming LLMs with Adversarial Defender Training. CoRR abs/2407.01850 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-08551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-08551
Lingwei Meng, Long Zhou, Shujie Liu, Sanyuan Chen, Bing Han, Shujie Hu, Yanqing Liu, Jinyu Li, Sheng Zhao, Xixin Wu, Helen Meng, Furu Wei:
Autoregressive Speech Synthesis without Vector Quantization. CoRR abs/2407.08551 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-09817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-09817
Lingwei Meng, Jiawen Kang, Yuejiao Wang, Zengrui Jin, Xixin Wu, Xunying Liu, Helen Meng:
Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System. CoRR abs/2407.09817 (2024)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10376
Yuejiao Wang, Xianmin Gong, Lingwei Meng, Xixin Wu, Helen Meng:
Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder. CoRR abs/2407.10376 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13509
Weiqin Li, Peiji Yang, Yicheng Zhong, Yixuan Zhou, Zhisheng Wang, Zhiyong Wu, Xixin Wu, Helen Meng:
Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models. CoRR abs/2407.13509 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13893
Dongchao Yang, Rongjie Huang, Yuanyuan Wang, Haohan Guo, Dading Chong, Songxiang Liu, Xixin Wu, Helen Meng:
SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models. CoRR abs/2408.13893 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00933
Haohan Guo, Fenglong Xie, Kun Xie, Dongchao Yang, Dake Guo, Xixin Wu, Helen Meng:
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis. CoRR abs/2409.00933 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08596
Lingwei Meng, Shujie Hu, Jiawen Kang, Zhaoqing Li, Yuejiao Wang, Wenxuan Wu, Xixin Wu, Xunying Liu, Helen Meng:
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions. CoRR abs/2409.08596 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11630
Haohan Guo, Fenglong Xie, Dongchao Yang, Xixin Wu, Helen Meng:
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation. CoRR abs/2409.11630 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12388
Jiawen Kang, Lingwei Meng, Mingyu Cui, Yuejiao Wang, Xixin Wu, Xunying Liu, Helen Meng:
Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC. CoRR abs/2409.12388 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12560
Yuanyuan Wang, Hangting Chen, Dongchao Yang, Zhiyong Wu, Helen Meng, Xixin Wu:
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions. CoRR abs/2409.12560 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16322
Jiawen Kang, Dongrui Han, Lingwei Meng, Jingyan Zhou, Jinchao Li, Xixin Wu, Helen Meng:
Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech. CoRR abs/2409.16322 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18786
Siheng Li, Cheng Yang, Taiqiang Wu, Chufan Shi, Yuji Zhang, Xinyu Zhu, Zesen Cheng, Deng Cai, Mo Yu, Lemao Liu, Jie Zhou, Yujiu Yang, Ngai Wong, Xixin Wu, Wai Lam:
A Survey on the Honesty of Large Language Models. CoRR abs/2409.18786 (2024)
2023
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-00836
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-00836
HoLam Chung, Junan Li, Pengfei Liu, Wai-Kim Leung, Xixin Wu, Helen Meng:
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition. CoRR abs/2302.00836 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-09908
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-09908
Lingwei Meng, Jiawen Kang, Mingyu Cui, Yuejiao Wang, Xixin Wu, Helen Meng:
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One. CoRR abs/2302.09908 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08019
Jinchao Li, Kaitao Song, Junan Li, Bo Zheng, Dongsheng Li, Xixin Wu, Xunying Liu, Helen Meng:
Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection. CoRR abs/2303.08019 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08027
Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng:
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition. CoRR abs/2303.08027 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-03728
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-03728
Tianhua Zhang, Hongyin Luo, Yung-Sung Chuang, Wei Fang, Luc Gaitskell, Thomas Hartvigsen, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
Interpretable Unified Language Checking. CoRR abs/2304.03728 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15225
Hongyin Luo, Yung-Sung Chuang, Yuan Gong, Tianhua Zhang, Yoon Kim, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
SAIL: Search-Augmented Instruction Learning. CoRR abs/2305.15225 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16263
Lingwei Meng, Jiawen Kang, Mingyu Cui, Haibin Wu, Xixin Wu, Helen Meng:
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator. CoRR abs/2305.16263 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-16012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-16012
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Xixin Wu, Shiyin Kang, Helen Meng:
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis. CoRR abs/2307.16012 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-15399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-15399
Jingyan Zhou, Minda Hu, Junan Li, Xiaoying Zhang, Xixin Wu, Irwin King, Helen Meng:
Rethinking Machine Ethics - Can LLMs Perform Moral Reasoning through the Lens of Moral Theories? CoRR abs/2308.15399 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-16577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-16577
Jie Chen, Changhe Song, Deyi Tuo, Xixin Wu, Shiyin Kang, Zhiyong Wu, Helen Meng:
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information. CoRR abs/2308.16577 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00126
Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng:
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning. CoRR abs/2309.00126 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10814
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10814
Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Xixin Wu, Yoon Kim, Helen Meng, James R. Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. CoRR abs/2309.10814 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11977
Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu, Xixin Wu, Shiyin Kang, Tao Jiang, Yahui Zhou, Yuxing Han, Helen Meng:
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts. CoRR abs/2309.11977 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00704
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng:
UniAudio: An Audio Foundation Model Toward Universal Audio Generation. CoRR abs/2310.00704 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-15623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-15623
Xiaohan Feng, Xixin Wu, Helen Meng:
Injecting linguistic knowledge into BERT for Dialogue State Tracking. CoRR abs/2311.15623 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11858
Boshi Tang, Zhiyong Wu, Xixin Wu, Qiaochu Huang, Jun Chen, Shun Lei, Helen Meng:
SimCalib: Graph Neural Network Calibration based on Similarity between Nodes. CoRR abs/2312.11858 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12181
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12181
Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng:
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis. CoRR abs/2312.12181 (2023)
2022
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01986
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01986
Naijun Zheng, Na Li, Xixin Wu, Lingwei Meng, Jiawen Kang, Haibin Wu, Chao Weng, Dan Su, Helen Meng:
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge. CoRR abs/2202.01986 (2022)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09082
Disong Wang, Songxiang Liu, Xixin Wu, Hui Lu, Lifa Sun, Xunying Liu, Helen Meng:
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation. CoRR abs/2202.09082 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-01080
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-01080
Haohan Guo, Hui Lu, Xixin Wu, Helen Meng:
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS. CoRR abs/2203.01080 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04443
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04443
Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland:
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors. CoRR abs/2203.04443 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15377
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15377
Haibin Wu, Lingwei Meng, Jiawen Kang, Jinchao Li, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng:
Spoofing-Aware Speaker Verification by Multi-Level Fusion. CoRR abs/2203.15377 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16928
Xixin Wu, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng:
Neural Architecture Search for Speech Emotion Recognition. CoRR abs/2203.16928 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09131
Haibin Wu, Jiawen Kang, Lingwei Meng, Yang Zhang, Xixin Wu, Zhiyong Wu, Hung-yi Lee, Helen Meng:
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion. CoRR abs/2206.09131 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13758
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13758
Yi Wang, Tianzi Wang, Zi Ye, Lingwei Meng, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng:
Exploring linguistic feature and model combination for speech recognition based automatic AD detection. CoRR abs/2206.13758 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10887
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10887
Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng:
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. CoRR abs/2209.10887 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13771
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13771
Hui Lu, Disong Wang, Xixin Wu, Zhiyong Wu, Xunying Liu, Helen Meng:
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE. CoRR abs/2210.13771 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15131
Haohan Guo, Fenglong Xie, Xixin Wu, Hui Lu, Helen Meng:
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations. CoRR abs/2210.15131 (2022)
2021
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-05397
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-05397
Xixin Wu, Mark J. F. Gales:
Should Ensemble Members Be Calibrated? CoRR abs/2101.05397 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-01264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-01264
Qingyun Dou, Yiting Lu, Potsawee Manakul, Xixin Wu, Mark J. F. Gales:
Attention Forcing for Machine Translation. CoRR abs/2104.01264 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-03298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-03298
Hui Lu, Zhiyong Wu, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng:
VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis. CoRR abs/2107.03298 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-08803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-08803
Xu Li, Xixin Wu, Hui Lu, Xunying Liu, Helen Meng:
Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks. CoRR abs/2107.08803 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-04330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-04330
Haibin Wu, Bo Zheng, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng:
Characterizing the adversarial vulnerability of speech self-supervised learning. CoRR abs/2111.04330 (2021)
2020
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00205
Xu Li, Xixin Wu, Xunying Liu, Helen Meng:
Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech. CoRR abs/2002.00205 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-04014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-04014
Xu Li, Jinghua Zhong, Jianwei Yu, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification. CoRR abs/2004.04014 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-06186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-06186
Xu Li, Na Li, Jinghua Zhong, Xixin Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification. CoRR abs/2006.06186 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-02725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-02725
Songxiang Liu, Yuewen Cao, Disong Wang, Xixin Wu, Xunying Liu, Helen Meng:
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling. CoRR abs/2009.02725 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01678
Disong Wang, Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion. CoRR abs/2011.01678 (2020)
2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-01145
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-01145
Peng Liu, Xixin Wu, Shiyin Kang, Guangzhi Li, Dan Su, Dong Yu:
Maximizing Mutual Information for Tacotron. CoRR abs/1909.01145 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03078
Xu Li, Jinghua Zhong, Xixin Wu, Jianwei Yu, Xunying Liu, Helen Meng:
Adversarial Attacks on GMM i-vector based Speaker Verification Systems. CoRR abs/1911.03078 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.