


Остановите войну!
for scientists:


default search action
Chng Eng Siong
Engsiong Chng – Eng Siong Chng
Person information

- affiliation: Nanyang Technological University, Singapore
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i65]Shangeth Rajaa, Kriti Anandan, Swaraj Dalmia, Tarun Gupta, Eng Siong Chng:
Improving Spoken Language Identification with Map-Mix. CoRR abs/2302.08229 (2023) - [i64]Alexey Sholokhov, Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng:
Probabilistic Back-ends for Online Speaker Recognition and Clustering. CoRR abs/2302.09523 (2023) - [i63]Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. CoRR abs/2302.11131 (2023) - [i62]Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. CoRR abs/2302.11362 (2023) - [i61]Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise adaptation using Data Simulation. CoRR abs/2302.11981 (2023) - [i60]Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2302.11989 (2023) - [i59]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition. CoRR abs/2302.14597 (2023) - [i58]Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. CoRR abs/2304.04974 (2023) - [i57]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-resource Keyword Spotting. CoRR abs/2305.01170 (2023) - [i56]Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. CoRR abs/2305.09212 (2023) - [i55]Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. CoRR abs/2305.09299 (2023) - [i54]Zizheng Zhang, Chen Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-aware Speech Separation with Contrastive Learning. CoRR abs/2305.10761 (2023) - [i53]Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. CoRR abs/2305.12121 (2023) - [i52]Leander Melroy Maben, Zixun Guo, Chen Chen, Utkarsh Chudiwal, Chng Eng Siong:
Study of GANs for Noisy Speech Simulation from Clean Speech. CoRR abs/2305.12460 (2023) - 2022
- [j36]Hexin Liu
, Leibny Paola García-Perera
, Andy W. H. Khong
, Eng Siong Chng, Suzy J. Styles, Sanjeev Khudanpur
:
Efficient Self-Supervised Learning Representations for Spoken Language Identification. IEEE J. Sel. Top. Signal Process. 16(6): 1296-1307 (2022) - [j35]Lili Guo
, Longbiao Wang
, Jianwu Dang, Eng Siong Chng, Seiichi Nakagawa:
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition. Speech Commun. 136: 118-127 (2022) - [c221]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022 - [c220]Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
Convmixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-Field Keyword Spotting. ICASSP 2022: 3603-3607 - [c219]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-Critical Sequence Training for Automatic Speech Recognition. ICASSP 2022: 3688-3692 - [c218]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data. ICASSP 2022: 4298-4302 - [c217]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. ICASSP 2022: 6292-6296 - [c216]Fuzhao Xue, Aixin Sun, Hao Zhang
, Jinjie Ni, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. ICASSP 2022: 6707-6711 - [c215]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. ICASSP 2022: 7287-7291 - [c214]Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention Based Multi-Level Acoustic Information. ICASSP 2022: 7367-7371 - [c213]Andrew Koh, Fuzhao Xue, Chng Eng Siong:
Automated Audio Captioning Using Transfer Learning and Reconstruction Latent Space Similarity Regularization. ICASSP 2022: 7722-7726 - [c212]Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR. ICASSP 2022: 7807-7811 - [c211]Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Eng Siong Chng:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. INTERSPEECH 2022: 1978-1982 - [c210]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. INTERSPEECH 2022: 2773-2777 - [c209]Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. INTERSPEECH 2022: 3764-3768 - [c208]Zixun Guo, Chen Chen, Eng Siong Chng:
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition. INTERSPEECH 2022: 3799-3803 - [c207]Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. ISCSLP 2022: 507-511 - [i51]Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting. CoRR abs/2201.05863 (2022) - [i50]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. CoRR abs/2202.09995 (2022) - [i49]Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Chng Eng Siong:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. CoRR abs/2203.11774 (2022) - [i48]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. CoRR abs/2203.14838 (2022) - [i47]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data. CoRR abs/2203.15321 (2022) - [i46]Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information. CoRR abs/2203.15326 (2022) - [i45]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning. CoRR abs/2203.15526 (2022) - [i44]Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. CoRR abs/2203.16361 (2022) - [i43]Dianwen Ng, Jin Hui Pang, Yang Xiao, Biao Tian, Qiang Fu, Eng Siong Chng:
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness. CoRR abs/2204.05445 (2022) - [i42]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-critical Sequence Training for Automatic Speech Recognition. CoRR abs/2204.06260 (2022) - [i41]Andrew Koh, Soham Tiwari, Chng Eng Siong:
Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning. CoRR abs/2206.01918 (2022) - [i40]Andrew Koh, Eng Siong Chng:
Language-Based Audio Retrieval with Converging Tied Layers and Contrastive Loss. CoRR abs/2206.14659 (2022) - [i39]Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng:
Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition. CoRR abs/2207.04176 (2022) - [i38]Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang:
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder. CoRR abs/2207.04177 (2022) - [i37]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022) - [i36]Zixun Guo, Chen Chen, Eng Siong Chng:
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition. CoRR abs/2208.00987 (2022) - [i35]Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization. CoRR abs/2209.06360 (2022) - [i34]Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li
:
Speech-text based multi-modal training with bidirectional attention for improved speech recognition. CoRR abs/2211.00325 (2022) - [i33]Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. CoRR abs/2211.01585 (2022) - [i32]Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning. CoRR abs/2212.05301 (2022) - [i31]Abhinav Rao, Thi-Nga Ho, Eng Siong Chng:
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin. CoRR abs/2212.05356 (2022) - 2021
- [c206]Fuzhao Xue, Aixin Sun, Hao Zhang
, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. AAAI 2021: 14194-14202 - [c205]Manav Kaushik, Van Tung Pham, Tran The Anh, Eng Siong Chng:
End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss. APSIPA ASC 2021: 1-8 - [c204]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-based joint learning approach to robust ASR for radio communication speech. APSIPA ASC 2021: 497-502 - [c203]Chen Chen, Nana Hou, Duo Ma, Eng Siong Chng:
Time Domain Speech Enhancement With Attentive Multi-scale Approach. APSIPA ASC 2021: 679-683 - [c202]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025 - [c201]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng:
Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. APSIPA ASC 2021: 1043-1048 - [c200]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. EMNLP (1) 2021: 9339-9349 - [c199]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. ICASSP 2021: 666-670 - [c198]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals. ICASSP 2021: 6109-6113 - [c197]Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition. ICASSP 2021: 6304-6308 - [c196]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. ICASSP 2021: 6813-6817 - [c195]Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523 - [c194]Weiguang Chen, Van Tung Pham, Eng Siong Chng, Xionghu Zhong:
Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion. Interspeech 2021: 4189-4193 - [c193]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5 - [c192]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [i30]Manav Kaushik, Van Tung Pham, Eng Siong Chng:
End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN. CoRR abs/2101.05056 (2021) - [i29]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech. CoRR abs/2107.10701 (2021) - [i28]Andrew Koh, Fuzhao Xue, Eng Siong Chng:
Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization. CoRR abs/2108.04692 (2021) - [i27]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. CoRR abs/2110.05267 (2021) - [i26]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. CoRR abs/2110.08545 (2021) - [i25]Shangeth Rajaa, Van Tung Pham, Chng Eng Siong:
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling. CoRR abs/2110.13653 (2021) - 2020
- [j34]Chenglin Xu
, Wei Rao
, Eng Siong Chng
, Haizhou Li
:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1370-1384 (2020) - [c191]Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. EMNLP (Findings) 2020: 41-46 - [c190]Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li:
Time-Domain Neural Network Approach for Speech Bandwidth Extension. ICASSP 2020: 866-870 - [c189]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - [c188]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Speech Transformer with Speaker Aware Persistent Memory. INTERSPEECH 2020: 1261-1265 - [c187]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. INTERSPEECH 2020: 1406-1410 - [c186]Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396 - [c185]Nana Hou, Chenglin Xu, Van Tung Pham, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network. INTERSPEECH 2020: 4064-4068 - [c184]Nana Hou, Chenglin Xu, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension. INTERSPEECH 2020: 4069-4073 - [c183]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Universal Speech Transformer. INTERSPEECH 2020: 5021-5025 - [c182]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Cross Attention with Monotonic Alignment for Speech Transformer. INTERSPEECH 2020: 5031-5035 - [i24]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. CoRR abs/2004.08326 (2020) - [i23]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-domain speaker extraction network. CoRR abs/2004.14762 (2020) - [i22]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. CoRR abs/2005.04686 (2020) - [i21]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020) - [i20]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020) - [i19]Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. CoRR abs/2009.11795 (2020) - [i18]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework. CoRR abs/2010.11483 (2020) - [i17]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020) - [i16]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals. CoRR abs/2011.09624 (2020) - [i15]Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. CoRR abs/2012.06780 (2020) - [i14]Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. CoRR abs/2012.13873 (2020)
2010 – 2019
- 2019
- [c181]Thi-Ly Vu, Zhiping Zeng, Haihua Xu, Eng Siong Chng:
Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition. APSIPA 2019: 198-203 - [c180]Karan Makhija, Thi-Nga Ho, Eng Siong Chng:
Transfer Learning for Punctuation Prediction. APSIPA 2019: 268-273 - [c179]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Domain Adversarial Training for Speech Enhancement. APSIPA 2019: 667-672 - [c178]Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng:
Improving code-switching speech recognition with data augmentation and system combination. APSIPA 2019: 1308-1312 - [c177]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-Domain Speaker Extraction Network. ASRU 2019: 327-334 - [c176]Chenglin Xu, Wei Rao, Eng Siong Chng
, Haizhou Li
:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. ICASSP 2019: 6990-6994 - [c175]Trang M. Nguyen, Van-Lien Tran, Duy-Cat Can, Quang-Thuy Ha
, Ly T. Vu, Engsiong Chng
:
QASA: Advanced Document Retriever for Open-Domain Question Answering by Learning to Rank Question-Aware Self-Attentive Document Representations. ICMLSC 2019: 221-225 - [c174]Xiaohai Tian, Eng Siong Chng
, Haizhou Li
:
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data. INTERSPEECH 2019: 201-205 - [c173]Wei Rao, Chenglin Xu, Eng Siong Chng
, Haizhou Li
:
Target Speaker Extraction for Multi-Talker Speaker Verification. INTERSPEECH 2019: 1273-1277 - [c172]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng
, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164 - [c171]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng
, Haizhou Li
:
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition. INTERSPEECH 2019: 2165-2169 - [c170]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng
:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. INTERSPEECH 2019: 3505-3509 - [c169]Thi-Ly Vu, Zin Tun Kyaw, Chng Eng Siong, Rafael E. Banchs:
Online FAQ Chatbot for Customer Support. IWSDS 2019: 251-259 - [i13]Wei Rao, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification. CoRR abs/1902.02546 (2019) - [i12]Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data. CoRR abs/1902.03705 (2019) - [i11]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. CoRR abs/1903.09952 (2019) - [i10]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. CoRR abs/1904.03799 (2019) - [i9]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019) - [i8]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019)