


Остановите войну!
for scientists:
Chng Eng Siong
Engsiong Chng – Eng Siong Chng
Person information

- affiliation: Nanyang Technological University, Singapore
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [j35]Lili Guo
, Longbiao Wang
, Jianwu Dang, Eng Siong Chng, Seiichi Nakagawa:
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition. Speech Commun. 136: 118-127 (2022) - [i40]Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting. CoRR abs/2201.05863 (2022) - [i39]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. CoRR abs/2202.09995 (2022) - [i38]Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Chng Eng Siong:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. CoRR abs/2203.11774 (2022) - [i37]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. CoRR abs/2203.14838 (2022) - [i36]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data. CoRR abs/2203.15321 (2022) - [i35]Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information. CoRR abs/2203.15326 (2022) - [i34]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning. CoRR abs/2203.15526 (2022) - [i33]Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. CoRR abs/2203.16361 (2022) - [i32]Dianwen Ng, Jin Hui Pang, Yang Xiao, Biao Tian, Qiang Fu, Eng Siong Chng:
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness. CoRR abs/2204.05445 (2022) - [i31]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-critical Sequence Training for Automatic Speech Recognition. CoRR abs/2204.06260 (2022) - 2021
- [c206]Fuzhao Xue, Aixin Sun, Hao Zhang
, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. AAAI 2021: 14194-14202 - [c205]Manav Kaushik, Van Tung Pham, Tran The Anh, Eng Siong Chng:
End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss. APSIPA ASC 2021: 1-8 - [c204]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-based joint learning approach to robust ASR for radio communication speech. APSIPA ASC 2021: 497-502 - [c203]Chen Chen, Nana Hou, Duo Ma, Eng Siong Chng:
Time Domain Speech Enhancement With Attentive Multi-scale Approach. APSIPA ASC 2021: 679-683 - [c202]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025 - [c201]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng:
Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. APSIPA ASC 2021: 1043-1048 - [c200]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. EMNLP (1) 2021: 9339-9349 - [c199]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. ICASSP 2021: 666-670 - [c198]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals. ICASSP 2021: 6109-6113 - [c197]Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition. ICASSP 2021: 6304-6308 - [c196]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. ICASSP 2021: 6813-6817 - [c195]Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523 - [c194]Weiguang Chen, Van Tung Pham, Eng Siong Chng, Xionghu Zhong:
Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion. Interspeech 2021: 4189-4193 - [c193]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5 - [c192]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [i30]Manav Kaushik, Van Tung Pham, Eng Siong Chng:
End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN. CoRR abs/2101.05056 (2021) - [i29]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech. CoRR abs/2107.10701 (2021) - [i28]Andrew Koh, Fuzhao Xue, Eng Siong Chng:
Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization. CoRR abs/2108.04692 (2021) - [i27]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. CoRR abs/2110.05267 (2021) - [i26]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. CoRR abs/2110.08545 (2021) - [i25]Shangeth Rajaa, Van Tung Pham, Chng Eng Siong:
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling. CoRR abs/2110.13653 (2021) - 2020
- [j34]Chenglin Xu
, Wei Rao
, Eng Siong Chng
, Haizhou Li
:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1370-1384 (2020) - [c191]Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. EMNLP (Findings) 2020: 41-46 - [c190]Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li:
Time-Domain Neural Network Approach for Speech Bandwidth Extension. ICASSP 2020: 866-870 - [c189]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - [c188]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Speech Transformer with Speaker Aware Persistent Memory. INTERSPEECH 2020: 1261-1265 - [c187]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. INTERSPEECH 2020: 1406-1410 - [c186]Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396 - [c185]Nana Hou, Chenglin Xu, Van Tung Pham, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network. INTERSPEECH 2020: 4064-4068 - [c184]Nana Hou, Chenglin Xu, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension. INTERSPEECH 2020: 4069-4073 - [c183]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Universal Speech Transformer. INTERSPEECH 2020: 5021-5025 - [c182]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Cross Attention with Monotonic Alignment for Speech Transformer. INTERSPEECH 2020: 5031-5035 - [i24]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. CoRR abs/2004.08326 (2020) - [i23]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-domain speaker extraction network. CoRR abs/2004.14762 (2020) - [i22]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. CoRR abs/2005.04686 (2020) - [i21]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020) - [i20]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020) - [i19]Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. CoRR abs/2009.11795 (2020) - [i18]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework. CoRR abs/2010.11483 (2020) - [i17]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020) - [i16]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals. CoRR abs/2011.09624 (2020) - [i15]Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. CoRR abs/2012.06780 (2020) - [i14]Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. CoRR abs/2012.13873 (2020)
2010 – 2019
- 2019
- [c181]Thi-Ly Vu, Zhiping Zeng, Haihua Xu, Eng Siong Chng:
Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition. APSIPA 2019: 198-203 - [c180]Karan Makhija, Thi-Nga Ho, Eng Siong Chng:
Transfer Learning for Punctuation Prediction. APSIPA 2019: 268-273 - [c179]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Domain Adversarial Training for Speech Enhancement. APSIPA 2019: 667-672 - [c178]Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng:
Improving code-switching speech recognition with data augmentation and system combination. APSIPA 2019: 1308-1312 - [c177]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-Domain Speaker Extraction Network. ASRU 2019: 327-334 - [c176]Chenglin Xu, Wei Rao, Eng Siong Chng
, Haizhou Li
:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. ICASSP 2019: 6990-6994 - [c175]Trang M. Nguyen, Van-Lien Tran, Duy-Cat Can, Quang-Thuy Ha
, Ly T. Vu, Engsiong Chng
:
QASA: Advanced Document Retriever for Open-Domain Question Answering by Learning to Rank Question-Aware Self-Attentive Document Representations. ICMLSC 2019: 221-225 - [c174]Xiaohai Tian, Eng Siong Chng
, Haizhou Li
:
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data. INTERSPEECH 2019: 201-205 - [c173]Wei Rao, Chenglin Xu, Eng Siong Chng
, Haizhou Li
:
Target Speaker Extraction for Multi-Talker Speaker Verification. INTERSPEECH 2019: 1273-1277 - [c172]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng
, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164 - [c171]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng
, Haizhou Li
:
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition. INTERSPEECH 2019: 2165-2169 - [c170]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng
:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. INTERSPEECH 2019: 3505-3509 - [c169]Thi-Ly Vu, Zin Tun Kyaw, Chng Eng Siong, Rafael E. Banchs:
Online FAQ Chatbot for Customer Support. IWSDS 2019: 251-259 - [i13]Wei Rao, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification. CoRR abs/1902.02546 (2019) - [i12]Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data. CoRR abs/1902.03705 (2019) - [i11]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. CoRR abs/1903.09952 (2019) - [i10]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. CoRR abs/1904.03799 (2019) - [i9]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019) - [i8]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i7]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019) - 2018
- [j33]Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng
:
Learning distributed sentence representations for story segmentation. Signal Process. 142: 403-411 (2018) - [j32]Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng
, Haizhou Li
:
Re-ranking spoken term detection with acoustic exemplars of keywords. Speech Commun. 104: 12-23 (2018) - [c168]Zhongwei Li, Xuancong Wang, AiTi Aw, Eng Siong Chng, Haizhou Li:
Named-Entity Tagging and Domain adaptation for Better Customized Translation. NEWS@ACL 2018: 41-46 - [c167]Duy-Cat Can, Thi-Nga Ho, Eng Siong Chng
:
A Hybrid Deep Learning Architecture for Sentence Unit Detection. IALP 2018: 129-132 - [c166]Thi-Nga Ho, Duy-Cat Can, Engsiong Chng
:
An Investigation of Word Embeddings with Deep Bidirectional LSTM for Sentence Unit Detection in Automatic Speech Transcription. IALP 2018: 139-142 - [c165]Chenglin Xu, Wei Rao, Xiong Xiao, Eng Siong Chng
, Haizhou Li
:
Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM. ICASSP 2018: 6-10 - [c164]Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng
, Haizhou Li
:
Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition. ICASSP 2018: 4889-4893 - [c163]Haihua Xu, Van Tung Pham, Zin Tun Kyaw, Zhi Hao Lim, Eng Siong Chng, Haizhou Li:
Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2018: 554-555 - [c162]Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng
:
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition. INTERSPEECH 2018: 1928-1932 - [c161]Yerbolat Khassanov
, Eng Siong Chng
:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. INTERSPEECH 2018: 3343-3347 - [c160]Chenglin Xu, Wei Rao, Eng Siong Chng
, Haizhou Li
:
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning. INTERSPEECH 2018: 3479-3483 - [c159]Xiaohai Tian, Junchao Wang, Haihua Xu, Eng Siong Chng, Haizhou Li:
Average Modeling Approach to Voice Conversion with Non-Parallel Data. Odyssey 2018: 227-232 - [i6]Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng:
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition. CoRR abs/1806.06200 (2018) - [i5]Yerbolat Khassanov, Eng Siong Chng:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. CoRR abs/1806.10306 (2018) - [i4]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition. CoRR abs/1811.00241 (2018) - 2017
- [j31]Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng
:
A hybrid neural network hidden Markov model approach for automatic story segmentation. J. Ambient Intell. Humaniz. Comput. 8(6): 925-936 (2017) - [j30]Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng
, Haizhou Li
:
An Exemplar-Based Approach to Frequency Warping for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1863-1876 (2017) - [c158]Yerbolat Khassanov
, Tze Yuang Chong, Benjamin Bigot, Eng Siong Chng
:
Unsupervised Language Model Adaptation by Data Selection for Speech Recognition. ACIIDS (1) 2017: 508-517 - [c157]Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng
:
An end-to-end neural network approach to story segmentation. APSIPA 2017: 171-176 - [c156]Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng
, Bin Ma, Haizhou Li
:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327 - [c155]Zhi Hao Lim, Xiaohai Tian, Wei Rao, Eng Siong Chng
:
An investigation of spectral feature partitioning for replay attacks detection. APSIPA 2017: 1570-1573 - [c154]Zhiping Zeng, Haihua Xu, Tze Yuang Chong, Eng Siong Chng
, Haizhou Li
:
Improving N-gram language modeling for code-switching speech recognition. APSIPA 2017: 1596-1601 - [c153]Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng
:
Topic embedding of sentences for story segmentation. APSIPA 2017: 1602-1607 - [c152]Xiaohai Tian, Lei Meng, Siyuan Liu, Zhiqi Shen, Eng Siong Chng
, Cyril Leung, Frank Guan
, Chunyan Miao
:
Novel Functional Technologies for Age-Friendly E-commerce. HCI (28) 2017: 150-158 - [c151]Nana Hou, Xiaohai Tian, Eng Siong Chng
, Bin Ma, Haizhou Li
:
Improving air traffic control speech intelligibility by reducing speaking rate effectively. IALP 2017: 197-200 - [c150]Grandee Lee, Thi-Nga Ho, Eng Siong Chng
, Haizhou Li
:
A review of the mandarin-english code-switching corpus: SEAME. IALP 2017: 210-213 - [c149]Zhongwei Li, Eng Siong Chng
, Haizhou Li
:
Named entity transliteration with sequence-to-sequence neural network. IALP 2017: 374-378 - [c148]Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng
, Haizhou Li
:
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition. ICASSP 2017: 3246-3250 - [c147]Lei Meng, Nguyen Quy Hy, Xiaohai Tian, Zhiqi Shen, Eng Siong Chng
, Frank Yunqing Guan
, Chunyan Miao
, Cyril Leung:
Towards Age-friendly E-commerce Through Crowd-Improved Speech Recognition, Multimodal Search, and Personalized Speech Feedback. ICCSE 2017: 127-135 - [c146]Chenglin Xu, Xiong Xiao, Sining Sun, Wei Rao, Eng Siong Chng, Haizhou Li:
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. INTERSPEECH 2017: 1894-1898 - [c145]Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng
:
Pruning Strategies for Partial Search in Spoken Term Detection. SoICT 2017: 114-119 - 2016
- [j29]Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng
, Haizhou Li
:
Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation. EURASIP J. Adv. Signal Process. 2016: 4 (2016) - [j28]Nguyen Quy Hy, Siu Wa Lee, Xiaohai Tian, Minghui Dong, Eng Siong Chng
:
High quality voice conversion using prosodic and high-resolution spectral features. Multim. Tools Appl. 75(9): 5265-5285 (2016) - [j27]Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng
, Haizhou Li
:
Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(6): 1006-1019 (2016) - [j26]Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng
, Haizhou Li
:
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization. J. Signal Process. Syst. 82(2): 151-161 (2016) - [c144]Thi-Nga Ho, Tze Yuang Chong, Van Hai Do, Van Tung Pham, Eng Siong Chng
:
Improving Efficiency of Sentence Boundary Detection by Feature Selection. ACIIDS (2) 2016: 594-603 - [c143]Su Jun Leow, Eng Siong Chng
, Chin-Hui Lee:
Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts. APSIPA 2016: 1-6 - [c142]Xiaohai Tian, Xiong Xiao, Eng Siong Chng
, Haizhou Li
:
Spoofing speech detection using temporal convolutional neural network. APSIPA 2016: 1-6 - [c141]Xiong Xiao, Shinji Watanabe
, Eng Siong Chng
, Haizhou Li
:
Beamforming networks using spatial covariance features for far-field speech recognition. APSIPA 2016: 1-6 - [c140]Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng
, Haizhou Li
:
I-vector based deep neural network acoustic model adaptation using multilingual language resource. APSIPA 2016: 1-5 - [c139]Thanh T. Vu, Benjamin Bigot, Eng Siong Chng
:
Combining non-negative matrix factorization and deep neural networks for speech enhancement and automatic speech recognition. ICASSP 2016: 499-503 - [c138]