default search action

combined dblp search
author search
venue search
publication search

ask others

Chng Eng Siong

Engsiong Chng – Eng Siong Chng

> Home > Persons

Person information

affiliation: Nanyang Technological University, Singapore

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j41]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/SunZGYLC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/SunZGYLC25
Linhui Sun, Xiaolong Zhou, Aifei Gong, Lei Ye, Pingan Li, Eng Siong Chng:
Noise-aware network with shared channel-attention encoder and joint constraint for noisy speech separation. Digit. Signal Process. 157: 104891 (2025)
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/jclc/DhingraAVCT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/DhingraAVCT25
Priyanshu Dhingra, Satyam Agrawal, Chandra Sekar Veerappan, Eng Siong Chng, Rong Tong:
Leveraging Large Language Models for Speech De-Identification. Int. J. Asian Lang. Process. 35(1): 2450014:1-2450014:18 (2025)
[j39]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChenZYCZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChenZYCZ25
Weiguang Chen, Junjie Zhang, Jielong Yang, Eng Siong Chng, Xionghu Zhong:
UniArray: Unified Spectral-Spatial Modeling for Array-Geometry-Agnostic Speech Separation. IEEE Signal Process. Lett. 32: 2164-2168 (2025)
[c311]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiYFC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiYFC25
Haoyang Li, Jia Qi Yip, Tianyu Fan, Eng Siong Chng:
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec. ICASSP 2025: 1-5
[c310]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuongLZLC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuongLZLC25
Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng:
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. ICASSP 2025: 1-5
[c309]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Yuen0YCKC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Yuen0YCKC25
Kwok Chin Yuen, Sheng Li, Jia Qi Yip, Chenhui Chu, Tatsuya Kawahara, Eng Siong Chng:
Extending Whisper for Emotion Prediction Using Word-level Pseudo Labels. ICASSP 2025: 1-5
[c308]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0075HWWCZYC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0075HWWCZYC25
Chen Chen, Yuchen Hu, Siyin Wang, Helin Wang, Zhehuai Chen, Chao Zhang, Chao-Han Huck Yang, Eng Siong Chng:
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators. ICLR 2025
[c307]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YaoL0HCX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YaoL0HCX25
Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, Eng Siong Chng, Lei Xie:
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling. ICLR 2025
[c306]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/XuQFCLMS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/XuQFCLMS25
Yu Xu, Xiaokai Qin, Tianyu Fan, Eng Siong Chng, Sheng Li, Nobuaki Minematsu, Daisuke Saito:
Bandwidth Extension System for Throat Microphone Speech Reconstruction. ICMEW 2025: 1-2
[c305]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZouLZCR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZouLZCR25
Heqing Zou, Fengmao Lv, Desheng Zheng, Eng Siong Chng, Deepu Rajan:
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages. ICME 2025: 1-6
[c304]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Ng0CX0C25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Ng0CX0C25
Dianwen Ng, Kun Zhou, Yi-Wen Chao, Zhiwei Xiong, Bin Ma, Engsiong Chng:
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding. ICML 2025
[c303]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaoPNMNCC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaoPNMNCC25
Yi-Wen Chao, Yizhou Peng, Dianwen Ng, Yukun Ma, Chongjia Ni, Eng Siong Chng, Eng Siong Chng:
A-SMiLE: Affective Sparse Mixture-of-Experts Adapter with Multi-Task Learning for Spoken Dialogue Models. INTERSPEECH 2025
[c302]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaoPNMNCC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaoPNMNCC25
Yi-Wen Chao, Yizhou Peng, Dianwen Ng, Yukun Ma, Chongjia Ni, Eng Siong Chng, Eng Siong Chng:
A-SMiLE: Affective Sparse Mixture-of-Experts Adapter with Multi-Task Learning for Spoken Dialogue Models. INTERSPEECH 2025
[c301]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaoVHAGYLCY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaoVHAGYLCY25
Alan Dao, Dinh Bach Vu, Huy Hoang Ha, Tuan Le Duc Anh, Shreyas Gopal, Yue Heng Yeo, Warren Keng Hoong Low, Eng Siong Chng, Jia Qi Yip:
Speechless: Speech Instruction Training Without Speech for Low Resource Languages. INTERSPEECH 2025
[c300]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GutierrezLWWCCL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GutierrezLWWCCL25
Fabian Ritter Gutierrez, Yi-Cheng Lin, Jui-Chiang Wei, Jeremy H. M. Wong, Eng Siong Chng, Nancy F. Chen, Hung-yi Lee:
Distilling a speech and music encoder with task arithmetic. INTERSPEECH 2025
[c299]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiHCSLC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiHCSLC25
Haoyang Li, Yuchen Hu, Chen Chen, Sabato Marco Siniscalchi, Songting Liu, Eng Siong Chng:
From KAN to GR-KAN: Advancing Speech Enhancement with KAN-Based Methodology. INTERSPEECH 2025
[c298]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ng00C25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ng00C25
Dianwen Ng, Kun Zhou, Bin Ma, Eng Siong Chng:
Thinking Fast and Slow: Robust Speech Recognition via Deep Filter-Tuning. INTERSPEECH 2025
[c297]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengCNMN0C25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengCNMN0C25
Yizhou Peng, Yi-Wen Chao, Dianwen Ng, Yukun Ma, Chongjia Ni, Bin Ma, Eng Siong Chng:
FD-Bench: A Full-Duplex Benchmarking Pipeline Designed for Full Duplex Spoken Dialogue Systems. INTERSPEECH 2025
[c296]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoLCX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoLCX25
Jixun Yao, Hexin Liu, Eng Siong Chng, Lei Xie:
EASY: Emotion-aware Speaker Anonymization via Factorized Distillation. INTERSPEECH 2025
[c295]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuenYC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuenYC25
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function. INTERSPEECH 2025
[c294]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/SureshWPC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/SureshWPC25
Sathya Krishnan Suresh, Mengjun Wu, Tushar Pranav, Engsiong Chng:
DiaSynth: Synthetic Dialogue Generation Framework for Low Resource Dialogue Applications. NAACL (Findings) 2025: 673-690
[i132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-07246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-07246
Ziyang Ma, Zhuo Chen, Yuping Wang, Eng Siong Chng, Xie Chen:
Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model. CoRR abs/2501.07246 (2025)
[i131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-07875
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-07875
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Continual Learning with Embedding Layer Surgery and Task-wise Beam Search using Whisper. CoRR abs/2501.07875 (2025)
[i130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-17202
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-17202
Chen Chen, Yuchen Hu, Siyin Wang, Helin Wang, Zhehuai Chen, Chao Zhang, Chao-Han Huck Yang, Eng Siong Chng:
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators. CoRR abs/2501.17202 (2025)
[i129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-02942
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-02942
Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, Chng Eng Siong, Lei Xie:
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling. CoRR abs/2502.02942 (2025)
[i128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-16240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-16240
Haoyang Li, Jia Qi Yip, Tianyu Fan, Eng Siong Chng:
Speech Enhancement Using Continuous Embeddings of Neural Audio Codec. CoRR abs/2502.16240 (2025)
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-05110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-05110
Weiguang Chen, Junjie Zhang, Jielong Yang, Eng Siong Chng, Xionghu Zhong:
UniArray: Unified Spectral-Spatial Modeling for Array-Geometry-Agnostic Speech Separation. CoRR abs/2503.05110 (2025)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-21806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-21806
Heqing Zou, Fengmao Lv, Desheng Zheng, Eng Siong Chng, Deepu Rajan:
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages. CoRR abs/2503.21806 (2025)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-07235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-07235
Dianwen Ng, Kun Zhou, Yi-Wen Chao, Zhiwei Xiong, Bin Ma, Eng Siong Chng:
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding. CoRR abs/2505.07235 (2025)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13032
Ziyang Ma, Yinghao Ma, Yanqiao Zhu, Chen Yang, Yi-Wen Chao, Ruiyang Xu, Wenxi Chen, Yuanzhe Chen, Zhuo Chen, Jian Cong, Kai Li, Keliang Li, Siyou Li, Xinfeng Li, Xiquan Li, Zheng Lian, Yuzhe Liang, Minghao Liu, Zhikang Niu, Tianrui Wang, Yuping Wang, Yuxuan Wang, Yihao Wu, Guanrou Yang, Jianwei Yu, Ruibin Yuan, Zhisheng Zheng, Ziya Zhou, Haina Zhu, Wei Xue, Emmanouil Benetos, Kai Yu, Chng Eng Siong, Xie Chen:
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix. CoRR abs/2505.13032 (2025)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13270
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13270
Fabian Ritter Gutierrez, Yi-Cheng Lin, Jui-Chiang Wei, Jeremy H. M. Wong, Eng Siong Chng, Nancy F. Chen, Hung-yi Lee:
Distilling a speech and music encoder with task arithmetic. CoRR abs/2505.13270 (2025)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13559
Sathya Krishnan Suresh, Tanmay Surana, Lim Zhi Hao, Eng Siong Chng:
CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models. CoRR abs/2505.13559 (2025)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-15004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-15004
Jixun Yao, Hexin Liu, Eng Siong Chng, Lei Xie:
EASY: Emotion-aware Speaker Anonymization via Factorized Distillation. CoRR abs/2505.15004 (2025)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17076
Haoyang Zhang, Hexin Liu, Xiangyu Zhang, Qiquan Zhang, Yuchen Hu, Junqi Zhao, Fei Tian, Xuerui Yang, Eng Siong Chng:
Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English. CoRR abs/2505.17076 (2025)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17417
Alan Dao, Dinh Bach Vu, Huy Hoang Ha, Tuan Le Duc Anh, Shreyas Gopal, Yue Heng Yeo, Warren Keng Hoong Low, Eng Siong Chng, Jia Qi Yip:
Speechless: Speech Instruction Training Without Speech for Low Resource Languages. CoRR abs/2505.17417 (2025)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-11403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-11403
Fabian Ritter Gutierrez, Yi-Cheng Lin, Jeremy H. M. Wong, Hung-yi Lee, Eng Siong Chng, Nancy F. Chen:
A correlation-permutation approach for speech-music encoders model merging. CoRR abs/2506.11403 (2025)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13339
Yizhou Peng, Bin Wang, Yi-Wen Chao, Ziyang Ma, Haoyang Zhang, Hexin Liu, Xie Chen, Eng Siong Chng:
NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025. CoRR abs/2506.13339 (2025)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13396
Yizhou Peng, Hexin Liu, Eng Siong Chng:
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR. CoRR abs/2506.13396 (2025)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-03468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-03468
Hieu-Thi Luong, Inbal Rimon, Haim H. Permuter, Kong Aik Lee, Eng Siong Chng:
Robust Localization of Partially Fake Speech: Metrics, Models, and Out-of-Domain Evaluation. CoRR abs/2507.03468 (2025)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-09929
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-09929
Haoyang Li, Nana Hou, Yuchen Hu, Jixun Yao, Sabato Marco Siniscalchi, Eng Siong Chng:
Aligning Generative Speech Enhancement with Human Preferences via Direct Preference Optimization. CoRR abs/2507.09929 (2025)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-19040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-19040
Yizhou Peng, Yi-Wen Chao, Dianwen Ng, Yukun Ma, Chongjia Ni, Bin Ma, Eng Siong Chng:
FD-Bench: A Full-Duplex Benchmarking Pipeline Designed for Full Duplex Spoken Dialogue Systems. CoRR abs/2507.19040 (2025)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-17796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-17796
Changsong Liu, Yizhou Peng, Eng Siong Chng:
Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation. CoRR abs/2508.17796 (2025)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-02771
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-02771
Nirmalya Mallick Thakur, Jia Qi Yip, Eng Siong Chng:
Analysis of Speaker Verification Performance Trade-offs with Neural Audio Codec Transmission. CoRR abs/2509.02771 (2025)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-09197
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-09197
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function. CoRR abs/2509.09197 (2025)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-13785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-13785
Bingshen Mu, Pengcheng Guo, Zhaokai Sun, Shuai Wang, Hexin Liu, Mingchen Shao, Lei Xie, Eng Siong Chng, Longshuai Xiao, Qiangze Feng, Daliang Wang:
Summary on The Multilingual Conversational Speech Language Model Challenge: Datasets, Tasks, Baselines, and Methods. CoRR abs/2509.13785 (2025)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-20679
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-20679
Duc-Tuan Truong, Tianchi Liu, Ruijie Tao, Junjie Li, Kong Aik Lee, Eng Siong Chng:
QAMO: Quality-aware Multi-centroid One-class Learning For Speech Deepfake Detection. CoRR abs/2509.20679 (2025)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-20682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-20682
Duc-Tuan Truong, Tianchi Liu, Junjie Li, Ruijie Tao, Kong Aik Lee, Eng Siong Chng:
Addressing Gradient Misalignment in Data-Augmented Training for Robust Speech Deepfake Detection. CoRR abs/2509.20682 (2025)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24629
Tianrui Wang, Haoyu Wang, Meng Ge, Cheng Gong, Chunyu Qiang, Ziyang Ma, Zikang Huang, Guanrou Yang, Xiaobao Wang, Eng Siong Chng, Xie Chen, Longbiao Wang, Jianwu Dang:
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis. CoRR abs/2509.24629 (2025)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-05150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-05150
Donghang Wu, Haoyang Zhang, Chen Chen, Tianyu Zhang, Fei Tian, Xuerui Yang, Gang Yu, Hexin Liu, Nana Hou, Yuchen Hu, Eng Siong Chng:
Chronological Thinking in Full-Duplex Spoken Dialogue Language Models. CoRR abs/2510.05150 (2025)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-08593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-08593
Yuxin Li, Eng Siong Chng, Cuntai Guan:
Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech. CoRR abs/2510.08593 (2025)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-09592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-09592
Donghang Wu, Haoyang Zhang, Jun Chen, Xiangyu Tony Zhang, Hexin Liu, Eng Siong Chng, Fei Tian, Xuerui Yang, Xiangyu Zhang, Daxin Jiang, Gang Yu:
Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models. CoRR abs/2510.09592 (2025)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-12720
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-12720
Ziyang Ma, Ruiyang Xu, Zhenghao Xing, Yunfei Chu, Yuxuan Wang, Jinzheng He, Jin Xu, Pheng-Ann Heng, Kai Yu, Junyang Lin, Eng Siong Chng, Xie Chen:
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception. CoRR abs/2510.12720 (2025)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-25150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-25150
Shreyas Gopal, Ashutosh Anshul, Haoyang Li, Yue Heng Yeo, Hexin Liu, Eng Siong Chng:
Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR. CoRR abs/2510.25150 (2025)
2024
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuCZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuCZC24
Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1145-1156 (2024)
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SunYGYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SunYGYC24
Linhui Sun, Shuo Yuan, Aifei Gong, Lei Ye, Eng Siong Chng:
Dual-Branch Modeling Based on State-Space Model for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1457-1467 (2024)
[c293]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Hu0Y0ZCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Hu0Y0ZCC24
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Engsiong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. ACL (1) 2024: 74-90
[c292]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Hu0QZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Hu0QZC024
Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, Engsiong Chng, Ruizhe Li:
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models. ACL (Findings) 2024: 666-679
[c291]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/PengC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PengC24
Yizhou Peng, Eng Siong Chng:
Optimizing Multi-Speaker Speech Recognition with Online Decoding and Data Augmentation. APSIPA 2024: 1-6
[c290]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YangP0CZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YangP0CZ24
Yuhang Yang, Yizhou Peng, Hao Huang, Eng Siong Chng, Xionghu Zhong:
Adapting OpenAI's Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets. APSIPA 2024: 1-6
[c289]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YipY0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YipY0C24
Jia Qi Yip, Kwok Chin Yuen, Bin Ma, Engsiong Chng:
Speech Separation using Neural Audio Codecs with Embedding Loss. APSIPA 2024: 1-6
[c288]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/Yuen0YC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Yuen0YC24
Kwok Chin Yuen, Sheng Li, Jia Qi Yip, Engsiong Chng:
Low-resource Language Adaptation with Ensemble of PEFT Approaches. APSIPA 2024: 1-6
[c287]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangLLZMGCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangLLZMGCY24
Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola García-Perera, Engsiong Chng, Lina Yao:
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. EMNLP 2024: 159-171
[c286]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/DhingraAVHCT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/DhingraAVHCT24
Priyanshu Dhingra, Satyam Agrawal, Chandra Sekar Veerappan, Thi-Nga Ho, Eng Siong Chng, Rong Tong:
Speech de-identification data augmentation leveraging large language model. IALP 2024: 97-102
[c285]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/YuenYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/YuenYC24
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Low Resource Language Adaptation using Two-stage Regularization for Multilingual ASR. IALP 2024: 332-337
[c284]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangGLZSXCZBXZCWWCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangGLZSXCZBXZCWWCL24
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li:
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge. ICASSP Workshops 2024: 63-64
[c283]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YipZMNZ000NC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YipZMNZ000NC024
Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. ICASSP 2024: 326-330
[c282]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GutierrezHNWLCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GutierrezHNWLCC24
Fabian Ritter Gutierrez, Kuan-Po Huang, Dianwen Ng, Jeremy H. M. Wong, Hung-Yi Lee, Eng Siong Chng, Nancy F. Chen:
Noise Robust Distillation of Self-Supervised Speech Models via Correlation Metrics. ICASSP Workshops 2024: 495-499
[c281]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCCLHC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCCLHC24
Zizheng Zhang, Chen Chen, Hsin-Hung Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-Aware Speech Separation with Contrastive Learning. ICASSP 2024: 1381-1385
[c280]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Zou0H0CR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Zou0H0CR24
Heqing Zou, Meng Shen, Yuchen Hu, Chen Chen, Eng Siong Chng, Deepu Rajan:
Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection. ICASSP 2024: 4900-4904
[c279]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TruongTYLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TruongTYLC24
Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng:
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification. ICASSP 2024: 10336-10340
[c278]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgZZMGNNZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgZZMGNNZC024
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-Shot Learners for Speech Recognition? ICASSP 2024: 10366-10370
[c277]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenAZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenAZC24
Weiguang Chen, Tran The Anh, Xionghu Zhong, Eng Siong Chng:
Enhancing Low-Latency Speaker Diarization with Spatial Dictionary Learning. ICASSP 2024: 11371-11375
[c276]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/00750HSCCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/00750HSCCY24
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Engsiong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. ICLR 2024
[c275]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Hu0Y00CC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Hu0Y00CC24
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Engsiong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. ICLR 2024
[c274]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GutierrezHWNLCC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GutierrezHWNLCC24
Fabian Ritter Gutierrez, Kuan-Po Huang, Jeremy H. M. Wong, Dianwen Ng, Hung-yi Lee, Nancy F. Chen, Eng Siong Chng:
Dataset-Distillation Generative Model for Speech Emotion Recognition. INTERSPEECH 2024
[c273]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hu0LZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hu0LZC24
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Noise-aware Speech Enhancement using Diffusion Probabilistic Model. INTERSPEECH 2024
[c272]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiCYCCK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiCYCCK24
Sheng Li, Chen Chen, Kwok Chin Yuen, Chenhui Chu, Eng Siong Chng, Hisashi Kawai:
Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses. INTERSPEECH 2024
[c271]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TruongTNLLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TruongTNLLC24
Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng:
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection. INTERSPEECH 2024
[c270]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YipZNC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YipZNC024
Jia Qi Yip, Shengkui Zhao, Dianwen Ng, Eng Siong Chng, Bin Ma:
Towards Audio Codec-based Speech Separation. INTERSPEECH 2024
[c269]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuenYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuenYC24
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems. INTERSPEECH 2024
[c268]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YangPCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YangPCZ24
Yuhang Yang, Yizhou Peng, Eng Siong Chng, Xionghu Zhong:
Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs. ISCSLP 2024: 646-650
[c267]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoSJTCA024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoSJTCA024
Ruijie Tao, Zhan Shi, Yidi Jiang, Duc-Tuan Truong, Eng Siong Chng, Massimo Alioto, Haizhou Li:
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization. ACM Multimedia 2024: 11342-11347
[c266]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HuCYQCCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuCYQCCZ24
Yuchen Hu, Chen Chen, Chao-Han Yang, Chengwei Qin, Pin-Yu Chen, Engsiong Chng, Chao Zhang:
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models. NeurIPS 2024
[c265]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/YuenYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/YuenYC24
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Continual Learning With Embedding Layer Surgery and Task-Wise Beam Search Using Whisper. SLT 2024: 140-146
[c264]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/YangPGLCLCHDZZCTBGSCBLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/YangPGLCLCHDZZCTBGSCBLW24
Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines For Speech Recognition, Speaker Tagging, and Emotion Recognition. SLT 2024: 371-378
[c263]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LuongTLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LuongTLC24
Hieu-Thi Luong, Duc-Tuan Truong, Kong Aik Lee, Eng Siong Chng:
Room Impulse Responses Help Attackers to Evade Deep Fake Detection. SLT 2024: 623-629
[c262]
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/YuenYC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/YuenYC24
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding. TSD (2) 2024: 70-80
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03473
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03473
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li:
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge. CoRR abs/2401.03473 (2024)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-05746
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-05746
Heqing Zou, Meng Shen, Yuchen Hu, Chen Chen, Eng Siong Chng, Deepu Rajan:
Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection. CoRR abs/2401.05746 (2024)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10446
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Eng Siong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. CoRR abs/2401.10446 (2024)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05457
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05457
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. CoRR abs/2402.05457 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06894
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. CoRR abs/2402.06894 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10642
Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola García, Eng Siong Chng, Lina Yao:
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. CoRR abs/2402.10642 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-10025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-10025
Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng, Ruizhe Li:
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models. CoRR abs/2405.10025 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14161
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang:
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models. CoRR abs/2405.14161 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00654
Chen Chen, Yuchen Hu, Wen Wu, Helin Wang, Eng Siong Chng, Chao Zhang:
Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback. CoRR abs/2406.00654 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02963
Fabian Ritter Gutierrez, Kuan-Po Huang, Jeremy H. M. Wong, Dianwen Ng, Hung-yi Lee, Nancy F. Chen, Eng Siong Chng:
Dataset-Distillation Generative Model for Speech Emotion Recognition. CoRR abs/2406.02963 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12434
Jia Qi Yip, Shengkui Zhao, Dianwen Ng, Eng Siong Chng, Bin Ma:
Towards Audio Codec-based Speech Separation. CoRR abs/2406.12434 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17376
Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng:
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection. CoRR abs/2406.17376 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02243
Yuchen Hu, Chen Chen, Siyin Wang, Eng Siong Chng, Chao Zhang:
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization. CoRR abs/2407.02243 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03645
Kwok Chin Yuen, Jia Qi Yip, Eng Siong Chng:
Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems. CoRR abs/2407.03645 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14841
Bo Han, Heqing Zou, Haoyang Li, Guangcong Wang, Chng Eng Siong:
Text-based Talking Video Editing with Cascaded Conditional Diffusion. CoRR abs/2407.14841 (2024)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09785
Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition. CoRR abs/2409.09785 (2024)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-14712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-14712
Hieu-Thi Luong, Duc-Tuan Truong, Kong Aik Lee, Eng Siong Chng:
Room Impulse Responses help attackers to evade Deep Fake Detection. CoRR abs/2409.14712 (2024)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-14743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-14743
Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng:
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. CoRR abs/2409.14743 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16005
Yuhang Yang, Yizhou Peng, Eng Siong Chng, Xionghu Zhong:
Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs. CoRR abs/2409.16005 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19020
Sathya Krishnan Suresh, Mengjun Wu, Tushar Pranav, Eng Siong Chng:
DiaSynth - Synthetic Dialogue Generation Framework. CoRR abs/2409.19020 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-02371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-02371
Nikita Kuzmin, Hieu-Thi Luong, Jixun Yao, Lei Xie, Kong Aik Lee, Eng Siong Chng:
NTU-NPU System for Voice Privacy 2024 Challenge. CoRR abs/2410.02371 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-19770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-19770
Haorui He, Yuchen Song, Yuancheng Wang, Haoyang Li, Xueyao Zhang, Li Wang, Gongping Huang, Eng Siong Chng, Zhizheng Wu:
Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities. CoRR abs/2411.19770 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-17778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-17778
Haoyang Li, Yuchen Hu, Chen Chen, Eng Siong Chng:
An Investigation on the Potential of KAN in Speech Enhancement. CoRR abs/2412.17778 (2024)
2023
[c261]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenHZZZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenHZZZC23
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning. AAAI 2023: 12607-12615
[c260]
- view
  authority control:
- export record
  dblp key:
  - conf/aciids/LiuHC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aciids/LiuHC23
Changsong Liu, Thi-Nga Ho, Eng Siong Chng:
An Empirical Study on Punctuation Restoration for English, Mandarin, and Code-Switching Speech. ACIIDS (2) 2023: 286-296
[c259]
- view
  authority control:
- export record
  dblp key:
  - conf/aciids/PrachasereeGHPTCC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aciids/PrachasereeGHPTCC23
Chaiyasait Prachaseree, Kshitij Gupta, Thi-Nga Ho, Yizhou Peng, Kyaw Zin Tun, Eng Siong Chng, G. S. S. Chalapthi:
Adapting Code-Switching Language Models with Statistical-Based Text Augmentation. ACIIDS (2) 2023: 310-322
[c258]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZouSCHRC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZouSCHRC23
Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. ACL (Findings) 2023: 659-672
[c257]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuCLZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuCLZC23
Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. ACL (1) 2023: 11610-11625
[c256]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuLCQZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuLCQZC23
Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. ACL (1) 2023: 15213-15232
[c255]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/JiangHC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/JiangHC23
Yufei Jiang, Thi-Nga Ho, Eng Siong Chng:
Adopting Neural Translation Model in Data Generation for Inverse Text Normalization. APSIPA ASC 2023: 38-45
[c254]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MabenGCCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MabenGCCS23
Leander Melroy Maben, Zixun Guo, Chen Chen, Utkarsh Chudiwal, Chng Eng Siong:
Study of Generative Adversarial Networks for Noisy Speech Simulation from Clean Speech. APSIPA ASC 2023: 1143-1149
[c253]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YuenLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YuenLS23
Kwok Chin Yuen, Haoyang Li, Chng Eng Siong:
ASR Model Adaptation for Rare Words Using Synthetic Data Generated by Multiple Text-To-Speech Systems. APSIPA ASC 2023: 1771-1778
[c252]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YipNMS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YipNMS23
Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong:
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures. APSIPA ASC 2023: 2002-2007
[c251]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/SuranaHTC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/SuranaHTC23
Tanmay Surana, Thi-Nga Ho, Kyaw Zin Tun, Eng Siong Chng:
CASSI: Contextual and Semantic Structure-based Interpolation Augmentation for Low-Resource NER. EMNLP (Findings) 2023: 9729-9742
[c250]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/GuptaPHTKTCG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/GuptaPHTKTCG23
Kshitij Gupta, Chaiyasait Prachaseree, Thi-Nga Ho, Kyaw Zin Tun, Jia Xin Koh, Ying Ying Tan, Eng Siong Chng, Chalapathi GSS:
Singaporean Conversational English-Malay Code-Switching Speech: An Analysis Based on Code-switching Points and Part -of-Speech. IALP 2023: 95-99
[c249]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHWC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHWC23
Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-Oriented Speech Enhancement Using Diffusion Probabilistic Model. ICASSP 2023: 1-5
[c248]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHZSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHZSC23
Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise Adaptation Using Data Simulation. ICASSP 2023: 1-5
[c247]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuCLZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuCLZC23
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. ICASSP 2023: 1-5
[c246]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuCZZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuCZZC23
Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. ICASSP 2023: 1-5
[c245]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgZYYNZMNCM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgZYYNZMNCM23
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition. ICASSP 2023: 1-5
[c244]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgZYZMNNCM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgZYZMNNCM23
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-Resource Keyword Spotting. ICASSP 2023: 1-5
[c243]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajaaADGC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajaaADGC23
Shangeth Rajaa, Kriti Anandan, Swaraj Dalmia, Tarun Gupta, Eng Siong Chng:
Improving Spoken Language Identification with Map-Mix. ICASSP 2023: 1-5
[c242]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SholokhovKLC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SholokhovKLC23
Alexey Sholokhov, Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng:
Probabilistic Back-ends for Online Speaker Recognition and Clustering. ICASSP 2023: 1-5
[c241]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangXHCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangXHCL23
Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li:
Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition. ICASSP 2023: 1-5
[c240]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/HuLCZZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HuLCZZC23
Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. IJCAI 2023: 5076-5084
[c239]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/GuoQHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/GuoQHS23
Yachao Guo, Zhibin Qiu, Hao Huang, Chng Eng Siong:
Improved Keyword Recognition Based on Aho-Corasick Automaton. IJCNN 2023: 1-7
[c238]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/SiZLWWDCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/SiZLWWDCL23
Yuke Si, Yan Zhang, Yuhang Li, Xiaobao Wang, Longbiao Wang, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition. IJCNN 2023: 1-8
[c237]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangN0FJXMNC0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangN0FJXMNC0Z23
Zhao Yang, Dianwen Ng, Chong Zhang, Xiao Fu, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma, Jizhong Zhao:
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition. INTERSPEECH 2023: 72-76
[c236]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgXYYTFC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgXYYTFC023
Dianwen Ng, Yang Xiao, Jia Qi Yip, Zhao Yang, Biao Tian, Qiang Fu, Eng Siong Chng, Bin Ma:
Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness. INTERSPEECH 2023: 296-300
[c235]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ng0ZM0NZ0WC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ng0ZM0NZ0WC023
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Qian Chen, Wen Wang, Eng Siong Chng, Bin Ma:
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition. INTERSPEECH 2023: 1319-1323
[c234]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YipTN0M0NZC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YipTN0M0NZC023
Jia Qi Yip, Duc-Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. INTERSPEECH 2023: 1938-1942
[c233]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiXXPLHC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiXXPLHC23
Rui Li, Zhiwei Xie, Haihua Xu, Yizhou Peng, Hexin Liu, Hao Huang, Eng Siong Chng:
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory. INTERSPEECH 2023: 1968-1972
[c232]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoXLCCFZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoXLCCFZ23
Zhiheng Liao, Feifei Xiong, Juan Luo, Minjie Cai, Eng Siong Chng, Jinwei Feng, Xionghu Zhong:
Blind Estimation of Room Impulse Response from Monaural Reverberant Speech with Segmental Generative Neural Network. INTERSPEECH 2023: 2723-2727
[c231]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuH0C23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuH0C23
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. INTERSPEECH 2023: 2918-2922
[c230]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangNL0JXMNZ0C23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangNL0JXMNZ0C23
Zhao Yang, Dianwen Ng, Xizhe Li, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng:
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement. INTERSPEECH 2023: 3774-3778
[c229]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0075YLHKC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0075YLHKC23
Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Modeling Approach to Efficient Speech Separation. INTERSPEECH 2023: 3784-3788
[c228]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangN0JXMNZ0C23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangN0JXMNZ0C23
Zhao Yang, Dianwen Ng, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng:
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions. INTERSPEECH 2023: 4953-4957
[c227]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0075HYSCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0075HYSCS23
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Chng Eng Siong:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. NeurIPS 2023
[c226]
- view
  authority control:
- export record
  dblp key:
  - conf/ssp/KhandelwalDKC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssp/KhandelwalDKC23
Tanmay Khandelwal, Rohan Kumar Das, Andrew Koh, Eng Siong Chng:
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions. SSP 2023: 329-333
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08229
Shangeth Rajaa, Kriti Anandan, Swaraj Dalmia, Tarun Gupta, Eng Siong Chng:
Improving Spoken Language Identification with Map-Mix. CoRR abs/2302.08229 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-09523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-09523
Alexey Sholokhov, Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng:
Probabilistic Back-ends for Online Speaker Recognition and Clustering. CoRR abs/2302.09523 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11131
Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. CoRR abs/2302.11131 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11362
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11362
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. CoRR abs/2302.11362 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11981
Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise adaptation using Data Simulation. CoRR abs/2302.11981 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11989
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11989
Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2302.11989 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14597
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition. CoRR abs/2302.14597 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04974
Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. CoRR abs/2304.04974 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-01170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-01170
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-resource Keyword Spotting. CoRR abs/2305.01170 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09212
Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. CoRR abs/2305.09212 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09299
Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. CoRR abs/2305.09299 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10761
Zizheng Zhang, Chen Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-aware Speech Separation with Contrastive Learning. CoRR abs/2305.10761 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12121
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12121
Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. CoRR abs/2305.12121 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12460
Leander Melroy Maben, Zixun Guo, Chen Chen, Utkarsh Chudiwal, Chng Eng Siong:
Study of GANs for Noisy Speech Simulation from Clean Speech. CoRR abs/2305.12460 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16932
Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Model Approach to Efficient Speech Separation. CoRR abs/2305.16932 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10563
Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. CoRR abs/2306.10563 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10567
Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. CoRR abs/2306.10567 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08029
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Noise-aware Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2307.08029 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07458
Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong:
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures. CoRR abs/2309.07458 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07466
Ansh Mishra, Jia Qi Yip, Eng Siong Chng:
Codec Data Augmentation for Time-domain Heart Sound Classification. CoRR abs/2309.07466 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09413
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09413
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-shot Learners for Speech Recognition? CoRR abs/2309.09413 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12608
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12608
Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for enhanced speech separation performance. CoRR abs/2309.12608 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-14838
Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng:
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification. CoRR abs/2309.14838 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15701
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. CoRR abs/2309.15701 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13013
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng:
Generative error correction for code-switching speech recognition using large language models. CoRR abs/2310.13013 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12153
Fabian Ritter Gutierrez, Kuan-Po Huang, Dianwen Ng, Jeremy Heng Meng Wong, Hung-yi Lee, Eng Siong Chng, Nancy F. Chen:
Noise robust distillation of self-supervised speech models via correlation metrics. CoRR abs/2312.12153 (2023)
2022
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/LiuGKCSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/LiuGKCSK22
Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Eng Siong Chng, Suzy J. Styles, Sanjeev Khudanpur:
Efficient Self-Supervised Learning Representations for Spoken Language Identification. IEEE J. Sel. Top. Signal Process. 16(6): 1296-1307 (2022)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/GuoWDCN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/GuoWDCN22
Lili Guo, Longbiao Wang, Jianwu Dang, Eng Siong Chng, Seiichi Nakagawa:
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition. Speech Commun. 136: 118-127 (2022)
[c225]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/XiaoLKSCPW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/XiaoLKSCPW22
Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022
[c224]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgCTFC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgCTFC22
Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
Convmixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-Field Keyword Spotting. ICASSP 2022: 3603-3607
[c223]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHHQZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHHQZC22
Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-Critical Sequence Training for Automatic Speech Recognition. ICASSP 2022: 3688-3692
[c222]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHHSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHHSC22
Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data. ICASSP 2022: 4298-4302
[c221]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuHCC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuHCC22
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. ICASSP 2022: 6292-6296
[c220]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueSZNC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueSZNC22
Fuzhao Xue, Aixin Sun, Hao Zhang, Jinjie Ni, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. ICASSP 2022: 6707-6711
[c219]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GeXWCDL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GeXWCDL22
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. ICASSP 2022: 7287-7291
[c218]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZouSCRC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZouSCRC22
Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention Based Multi-Level Acoustic Information. ICASSP 2022: 7367-7371
[c217]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KohXS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KohXS22
Andrew Koh, Fuzhao Xue, Chng Eng Siong:
Automated Audio Captioning Using Transfer Learning and Reconstruction Latent Space Similarity Regularization. ICASSP 2022: 7722-7726
[c216]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PengZXHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PengZXHC22
Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR. ICASSP 2022: 7807-7811
[c215]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaTAC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaTAC22
Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Eng Siong Chng:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. INTERSPEECH 2022: 1978-1982
[c214]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHHZQC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHHZQC22
Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. INTERSPEECH 2022: 2773-2777
[c213]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoHC22
Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. INTERSPEECH 2022: 3764-3768
[c212]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoCC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoCC22
Zixun Guo, Chen Chen, Eng Siong Chng:
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition. INTERSPEECH 2022: 3799-3803
[c211]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhangYHXWCBZCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhangYHXWCBZCX22
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. ISCSLP 2022: 507-511
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05863
Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting. CoRR abs/2201.05863 (2022)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09995
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. CoRR abs/2202.09995 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11774
Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Chng Eng Siong:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. CoRR abs/2203.11774 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14838
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. CoRR abs/2203.14838 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15321
Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data. CoRR abs/2203.15321 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15326
Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information. CoRR abs/2203.15326 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15526
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15526
Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning. CoRR abs/2203.15526 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16361
Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. CoRR abs/2203.16361 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-05445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-05445
Dianwen Ng, Jin Hui Pang, Yang Xiao, Biao Tian, Qiang Fu, Eng Siong Chng:
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness. CoRR abs/2204.05445 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-06260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-06260
Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-critical Sequence Training for Automatic Speech Recognition. CoRR abs/2204.06260 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01918
Andrew Koh, Soham Tiwari, Chng Eng Siong:
Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning. CoRR abs/2206.01918 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-14659
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-14659
Andrew Koh, Eng Siong Chng:
Language-Based Audio Retrieval with Converging Tied Layers and Contrastive Loss. CoRR abs/2206.14659 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04176
Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng:
Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition. CoRR abs/2207.04176 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04177
Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang:
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder. CoRR abs/2207.04177 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07429
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07429
Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-00987
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-00987
Zixun Guo, Chen Chen, Eng Siong Chng:
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition. CoRR abs/2208.00987 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06360
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06360
Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization. CoRR abs/2209.06360 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00325
Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li:
Speech-text based multi-modal training with bidirectional attention for improved speech recognition. CoRR abs/2211.00325 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01585
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01585
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. CoRR abs/2211.01585 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05301
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05301
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning. CoRR abs/2212.05301 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05356
Abhinav Rao, Thi-Nga Ho, Eng Siong Chng:
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin. CoRR abs/2212.05356 (2022)
2021
[c210]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XueSZC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XueSZC21
Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. AAAI 2021: 14194-14202
[c209]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/KaushikPAC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KaushikPAC21
Manav Kaushik, Van Tung Pham, Tran The Anh, Eng Siong Chng:
End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss. APSIPA ASC 2021: 1-8
[c208]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/MaHPXC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MaHPXC21
Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-based joint learning approach to robust ASR for radio communication speech. APSIPA ASC 2021: 497-502
[c207]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/ChenHMC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenHMC21
Chen Chen, Nana Hou, Duo Ma, Eng Siong Chng:
Time Domain Speech Enhancement With Attentive Multi-scale Approach. APSIPA ASC 2021: 679-683
[c206]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/MaoKPXHWC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MaoKPXHWC21
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025
[c205]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/PengZZXHLC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PengZZXHLC21
Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng:
Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. APSIPA ASC 2021: 1043-1048
[c204]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhaoNLJCM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhaoNLJCM21
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. EMNLP (1) 2021: 9339-9349
[c203]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HouXC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HouXC021
Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. ICASSP 2021: 666-670
[c202]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GeXWCD021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GeXWCD021
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals. ICASSP 2021: 6109-6113
[c201]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoWXDC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoWXDC021
Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition. ICASSP 2021: 6304-6308
[c200]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoNLJCM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoNLJCM21
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. ICASSP 2021: 6813-6817
[c199]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangPPXHC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangPPXHC21
Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523
[c198]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenPCZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenPCZ21
Weiguang Chen, Van Tung Pham, Eng Siong Chng, Xionghu Zhong:
Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion. Interspeech 2021: 4189-4193
[c197]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/MaoKPXHC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/MaoKPXHC21
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5
[c196]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZengPXKCNM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZengPXKCNM21
Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-05056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-05056
Manav Kaushik, Van Tung Pham, Eng Siong Chng:
End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN. CoRR abs/2101.05056 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-10701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-10701
Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech. CoRR abs/2107.10701 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-04692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-04692
Andrew Koh, Fuzhao Xue, Eng Siong Chng:
Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization. CoRR abs/2108.04692 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05267
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. CoRR abs/2110.05267 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08545
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08545
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. CoRR abs/2110.08545 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13653
Shangeth Rajaa, Van Tung Pham, Chng Eng Siong:
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling. CoRR abs/2110.13653 (2021)
2020
[j34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/XuRCL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuRCL20
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1370-1384 (2020)
[c195]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/YapKC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YapKC20
Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. EMNLP (Findings) 2020: 41-46
[c194]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HaoXHXC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HaoXHXC020
Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li:
Time-Domain Neural Network Approach for Speech Bandwidth Extension. ICASSP 2020: 866-870
[c193]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PhamXKZCNM020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PhamXKZCNM020
Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063
[c192]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoNLJCM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoNLJCM20
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Speech Transformer with Speaker Aware Persistent Memory. INTERSPEECH 2020: 1261-1265
[c191]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeXWCD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeXWCD020
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. INTERSPEECH 2020: 1406-1410
[c190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangXPHC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangXPHC20
Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396
[c189]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouXPZC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouXPZC020
Nana Hou, Chenglin Xu, Van Tung Pham, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network. INTERSPEECH 2020: 4064-4068
[c188]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouXZC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouXZC020
Nana Hou, Chenglin Xu, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension. INTERSPEECH 2020: 4069-4073
[c187]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoNLJCM20a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoNLJCM20a
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Universal Speech Transformer. INTERSPEECH 2020: 5021-5025
[c186]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoNLJCM20b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoNLJCM20b
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Cross Attention with Monotonic Alignment for Speech Transformer. INTERSPEECH 2020: 5031-5035
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-08326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-08326
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. CoRR abs/2004.08326 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-14762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-14762
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-domain speaker extraction network. CoRR abs/2004.14762 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04686
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. CoRR abs/2005.04686 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08742
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-10407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-10407
Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-11795
Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. CoRR abs/2009.11795 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11483
Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework. CoRR abs/2010.11483 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12143
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09624
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals. CoRR abs/2011.09624 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-06780
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-06780
Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. CoRR abs/2012.06780 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13873
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13873
Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. CoRR abs/2012.13873 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c185]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/VuZXC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/VuZXC19
Thi-Ly Vu, Zhiping Zeng, Haihua Xu, Eng Siong Chng:
Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition. APSIPA 2019: 198-203
[c184]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MakhijaHC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MakhijaHC19
Karan Makhija, Thi-Nga Ho, Eng Siong Chng:
Transfer Learning for Punctuation Prediction. APSIPA 2019: 268-273
[c183]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HouXC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HouXC019
Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Domain Adversarial Training for Speech Enhancement. APSIPA 2019: 667-672
[c182]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MaLXC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MaLXC19
Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng:
Improving code-switching speech recognition with data augmentation and system combination. APSIPA 2019: 1308-1312
[c181]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/XuRCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/XuRCL19
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-Domain Speaker Extraction Network. ASRU 2019: 327-334
[c180]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/LiXC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LiXC19
Wenjie Li, Haihua Xu, Eng Siong Chng:
The TL@NTU Text-to-speech System for the Blizzard Challenge 2019. Blizzard Challenge 2019
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuRC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuRC019
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. ICASSP 2019: 6990-6994
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/icmlsc/NguyenTCHVC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmlsc/NguyenTCHVC19
Trang M. Nguyen, Van-Lien Tran, Duy-Cat Can, Quang-Thuy Ha, Ly T. Vu, Engsiong Chng:
QASA: Advanced Document Retriever for Open-Domain Question Answering by Learning to Rank Question-Aware Self-Attentive Document Representations. ICMLSC 2019: 221-225
[c177]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianC019
Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data. INTERSPEECH 2019: 201-205
[c176]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaoXC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaoXC019
Wei Rao, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Target Speaker Extraction for Multi-Talker Speaker Verification. INTERSPEECH 2019: 1273-1277
[c175]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhassanovXPZCNM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhassanovXPZCNM19
Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164
[c174]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZengKPXC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZengKPXC019
Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition. INTERSPEECH 2019: 2165-2169
[c173]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhassanovZPXC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhassanovZPXC19
Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. INTERSPEECH 2019: 3505-3509
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/VuKSB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/VuKSB19
Thi-Ly Vu, Zin Tun Kyaw, Chng Eng Siong, Rafael E. Banchs:
Online FAQ Chatbot for Customer Support. IWSDS 2019: 251-259
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02546
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02546
Wei Rao, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification. CoRR abs/1902.02546 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03705
Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data. CoRR abs/1902.03705 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-09952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-09952
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. CoRR abs/1903.09952 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03799
Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. CoRR abs/1904.03799 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03802
Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-07386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-07386
Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-00863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-00863
Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019)
2018
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/sigpro/YuXXC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sigpro/YuXXC18
Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng:
Learning distributed sentence representations for story segmentation. Signal Process. 142: 403-411 (2018)
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/PhamXXCCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/PhamXXCCL18
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Re-ranking spoken term detection with acoustic exemplars of keywords. Speech Commun. 104: 12-23 (2018)
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/aclnews/LiWACL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/LiWACL18
Zhongwei Li, Xuancong Wang, AiTi Aw, Eng Siong Chng, Haizhou Li:
Named-Entity Tagging and Domain adaptation for Better Customized Translation. NEWS@ACL 2018: 41-46
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/CanHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/CanHC18
Duy-Cat Can, Thi-Nga Ho, Eng Siong Chng:
A Hybrid Deep Learning Architecture for Sentence Unit Detection. IALP 2018: 129-132
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/HoCC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/HoCC18
Thi-Nga Ho, Duy-Cat Can, Engsiong Chng:
An Investigation of Word Embeddings with Deep Bidirectional LSTM for Sentence Unit Detection in Automatic Speech Transcription. IALP 2018: 139-142
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuRXC018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuRXC018
Chenglin Xu, Wei Rao, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM. ICASSP 2018: 6-10
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRSXCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRSXCL18
Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng, Haizhou Li:
Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition. ICASSP 2018: 4889-4893
[c166]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/XuPKLCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuPKLCL18
Haihua Xu, Van Tung Pham, Zin Tun Kyaw, Zhi Hao Lim, Eng Siong Chng, Haizhou Li:
Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2018: 554-555
[c165]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoXXC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoXXC18
Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng:
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition. INTERSPEECH 2018: 1928-1932
[c164]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhassanovC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhassanovC18
Yerbolat Khassanov, Eng Siong Chng:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. INTERSPEECH 2018: 3343-3347
[c163]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuRCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuRCL18
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning. INTERSPEECH 2018: 3479-3483
[c162]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/TianWXC018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/TianWXC018
Xiaohai Tian, Junchao Wang, Haihua Xu, Eng Siong Chng, Haizhou Li:
Average Modeling Approach to Voice Conversion with Non-Parallel Data. Odyssey 2018: 227-232
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-06200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-06200
Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng:
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition. CoRR abs/1806.06200 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-10306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-10306
Yerbolat Khassanov, Eng Siong Chng:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. CoRR abs/1806.10306 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00241
Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition. CoRR abs/1811.00241 (2018)
2017
[j31]
- view
  authority control:
- export record
  dblp key:
  - journals/jaihc/YuXXC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jaihc/YuXXC17
Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng:
A hybrid neural network hidden Markov model approach for automatic story segmentation. J. Ambient Intell. Humaniz. Comput. 8(6): 925-936 (2017)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TianLWCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TianLWCL17
Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li:
An Exemplar-Based Approach to Frequency Warping for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1863-1876 (2017)
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/aciids/KhassanovCBC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aciids/KhassanovCBC17
Yerbolat Khassanov, Tze Yuang Chong, Benjamin Bigot, Eng Siong Chng:
Unsupervised Language Model Adaptation by Data Selection for Speech Recognition. ACIIDS (1) 2017: 508-517
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YuXXC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YuXXC17
Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng:
An end-to-end neural network approach to story segmentation. APSIPA 2017: 171-176
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenLDPNXHCXSCM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenLDPNXHCXSCM17
Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LimTRC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LimTRC17
Zhi Hao Lim, Xiaohai Tian, Wei Rao, Eng Siong Chng:
An investigation of spectral feature partitioning for replay attacks detection. APSIPA 2017: 1570-1573
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZengXCCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZengXCCL17
Zhiping Zeng, Haihua Xu, Tze Yuang Chong, Eng Siong Chng, Haizhou Li:
Improving N-gram language modeling for code-switching speech recognition. APSIPA 2017: 1596-1601
[c156]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YuXXC17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YuXXC17a
Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng:
Topic embedding of sentences for story segmentation. APSIPA 2017: 1602-1607
[c155]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/TianMLSCLGM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/TianMLSCLGM17
Xiaohai Tian, Lei Meng, Siyuan Liu, Zhiqi Shen, Eng Siong Chng, Cyril Leung, Frank Guan, Chunyan Miao:
Novel Functional Technologies for Age-Friendly E-commerce. HCI (28) 2017: 150-158
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/HouTCML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/HouTCML17
Nana Hou, Xiaohai Tian, Eng Siong Chng, Bin Ma, Haizhou Li:
Improving air traffic control speech intelligibility by reducing speaking rate effectively. IALP 2017: 197-200
[c153]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LeeHCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LeeHCL17
Grandee Lee, Thi-Nga Ho, Eng Siong Chng, Haizhou Li:
A review of the mandarin-english code-switching corpus: SEAME. IALP 2017: 210-213
[c152]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LiCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LiCL17
Zhongwei Li, Eng Siong Chng, Haizhou Li:
Named entity transliteration with sequence-to-sequence neural network. IALP 2017: 374-378
[c151]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoZJCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoZJCL17
Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition. ICASSP 2017: 3246-3250
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/iccse/MengHTSCGML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccse/MengHTSCGML17
Lei Meng, Nguyen Quy Hy, Xiaohai Tian, Zhiqi Shen, Eng Siong Chng, Frank Yunqing Guan, Chunyan Miao, Cyril Leung:
Towards Age-friendly E-commerce Through Crowd-Improved Speech Recognition, Multimodal Search, and Personalized Speech Feedback. ICCSE 2017: 127-135
[c149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHKLa17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHKLa17
Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332
[c148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuXSRCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuXSRCL17
Chenglin Xu, Xiong Xiao, Sining Sun, Wei Rao, Eng Siong Chng, Haizhou Li:
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. INTERSPEECH 2017: 1894-1898
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/soict/PhamXXCC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/soict/PhamXXCC17
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng:
Pruning Strategies for Partial Search in Spoken Term Detection. SoICT 2017: 114-119
2016
[j29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/XiaoZNZJCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/XiaoZNZJCL16
Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation. EURASIP J. Adv. Signal Process. 2016: 4 (2016)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/HyLTDC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/HyLTDC16
Nguyen Quy Hy, Siu Wa Lee, Xiaohai Tian, Minghui Dong, Eng Siong Chng:
High quality voice conversion using prosodic and high-resolution spectral features. Multim. Tools Appl. 75(9): 5265-5285 (2016)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NguyenXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NguyenXCL16
Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(6): 1006-1019 (2016)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/UedaWKXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/UedaWKXCL16
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization. J. Signal Process. Syst. 82(2): 151-161 (2016)
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/aciids/HoCDPC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aciids/HoCDPC16
Thi-Nga Ho, Tze Yuang Chong, Van Hai Do, Van Tung Pham, Eng Siong Chng:
Improving Efficiency of Sentence Boundary Detection by Feature Selection. ACIIDS (2) 2016: 594-603
[c145]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LeowCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LeowCL16
Su Jun Leow, Eng Siong Chng, Chin-Hui Lee:
Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts. APSIPA 2016: 1-6
[c144]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TianXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TianXCL16
Xiaohai Tian, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing speech detection using temporal convolutional neural network. APSIPA 2016: 1-6
[c143]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XiaoWCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XiaoWCL16
Xiong Xiao, Shinji Watanabe, Eng Siong Chng, Haizhou Li:
Beamforming networks using spatial covariance features for far-field speech recognition. APSIPA 2016: 1-6
[c142]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XuRXHCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XuRXHCL16
Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
I-vector based deep neural network acoustic model adaptation using multilingual language resource. APSIPA 2016: 1-5
[c141]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VuBC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VuBC16
Thanh T. Vu, Benjamin Bigot, Eng Siong Chng:
Combining non-negative matrix factorization and deep neural networks for speech enhancement and automatic speech recognition. ICASSP 2016: 499-503
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection from a feature representation perspective. ICASSP 2016: 2119-2123
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLCMLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLCMLD16
Liping Chen, Kong-Aik Lee, Eng Siong Chng, Bin Ma, Haizhou Li, Li-Rong Dai:
Content-aware local variability vector for speaker verification with short utterance. ICASSP 2016: 5485-5489
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuHXPLWDLXMCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuHXPLWDLXMCL16
Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li:
Approximate search of audio queries by using DTW with phone time boundary and data augmentation. ICASSP 2016: 6030-6034
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PhamXXCCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PhamXXCCL16
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Keyword search using query expansion for graph-based rescoring of hypothesized detections. ICASSP 2016: 6035-6039
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenPXXDNCSLCML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenPXXDNCSLCML16
Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoZNJCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoZNJCL16
Xiong Xiao, Shengkui Zhao, Thi Ngoc Tho Nguyen, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
An expectation-maximization eigenvector clustering approach to direction of arrival estimation of multiple speech sources. ICASSP 2016: 6330-6334
[c134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PhamXXCCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PhamXXCCL16
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples. INTERSPEECH 2016: 933-937
[c133]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuSNXHCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuSNXHCL16
Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions. INTERSPEECH 2016: 1315-1319
[c132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuXXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuXXCL16
Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li:
A DNN-HMM Approach to Story Segmentation. INTERSPEECH 2016: 1527-1531
[c131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions. INTERSPEECH 2016: 1715-1719
[c130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLDHRXLSNWSCK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLDHRXLSNWSCK16
Kong-Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Wei Rao, Xiong Xiao, Anthony Larcher, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov, Amir Hossein Poorjam, Trung Ngo Trong, Chenglin Xu, Haihua Xu, Bin Ma, Eng Siong Chng, Sylvain Meignier:
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS. INTERSPEECH 2016: 3211-3215
[c129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeungWXHPLXXNMC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeungWXHPLXXNMC16
Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li:
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. INTERSPEECH 2016: 3703-3707
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/RaoXXXLCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/RaoXXXLCL16
Wei Rao, Xiong Xiao, Chenglin Xu, Haihua Xu, Kong-Aik Lee, Eng Siong Chng, Haizhou Li:
Neural networks based channel compensation for i-vector speaker verification. ISCSLP 2016: 1-5
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhangXWDICL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhangXWDICL16
Zhaofeng Zhang, Xiong Xiao, Longbiao Wang, Jianwu Dang, Masahiro Iwahashi, Eng Siong Chng, Haizhou Li:
Multi-channel feature adaptation for robust speech recognition. ISCSLP 2016: 1-5
[c126]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/WangNLYXXXNCML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/WangNLYXXXNCML16
Lei Wang, Chongjia Ni, Cheung-Chi Leung, Changhuai You, Lei Xie, Haihua Xu, Xiong Xiao, Tin Lay Nwe, Eng Siong Chng, Bin Ma, Haizhou Li:
The NNI Vietnamese Speech Recognition System for MediaEval 2016. MediaEval 2016
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection under noisy conditions: a preliminary investigation and an initial database. CoRR abs/1602.02950 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangXWCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangXWCL16
Zhaofeng Zhang, Xiong Xiao, Longbiao Wang, Eng Siong Chng, Haizhou Li:
Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting. CoRR abs/1604.03276 (2016)
2015
[j25]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/DoXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/DoXCL15
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages. Int. J. Asian Lang. Process. 23(1): 21-33 (2015)
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/lre/LyuTCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/lre/LyuTCL15
Dau-Cheng Lyu, Tien Ping Tan, Engsiong Chng, Haizhou Li:
Mandarin-English code-switching speech corpus in South-East Asia: SEAME. Lang. Resour. Evaluation 49(3): 581-600 (2015)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/WuCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/WuCL15
Zhizheng Wu, Engsiong Chng, Haizhou Li:
Exemplar-based voice conversion using joint nonnegative matrix factorization. Multim. Tools Appl. 74(22): 9943-9958 (2015)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChongBCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChongBCL15
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
Decoupling Word-Pair Distance and Co-occurrence Information for Effective Long History Context Language Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 23(7): 1221-1232 (2015)
[c125]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DoXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DoXCL15
Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Distance metric learning for kernel density-based acoustic model under limited training data conditions. APSIPA 2015: 54-58
[c124]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YuXXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YuXXCL15
Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, Haizhou Li:
A density peak clustering approach to unsupervised acoustic subword units discovery. APSIPA 2015: 178-183
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangHXCLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangHXCLD15
Shaofei Zhang, Dong-Yan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong:
Non-negative matrix factorization using stable alternating direction method of multipliers for source separation. APSIPA 2015: 222-228
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/PhamXDCXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PhamXDCXCL15
Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, Haizhou Li:
On the study of very low-resource language keyword search. APSIPA 2015: 358-364
[c121]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DoXXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DoXXCL15
Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, Haizhou Li:
Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation. APSIPA 2015: 594-98
[c120]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/VuBC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/VuBC15
Thanh T. Vu, Benjamin Bigot, Engsiong Chng:
Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge. ASRU 2015: 423-429
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhaoXZNZRWJCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhaoXZNZRWJCL15
Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren, Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li:
Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction. ASRU 2015: 460-467
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/XuXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/XuXCL15
Haihua Xu, Xiong Xiao, Engsiong Chng, Haizhou Li:
On statistical machine translation method for lexicon refinement in speech recognition. ChinaSIP 2015: 25-29
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/TianDXXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/TianDXXCL15
Xiaohai Tian, Steven Du, Xiong Xiao, Haihua Xu, Engsiong Chng, Haizhou Li:
Detecting synthetic speech using long term magnitude and phase information. ChinaSIP 2015: 611-615
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/DuXC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/DuXC15
Steven Du, Xiong Xiao, Engsiong Chng:
DNN feature compensation for noise robust speaker verification. ChinaSIP 2015: 871-875
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/cicling/ChikersalPCGS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cicling/ChikersalPCGS15
Prerna Chikersal, Soujanya Poria, Erik Cambria, Alexander F. Gelbukh, Chng Eng Siong:
Modelling Public Sentiment in Twitter: Using Linguistic Patterns to Enhance Supervised Learning. CICLing (2) 2015: 49-65
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoZZJCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoZZJCL15
Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li:
A learning-based approach to direction of arrival estimation in noisy and reverberant environments. ICASSP 2015: 2814-2818
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TianWLHCD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TianWLHCD15
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Engsiong Chng, Minghui Dong:
Sparse representation for frequency warping based voice conversion. ICASSP 2015: 4235-4239
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuYXXLCYLWLMCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuYXXLCYLWLMCL15
Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Engsiong Chng, Haizhou Li:
Language independent query-by-example spoken term detection using N-best phone sequences and partial matching. ICASSP 2015: 5191-5195
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenNCSPXXLLLL015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenNCSPXXLLLL015
Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao, Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina Goh, Engsiong Chng, Bin Ma, Haizhou Li:
Low-resource keyword search strategies for tamil. ICASSP 2015: 5366-5370
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeowCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeowCL15
Su Jun Leow, Engsiong Chng, Chin-Hui Lee:
Language-resource independent speech segmentation using cues from a spectrogram image. ICASSP 2015: 5813-5817
[c109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChongBCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChongBCL15
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
TDTO language modeling with feedforward neural networks. INTERSPEECH 2015: 1458-1462
[c108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangHXCLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangHXCLD15
Shaofei Zhang, Dong-Yan Huang, Lei Xie, Engsiong Chng, Haizhou Li, Minghui Dong:
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation. INTERSPEECH 2015: 1498-1502
[c107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoTDXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoTDXCL15
Xiong Xiao, Xiaohai Tian, Steven Du, Haihua Xu, Engsiong Chng, Haizhou Li:
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge. INTERSPEECH 2015: 2052-2056
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuDXC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuDXC15
Haihua Xu, Van Hai Do, Xiong Xiao, Engsiong Chng:
A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition. INTERSPEECH 2015: 2132-2136
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianWLHDC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianWLHDC15
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Minghui Dong, Engsiong Chng:
System fusion for high-performance voice conversion. INTERSPEECH 2015: 2759-2763
[c104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoZZJCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoZZJCL15
Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li:
Learning to estimate reverberation time in noisy and reverberant rooms. INTERSPEECH 2015: 3431-3435
[c103]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/HouPL0XLXFNXCZS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/HouPL0XLXFNXCZS15
Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Engsiong Chng, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2015. MediaEval 2015
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HyLTDC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HyLTDC15
Nguyen Quy Hy, Siu Wa Lee, Xiaohai Tian, Minghui Dong, Engsiong Chng:
High quality voice conversion using prosodic and high-resolution spectral features. CoRR abs/1512.01809 (2015)
2014
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/DoXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/DoXCL14
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages. IEICE Trans. Inf. Syst. 97-D(2): 285-295 (2014)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuVCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuVCL14
Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li:
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1506-1521 (2014)
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HuangXXXSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HuangXXXSL14
Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Chng Eng Siong, Haizhou Li:
Multi-view features in a DNN-CRF model for improved sentence unit detection on English broadcast news. APSIPA 2014: 1-9
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuGCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuGCL14
Zhizheng Wu, Sheng Gao, Engsiong Chng, Haizhou Li:
A study on replay attack and anti-spoofing for text-dependent speaker verification. APSIPA 2014: 1-5
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XuPCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XuPCL14
Haihua Xu, Van Tung Pham, Engsiong Chng, Haizhou Li:
Towards better keyword search performance on Malay broadcast news data. APSIPA 2014: 1-5
[c99]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/fusion/ZhongWNC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fusion/ZhongWNC14
Xionghu Zhong, Wenwu Wang, Syed Mohsen Naqvi, Engsiong Chng:
A Bayesian performance bound for time-delay of arrival based acoustic source tracking in a reverberant environment. FUSION 2014: 1-8
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL14
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Feature compensation using linear combination of speaker and environment dependent correction vectors. ICASSP 2014: 1720-1724
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NguyenXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NguyenXCL14
Duc Hoang Ha Nguyen, Xiong Xiao, Engsiong Chng, Haizhou Li:
Generalization of temporal filter and linear transformation for robust speech recognition. ICASSP 2014: 1730-1734
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DennisDLC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DennisDLC14
Jonathan William Dennis, Tran Huy Dat, Haizhou Li, Engsiong Chng:
A discriminatively trained Hough Transform for frame-level phoneme recognition. ICASSP 2014: 2514-2518
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChongBCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChongBCL14
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
Improving language modeling by using distance and co-occurrence information of word-pairs and its application to LVCSR. ICASSP 2014: 4883-4887
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PhamXCSLCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PhamXCSLCL14
Van Tung Pham, Haihua Xu, Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Engsiong Chng, Haizhou Li:
Discriminative score normalization for keyword search decision. ICASSP 2014: 7078-7082
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoXSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoXSL14
Van Hai Do, Xiong Xiao, Chng Eng Siong, Haizhou Li:
Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR. INTERSPEECH 2014: 6-10
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuSSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuSSL14
Haihua Xu, Hang Su, Chng Eng Siong, Haizhou Li:
Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems. INTERSPEECH 2014: 2078-2082
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSL14
Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Joint nonnegative matrix factorization for exemplar-based voice conversion. INTERSPEECH 2014: 2509-2513
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DennisDS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DennisDS14
Jonathan William Dennis, Tran Huy Dat, Chng Eng Siong:
Analysis of spectrogram image methods for sound event classification. INTERSPEECH 2014: 2533-2537
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuXHXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuXHXCL14
Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Engsiong Chng, Haizhou Li:
A deep neural network approach for sentence boundary detection in broadcast news. INTERSPEECH 2014: 2887-2891
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TianWLC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TianWLC14
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Engsiong Chng:
Correlation-based frequency warping for voice conversion. ISCSLP 2014: 211-215
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/UedaWKXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/UedaWKXCL14
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization. ISCSLP 2014: 379-383
[c86]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/YangXXXLCYL0LMSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/YangXXXLCYL0LMSL14
Peng Yang, Haihua Xu, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Chng Eng Siong, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2014. MediaEval 2014
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PhamCSXCNCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PhamCSXCNCL14
Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Engsiong Chng, Haizhou Li:
System and keyword dependent fusion for spoken term detection. SLT 2014: 430-435
[e2]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/2014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2014
Haizhou Li, Helen M. Meng, Bin Ma, Engsiong Chng, Lei Xie:
15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014, Singapore, September 14-18, 2014. ISCA 2014 [contents]
2013
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/DennisTC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/DennisTC13
Jonathan William Dennis, Tran Huy Dat, Engsiong Chng:
Overlapping sound event recognition using local spectrogram features and the generalised hough transform. Pattern Recognit. Lett. 34(9): 1085-1093 (2013)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/spe/TanTCLLDCXN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spe/TanTCLLDCXN13
Yu Shyang Tan, Jiaqi Tan, Engsiong Chng, Bu-Sung Lee, Jiaming Li, Susumu Date, Hui Ping Chak, Xiong Xiao, Atsushi Narishige:
Hadoop framework: impact of data organization on performance. Softw. Pract. Exp. 43(11): 1241-1260 (2013)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DennisDC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DennisDC13
Jonathan William Dennis, Tran Huy Dat, Engsiong Chng:
Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification. IEEE Trans. Speech Audio Process. 21(2): 367-377 (2013)
[c84]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ChongBCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChongBCL13
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
Modeling of term-distance and term-occurrence information for improving n-gram language model performance. ACL (2) 2013: 233-237
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/NgDDS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/NgDDS13
Wen Zheng Terence Ng, Tran Huy Dat, Jonathan William Dennis, Chng Eng Siong:
A robust sound event recognition framework under TV playing conditions. APSIPA 2013: 1-5
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/NgDHS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/NgDHS13
Wen Zheng Terence Ng, Tran Huy Dat, Huynh Thai Hoa, Chng Eng Siong:
Adaptive semi-supervised tree SVM for sound event recognition in home environments. APSIPA 2013: 1-4
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/NguyenMXCLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/NguyenMXCLL13
Duc Hoang Ha Nguyen, Aleem Mushtaq, Xiong Xiao, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A particle filter compensation approach to robust LVCSR. APSIPA 2013: 1-7
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TianWC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TianWC13
Xiaohai Tian, Zhizheng Wu, Engsiong Chng:
Local partial least square regression for spectral mapping in voice conversion. APSIPA 2013: 1-6
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/WuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/WuCL13
Zhizheng Wu, Engsiong Chng, Haizhou Li:
Conditional restricted Boltzmann machine for voice conversion. ChinaSIP 2013: 104-108
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/LyuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/LyuCL13
Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Language diarization for conversational code-switch speech with pronunciation dictionary adaptation. ChinaSIP 2013: 147-150
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/NgDDS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/NgDDS13
Wen Zheng Terence Ng, Tran Huy Dat, Jonathan William Dennis, Chng Eng Siong:
Robust sound event recognition under TV playing conditions. ChinaSIP 2013: 332-336
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Constrained adaptation of histogram equalization for robust speech recognition. ChinaSIP 2013: 360-364
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuXCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuXCL13
Zhizheng Wu, Xiong Xiao, Engsiong Chng, Haizhou Li:
Synthetic speech detection using temporal modulation feature. ICASSP 2013: 7234-7238
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LyuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LyuCL13
Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Language diarization for code-switch conversational speech. ICASSP 2013: 7314-7318
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Temporal filter design by minimum KL divergence criterion for robust speech recognition. ICASSP 2013: 7908-7912
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoXCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoXCL13
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent phone mapping for LVCSR of under-resourced languages. INTERSPEECH 2013: 500-504
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition. INTERSPEECH 2013: 876-880
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuLLCKL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuLLCKL13
Zhizheng Wu, Anthony Larcher, Kong-Aik Lee, Engsiong Chng, Tomi Kinnunen, Haizhou Li:
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints. INTERSPEECH 2013: 950-954
[c69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/ChongXXTPLSL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/ChongXXTPLSL13
Tze Yuang Chong, Xiong Xiao, Haihua Xu, Tien Ping Tan, Chau Khoa Pham, Dau-Cheng Lyu, Chng Eng Siong, Haizhou Li:
The development and analysis of a Malay broadcasr news corpus. O-COCOSDA/CASLRE 2013: 1-5
[c67]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ssw/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206
2012
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/WangXLMCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/WangXLMCL12
Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, Haizhou Li:
Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features. IEICE Trans. Inf. Syst. 95-D(5): 1206-1215 (2012)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/DehzangiMCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/DehzangiMCL12
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recognit. Lett. 33(13): 1703-1709 (2012)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WuKCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WuKCL12
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion. IEEE Signal Process. Lett. 19(12): 914-917 (2012)
[c66]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/WuKCLA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuKCLA12
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li, Eliathamby Ambikairajah:
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. APSIPA 2012: 1-5
[c65]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/hytra/ChongBC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hytra/ChongBC12
Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng:
An Empirical Evaluation of Stop Word Removal in Statistical Machine Translation. ESIRMT/HyTra@EACL 2012: 30-37
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/DoXCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/DoXCL12
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
A Phone Mapping Technique for Acoustic Modeling of Under-Resourced Languages. IALP 2012: 233-236
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL12
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Lasso environment model combination for robust speech recognition. ICASSP 2012: 4305-4308
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCL12
Xiong Xiao, Engsiong Chng, Haizhou Li:
Joint spectral and temporal normalization of features for robust recognition of noisy and reverberated speech. ICASSP 2012: 4325-4328
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KinnunenWLSCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KinnunenWLSCL12
Tomi Kinnunen, Zhizheng Wu, Kong-Aik Lee, Filip Sedlak, Engsiong Chng, Haizhou Li:
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech. ICASSP 2012: 4401-4404
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VuLWTSBCSL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VuLWTSBCSL12
Ngoc Thang Vu, Dau-Cheng Lyu, Jochen Weiner, Dominic Telaar, Tim Schlippe, Fabian Blaicher, Engsiong Chng, Tanja Schultz, Haizhou Li:
A first speech recognition system for Mandarin-English code-switch conversational speech. ICASSP 2012: 4889-4892
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSL12
Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition. INTERSPEECH 2012: 1700-1703
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DennisDC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DennisDC12
Jonathan William Dennis, Tran Huy Dat, Engsiong Chng:
Overlapping Sound Event Recognition using Local Spectrogram Features with the Generalised Hough Transform. INTERSPEECH 2012: 2266-2269
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/DoXCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/DoXCL12
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context dependant phone mapping for cross-lingual acoustic modeling. ISCSLP 2012: 16-20
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/NguyenXCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/NguyenXCL12
Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, Haizhou Li:
An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition. ISCSLP 2012: 131-135
[c55]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/WeinerVTMSLCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/WeinerVTMSLCL12
Jochen Weiner, Ngoc Thang Vu, Dominic Telaar, Florian Metze, Tanja Schultz, Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Integration of language identification into a recognition system for spoken conversations containing code-Switches. SLTU 2012: 76-79
2011
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/DehzangiMCL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/DehzangiMCL11
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Error Corrective Fusion of Classifier Scores for Spoken Language Recognition. IEICE Trans. Inf. Syst. 94-D(12): 2503-2512 (2011)
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL11
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Maximum likelihood adaptation of histogram equalization with constraint for robust speech recognition. ICASSP 2011: 5480-5483
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoLSL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoLSL11
Xiong Xiao, Jinyu Li, Chng Eng Siong, Haizhou Li:
Feature Normalization Using Structured Full Transforms for Robust Speech Recognition. INTERSPEECH 2011: 693-696
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLS11
Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong:
Target-Aware Lattice Rescoring for Dialect Recognition. INTERSPEECH 2011: 733-736
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamXBCLS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamXBCLS11
Sethserey Sam, Xiong Xiao, Laurent Besacier, Eric Castelli, Haizhou Li, Chng Eng Siong:
Speech Modulation Features for Robust Nonnative Speech Accent Detection. INTERSPEECH 2011: 2417-2420
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MehtaPS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MehtaPS11
Kannu Mehta, Chau Khoa Pham, Chng Eng Siong:
Linear Dynamic Models for Voice Activity Detection. INTERSPEECH 2011: 2617-2620
2010
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/WangCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/WangCL10
Lei Wang, Engsiong Chng, Haizhou Li:
A tree-construction search approach for multivariate time series motifs discovery. Pattern Recognit. Lett. 31(9): 869-875 (2010)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XiaoLCLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XiaoLCLL10
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 18(6): 1158-1169 (2010)
[c49]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhangZLC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangZLC10
Hui Zhang, Min Zhang, Haizhou Li, Engsiong Chng:
Non-Isomorphic Forest Pair Translation. EMNLP 2010: 440-450
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Error corrective classifier fusion for spoken Language Recognition. ICASSP 2010: 1994-1997
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Framewise Phone Classification Using Weighted Fuzzy Classification Rules. ICPR 2010: 4186-4189
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLC10
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Selecting phonotactic features for language recognition. INTERSPEECH 2010: 737-740
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangXMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangXMCL10
Xiaoxuan Wang, Lei Xie, Bin Ma, Engsiong Chng, Haizhou Li:
Phoneme lattice based texttiling towards multilingual story segmentation. INTERSPEECH 2010: 1305-1308
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKCL10
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Text-independent F0 transformation with non-parallel data for voice conversion. INTERSPEECH 2010: 1732-1735
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyuTCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyuTCL10
Dau-Cheng Lyu, Tien Ping Tan, Engsiong Chng, Haizhou Li:
SEAME: a Mandarin-English code-switching speech corpus in south-east asia. INTERSPEECH 2010: 1986-1989
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
A discriminative performance metric for GMM-UBM speaker identification. INTERSPEECH 2010: 2114-2117

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TongMLS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TongMLS09
Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong:
A Target-Oriented Phonotactic Front-End for Spoken Language Recognition. IEEE Trans. Speech Audio Process. 17(7): 1335-1347 (2009)
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/XiaoLCLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/XiaoLCLL09
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A study on hidden Markov model's generalization capability for speech recognition. ASRU 2009: 255-260
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NguyenLS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NguyenLS09
Trung Hieu Nguyen, Haizhou Li, Chng Eng Siong:
Cluster criterion functions in spectral subspace and their application in speaker clustering. ICASSP 2009: 4085-4088
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LongMLGSD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LongMLGSD09
Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Chng Eng Siong, Li-Rong Dai:
Exploiting prosodic information for Speaker Recognition. ICASSP 2009: 4225-4228
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangSL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangSL09
Lei Wang, Chng Eng Siong, Haizhou Li:
Efficient sparse self-similarity matrix construction for repeating sequence detection. ICME 2009: 458-461
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLCL09
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng, Kong-Aik Lee:
Target-aware language models for spoken language recognition. INTERSPEECH 2009: 200-203
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehzangiMCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehzangiMCL09
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Discriminative feature transformation using output coding for speech recognition. INTERSPEECH 2009: 2979-2982
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/ism/YounessianRS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ism/YounessianRS09
Ehsan Younessian, Deepu Rajan, Chng Eng Siong:
Improved Keypoint Matching Method for Near-Duplicate Keyframe Retrieval. ISM 2009: 298-303
2008
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/asc/CheokZC08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/asc/CheokZC08
Adrian David Cheok, Jian Zhang, Chng Eng Siong:
Efficient mobile phone Chinese optical character recognition systems by use of heuristic fuzzy rules and bigram Markov language models. Appl. Soft Comput. 8(2): 1005-1017 (2008)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/mms/WangXCLT08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mms/WangXCLT08
Jinjun Wang, Changsheng Xu, Engsiong Chng, Hanqing Lu, Qi Tian:
Automatic composition of broadcast sports video. Multim. Syst. 14(4): 179-193 (2008)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XiaoSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XiaoSL08
Xiong Xiao, Chng Eng Siong, Haizhou Li:
Normalization of the Speech Modulation Spectra for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 16(8): 1662-1674 (2008)
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/csse/TanTCG08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/csse/TanTCG08
Choon-Ching Tan, Su-Lim Tan, Chng Eng Siong, Wooi-Boon Goh:
MICRO-EBLOCK: A Modular Platform for Embedded System Education. CSSE (5) 2008: 299-303
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongMLC08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongMLC08
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Target-oriented phone tokenizers for spoken language recognition. ICASSP 2008: 4221-4224
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/DehzangiMCL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/DehzangiMCL08
Omid Dehzangi, Bin Ma, Chng Eng Siong, Haizhou Li:
Fuzzy rule selection using Iterative Rule Learning for speech data classification. ICPR 2008: 1-4
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenCL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenCL08
Trung Hieu Nguyen, Engsiong Chng, Haizhou Li:
T-test distance and clustering criterion for speaker diarization. INTERSPEECH 2008: 36-39
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLC08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLC08
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Target-oriented phone selection from universal phone set for spoken language recognition. INTERSPEECH 2008: 715-718
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XiaoSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XiaoSL08
Xiong Xiao, Chng Eng Siong, Haizhou Li:
Effect of Feature Smoothing for Robust Speech Recognition. ISCSLP 2008: 73-76
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/DehzangiMSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/DehzangiMSL08
Omid Dehzangi, Bin Ma, Chng Eng Siong, Haizhou Li:
Discriminative Output Coding Features for Speech Recognition. ISCSLP 2008: 89-92
2007
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XiaoSL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XiaoSL07
Xiong Xiao, Chng Eng Siong, Haizhou Li:
Temporal Structure Normalization of Speech Feature for Robust Speech Recognition. IEEE Signal Process. Lett. 14(7): 500-503 (2007)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/WangCXLT07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/WangCXLT07
Jinjun Wang, Engsiong Chng, Changsheng Xu, Hanqing Lu, Qi Tian:
Generation of Personalized Music Sports Video Using Multimodal Cues. IEEE Trans. Multim. 9(3): 576-588 (2007)
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/clear/KohSNNMCLR07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/clear/KohSNNMCLR07
Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Chng Eng Siong, Haizhou Li, Susanto Rahardja:
Speaker Diarization Using Direction of Arrival Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007 Evaluation. CLEAR 2007: 484-496
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongLMCC07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongLMCC07
Rong Tong, Haizhou Li, Bin Ma, Engsiong Chng, Siu-Yeung Cho:
Spoken Language Recognition with Relevance Feedback. ICASSP (4) 2007: 861-864
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCL07
Xiong Xiao, Engsiong Chng, Haizhou Li:
Normalizing the Speech Modulation Spectrum for Robust Speech Recognition. ICASSP (4) 2007: 1021-1024
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangLC07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangLC07
Lei Wang, Haizhou Li, Engsiong Chng:
A Vector-Based Approach to Broadcast Audio Database Indexing and Retrieval. ICME 2007: 512-515
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icpads/BaiCB07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpads/BaiCB07
Yunfei Bai, Chng Eng Siong, Gorthi Prashant Bhanu:
An MCU description methodology for initialization code generation software. ICPADS 2007: 1-7
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoCL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoCL07
Xiong Xiao, Engsiong Chng, Haizhou Li:
Evaluating the temporal structure normalisation technique on the Aurora-4 task. INTERSPEECH 2007: 1070-1073
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KohSNNMCLR07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KohSNNMCLR07
Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Engsiong Chng, Haizhou Li, Susanto Rahardja:
Using direction of arrival estimate and acoustic feature information in speaker diarization. INTERSPEECH 2007: 2149-2152
2006
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongMZLC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongMZLC06
Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, Engsiong Chng:
Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification. ICASSP (1) 2006: 205-208
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangCXLT06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangCXLT06
Jinjun Wang, Engsiong Chng, Changsheng Xu, Hanqing Lu, Xiaofeng Tong:
Identify Sports Video Shots with "Happy" or "Sad" Emotions. ICME 2006: 877-880
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangCX06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangCX06
Jinjun Wang, Engsiong Chng, Changsheng Xu:
Fully and Semi-Automatic Music Sports Video Composition. ICME 2006: 1897-1900
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/WangXC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/WangXC06
Jinjun Wang, Changsheng Xu, Engsiong Chng:
Automatic Sports Video Genre Classification using Pseudo-2D-HMM. ICPR (4) 2006: 778-781
[c15]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/KinnunenK00C06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/KinnunenK00C06
Tomi Kinnunen, Chin-Wei Eugene Koh, Lei Wang, Haizhou Li, Eng Siong Chng:
Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification. ISCSLP 2006
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XiaoLC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XiaoLC06
Xiong Xiao, Haizhou Li, Engsiong Chng:
Vector Autoregressive Model for Missing Feature Reconstruction. ISCSLP (Selected Papers) 2006: 315-324
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LeeSTMDYZKWKEL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LeeSTMDYZKWKEL06
Kong-Aik Lee, Hanwu Sun, Rong Tong, Bin Ma, Minghui Dong, Changhuai You, Donglai Zhu, Chin-Wei Eugene Koh, Lei Wang, Tomi Kinnunen, Chng Eng Siong, Haizhou Li:
The IIR Submission to CSLP 2006 Speaker Recognition Evaluation. ISCSLP (Selected Papers) 2006: 494-505
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TongMLYZKSDCL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TongMLYZKSDCL06
Rong Tong, Bin Ma, Kong-Aik Lee, Changhuai You, Donglai Zhu, Tomi Kinnunen, Hanwu Sun, Minghui Dong, Chng Eng Siong, Haizhou Li:
Fusion of Acoustic and Tokenization Features for Speaker Recognition. ISCSLP (Selected Papers) 2006: 566-577
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/2006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/2006
Qiang Huo, Bin Ma, Chng Eng Siong, Haizhou Li:
Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Selected Papers. Lecture Notes in Computer Science 4274, Springer 2006, ISBN 3-540-49665-3 [contents]
2005
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ijautcomp/ChngC05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijautcomp/ChngC05
Eng Siong Chng, Sheng Chen:
Determining the optimal decision delay parameter for a linear equalizer. Int. J. Autom. Comput. 2(1): 20-24 (2005)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangCX05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangCX05
Jinjun Wang, Engsiong Chng, Changsheng Xu:
Soccer replay detection using scene transition structure analysis. ICASSP (2) 2005: 433-436
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/YuHYC05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/YuHYC05
Xinguo Yu, Tze Sen Hay, Xin Yan, Engsiong Chng:
A Player-Possession Acquisition System for Broadcast Soccer Video. ICME 2005: 522-525
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXSDWT05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXSDWT05
Jinjun Wang, Changsheng Xu, Chng Eng Siong, Ling-Yu Duan, Kongwah Wan, Qi Tian:
Automatic generation of personalized music sports video. ACM Multimedia 2005: 735-744
2004
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icc/ChenC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icc/ChenC04
Sheng Chen, Engsiong Chng:
Concurrent constant modulus algorithm and soft decision directed scheme for fractionally-spaced blind equalization. ICC 2004: 2342-2346
[c7]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/icip/WangXSYT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/WangXSYT04
Jinjun Wang, Changsheng Xu, Chng Eng Siong, Xinguo Yu, Qi Tian:
Event detection based on non-broadcast sports video. ICIP 2004: 1637-1640
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangT04
Jinjun Wang, Changsheng Xu, Chng Eng Siong, Qi Tian:
Sports highlight detection from keyword sequences using HMM. ICME 2004: 599-602
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/XuGSRTW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/XuGSRTW04
Wenjie Xu, Cuntai Guan, Chng Eng Siong, S. Ranganatha, M. Thulasidas, Jiankang Wu:
High Accuracy Classification of EEG Signal. ICPR (2) 2004: 391-394
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangXSWT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangXSWT04
Jinjun Wang, Changsheng Xu, Chng Eng Siong, Kongwah Wan, Qi Tian:
Automatic replay generation for soccer video broadcasting. ACM Multimedia 2004: 32-39
2000
[c3]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0005C000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0005C000
Min Zhang, Engsiong Chng, Haizhou Li:
Semi-class-based N-gram Language Modeling for Chinese Dictation. ISCSLP 2000

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1996
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChngYB96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChngYB96
Eng Siong Chng, Howard Hua Yang, Siegfried Bös:
Orthogonal least-squares learning algorithm with local adaptation process for the radial basis function networks. IEEE Signal Process. Lett. 3(8): 253-255 (1996)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/ChngCM96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/ChngCM96
Engsiong Chng, Sheng Chen, Bernard Mulgrew:
Gradient radial basis function networks for nonlinear and nonstationary time series prediction. IEEE Trans. Neural Networks 7(1): 190-194 (1996)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icnn/BosC96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icnn/BosC96
Siegfried Bös, Eng Siong Chng:
Using weight decay to optimize the generalization ability of a perceptron. ICNN 1996: 241-246
1995
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tsp/ChngCM95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsp/ChngCM95
Engsiong Chng, Sheng Chen, Bernard Mulgrew:
Efficient computational schemes for the orthogonal least squares algorithm. IEEE Trans. Signal Process. 43(1): 373-376 (1995)
1994
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Chng0M94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Chng0M94
Engsiong Chng, Sheng Chen, Bernard Mulgrew:
Reducing the computational requirement of the orthogonal least squares algorithm. ICASSP (3) 1994: 529-532

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.