


Остановите войну!
for scientists:


default search action
Chin-Hui Lee
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [c349]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. ICASSP 2022: 4041-4045 - [c348]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. ICASSP 2022: 7572-7576 - [c347]Shutong Niu, Jun Du, Lei Sun, Chin-Hui Lee:
Improving Separation-Based Speaker Diarization Via Iterative Model Refinement And Speaker Embedding Based Post-Processing. ICASSP 2022: 8387-8391 - [c346]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge. ICASSP 2022: 9166-9170 - [c345]Hang Chen, Hengshun Zhou, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe
, Sabato Marco Siniscalchi, Odette Scharenborg
, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results. ICASSP 2022: 9266-9270 - [c344]Hengshun Zhou, Jun Du, Gongzhen Zou, Zhaoxu Nian, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Shifu Xiong, Jianqing Gao:
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1111-1115 - [c343]Mao-Kui He, Jun Du, Chin-Hui Lee:
End-to-End Audio-Visual Neural Speaker Diarization. INTERSPEECH 2022: 1461-1465 - [c342]Hang Chen, Jun Du, Yusheng Dai, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Baocai Yin, Jia Pan:
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1766-1770 - [c341]Yajian Wang, Jun Du, Hang Chen, Qing Wang, Chin-Hui Lee:
Deep Segment Model for Acoustic Scene Classification. INTERSPEECH 2022: 4177-4181 - [i33]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge. CoRR abs/2202.04855 (2022) - [i32]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. CoRR abs/2202.08509 (2022) - [i31]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification. CoRR abs/2203.04114 (2022) - [i30]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i29]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. CoRR abs/2210.06382 (2022) - [i28]Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. CoRR abs/2210.14581 (2022) - [i27]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022) - 2021
- [j124]Hang Chen
, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021) - [j123]Li Chai
, Jun Du
, Qing-Feng Liu, Chin-Hui Lee
:
A Cross-Entropy-Guided Measure (CEGM) for Assessing Speech Recognition Performance and Optimizing DNN-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 29: 106-117 (2021) - [j122]Hengshun Zhou
, Jun Du
, Yuanyuan Zhang, Qing Wang
, Qing-Feng Liu, Chin-Hui Lee
:
Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2617-2629 (2021) - [c340]Koen Oostermeijer, Jun Du, Qing Wang, Chin-Hui Lee:
Speech Enhancement Autoencoder with Hierarchical Latent Structure. ICASSP 2021: 671-675 - [c339]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. ICASSP 2021: 845-849 - [c338]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. ICASSP 2021: 6523-6527 - [c337]Zhaoxu Nian, Yan-Hui Tu, Jun Du, Chin-Hui Lee:
A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition. ICASSP 2021: 6913-6917 - [c336]Hengshun Zhou, Jun Du, Hang Chen, Zijun Jing, Shifu Xiong, Chin-Hui Lee:
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments. Interspeech 2021: 341-345 - [c335]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. Interspeech 2021: 881-885 - [c334]Xiaoqi Zhang, Jun Du, Li Chai, Chin-Hui Lee:
A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement. Interspeech 2021: 2701-2705 - [c333]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005 - [c332]Yu-Xuan Wang, Jun Du, Maokui He, Shutong Niu, Lei Sun, Chin-Hui Lee:
Scenario-Dependent Speaker Diarization for DIHARD-III Challenge. Interspeech 2021: 3106-3110 - [c331]Qing Wang, Huaxin Wu, Zijun Jing, Feng Ma, Yi Fang, Yuxuan Wang, Tairan Chen, Jia Pan, Jun Du, Chin-Hui Lee:
A Model Ensemble Approach for Sound Event Localization and Detection. ISCSLP 2021: 1-5 - [c330]Siyuan Zheng, Jun Du, Hengshun Zhou, Xue Bai, Chin-Hui Lee, Shipeng Li:
Speech Emotion Recognition Based on Acoustic Segment Model. ISCSLP 2021: 1-5 - [c329]Li Chai, Jun Du, Diyuan Liu, Yanhui Tu, Chin-Hui Lee:
Acoustic Modeling for Multi-Array Conversational Speech Recognition in the Chime-6 Challenge. SLT 2021: 912-918 - [c328]Zhengyu Zhou, In Gyu Choi, Yongliang He, Vikas Yadav, Chin-Hui Lee:
Using Paralinguistic Information to Disambiguate User Intentions for Distinguishing Phrase Structure and Sarcasm in Spoken Dialog Systems. SLT 2021: 1020-1027 - [i26]Qing Wang, Jun Du, Huaxin Wu, Jia Pan, Feng Ma, Chin-Hui Lee:
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection. CoRR abs/2101.02919 (2021) - [i25]Yuxuan Wang, Mao-Kui He, Shutong Niu, Lei Sun, Tian Gao, Xin Fang, Jia Pan, Jun Du, Chin-Hui Lee:
USTC-NELSLIP System Description for DIHARD-III Challenge. CoRR abs/2103.10661 (2021) - [i24]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. CoRR abs/2104.01271 (2021) - [i23]Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Qing Wang, Yuyang Wang, Xianjun Xia, Yuanjun Zhao, Yuzhong Wu, Yannan Wang, Jun Du, Chin-Hui Lee:
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification. CoRR abs/2107.01461 (2021) - [i22]Shutong Niu, Jun Du, Lei Sun, Chin-Hui Lee:
Separation Guided Speaker Diarization in Realistic Mismatched Conditions. CoRR abs/2107.02357 (2021) - [i21]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. CoRR abs/2110.08598 (2021) - [i20]Hengshun Zhou, Jun Du, Yuanyuan Zhang, Qing Wang, Qing-Feng Liu, Chin-Hui Lee:
Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition. CoRR abs/2111.08910 (2021) - 2020
- [j121]Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Xiaoli Ma
, Chin-Hui Lee
:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. IEEE Signal Process. Lett. 27: 1485-1489 (2020) - [j120]Yanhui Tu
, Jun Du
, Tian Gao
, Chin-Hui Lee
:
A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1608-1619 (2020) - [j119]Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Xiaoli Ma
, Chin-Hui Lee
:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression. IEEE Trans. Signal Process. 68: 3411-3422 (2020) - [c327]Jun Qi, Xiaoli Ma, Chin-Hui Lee, Jun Du, Sabato Marco Siniscalchi:
Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression. CISS 2020: 1-6 - [c326]Xue Bai, Jun Du, Jia Pan, Hengshun Zhou, Yanhui Tu, Chin-Hui Lee:
High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification. ICASSP 2020: 656-660 - [c325]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. ICASSP 2020: 3107-3111 - [c324]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma:
Enhanced Adversarial Strategically-Timed Attacks Against Deep Reinforcement Learning. ICASSP 2020: 3407-3411 - [c323]Sicheng Wang, Wei Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers. ICASSP 2020: 6219-6223 - [c322]Shutong Niu, Jun Du, Li Chai, Chin-Hui Lee:
A Maximum Likelihood Approach to Multi-Objective Learning Using Generalized Gaussian Distributions for Dnn-Based Speech Enhancement. ICASSP 2020: 6229-6233 - [c321]Yanhui Tu, Jun Du, Chin-Hui Lee:
2D-to-2D Mask Estimation for Speech Enhancement Based on Fully Convolutional Neural Network. ICASSP 2020: 6664-6668 - [c320]Lei Sun, Jun Du, Xueyang Zhang, Tian Gao, Xin Fang, Chin-Hui Lee:
Progressive Multi-Target Network Based Speech Enhancement with Snr-Preselection for Robust Speaker Diarization. ICASSP 2020: 7099-7103 - [c319]Xin Wang, Jun Du, Alejandrina Cristià, Lei Sun, Chin-Hui Lee:
A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions. ICASSP 2020: 7304-7308 - [c318]Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. ICASSP 2020: 7389-7393 - [c317]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network. ICASSP 2020: 7504-7508 - [c316]Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
Geometry Constrained Progressive Learning for Lstm-Based Speech Enhancement. ICASSP 2020: 7514-7518 - [c315]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. INTERSPEECH 2020: 76-80 - [c314]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Jia Pan, Chin-Hui Lee:
A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge. INTERSPEECH 2020: 96-100 - [c313]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. INTERSPEECH 2020: 1196-1200 - [c312]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. INTERSPEECH 2020: 1201-1205 - [c311]Hengshun Zhou, Jun Du, Yanhui Tu, Chin-Hui Lee:
Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions. INTERSPEECH 2020: 4098-4102 - [c310]Yu-Xuan Wang, Jun Du, Li Chai, Chin-Hui Lee, Jia Pan:
A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement. INTERSPEECH 2020: 4501-4505 - [i19]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network. CoRR abs/2002.00544 (2020) - [i18]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma:
Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning. CoRR abs/2002.09027 (2020) - [i17]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. CoRR abs/2003.13917 (2020) - [i16]Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. CoRR abs/2004.13480 (2020) - [i15]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation. CoRR abs/2007.08389 (2020) - [i14]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. CoRR abs/2007.13024 (2020) - [i13]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. CoRR abs/2008.00107 (2020) - [i12]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. CoRR abs/2008.00110 (2020) - [i11]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.05459 (2020) - [i10]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.07281 (2020) - [i9]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020) - [i8]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. CoRR abs/2010.13309 (2020) - [i7]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. CoRR abs/2011.01447 (2020) - [i6]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)
2010 – 2019
- 2019
- [j118]Lei Sun
, Jun Du
, Tian Gao
, Yi Fang, Feng Ma, Chin-Hui Lee:
A Speaker-Dependent Approach to Separation of Far-Field Multi-Talker Microphone Array Speech for Front-End Processing in the CHiME-5 Challenge. IEEE J. Sel. Top. Signal Process. 13(4): 827-840 (2019) - [j117]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Hai-Kun Wang, Jingdong Chen
, Chin-Hui Lee:
An iterative mask estimation approach to deep learning based multi-channel speech recognition. Speech Commun. 106: 31-43 (2019) - [j116]Li Chai
, Jun Du
, Qing-Feng Liu, Chin-Hui Lee
:
Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1919-1931 (2019) - [j115]Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Chin-Hui Lee
:
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1932-1943 (2019) - [j114]Wei Li
, Nancy F. Chen
, Sabato Marco Siniscalchi
, Chin-Hui Lee:
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2012-2024 (2019) - [j113]Yanhui Tu
, Jun Du
, Chin-Hui Lee
:
Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2080-2091 (2019) - [c309]Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising. APSIPA 2019: 274-278 - [c308]Nan Zhou, Jun Du, Yanhui Tu, Tian Gao, Chin-Hui Lee:
A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition. APSIPA 2019: 873-877 - [c307]Yanhui Tu, Jun Du, Chin-Hui Lee:
DNN Training Based on Classic Gain Function for Single-channel Speech Enhancement and Recognition. ICASSP 2019: 910-914 - [c306]Wei Li, Sicheng Wang, Ming Lei, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training. ICASSP 2019: 6560-6564 - [c305]Lei Sun, Jun Du, Tian Gao, Yi Fang, Feng Ma, Jia Pan, Chin-Hui Lee:
A Two-stage Single-channel Speaker-dependent Speech Separation Approach for Chime-5 Challenge. ICASSP 2019: 6650-6654 - [c304]Feng Ma, Li Chai, Jun Du, Diyuan Liu, Zhongfu Ye, Chin-Hui Lee:
Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge. INTERSPEECH 2019: 1258-1262 - [c303]Li Chai, Jun Du, Chin-Hui Lee:
KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement. INTERSPEECH 2019: 1806-1810 - [c302]Li Chai, Jun Du, Chin-Hui Lee:
A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition. INTERSPEECH 2019: 3431-3435 - [c301]Xue Bai, Jun Du, Zi-Rui Wang, Chin-Hui Lee:
A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models. INTERSPEECH 2019: 3619-3623 - 2018
- [j112]Jing Zhang, Yakun Mu, Sheng-Wei Feng, Kehuang Li, Yubo Yuan, Chin-Hui Lee:
Image region annotation based on segmentation and semantic correlation analysis. IET Image Process. 12(8): 1331-1337 (2018) - [j111]Qing Wang
, Jun Du
, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1181-1193 (2018) - [j110]Sadia Shakil
, Jacob C. Billings
, Shella D. Keilholz, Chin-Hui Lee:
Parametric Dependencies of Sliding Window Correlation. IEEE Trans. Biomed. Eng. 65(2): 254-263 (2018) - [j109]Yanhui Tu, Jun Du, Chin-Hui Lee:
A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech. J. Signal Process. Syst. 90(7): 963-973 (2018) - [j108]Zhengqi Wen, Kehuang Li, Zhen Huang, Chin-Hui Lee, Jianhua Tao:
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning. J. Signal Process. Syst. 90(7): 1025-1037 (2018) - [j107]Ju Lin, Wei Li, Yingming Gao, Yanlu Xie, Nancy F. Chen, Sabato Marco Siniscalchi, Jinsong Zhang
, Chin-Hui Lee:
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. J. Signal Process. Syst. 90(7): 1077-1087 (2018) - [c300]Yanhui Tu, Jun Du, Nan Zhou, Chin-Hui Lee:
Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR. APSIPA 2018: 362-366 - [c299]Han Zhao, Shuayb Zarar, Ivan Tashev, Chin-Hui Lee:
Convolutional-Recurrent Neural Networks for Speech Enhancement. ICASSP 2018: 2401-2405 - [c298]Yanhui Tu, Ivan Tashev, Shuayb Zarar, Chin-Hui Lee:
A Hybrid Approach to Combining Conventional and Deep Learning Techniques for Single-Channel Speech Enhancement and Recognition. ICASSP 2018: 2531-2535 - [c297]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Densely Connected Progressive Learning for LSTM-Based Speech Enhancement. ICASSP 2018: 5054-5058 - [c296]Lei Sun, Jun Du, Tian Gao, Yu-Ding Lu, Yu Tsao
, Chin-Hui Lee, Neville Ryant:
A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions. ICASSP 2018: 5234-5238 - [c295]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models. ICASSP 2018: 6249-6253 - [c294]Lei Sun, Jun Du, Chao Jiang, Xueyang Zhang, Shan He, Bing Yin, Chin-Hui Lee:
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge. INTERSPEECH 2018: 2793-2797 - [c293]Li Chai, Jun Du, Chin-Hui Lee:
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement. INTERSPEECH 2018: 3269-3273 - [c292]Quandong Wang, Sicheng Wang, Fengpei Ge, Chang Woo Han, Jaewon Lee, Lianghao Guo, Chin-Hui Lee:
Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech. ISCSLP 2018: 21-25 - [c291]Xin Wang, Jun Du, Lei Sun, Qing Wang, Chin-Hui Lee:
A Progressive Deep Learning Approach to Child Speech Separation. ISCSLP 2018: 76-80 - [c290]Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. ISCSLP 2018: 295-299 - [i5]Han Zhao, Shuayb Zarar, Ivan Tashev, Chin-Hui Lee:
Convolutional-Recurrent Neural Networks for Speech Enhancement. CoRR abs/1805.00579 (2018) - [i4]Li Chai, Jun Du, Chin-Hui Lee:
Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR. CoRR abs/1811.11517 (2018) - 2017
- [j106]