Остановите войну!
for scientists:
default search action
Chin-Hui Lee 0001
Person information
- affiliation: Georgia Institute of Technology, School of Electrical and Computer Engineering, USA
- affiliation (1981-2001): Bell Laboratories, Dialogue Systems Research Department, Murray Hill, New Jersey, NY, USA
Other persons with the same name
- Chin-Hui Lee — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i39]Hu Hu, Sabato Marco Siniscalchi, Chin-Hui Lee:
Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori. CoRR abs/2401.13766 (2024) - 2023
- [j44]Shi Cheng, Jun Du, Shutong Niu, Alejandrina Cristià, Xin Wang, Qing Wang, Chin-Hui Lee:
Using iterative adaptation and dynamic mask for child speech extraction under real-world multilingual conditions. Speech Commun. 152: 102956 (2023) - [j43]Li Chai, Hang Chen, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech. Speech Commun. 153: 102958 (2023) - [j42]Shutong Niu, Jun Du, Lei Sun, Yu Hu, Chin-Hui Lee:
QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1037-1049 (2023) - [j41]Qing Wang, Jun Du, Huaxin Wu, Jia Pan, Feng Ma, Chin-Hui Lee:
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1251-1264 (2023) - [j40]Mao-Kui He, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1561-1573 (2023) - [c141]Hang Chen, Jun Du, Zhe Wang, Chenxi Wang, Yuling Ren, Qinglong Li, Ruibo Liu, Chin-Hui Lee:
Correlated Multi-Level Speech Enhancement for Robust Real-World ASR Applications Using Mask-Waveform-Feature Optimization. APSIPA ASC 2023: 96-101 - [c140]Chang Wang, Jun Du, Hang Chen, Ruoyu Wang, Chao-Han Huck Yang, Jiangjiang Zhao, Yuling Ren, Qinglong Li, Chin-Hui Lee:
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition. APSIPA ASC 2023: 635-642 - [c139]Shi Cheng, Jun Du, Qing Wang, Ya Jiang, Zhaoxu Nian, Shutong Niu, Chin-Hui Lee, Yu Gao, Wenbin Zhang:
Improving Sound Event Localization and Detection with Class-Dependent Sound Separation for Real-World Scenarios. APSIPA ASC 2023: 2068-2073 - [c138]Shilong Wu, Jun Du, Mao-Kui He, Shutong Niu, Hang Chen, Haitao Tang, Chin-Hui Lee:
Semi-Supervised Multi-Channel Speaker Diarization With Cross-Channel Attention. ASRU 2023: 1-8 - [c137]Hang Chen, Shilong Wu, Yusheng Dai, Zhe Wang, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge. ICASSP 2023: 1-2 - [c136]Ya Jiang, Hang Chen, Jun Du, Qing Wang, Chin-Hui Lee:
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion. ICASSP 2023: 1-5 - [c135]Shutong Niu, Jun Du, Qing Wang, Li Chai, Huaxin Wu, Zhaoxu Nian, Lei Sun, Yi Fang, Jia Pan, Chin-Hui Lee:
An Experimental Study on Sound Event Localization and Detection Under Realistic Testing Conditions. ICASSP 2023: 1-5 - [c134]Qing Wang, Jun Du, Zhaoxu Nian, Shutong Niu, Li Chai, Huaxin Wu, Jia Pan, Chin-Hui Lee:
Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic Data. ICASSP 2023: 1-5 - [c133]Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. ICASSP 2023: 1-5 - [c132]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. ICASSP 2023: 1-5 - [c131]Chenyue Zhang, Hang Chen, Jun Du, Bao-Cai Yin, Jia Pan, Chin-Hui Lee:
Incorporating Visual Information Reconstruction into Progressive Learning for Optimizing audio-visual Speech Enhancement. ICASSP 2023: 1-5 - [c130]Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. ICME 2023: 2627-2632 - [i38]Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition. CoRR abs/2303.06326 (2023) - [i37]Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models. CoRR abs/2306.00331 (2023) - [i36]Zilu Guo, Jun Du, Chin-Hui Lee, Yu Gao, Wenbin Zhang:
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement. CoRR abs/2306.08527 (2023) - [i35]Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. CoRR abs/2308.08488 (2023) - [i34]Ruoyu Wang, Maokui He, Jun Du, Hengshun Zhou, Shutong Niu, Hang Chen, Yanyan Yue, Gaobin Yang, Shilong Wu, Lei Sun, Yanhui Tu, Haitao Tang, Shuangqing Qian, Tian Gao, Mengzhi Wang, Genshun Wan, Jia Pan, Jianqing Gao, Chin-Hui Lee:
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge. CoRR abs/2308.14638 (2023) - [i33]Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao:
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction. CoRR abs/2309.08348 (2023) - [i32]Hao Yen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints. CoRR abs/2309.08828 (2023) - [i31]Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee:
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture. CoRR abs/2309.09180 (2023) - 2022
- [c129]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. ICASSP 2022: 4041-4045 - [c128]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. ICASSP 2022: 7572-7576 - [c127]Shutong Niu, Jun Du, Lei Sun, Chin-Hui Lee:
Improving Separation-Based Speaker Diarization Via Iterative Model Refinement And Speaker Embedding Based Post-Processing. ICASSP 2022: 8387-8391 - [c126]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge. ICASSP 2022: 9166-9170 - [c125]Hang Chen, Hengshun Zhou, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results. ICASSP 2022: 9266-9270 - [c124]Hengshun Zhou, Jun Du, Gongzhen Zou, Zhaoxu Nian, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Shifu Xiong, Jianqing Gao:
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1111-1115 - [c123]Mao-Kui He, Jun Du, Chin-Hui Lee:
End-to-End Audio-Visual Neural Speaker Diarization. INTERSPEECH 2022: 1461-1465 - [c122]Hang Chen, Jun Du, Yusheng Dai, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe, Odette Scharenborg, Jingdong Chen, Baocai Yin, Jia Pan:
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1766-1770 - [c121]Yajian Wang, Jun Du, Hang Chen, Qing Wang, Chin-Hui Lee:
Deep Segment Model for Acoustic Scene Classification. INTERSPEECH 2022: 4177-4181 - [c120]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. ISCSLP 2022: 1-5 - [c119]Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. ISCSLP 2022: 250-254 - [c118]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification. ISCSLP 2022: 453-457 - [c117]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. SLT 2022: 1074-1080 - [i30]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge. CoRR abs/2202.04855 (2022) - [i29]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. CoRR abs/2202.08509 (2022) - [i28]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification. CoRR abs/2203.04114 (2022) - [i27]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i26]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. CoRR abs/2210.06382 (2022) - [i25]Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. CoRR abs/2210.14581 (2022) - [i24]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022) - 2021
- [j39]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021) - [j38]Li Chai, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
A Cross-Entropy-Guided Measure (CEGM) for Assessing Speech Recognition Performance and Optimizing DNN-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 29: 106-117 (2021) - [j37]Hengshun Zhou, Jun Du, Yuanyuan Zhang, Qing Wang, Qing-Feng Liu, Chin-Hui Lee:
Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2617-2629 (2021) - [c116]Koen Oostermeijer, Jun Du, Qing Wang, Chin-Hui Lee:
Speech Enhancement Autoencoder with Hierarchical Latent Structure. ICASSP 2021: 671-675 - [c115]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. ICASSP 2021: 845-849 - [c114]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. ICASSP 2021: 6523-6527 - [c113]Zhaoxu Nian, Yan-Hui Tu, Jun Du, Chin-Hui Lee:
A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition. ICASSP 2021: 6913-6917 - [c112]Hengshun Zhou, Jun Du, Hang Chen, Zijun Jing, Shifu Xiong, Chin-Hui Lee:
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments. Interspeech 2021: 341-345 - [c111]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. Interspeech 2021: 881-885 - [c110]Xiaoqi Zhang, Jun Du, Li Chai, Chin-Hui Lee:
A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement. Interspeech 2021: 2701-2705 - [c109]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005 - [c108]Yu-Xuan Wang, Jun Du, Maokui He, Shutong Niu, Lei Sun, Chin-Hui Lee:
Scenario-Dependent Speaker Diarization for DIHARD-III Challenge. Interspeech 2021: 3106-3110 - [c107]Qing Wang, Huaxin Wu, Zijun Jing, Feng Ma, Yi Fang, Yuxuan Wang, Tairan Chen, Jia Pan, Jun Du, Chin-Hui Lee:
A Model Ensemble Approach for Sound Event Localization and Detection. ISCSLP 2021: 1-5 - [c106]Siyuan Zheng, Jun Du, Hengshun Zhou, Xue Bai, Chin-Hui Lee, Shipeng Li:
Speech Emotion Recognition Based on Acoustic Segment Model. ISCSLP 2021: 1-5 - [c105]Li Chai, Jun Du, Diyuan Liu, Yanhui Tu, Chin-Hui Lee:
Acoustic Modeling for Multi-Array Conversational Speech Recognition in the Chime-6 Challenge. SLT 2021: 912-918 - [i23]Qing Wang, Jun Du, Huaxin Wu, Jia Pan, Feng Ma, Chin-Hui Lee:
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection. CoRR abs/2101.02919 (2021) - [i22]Yuxuan Wang, Mao-Kui He, Shutong Niu, Lei Sun, Tian Gao, Xin Fang, Jia Pan, Jun Du, Chin-Hui Lee:
USTC-NELSLIP System Description for DIHARD-III Challenge. CoRR abs/2103.10661 (2021) - [i21]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. CoRR abs/2104.01271 (2021) - [i20]Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Qing Wang, Yuyang Wang, Xianjun Xia, Yuanjun Zhao, Yuzhong Wu, Yannan Wang, Jun Du, Chin-Hui Lee:
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification. CoRR abs/2107.01461 (2021) - [i19]Shutong Niu, Jun Du, Lei Sun, Chin-Hui Lee:
Separation Guided Speaker Diarization in Realistic Mismatched Conditions. CoRR abs/2107.02357 (2021) - [i18]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. CoRR abs/2110.08598 (2021) - [i17]Hengshun Zhou, Jun Du, Yuanyuan Zhang, Qing Wang, Qing-Feng Liu, Chin-Hui Lee:
Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition. CoRR abs/2111.08910 (2021) - 2020
- [j36]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. IEEE Signal Process. Lett. 27: 1485-1489 (2020) - [j35]Yanhui Tu, Jun Du, Tian Gao, Chin-Hui Lee:
A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1608-1619 (2020) - [j34]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression. IEEE Trans. Signal Process. 68: 3411-3422 (2020) - [c104]Jun Qi, Xiaoli Ma, Chin-Hui Lee, Jun Du, Sabato Marco Siniscalchi:
Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression. CISS 2020: 1-6 - [c103]Xue Bai, Jun Du, Jia Pan, Hengshun Zhou, Yanhui Tu, Chin-Hui Lee:
High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification. ICASSP 2020: 656-660 - [c102]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. ICASSP 2020: 3107-3111 - [c101]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma:
Enhanced Adversarial Strategically-Timed Attacks Against Deep Reinforcement Learning. ICASSP 2020: 3407-3411 - [c100]Sicheng Wang, Wei Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers. ICASSP 2020: 6219-6223 - [c99]Shutong Niu, Jun Du, Li Chai, Chin-Hui Lee:
A Maximum Likelihood Approach to Multi-Objective Learning Using Generalized Gaussian Distributions for Dnn-Based Speech Enhancement. ICASSP 2020: 6229-6233 - [c98]Yanhui Tu, Jun Du, Chin-Hui Lee:
2D-to-2D Mask Estimation for Speech Enhancement Based on Fully Convolutional Neural Network. ICASSP 2020: 6664-6668 - [c97]Lei Sun, Jun Du, Xueyang Zhang, Tian Gao, Xin Fang, Chin-Hui Lee:
Progressive Multi-Target Network Based Speech Enhancement with Snr-Preselection for Robust Speaker Diarization. ICASSP 2020: 7099-7103 - [c96]Xin Wang, Jun Du, Alejandrina Cristià, Lei Sun, Chin-Hui Lee:
A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions. ICASSP 2020: 7304-7308 - [c95]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network. ICASSP 2020: 7504-7508 - [c94]Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
Geometry Constrained Progressive Learning for Lstm-Based Speech Enhancement. ICASSP 2020: 7514-7518 - [c93]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. INTERSPEECH 2020: 76-80 - [c92]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Jia Pan, Chin-Hui Lee:
A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge. INTERSPEECH 2020: 96-100 - [c91]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. INTERSPEECH 2020: 1196-1200 - [c90]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. INTERSPEECH 2020: 1201-1205 - [c89]Hengshun Zhou, Jun Du, Yanhui Tu, Chin-Hui Lee:
Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions. INTERSPEECH 2020: 4098-4102 - [c88]Yu-Xuan Wang, Jun Du, Li Chai, Chin-Hui Lee, Jia Pan:
A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement. INTERSPEECH 2020: 4501-4505 - [i16]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network. CoRR abs/2002.00544 (2020) - [i15]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang, I-Te Danny Hung, Chin-Hui Lee, Xiaoli Ma:
Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning. CoRR abs/2002.09027 (2020) - [i14]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. CoRR abs/2003.13917 (2020) - [i13]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation. CoRR abs/2007.08389 (2020) - [i12]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. CoRR abs/2007.13024 (2020) - [i11]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. CoRR abs/2008.00107 (2020) - [i10]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. CoRR abs/2008.00110 (2020) - [i9]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.05459 (2020) - [i8]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.07281 (2020) - [i7]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020) - [i6]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. CoRR abs/2010.13309 (2020) - [i5]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. CoRR abs/2011.01447 (2020) - [i4]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)
2010 – 2019
- 2019
- [j33]Lei Sun, Jun Du, Tian Gao, Yi Fang, Feng Ma, Chin-Hui Lee:
A Speaker-Dependent Approach to Separation of Far-Field Multi-Talker Microphone Array Speech for Front-End Processing in the CHiME-5 Challenge. IEEE J. Sel. Top. Signal Process. 13(4): 827-840 (2019) - [j32]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Hai-Kun Wang, Jingdong Chen, Chin-Hui Lee:
An iterative mask estimation approach to deep learning based multi-channel speech recognition. Speech Commun. 106: 31-43 (2019) - [j31]Li Chai, Jun Du, Qing-Feng Liu, Chin-Hui Lee:
Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1919-1931 (2019) - [j30]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1932-1943 (2019) - [j29]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2012-2024 (2019) - [j28]Yanhui Tu, Jun Du, Chin-Hui Lee:
Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2080-2091 (2019) - [c87]Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising. APSIPA 2019: 274-278 - [c86]Nan Zhou, Jun Du, Yanhui Tu, Tian Gao, Chin-Hui Lee:
A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition. APSIPA 2019: 873-877 - [c85]Yanhui Tu, Jun Du, Chin-Hui Lee:
DNN Training Based on Classic Gain Function for Single-channel Speech Enhancement and Recognition. ICASSP 2019: 910-914 - [c84]Wei Li, Sicheng Wang, Ming Lei, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training. ICASSP 2019: 6560-6564 - [c83]Lei Sun, Jun Du, Tian Gao, Yi Fang, Feng Ma, Jia Pan, Chin-Hui Lee:
A Two-stage Single-channel Speaker-dependent Speech Separation Approach for Chime-5 Challenge. ICASSP 2019: 6650-6654 - [c82]Feng Ma, Li Chai, Jun Du, Diyuan Liu, Zhongfu Ye, Chin-Hui Lee:
Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge. INTERSPEECH 2019: 1258-1262 - [c81]Li Chai, Jun Du, Chin-Hui Lee:
KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement. INTERSPEECH 2019: 1806-1810 - [c80]Li Chai, Jun Du, Chin-Hui Lee:
A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition. INTERSPEECH 2019: 3431-3435 - [c79]Xue Bai, Jun Du, Zi-Rui Wang, Chin-Hui Lee:
A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models. INTERSPEECH 2019: 3619-3623 - 2018
- [j27]Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1181-1193 (2018) - [j26]Yanhui Tu, Jun Du, Chin-Hui Lee:
A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech. J. Signal Process. Syst. 90(7): 963-973 (2018) - [j25]Zhengqi Wen, Kehuang Li, Zhen Huang, Chin-Hui Lee, Jianhua Tao:
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning. J. Signal Process. Syst. 90(7): 1025-1037 (2018) - [j24]Ju Lin, Wei Li, Yingming Gao, Yanlu Xie, Nancy F. Chen, Sabato Marco Siniscalchi, Jinsong Zhang, Chin-Hui Lee:
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. J. Signal Process. Syst. 90(7): 1077-1087 (2018) - [c78]Yanhui Tu, Jun Du, Nan Zhou, Chin-Hui Lee:
Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR. APSIPA 2018: 362-366 - [c77]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Densely Connected Progressive Learning for LSTM-Based Speech Enhancement. ICASSP 2018: 5054-5058 - [c76]Lei Sun, Jun Du, Tian Gao, Yu-Ding Lu, Yu Tsao, Chin-Hui Lee, Neville Ryant:
A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions. ICASSP 2018: 5234-5238 - [c75]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models. ICASSP 2018: 6249-6253 - [c74]Lei Sun, Jun Du, Chao Jiang, Xueyang Zhang, Shan He, Bing Yin, Chin-Hui Lee:
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge. INTERSPEECH 2018: 2793-2797 - [c73]Li Chai, Jun Du, Chin-Hui Lee:
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement. INTERSPEECH 2018: 3269-3273 - [c72]Xin Wang, Jun Du, Lei Sun, Qing Wang, Chin-Hui Lee:
A Progressive Deep Learning Approach to Child Speech Separation. ISCSLP 2018: 76-80 - [c71]Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. ISCSLP 2018: 295-299 - [i3]Li Chai, Jun Du, Chin-Hui Lee:
Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR. CoRR abs/1811.11517 (2018) - 2017
- [j23]Yanhui Tu, Jun Du, Qing Wang, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech. Comput. Speech Lang. 46: 517-534 (2017) - [j22]Bo Wu, Minglei Yang, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Tong Wang, Chin-Hui Lee:
A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation. EURASIP J. Adv. Signal Process. 2017: 81 (2017) - [j21]Bo Wu, Kehuang Li, Fengpei Ge, Zhen Huang, Minglei Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition. IEEE J. Sel. Top. Signal Process. 11(8): 1289-1300 (2017) - [j20]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation. Pattern Recognit. Lett. 98: 1-7 (2017) - [j19]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments. Speech Commun. 95: 28-39 (2017) - [j18]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 60-71 (2017) - [j17]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1535-1546 (2017) - [c70]Yanhui Tu, Jun Du, Lei Sun, Chin-Hui Lee:
LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement. APSIPA 2017: 488-491 - [c69]Bo Wu, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Minglei Yang, Chin-Hui Lee:
A unified deep modeling approach to simultaneous speech dereverberation and recognition for the reverb challenge. HSCMA 2017: 36-40 - [c68]Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features. HSCMA 2017: 101-105 - [c67]Lei Sun, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Multiple-target deep learning for LSTM-RNN based speech enhancement. HSCMA 2017: 136-140 - [c66]Sicheng Wang, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A transfer learning and progressive stacking approach to reducing deep model sizes with an application to speech enhancement. ICASSP 2017: 5575-5579 - [c65]Yanhui Tu, Jun Du, Lei Sun, Feng Ma, Chin-Hui Lee:
On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones. INTERSPEECH 2017: 394-398 - [c64]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation. INTERSPEECH 2017: 1178-1182 - [c63]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models. INTERSPEECH 2017: 2759-2763 - [c62]Fengpei Ge, Kehuang Li, Bo Wu, Sabato Marco Siniscalchi, Yonghong Yan, Chin-Hui Lee:
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition. INTERSPEECH 2017: 3847-3851 - [c61]Shi-Xue Wen, Jun Du, Chin-Hui Lee:
On generating mixing noise signals with basis functions for simulating noisy speech and learning dnn-based speech enhancement models. MLSP 2017: 1-6 - [i2]Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-Objective Learning and Mask-Based Post-Processing for Deep Neural Network Based Speech Enhancement. CoRR abs/1703.07172 (2017) - 2016
- [j16]Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition. EURASIP J. Adv. Signal Process. 2016: 86 (2016) - [j15]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition. Neurocomputing 218: 448-459 (2016) - [j14]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee:
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 29-41 (2016) - [j13]Jun Du, Yanhui Tu, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1424-1437 (2016) - [c60]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Chin-Hui Lee:
Towards a direct Bayesian adaptation framework for deep models. APSIPA 2016: 1-4 - [c59]Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations. APSIPA 2016: 1-4 - [c58]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Unsupervised single-channel speech separation via deep neural network for different gender mixtures. APSIPA 2016: 1-4 - [c57]Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling. ICASSP 2016: 6135-6139 - [c56]Jianqing Gao, Jun Du, Changqing Kong, Huaifang Lu, Enhong Chen, Chin-Hui Lee:
An experimental study on joint modeling of mixed-bandwidth data via deep neural networks for robust speech recognition. IJCNN 2016: 588-594 - [c55]Wei Li, Kehuang Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees. INTERSPEECH 2016: 3127-3131 - [c54]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement. INTERSPEECH 2016: 3713-3717 - [c53]Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition. ISCSLP 2016: 1-5 - [c52]Zhengqi Wen, Kehuang Li, Zhen Huang, Jianhua Tao, Chin-Hui Lee:
Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks. ISCSLP 2016: 1-5 - 2015
- [j12]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Speech Enhancement Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 7-19 (2015) - [c51]Jun Du, Qing Wang, Yanhui Tu, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework. ASRU 2015: 430-435 - [c50]Tian Gao, Jun Du, Li Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
A unified speaker-dependent speech separation and enhancement system based on deep neural networks. ChinaSIP 2015: 687-691 - [c49]Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments. LVA/ICA 2015: 75-82 - [c48]Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition. ICASSP 2015: 61-65 - [c47]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint training of front-end and back-end deep neural networks for robust speech recognition. ICASSP 2015: 4375-4379 - [c46]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification. INTERSPEECH 2015: 992-996 - [c45]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jinyu Li, Jiadong Wu, Chin-Hui Lee:
Maximum a posteriori adaptation of network parameters in deep models. INTERSPEECH 2015: 1076-1080 - [c44]Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement. INTERSPEECH 2015: 1508-1512 - [c43]Qing Wang, Jun Du, Xiao Bao, Zi-Rui Wang, Li-Rong Dai, Chin-Hui Lee:
A universal VAD based on jointly trained deep neural networks. INTERSPEECH 2015: 2282-2286 - [c42]Kehuang Li, Zhen Huang, Yong Xu, Chin-Hui Lee:
DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech. INTERSPEECH 2015: 2578-2582 - [c41]Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Ji Wu, Chin-Hui Lee:
Rapid adaptation for deep neural networks through multi-task learning. INTERSPEECH 2015: 3625-3629 - [i1]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jiadong Wu, Chin-Hui Lee:
Maximum a Posteriori Adaptation of Network Parameters in Deep Models. CoRR abs/1503.02108 (2015) - 2014
- [j11]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
An artificial neural network approach to automatic speech processing. Neurocomputing 140: 326-338 (2014) - [j10]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
An Experimental Study on Speech Enhancement Based on Deep Neural Networks. IEEE Signal Process. Lett. 21(1): 65-68 (2014) - [c40]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Global variance equalization for improving deep neural network based speech enhancement. ChinaSIP 2014: 71-75 - [c39]Zhen Huang, Chao Weng, Kehuang Li, You-Chi Cheng, Chin-Hui Lee:
Deep learning vector quantization for acoustic information retrieval. ICASSP 2014: 1350-1354 - [c38]I-Fan Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Attribute based lattice rescoring in spontaneous speech recognition. ICASSP 2014: 3325-3329 - [c37]Kehuang Li, Zhen Huang, You-Chi Cheng, Chin-Hui Lee:
A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers. ICASSP 2014: 4503-4507 - [c36]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee:
Introducing attribute features to foreign accent recognition. ICASSP 2014: 5332-5336 - [c35]You-Chi Cheng, Ville Hautamäki, Zhen Huang, Kehuang Li, Chin-Hui Lee:
An i-vector based descriptor for alphabetical gesture recognition. ICASSP 2014: 6593-6597 - [c34]Jun Du, Qing Wang, Tian Gao, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Robust speech recognition with speech enhanced deep neural networks. INTERSPEECH 2014: 616-620 - [c33]Zhen Huang, Jinyu Li, Chao Weng, Chin-Hui Lee:
Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition. INTERSPEECH 2014: 1214-1218 - [c32]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, Chin-Hui Lee:
Dialect levelling in Finnish: a universal speech attribute approach. INTERSPEECH 2014: 2165-2169 - [c31]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Dynamic noise aware training for speech enhancement based on deep neural networks. INTERSPEECH 2014: 2670-2674 - [c30]Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Chao Weng, Chin-Hui Lee:
Feature space maximum a posteriori linear regression for adaptation of deep neural networks. INTERSPEECH 2014: 2992-2996 - [c29]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors. ISCSLP 2014: 158-162 - [c28]Yanhui Tu, Jun Du, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers. ISCSLP 2014: 250-254 - [c27]Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Cross-language transfer learning for deep neural network based speech enhancement. ISCSLP 2014: 336-340 - 2013
- [j9]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1): 209-227 (2013) - [j8]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Model-based margin estimation for hidden Markov model learning and generalisation. IET Signal Process. 7(8): 704-709 (2013) - [j7]Sabato Marco Siniscalchi, Dong Yu, Li Deng, Chin-Hui Lee:
Exploiting deep neural networks for detection-based speech recognition. Neurocomputing 106: 148-157 (2013) - [j6]Chin-Hui Lee, Sabato Marco Siniscalchi:
An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition. Proc. IEEE 101(5): 1089-1115 (2013) - [j5]Sabato Marco Siniscalchi, Dong Yu, Li Deng, Chin-Hui Lee:
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model. IEEE Signal Process. Lett. 20(3): 201-204 (2013) - [j4]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 21(4): 786-797 (2013) - [j3]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems. IEEE Trans. Speech Audio Process. 21(10): 2152-2161 (2013) - [c26]I-Fan Chen, Sabato Marco Siniscalchi, Seokyong Moon, Daejin Shin, Myoung-Wan Koo, Minhwa Chung, Chin-Hui Lee:
An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks. APSIPA 2013: 1-10 - [c25]Chen-Yu Chiang, Sabato Marco Siniscalchi, Sin-Horng Chen, Chin-Hui Lee:
Knowledge integration for improving performance in LVCSR. INTERSPEECH 2013: 1786-1790 - [c24]Zhen Huang, You-Chi Cheng, Kehuang Li, Ville Hautamäki, Chin-Hui Lee:
A blind segmentation approach to acoustic event detection based on i-vector. INTERSPEECH 2013: 2282-2286 - [c23]Sangmin Oh, A. G. Amitha Perera, Ilseo Kim, Megha Pandey, Kevin J. Cannons, Hossein Hajimirsadeghi, Arash Vahdat, Greg Mori, Ben Miller, Scott McCloskey, You-Chi Cheng, Zhen Huang, Chin-Hui Lee, Chenliang Xu, Rohit Kumar, Wei Chen, Jason J. Corso, Li Fei-Fei, Daphne Koller, Vignesh Ramanathan, Kevin Tang, Armand Joulin, Alexandre Alahi:
TRECVID 2013 GENIE: Multimedia Event Detection and Recounting. TRECVID 2013 - 2012
- [j2]Sabato Marco Siniscalchi, Dau-Cheng Lyu, Torbjørn Svendsen, Chin-Hui Lee:
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data. IEEE Trans. Speech Audio Process. 20(3): 875-887 (2012) - [c22]Dong Yu, Sabato Marco Siniscalchi, Li Deng, Chin-Hui Lee:
Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition. ICASSP 2012: 4169-4172 - [c21]Byungki Byun, Ilseo Kim, Sabato Marco Siniscalchi, Chin-Hui Lee:
Consumer-level multimedia event detection through unsupervised audio signal modeling. INTERSPEECH 2012: 2081-2084 - [c20]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Hermitian based Hidden Activation Functions for Adaptation of Hybrid HMM/ANN Models. INTERSPEECH 2012: 2590-2593 - [c19]Su Jun Leow, Tze Siong Lau, Alvina Goh, Han Meng Peh, Teck Khim Ng, Sabato Marco Siniscalchi, Chin-Hui Lee:
A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting. ISCSLP 2012: 112-116 - [c18]Chen-Yu Chiang, Sabato Marco Siniscalchi, Yih-Ru Wang, Sin-Horng Chen, Chin-Hui Lee:
A study on cross-language knowledge integration in Mandarin LVCSR. ISCSLP 2012: 315-319 - [c17]A. G. Amitha Perera, Sangmin Oh, Megha Pandey, Tianyang Ma, Anthony Hoogs, Arash Vahdat, Kevin J. Cannons, Hossein Hajimirsadeghi, Greg Mori, Scott McCloskey, Ben Miller, Sharath Venkatesha, Pedro Davalos, Pradipto Das, Chenliang Xu, Jason J. Corso, Rohini K. Srihari, Ilseo Kim, You-Chi Cheng, Zhen Huang, Chin-Hui Lee, Kevin Tang, Li Fei-Fei, Daphne Koller:
TRECVID 2012 GENIE: Multimedia Event Detection and Recounting. TRECVID 2012 - 2011
- [c16]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines. INTERSPEECH 2011: 901-904 - 2010
- [c15]Sabato Marco Siniscalchi, Torbjørn Svendsen, Filippo Sorbello, Chin-Hui Lee:
Experimental studies on continuous speech recognition using neural architectures with "adaptive" hidden activation functions. ICASSP 2010: 4882-4885 - [c14]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition. INTERSPEECH 2010: 2718-2721 - [c13]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A survey on recent progress in the ASAT/SIRKUS paradigm. ISCSLP 2010: 465-470
2000 – 2009
- 2009
- [j1]Sabato Marco Siniscalchi, Chin-Hui Lee:
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition. Speech Commun. 51(11): 1139-1153 (2009) - [c12]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A phonetic feature based lattice rescoring approach to LVCSR. ICASSP 2009: 3865-3868 - [c11]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploring universal attribute characterization of spoken languages for spoken language recognition. INTERSPEECH 2009: 168-171 - [c10]Jeremy Reed, Yushi Ueda, Sabato Marco Siniscalchi, Yuuki Uchiyama, Shigeki Sagayama, Chin-Hui Lee:
Minimum Classification Error Training to Improve Isolated Chord Recognition. ISMIR 2009: 609-614 - 2008
- [c9]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
Toward a detector-based universal phone recognizer. ICASSP 2008: 4261-4264 - [c8]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A penalized logistic regression approach to detection based phone classification. INTERSPEECH 2008: 2390-2393 - [c7]Dau-Cheng Lyu, Sabato Marco Siniscalchi, Tae-Yoon Kim, Chin-Hui Lee:
Continuous phone recognition without target language training data. INTERSPEECH 2008: 2687-2690 - 2007
- [c6]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
Towards bottom-up continuous phone recognition. ASRU 2007: 566-569 - [c5]Jinyu Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
Approximate Test Risk Minimization Through Soft Margin Estimation. ICASSP (4) 2007: 653-656 - [c4]Sabato Marco Siniscalchi, Petr Schwarz, Chin-Hui Lee:
High-Accuracy Phone Recognition By Combining High-Performance Lattice Generation and Knowledge Based Rescoring. ICASSP (4) 2007: 869-872 - [c3]Filippo Vella, Chin-Hui Lee, Salvatore Gaglio:
Boosting of Maximal Figure of Merit Classifiers for Automatic Image Annotation. ICIP (2) 2007: 217-220 - [c2]Filippo Vella, Chin-Hui Lee:
Information fusion techniques for automatic image annotation. VISAPP (2) 2007: 60-67 - 2006
- [c1]Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
A study on lattice rescoring with knowledge scores for automatic speech recognition. INTERSPEECH 2006
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-25 05:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint