


Остановите войну!
for scientists:
Li-Rong Dai 0001
Lirong Dai 0001
Person information

- affiliation: University of Science and Technology of China, National Engineering Laboratory for Speech and Language Information Processing, Hefei, China
Other persons with the same name
- Li-Rong Dai (aka: Lirong Dai) — disambiguation page
- Li-Rong Dai 0002 (aka: Lirong Dai 0002) — Seattle University, USA (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [j48]Jiajia Wu, Jun Du, Fengren Wang, Chen Yang, Xinzhe Jiang, Jinshui Hu, Bing Yin, Jianshu Zhang, Lirong Dai:
A multimodal attention fusion network with a dynamic vocabulary for TextVQA. Pattern Recognit. 122: 108214 (2022) - [i32]Qiu-Shi Zhu, Jie Zhang, Zi-qiang Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition. CoRR abs/2201.08930 (2022) - [i31]Xing-Yu Chen, Qiu-Shi Zhu, Jie Zhang, Li-Rong Dai:
Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals. CoRR abs/2201.08934 (2022) - [i30]Zi-qiang Zhang, Jie Zhang, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition. CoRR abs/2202.07428 (2022) - [i29]Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. CoRR abs/2203.17113 (2022) - [i28]Ye-Qian Du, Jie Zhang, Qiu-Shi Zhu, Li-Rong Dai, Ming-Hui Wu, Xin Fang, Zhou-Wang Yang:
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition. CoRR abs/2204.02023 (2022) - 2021
- [j47]Hang Chen
, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021) - [j46]Jie Zhang
, Huawei Chen
, Li-Rong Dai, Richard Christian Hendriks
:
A Study on Reference Microphone Selection for Multi-Microphone Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 29: 671-683 (2021) - [j45]Jie Zhang
, Jun Du
, Li-Rong Dai:
Sensor Selection for Relative Acoustic Transfer Function Steered Linearly-Constrained Beamformers. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1220-1232 (2021) - [j44]Xiao Zhou
, Zhen-Hua Ling
, Li-Rong Dai:
UnitNet: A Sequence-to-Sequence Acoustic Model for Concatenative Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2643-2655 (2021) - [j43]Jian Tang
, Jie Zhang
, Yan Song
, Ian McLoughlin
, Li-Rong Dai:
Multi-Granularity Sequence Alignment Mapping for Encoder-Decoder Based End-to-End ASR. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2816-2828 (2021) - [j42]Jianshu Zhang
, Jun Du
, Yongxin Yang, Yi-Zhe Song
, Lirong Dai:
SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition. IEEE Trans. Multim. 23: 2471-2480 (2021) - [c216]Jing-Xuan Zhang, Korin Richmond, Zhen-Hua Ling, Lirong Dai:
TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis. AAAI 2021: 14402-14410 - [c215]Xu Zheng, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection. ICASSP 2021: 356-360 - [c214]Ying Liu, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Effective Deep Embedding Learning Method Based on Dense-Residual Networks for Speaker Verification. ICASSP 2021: 6683-6687 - [c213]Xu Zheng, Yan Song, Li-Rong Dai, Ian McLoughlin, Lin Liu:
An Effective Mutual Mean Teaching Based Domain Adaptation Method for Sound Event Detection. Interspeech 2021: 556-560 - [c212]Hui Wang, Lin Liu, Yan Song, Lei Fang, Ian McLoughlin, Li-Rong Dai:
A Weight Moving Average Based Alternate Decoupled Learning Algorithm for Long-Tailed Language Identification. Interspeech 2021: 1499-1503 - [c211]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005 - [c210]Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
UnitNet-Based Hybrid Speech Synthesis. Interspeech 2021: 4119-4123 - [c209]Qiu-Shi Zhu, Jie Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition. Interspeech 2021: 4334-4338 - [i27]Zi-qiang Zhang, Yan Song, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition. CoRR abs/2103.08207 (2021) - 2020
- [j41]Jian Tang
, Junfeng Hou
, Yan Song, Li-Rong Dai, Ian McLoughlin
:
Effective Exploitation of Posterior Information for Attention-Based Speech Recognition. IEEE Access 8: 108988-108999 (2020) - [j40]Junfeng Hou
, Wu Guo, Yan Song, Li-Rong Dai:
Segment boundary detection directed attention for online end-to-end speech recognition. EURASIP J. Audio Speech Music. Process. 2020(1): 3 (2020) - [j39]Jianshu Zhang
, Jun Du, Lirong Dai:
Radical analysis network for learning hierarchies of Chinese characters. Pattern Recognit. 103: 107305 (2020) - [j38]Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis. ACM Trans. Asian Low Resour. Lang. Inf. Process. 19(3): 38:1-38:14 (2020) - [j37]Jing-Xuan Zhang
, Zhen-Hua Ling
, Li-Rong Dai:
Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations. IEEE ACM Trans. Audio Speech Lang. Process. 28: 540-552 (2020) - [c208]Liangfa Wei, Jie Zhang, Junfeng Hou, Lirong Dai:
Attentive Fusion Enhanced Audio-Visual Encoding for Transformer Based Robust Speech Recognition. APSIPA 2020: 638-643 - [c207]Jie Yan, Yan Song, Li-Rong Dai, Ian McLoughlin:
Task-Aware Mean Teacher Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection. ICASSP 2020: 326-330 - [c206]Hui Wang, Yan Song, Zengxi Li, Ian McLoughlin, Li-Rong Dai:
An Online Speaker-aware Speech Separation Approach Based on Time-domain Representation. ICASSP 2020: 6379-6383 - [c205]Bin Gu, Wu Guo, Lirong Dai, Jun Du:
An Improved Deep Neural Network for Modeling Speaker Characteristics at Different Temporal Scales. ICASSP 2020: 6814-6818 - [c204]Fenglin Ding, Wu Guo, Lirong Dai, Jun Du:
Attention-Based Gated Scaling Adaptive Acoustic Model for CTC-Based Speech Recognition. ICASSP 2020: 7404-7408 - [c203]Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
Extracting Unit Embeddings Using Sequence-To-Sequence Acoustic Models for Unit Selection Speech Synthesis. ICASSP 2020: 7659-7663 - [c202]Jianshu Zhang, Jun Du, Yongxin Yang, Yi-Zhe Song, Si Wei, Lirong Dai:
A Tree-Structured Decoder for Image-to-Markup Generation. ICML 2020: 11076-11085 - [c201]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning. INTERSPEECH 2020: 771-775 - [c200]Xu Zheng, Yan Song, Jie Yan, Li-Rong Dai, Ian McLoughlin, Lin Liu:
An Effective Perturbation Based Semi-Supervised Learning Method for Sound Event Detection. INTERSPEECH 2020: 841-845 - [c199]Ying Liu, Yan Song, Yiheng Jiang, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Effective Speaker Recognition Method Based on Joint Identification and Verification Supervisions. INTERSPEECH 2020: 3007-3011 - [c198]Zi-qiang Zhang, Yan Song, Jian-Shu Zhang, Ian McLoughlin, Li-Rong Dai:
Semi-Supervised End-to-End ASR via Teacher-Student Learning with Conditional Posterior Distribution. INTERSPEECH 2020: 3580-3584 - [i26]Fenglin Ding, Wu Guo, Lirong Dai, Jun Du:
Attentive batch normalization for lstm-based acoustic modeling of speech recognition. CoRR abs/2001.00129 (2020) - [i25]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning. CoRR abs/2008.02371 (2020) - [i24]Liangfa Wei, Jie Zhang, Junfeng Hou, Lirong Dai:
Attentive Fusion Enhanced Audio-Visual Encoding for Transformer Based Robust Speech Recognition. CoRR abs/2008.02686 (2020) - [i23]Jing-Xuan Zhang, Li-Juan Liu, Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai:
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer. CoRR abs/2009.01475 (2020) - [i22]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020) - [i21]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)
2010 – 2019
- 2019
- [j36]Jing-Xuan Zhang
, Zhen-Hua Ling
, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 631-644 (2019) - [j35]Zengxi Li
, Yan Song
, Li-Rong Dai, Ian McLoughlin
:
Listening and Grouping: An Online Autoregressive Approach for Monaural Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 692-703 (2019) - [j34]Jianshu Zhang
, Jun Du
, Lirong Dai:
Track, Attend, and Parse (TAP): An End-to-End Framework for Online Handwritten Mathematical Expression Recognition. IEEE Trans. Multim. 21(1): 221-233 (2019) - [c197]Yuxuan Xi, Pengcheng Li, Yan Song, Yiheng Jiang, Lirong Dai:
Speaker to Emotion: Domain Adaptation for Speech Emotion Recognition with Residual Adapters. APSIPA 2019: 513-518 - [c196]Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Lirong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. APSIPA 2019: 623-627 - [c195]Jingyi Xu, Junfeng Hou, Yan Song, Wu Guo, Lirong Dai:
Knowledge Distillation from Multilingual and Monolingual Teachers for End-to-End Multilingual Speech Recognition. APSIPA 2019: 844-849 - [c194]Rui Na, Junfeng Hou, Wu Guo, Yan Song, Lirong Dai:
Learning Adaptive Downsampling Encoding for Online End-to-End Speech Recognition. APSIPA 2019: 850-854 - [c193]Yiheng Jiang, Yan Song, Jie Yan, Lirong Dai, Ian McLoughlin:
Triplet-Center Loss Based Deep Embedding Learning Method for Speaker Verification. APSIPA 2019: 1625-1629 - [c192]Jie Yan, Yan Song, Wu Guo, Li-Rong Dai, Ian McLoughlin
, Liang Chen:
A Region Based Attention Method for Weakly Supervised Sound Event Detection and Classification. ICASSP 2019: 755-759 - [c191]Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai:
Improving Sequence-to-sequence Voice Conversion by Adding Text-supervision. ICASSP 2019: 6785-6789 - [c190]Zhifu Gao, Yan Song, Ian McLoughlin
, Pengcheng Li, Yiheng Jiang, Li-Rong Dai:
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System. INTERSPEECH 2019: 361-365 - [c189]Lanhua You, Wu Guo, Li-Rong Dai, Jun Du:
Multi-Task Learning with High-Order Statistics for x-Vector Based Text-Independent Speaker Verification. INTERSPEECH 2019: 1158-1162 - [c188]Lanhua You, Wu Guo, Li-Rong Dai, Jun Du:
Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification. INTERSPEECH 2019: 1168-1172 - [c187]Jia-Xiang Chen, Zhen-Hua Ling, Li-Rong Dai:
A Chinese Dataset for Identifying Speakers in Novels. INTERSPEECH 2019: 1561-1565 - [c186]Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling. INTERSPEECH 2019: 2593-2597 - [c185]Yiheng Jiang, Yan Song, Ian McLoughlin
, Zhifu Gao, Li-Rong Dai:
An Effective Deep Embedding Learning Architecture for Speaker Verification. INTERSPEECH 2019: 4040-4044 - [c184]Zhi Chen, Wu Guo, Li-Rong Dai, Zhen-Hua Ling, Jun Du:
Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels. INTERSPEECH 2019: 4225-4229 - [i20]Lanhua You, Wu Guo, Lirong Dai, Jun Du:
Deep Neural Network Embedding Learning with High-Order Statistics for Text-Independent Speaker Verification. CoRR abs/1903.12058 (2019) - [i19]Lanhua You, Wu Guo, Lirong Dai, Jun Du:
Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification. CoRR abs/1903.12092 (2019) - [i18]Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling. CoRR abs/1906.08977 (2019) - [i17]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations. CoRR abs/1906.10508 (2019) - [i16]Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Li-Rong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. CoRR abs/1906.10859 (2019) - 2018
- [j33]Zengxi Li
, Li-Rong Dai, Yan Song, Ian McLoughlin
:
A Conditional Generative Model for Speech Enhancement. Circuits Syst. Signal Process. 37(11): 5005-5022 (2018) - [j32]Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation. Speech Commun. 99: 161-172 (2018) - [j31]Zheng-Chen Liu, Zhen-Hua Ling
, Li-Rong Dai:
Statistical Parametric Speech Synthesis Using Generalized Distillation Framework. IEEE Signal Process. Lett. 25(5): 695-699 (2018) - [j30]Ma Jin
, Yan Song, Ian McLoughlin
, Li-Rong Dai:
LID-Senones and Their Statistics for Language Identification. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 171-183 (2018) - [j29]Zhen-Hua Ling
, Yang Ai
, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. IEEE ACM Trans. Audio Speech Lang. Process. 26(5): 883-894 (2018) - [j28]Qing Wang
, Jun Du
, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1181-1193 (2018) - [j27]Junhua Liu, Zhen-Hua Ling
, Si Wei, Guoping Hu, Li-Rong Dai:
Improving the Decoding Efficiency of Deep Neural Network Acoustic Models by Cluster-Based Senone Selection. J. Signal Process. Syst. 90(7): 999-1011 (2018) - [c183]Yaming Liu, Jian Tang, Yan Song, Lirong Dai:
A Capsule based Approach for Polyphonic Sound Event Detection. APSIPA 2018: 1853-1857 - [c182]Zengxi Li, Yan Song, Li-Rong Dai, Ian McLoughlin
:
Source-Aware Context Network for Single-Channel Multi-Speaker Speech Separation. ICASSP 2018: 681-685 - [c181]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Forward Attention in Sequence- To-Sequence Acoustic Modeling for Speech Synthesis. ICASSP 2018: 4789-4793 - [c180]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Densely Connected Progressive Learning for LSTM-Based Speech Enhancement. ICASSP 2018: 5054-5058 - [c179]Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. ICASSP 2018: 5869-5873 - [c178]Peixin Chen, Wu Guo, Lirong Dai, Zhenhua Ling:
Pseudo-Supervised Approach for Text Clustering Based on Consensus Analysis. ICASSP 2018: 6184-6188 - [c177]Jianshu Zhang, Yixing Zhu, Jun Du, Lirong Dai:
Radical Analysis Network for Zero-Shot Learning in Printed Chinese Character Recognition. ICME 2018: 1-6 - [c176]Jianshu Zhang, Jun Du, Lirong Dai:
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition. ICPR 2018: 2245-2250 - [c175]Jianshu Zhang, Yixing Zhu, Jun Du, Lirong Dai:
Trajectory-based Radical Analysis Network for Online Handwritten Chinese Character Recognition. ICPR 2018: 3681-3686 - [c174]Jian Tang, Yan Song, Lirong Dai, Ian McLoughlin
:
Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition. INTERSPEECH 2018: 1783-1787 - [c173]Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Ming Zhou, Li-Rong Dai:
WaveNet Vocoder with Limited Training Data for Voice Conversion. INTERSPEECH 2018: 1983-1987 - [c172]Xiao Zhou, Zhen-Hua Ling, Zhi-Ping Zhou, Li-Rong Dai:
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis. INTERSPEECH 2018: 2509-2513 - [c171]Pengcheng Li, Yan Song, Ian McLoughlin
, Wu Guo, Lirong Dai:
An Attention Pooling Based Representation Learning Method for Speech Emotion Recognition. INTERSPEECH 2018: 3087-3091 - [c170]Zhifu Gao, Yan Song, Ian McLoughlin
, Wu Guo, Lirong Dai:
An Improved Deep Embedding Learning Method for Short Duration Speaker Verification. INTERSPEECH 2018: 3578-3582 - [c169]Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. ISCSLP 2018: 295-299 - [i15]Jianshu Zhang, Jun Du, Lirong Dai:
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition. CoRR abs/1801.03530 (2018) - [i14]Zhen-Hua Ling, Yang Ai, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. CoRR abs/1801.07910 (2018) - [i13]Jianshu Zhang, Yixing Zhu, Jun Du, Lirong Dai:
Trajectory-based Radical Analysis Network for Online Handwritten Chinese Character Recognition. CoRR abs/1801.10109 (2018) - [i12]Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. CoRR abs/1803.05030 (2018) - [i11]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis. CoRR abs/1807.06736 (2018) - [i10]Yaming Liu, Jian Tang, Yan Song, Lirong Dai:
A Capsule based Approach for Polyphonic Sound Event Detection. CoRR abs/1807.07436 (2018) - [i9]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. CoRR abs/1810.06865 (2018) - [i8]Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai:
Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision. CoRR abs/1811.08111 (2018) - 2017
- [j26]Yanhui Tu, Jun Du, Qing Wang, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech. Comput. Speech Lang. 46: 517-534 (2017) - [j25]Yonghong Tian, Xilin Chen, Hongkai Xiong
, Hong-Liang Li, Li-Rong Dai, Jing Chen, Junliang Xing, Jing Chen, Xihong Wu, Weiming Hu, Yu Hu, Tiejun Huang, Wen Gao:
Towards human-like and transhuman perception in AI 2.0: a review. Frontiers Inf. Technol. Electron. Eng. 18(1): 58-67 (2017) - [j24]Jianshu Zhang, Jun Du, Shiliang Zhang, Dan Liu, Yulong Hu, Jin-Shui Hu, Si Wei, Li-Rong Dai:
Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition. Pattern Recognit. 71: 196-206 (2017) - [j23]Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments. Speech Commun. 95: 28-39 (2017) - [j22]Shiliang Zhang, Cong Liu, Hui Jiang, Si Wei, Li-Rong Dai, Yu Hu:
Nonrecurrent Neural Structure for Long-Term Dependence. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 871-884 (2017) - [j21]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1535-1546 (2017) - [c168]Junfeng Hou, Shiliang Zhang, Li-Rong Dai, Hui Jiang:
Feedforward sequential memory networks based encoder-decoder model for machine translation. APSIPA 2017: 622-625 - [c167]Huang Chen, Shiliang Zhang, Junfeng Hou, Lirong Dai:
Learning the number of nodes in DNNs with activation mask. APSIPA 2017: 1218-1221 - [c166]Shumin An, Zhenhua Ling, Lirong Dai:
Emotional statistical parametric speech synthesis using LSTM-RNNs. APSIPA 2017: 1613-1616 - [c165]Ya-Jun Hu, Li-Juan Liu, Chuang Ding, Zhen-Hua Ling, Li-Rong Dai:
The USTC system for blizzard machine learning challenge 2017-ES2. ASRU 2017: 650-656 - [c164]Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features. HSCMA 2017: 101-105 - [c163]Lei Sun, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Multiple-target deep learning for LSTM-RNN based speech enhancement. HSCMA 2017: 136-140 - [c162]Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Extracting structural spectral features using what-where auto-encoders for statistical parametric speech synthesis. ICASSP 2017: 4915-4919 - [c161]Liping Chen, Kong-Aik Lee, Bin Ma, Long Ma, Haizhou Li
, Li-Rong Dai:
Adaptation of PLDA for multi-source text-independent speaker verification. ICASSP 2017: 5380-5384 - [c160]Jianshu Zhang, Jun Du, Lirong Dai:
A GRU-Based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition. ICDAR 2017: 902-907 - [c159]Xiao Bao, Tian Gao, Jun Du, Li-Rong Dai:
An investigation of high-resolution modeling units of deep neural networks for acoustic scene classification. IJCNN 2017: 3028-3035 - [c158]Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation. INTERSPEECH 2017: 1178-1182 - [c157]Ma Jin, Yan Song, Ian Vince McLoughlin, Wu Guo, Li-Rong Dai:
End-to-End Language Identification Using High-Order Utterance Representation with Bilinear Pooling. INTERSPEECH 2017: 2571-2575 - [c156]Junfeng Hou, Shiliang Zhang, Li-Rong Dai:
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition. INTERSPEECH 2017: 3692-3696 - [i7]Junbei Zhang, Xiao-Dan Zhu, Qian Chen, Li-Rong Dai, Si Wei, Hui Jiang:
Exploring Question Understanding and Adaptation in Neural-Network-Based Question Answering. CoRR abs/1703.04617 (2017) - [i6]