default search action
Philip C. Woodland
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j44]Guangzhi Sun, Chao Zhang, Ivan Vulic, Pawel Budzianowski, Philip C. Woodland:
Knowledge-aware audio-grounded generative slot filling for limited annotated data. Comput. Speech Lang. 89: 101707 (2025) - 2024
- [j43]Keqi Deng, Philip C. Woodland:
Decoupled structure for improved adaptability of end-to-end models. Speech Commun. 163: 103109 (2024) - [j42]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Graph Neural Networks for Contextual ASR With the Tree-Constrained Pointer Generator. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2407-2417 (2024) - [j41]Keqi Deng, Philip C. Woodland:
Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3507-3516 (2024) - [c192]Wen Wu, Wenlin Chen, Chao Zhang, Philip C. Woodland:
Modelling Variability in Human Annotator Simulation. ACL (Findings) 2024: 1139-1157 - [c191]Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. ACL (1) 2024: 2078-2093 - [c190]Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Philip C. Woodland:
Speech-based Slot Filling using Large Language Models. ACL (Findings) 2024: 6351-6362 - [c189]Keqi Deng, Philip C. Woodland:
Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation. ACL (1) 2024: 8235-8251 - [c188]Nineli Lashkarashvili, Wen Wu, Guangzhi Sun, Philip C. Woodland:
Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation. ICASSP 2024: 10986-10990 - [c187]Keqi Deng, Philip C. Woodland:
FastInject: Injecting Unpaired Text Data into CTC-Based ASR Training. ICASSP 2024: 11836-11840 - [c186]Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. Odyssey 2024: 260-265 - [i54]Nineli Lashkarashvili, Wen Wu, Guangzhi Sun, Philip C. Woodland:
Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation. CoRR abs/2402.11747 (2024) - [i53]Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. CoRR abs/2402.12862 (2024) - [i52]Guangzhi Sun, Potsawee Manakul, Adian Liusie, Kunat Pipatanakul, Chao Zhang, Philip C. Woodland, Mark J. F. Gales:
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models. CoRR abs/2405.13684 (2024) - [i51]Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. CoRR abs/2405.20064 (2024) - [i50]Keqi Deng, Guangzhi Sun, Philip C. Woodland:
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning. CoRR abs/2406.00522 (2024) - [i49]Keqi Deng, Philip C. Woodland:
Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation. CoRR abs/2406.04541 (2024) - [i48]Xiaodong Wu, Wenyi Yu, Chao Zhang, Philip C. Woodland:
An Improved Empirical Fisher Approximation for Natural Gradient Descent. CoRR abs/2406.06420 (2024) - [i47]Wen Wu, Chao Zhang, Philip C. Woodland:
Confidence Estimation for Automatic Detection of Depression and Alzheimer's Disease Based on Clinical Interviews. CoRR abs/2407.19984 (2024) - [i46]Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland:
MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events. CoRR abs/2409.17010 (2024) - [i45]Guangzhi Sun, Anmol Kagrecha, Potsawee Manakul, Philip C. Woodland, Mark J. F. Gales:
SkillAggregation: Reference-free LLM-Dependent Aggregation. CoRR abs/2410.10215 (2024) - 2023
- [j40]Qiujia Li, Chao Zhang, Philip C. Woodland:
Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147: 12-21 (2023) - [j39]Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland:
Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors. IEEE Trans. Affect. Comput. 14(4): 2810-2822 (2023) - [j38]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Minimising Biasing Word Errors for Contextual ASR With the Tree-Constrained Pointer Generator. IEEE ACM Trans. Audio Speech Lang. Process. 31: 345-354 (2023) - [c185]Wen Wu, Chao Zhang, Philip C. Woodland:
Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression. ACL (1) 2023: 15681-15695 - [c184]Keqi Deng, Philip C. Woodland:
Adaptable End-to-End ASR Models Using Replaceable Internal LMs and Residual Softmax. ICASSP 2023: 1-5 - [c183]Evonne P. C. Lee, Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Spectral Clustering-Aware Learning of Embeddings for Speaker Diarisation. ICASSP 2023: 1-5 - [c182]Yuang Li, Xianrui Zheng, Philip C. Woodland:
Self-Supervised Learning-Based Source Separation for Meeting Data. ICASSP 2023: 1-5 - [c181]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
End-to-End Spoken Language Understanding with Tree-Constrained Pointer Generator. ICASSP 2023: 1-5 - [c180]Wen Wu, Chao Zhang, Philip C. Woodland:
Self-Supervised Representations in Speech-Based Depression Detection. ICASSP 2023: 1-5 - [c179]Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Can Contextual Biasing Remain Effective with Whisper and GPT-2? INTERSPEECH 2023: 1289-1293 - [c178]Dongcheng Jiang, Chao Zhang, Philip C. Woodland:
A Neural Time Alignment Module for End-to-End Automatic Speech Recognition. INTERSPEECH 2023: 1374-1378 - [c177]Wen Wu, Chao Zhang, Philip C. Woodland:
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations. INTERSPEECH 2023: 3607-3611 - [c176]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdel-rahman Mohamed, Philip C. Woodland:
Biased Self-supervised Learning for ASR. INTERSPEECH 2023: 4948-4952 - [i44]Keqi Deng, Philip C. Woodland:
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax. CoRR abs/2302.08579 (2023) - [i43]Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland:
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition. CoRR abs/2303.10917 (2023) - [i42]Wen Wu, Chao Zhang, Philip C. Woodland:
Self-supervised representations in speech-based depression detection. CoRR abs/2305.12263 (2023) - [i41]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator. CoRR abs/2305.18824 (2023) - [i40]Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Can Contextual Biasing Remain Effective with Whisper and GPT-2? CoRR abs/2306.01942 (2023) - [i39]Wen Wu, Chao Zhang, Philip C. Woodland:
Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression. CoRR abs/2306.06760 (2023) - [i38]Guangzhi Sun, Chao Zhang, Ivan Vulic, Pawel Budzianowski, Philip C. Woodland:
Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data. CoRR abs/2307.01764 (2023) - [i37]Wen Wu, Chao Zhang, Philip C. Woodland:
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations. CoRR abs/2308.07145 (2023) - [i36]Keqi Deng, Philip C. Woodland:
Decoupled Structure for Improved Adaptability of End-to-End Models. CoRR abs/2308.13345 (2023) - [i35]Wen Wu, Wenlin Chen, Chao Zhang, Philip C. Woodland:
It HAS to be Subjective: Human Annotator Simulation via Zero-shot Density Estimation. CoRR abs/2310.00486 (2023) - [i34]Theodor Nguyen, Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Conditional Diffusion Model for Target Speaker Extraction. CoRR abs/2310.04791 (2023) - [i33]Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Philip C. Woodland:
Speech-based Slot Filling using Large Language Models. CoRR abs/2311.07418 (2023) - [i32]Keqi Deng, Philip C. Woodland:
FastInject: Injecting Unpaired Text Data into CTC-based ASR training. CoRR abs/2312.09100 (2023) - 2022
- [j37]Cai Wingfield, Chao Zhang, Barry Devereux, Elisabeth Fonteneau, Andrew Thwaites, Xunying Liu, Philip C. Woodland, William D. Marslen-Wilson, Li Su:
On the similarities of representations in artificial and brain neural networks for speech recognition. Frontiers Comput. Neurosci. 16 (2022) - [c175]Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland:
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition. ICASSP 2022: 6537-6541 - [c174]Xiaoyu Yang, Qiujia Li, Philip C. Woodland:
Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-Trained Models. ICASSP 2022: 8527-8531 - [c173]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition. INTERSPEECH 2022: 2043-2047 - [c172]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription. INTERSPEECH 2022: 3844-3848 - [c171]Wen Wu, Chao Zhang, Philip C. Woodland:
Distribution-Based Emotion Recognition in Conversation. SLT 2022: 860-867 - [i31]Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland:
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors. CoRR abs/2203.04443 (2022) - [i30]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator. CoRR abs/2205.09058 (2022) - [i29]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition. CoRR abs/2207.00857 (2022) - [i28]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription. CoRR abs/2207.03852 (2022) - [i27]Evonne P. C. Lee, Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation. CoRR abs/2210.13576 (2022) - [i26]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator. CoRR abs/2210.16554 (2022) - [i25]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdelrahman Mohamed, Philip C. Woodland:
Biased Self-supervised learning for ASR. CoRR abs/2211.02536 (2022) - [i24]Wen Wu, Chao Zhang, Philip C. Woodland:
Distribution-based Emotion Recognition in Conversation. CoRR abs/2211.04834 (2022) - 2021
- [j36]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Combination of deep speaker embeddings for diarisation. Neural Networks 141: 372-384 (2021) - [j35]Adnan Haider, Chao Zhang, Florian L. Kreyssig, Philip C. Woodland:
A distributed optimisation framework combining natural gradient with Hessian-free for discriminative sequence training. Neural Networks 143: 537-549 (2021) - [c170]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition. ASRU 2021: 162-168 - [c169]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-Constrained Pointer Generator for End-to-End Contextual Speech Recognition. ASRU 2021: 780-787 - [c168]Wen Wu, Chao Zhang, Philip C. Woodland:
Emotion Recognition by Fusing Time Synchronous and Time Asynchronous Representations. ICASSP 2021: 6269-6273 - [c167]Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition. ICASSP 2021: 6388-6392 - [c166]Guangzhi Sun, D. Liu, Chao Zhang, Philip C. Woodland:
Content-Aware Speaker Embeddings for Speaker Diarisation. ICASSP 2021: 7168-7172 - [c165]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Transformer Language Models with LSTM-Based Cross-Utterance Information Representation. ICASSP 2021: 7363-7367 - [c164]Dongcheng Jiang, Chao Zhang, Philip C. Woodland:
Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning. Interspeech 2021: 2601-2605 - [c163]Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland:
Residual Energy-Based Models for End-to-End Speech Recognition. Interspeech 2021: 4069-4073 - [c162]Qiujia Li, Florian L. Kreyssig, Chao Zhang, Philip C. Woodland:
Discriminative Neural Clustering for Speaker Diarisation. SLT 2021: 574-581 - [i23]Guangzhi Sun, D. Liu, Chao Zhang, Philip C. Woodland:
Content-Aware Speaker Embeddings for Speaker Diarisation. CoRR abs/2102.06467 (2021) - [i22]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Transformer Language Models with LSTM-based Cross-utterance Information Representation. CoRR abs/2102.06474 (2021) - [i21]Adnan Haider, Chao Zhang, Florian L. Kreyssig, Philip C. Woodland:
A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training. CoRR abs/2103.07554 (2021) - [i20]Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland:
Residual Energy-Based Models for End-to-End Speech Recognition. CoRR abs/2103.14152 (2021) - [i19]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition. CoRR abs/2108.07789 (2021) - [i18]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition. CoRR abs/2109.00627 (2021) - [i17]Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland:
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition. CoRR abs/2110.03327 (2021) - 2020
- [c161]Yassir Fathullah, Chao Zhang, Philip C. Woodland:
Improved Large-Margin Softmax Loss for Speaker Diarisation. ICASSP 2020: 7104-7108 - [c160]Florian L. Kreyssig, Philip C. Woodland:
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings. INTERSPEECH 2020: 3241-3245 - [i16]Florian L. Kreyssig, Philip C. Woodland:
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings. CoRR abs/2008.03756 (2020) - [i15]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Cross-Utterance Language Models with Acoustic Error Sampling. CoRR abs/2009.01008 (2020) - [i14]Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition. CoRR abs/2010.11428 (2020) - [i13]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Combination of Deep Speaker Embeddings for Diarisation. CoRR abs/2010.12025 (2020) - [i12]Wen Wu, Chao Zhang, Philip C. Woodland:
Emotion recognition by fusing time synchronous and time asynchronous representations. CoRR abs/2010.14102 (2020)
2010 – 2019
- 2019
- [c159]Qiujia Li, Chao Zhang, Philip C. Woodland:
Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition. ASRU 2019: 39-46 - [c158]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Speaker Diarisation Using 2D Self-attentive Combination of Embeddings. ICASSP 2019: 5801-5805 - [c157]Chao Zhang, Florian L. Kreyssig, Qiujia Li, Philip C. Woodland:
PyHTK: Python Library and ASR Pipelines for HTK. ICASSP 2019: 6470-6474 - [c156]Patrick von Platen, Chao Zhang, Philip C. Woodland:
Multi-Span Acoustic Modelling Using Raw Waveform Signals. INTERSPEECH 2019: 1393-1397 - [i11]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Speaker diarisation using 2D self-attentive combination of embeddings. CoRR abs/1902.03190 (2019) - [i10]Patrick von Platen, Chao Zhang, Philip C. Woodland:
Multi-Span Acoustic Modelling using Raw Waveform Signals. CoRR abs/1906.11047 (2019) - [i9]Qiujia Li, Chao Zhang, Philip C. Woodland:
Integrating Source-channel and Attention-based Sequence-to-sequence Models for Speech Recognition. CoRR abs/1909.06614 (2019) - [i8]Qiujia Li, Florian L. Kreyssig, Chao Zhang, Philip C. Woodland:
Discriminative Neural Clustering for Speaker Diarisation. CoRR abs/1910.09703 (2019) - [i7]Yassir Fathullah, Chao Zhang, Philip C. Woodland:
Improved Large-margin Softmax Loss for Speaker Diarisation. CoRR abs/1911.03970 (2019) - 2018
- [c155]Florian L. Kreyssig, Chao Zhang, Philip C. Woodland:
Improved Tdnns Using Deep Kernels and Frequency Dependent Grid-RNNS. ICASSP 2018: 4864-4868 - [c154]Chao Zhang, Philip C. Woodland:
High Order Recurrent Neural Networks for Acoustic Modelling. ICASSP 2018: 5849-5853 - [c153]Yu Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland:
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems. INTERSPEECH 2018: 872-876 - [c152]Chao Zhang, Philip C. Woodland:
Semi-tied Units for Efficient Gating in LSTM and Highway Networks. INTERSPEECH 2018: 1773-1777 - [c151]Adnan Haider, Philip C. Woodland:
Combining Natural Gradient with Hessian Free Methods for Sequence Training. INTERSPEECH 2018: 2918-2922 - [i6]Florian Kreyssig, Chao Zhang, Philip C. Woodland:
Improved TDNNs using Deep Kernels and Frequency Dependent Grid-RNNs. CoRR abs/1802.06412 (2018) - [i5]Chao Zhang, Philip C. Woodland:
High Order Recurrent Neural Networks for Acoustic Modelling. CoRR abs/1802.08314 (2018) - [i4]Adnan Haider, Philip C. Woodland:
Sequence Training of DNN Acoustic Models With Natural Gradient. CoRR abs/1804.02204 (2018) - [i3]Chao Zhang, Philip C. Woodland:
Semi-tied Units for Efficient Gating in LSTM and Highway Networks. CoRR abs/1806.06513 (2018) - [i2]Adnan Haider, Philip C. Woodland:
Combining Natural Gradient with Hessian Free Methods for Sequence Training. CoRR abs/1810.01873 (2018) - 2017
- [j34]Cai Wingfield, Li Su, Xunying Liu, Chao Zhang, Philip C. Woodland, Andrew Thwaites, Elisabeth Fonteneau, William D. Marslen-Wilson:
Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem. PLoS Comput. Biol. 13(9) (2017) - [j33]Penny Karanasou, Chunyang Wu, Mark J. F. Gales, Philip C. Woodland:
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 818-828 (2017) - [c150]Adnan Haider, Philip C. Woodland:
Sequence training of DNN acoustic models with natural gradient. ASRU 2017: 178-184 - [c149]Chao Zhang, Philip C. Woodland:
Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training. ICASSP 2017: 5015-5019 - 2016
- [j32]Xunying Liu, Xie Chen, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1438-1449 (2016) - [j31]Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2146-2157 (2016) - [c148]Chao Zhang, Philip C. Woodland:
DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions. ICASSP 2016: 5300-5304 - [c147]J. Yang, Chao Zhang, Anton Ragni, Mark J. F. Gales, Philip C. Woodland:
System combination with log-linear models. ICASSP 2016: 5675-5679 - [c146]Linlin Wang, Chao Zhang, Philip C. Woodland, Mark J. F. Gales, Panagiota Karanasou, Pierre Lanchantin, Xunying Liu, Yanmin Qian:
Improved DNN-based segmentation for multi-genre broadcast audio. ICASSP 2016: 5700-5704 - [c145]Xie Chen, Xunying Liu, Y. Qian, Mark J. F. Gales, Philip C. Woodland:
CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models. ICASSP 2016: 6000-6004 - [c144]Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanman Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems. INTERSPEECH 2016: 3057-3061 - [c143]Yanmin Qian, Philip C. Woodland:
Very deep convolutional neural networks for robust speech recognition. SLT 2016: 481-488 - [i1]