default search action

combined dblp search
author search
venue search
publication search

ask others

Yanzhang He

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DingQRHRLPWSHLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DingQRHRLPWSHLY24
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal:
USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models. ICASSP 2024: 10756-10760
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarMWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarMWS24
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. ICASSP 2024: 11816-11820
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWCMSRPSPMZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWCMSRPSPMZS24
Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara Sainath, Yanzhang He, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. INTERSPEECH 2024
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangPSMHLS0QCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangPSMHLS0QCSZ24
Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/QiuRDRH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/QiuRDRH24
David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He:
Rand: Robustness Aware Norm Decay for Quantized Neural Networks. SLT 2024: 1023-1030
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17184
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. CoRR abs/2402.17184 (2024)
2023
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/CaiQDHWBPSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/CaiQDHWBPSH23
Xingyu Cai, David Qiu, Shaojin Ding, Dongseong Hwang, Weiran Wang, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He:
Efficient Cascaded Streaming ASR System Via Frame Rate Reduction. ASRU 2023: 1-8
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QiuDH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QiuDH23
David Qiu, Shaojin Ding, Yanzhang He:
The Role of Feature Correlation on Quantized Neural Networks. ASRU 2023: 1-7
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HernandezZDBPSHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HernandezZDBPSHM23
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. ICASSP 2023: 1-5
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangCSHRDPAPS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangCSHRDPAPS23
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman:
E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model. ICASSP 2023: 1-5
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OMalleyDNWRLHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OMalleyDNWRLHM23
Tom O'Malley, Shaojin Ding, Arun Narayanan, Quan Wang, Rajeev Rikhye, Qiao Liang, Yanzhang He, Ian McGraw:
Conditional Conformer: Improving Speaker Modulation For Single And Multi-User Speech Enhancement. ICASSP 2023: 1-5
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZDZCRSHMK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZDZCRSHMK23
Weiran Wang, Ding Zhao, Shaojin Ding, Hao Zhang, Shuo-Yiin Chang, David Rybach, Tara N. Sainath, Yanzhang He, Ian McGraw, Shankar Kumar:
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks. ICASSP 2023: 1-5
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RybakovMDQLRH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RybakovMDQLRH23
Oleg Rybakov, Phoenix Meadowlark, Shaojin Ding, David Qiu, Jian Li, David Rim, Yanzhang He:
2-bit Conformer quantization for automatic speech recognition. INTERSPEECH 2023: 4908-4912
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08343
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08343
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. CoRR abs/2303.08343 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15536
David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He:
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models. CoRR abs/2305.15536 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12963
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08553
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Shivani Agrawal, Zhonglin Han, Jian Li, Amir Yazdanbakhsh:
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models. CoRR abs/2312.08553 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09463
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09463
Antoine Bruguier, David Qiu, Yanzhang He:
Partial Rewriting for Multi-Stage ASR. CoRR abs/2312.09463 (2023)
2022
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZQHCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZQHCW22
Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland:
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition. ICASSP 2022: 6537-6541
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HwangMHSGQSSBH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HwangMHSGQSSBH22
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He:
Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning. ICASSP 2022: 6627-6631
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathHNBWQCPG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathHNBWQCPG22
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Weiran Wang, David Qiu, Chung-Cheng Chiu, Rohit Prabhavalkar, Alexander Gruenstein, Anmol Gulati, Bo Li, David Rybach, Emmanuel Guzman, Ian McGraw, James Qin, Krzysztof Choromanski, Qiao Liang, Robert David, Ruoming Pang, Shuo-Yiin Chang, Trevor Strohman, W. Ronny Huang, Wei Han, Yonghui Wu, Yu Zhang:
Improving The Latency And Quality Of Cascaded Encoders. ICASSP 2022: 8112-8116
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCSVPHRGMPSH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCSVPHRGMPSH22
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. INTERSPEECH 2022: 1031-1035
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingWZSHDBWPLHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingWZSHDBWPLHM22
Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. INTERSPEECH 2022: 1706-1710
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingMHLAR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingMHLAR22
Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov:
4-bit Conformer with Native Quantization Aware Training for Speech Recognition. INTERSPEECH 2022: 1711-1715
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLSZSLH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLSZSLH22
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He:
Turn-Taking Prediction for Natural Conversational Speech. INTERSPEECH 2022: 1821-1825
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSPCXSCLLHHB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSPCXSCLLHHB22
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani:
A Language Agnostic Multilingual Streaming On-Device ASR System. INTERSPEECH 2022: 3188-3192
[c40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingRLHWNOM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingRLHWNOM22
Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw:
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. INTERSPEECH 2022: 3744-3748
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuSHPSMW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuSHPSMW22
Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang:
Improving Deliberation by Text-Only and Semi-Supervised Training. INTERSPEECH 2022: 4940-4944
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/RikhyeWLHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/RikhyeWLHM22
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Closing the Gap Between Single-User and Multi-User VoiceFilter-Lite. Odyssey 2022: 294-300
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/QiuMHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/QiuMHS22
David Qiu, Tsendsuren Munkhdalai, Yanzhang He, Khe Chai Sim:
Context-Aware Neural Confidence Estimation for Rare Word Speech Recognition. SLT 2022: 31-37
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BruguierQSH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BruguierQSH22
Antoine Bruguier, David Qiu, Trevor Strohman, Yanzhang He:
Flickering Reduction with Partial Hypothesis Reranking for Streaming ASR. SLT 2022: 38-45
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BijwadiaCLSZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BijwadiaCLSZH22
Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Yanzhang He:
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems. SLT 2022: 310-316
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-12169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-12169
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Closing the Gap between Single-User and Multi-User VoiceFilter-Lite. CoRR abs/2202.12169 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15952
Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov:
4-bit Conformer with Native Quantization Aware Training for Speech Recognition. CoRR abs/2203.15952 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03793
Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw:
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. CoRR abs/2204.03793 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-06164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-06164
Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. CoRR abs/2204.06164 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07553
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. CoRR abs/2204.07553 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-14716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-14716
Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang:
Improving Deliberation by Text-Only and Semi-Supervised Training. CoRR abs/2206.14716 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13321
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He:
Turn-Taking Prediction for Natural Conversational Speech. CoRR abs/2208.13321 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13916
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani:
A Language Agnostic Multilingual Streaming On-Device ASR System. CoRR abs/2208.13916 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00786
Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Yanzhang He:
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems. CoRR abs/2211.00786 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-15432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-15432
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman:
E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model. CoRR abs/2211.15432 (2022)
2021
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RikhyeWLHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RikhyeWLHM21
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Multi-User Voicefilter-Lite via Attentive Speaker Embedding. ASRU 2021: 275-282
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NarayananCOWH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NarayananCOWH21
Arun Narayanan, Chung-Cheng Chiu, Tom O'Malley, Quan Wang, Yanzhang He:
Cross-Attention Conformer for Context Modeling in Speech Enhancement for ASR. ASRU 2021: 312-319
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiGYSCNCPHQ0LZS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiGYSCNCPHQ0LZS21
Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster end-to-end Model for Streaming ASR. ICASSP 2021: 5634-5638
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarHRC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarHRC21
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath:
Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging. ICASSP 2021: 5659-5663
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuCLCSHNHGWP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuCLCSHNHGWP21
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization. ICASSP 2021: 6004-6008
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQZLHWCS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQZLHWCS21
Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition. ICASSP 2021: 6388-6392
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QiuLHZLCPBLHSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QiuLHZLCPBLHSM21
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence for Subword End-To-End ASR. ICASSP 2021: 6393-6397
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathHNBPRAVQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathHNBPRAVQ21
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Ruoming Pang, David Rybach, Cyril Allauzen, Ehsan Variani, James Qin, Quoc-Nam Le-The, Shuo-Yiin Chang, Bo Li, Anmol Gulati, Jiahui Yu, Chung-Cheng Chiu, Diamantino Caseiro, Wei Li, Qiao Liang, Pat Rondon:
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. Interspeech 2021: 1777-1781
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QiuHLZCM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QiuHLZCM21
David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw:
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction. Interspeech 2021: 4074-4078
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RikhyeWLHZHNM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RikhyeWLHZHNM21
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ding Zhao, Yiteng Huang, Arun Narayanan, Ian McGraw:
Personalized Keyphrase Detection Using Speaker and Environment Information. Interspeech 2021: 4204-4208
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BotrosSDG0H21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BotrosSDG0H21
Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He:
Tied & Reduced RNN-T Decoder. Interspeech 2021: 4563-4567
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-06716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-06716
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence For Subword End-to-End ASR. CoRR abs/2103.06716 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-12870
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-12870
David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw:
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction. CoRR abs/2104.12870 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-13970
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-13970
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ding Zhao, Yiteng Huang, Arun Narayanan, Ian McGraw:
Personalized Keyphrase Detection using Speaker and Environment Information. CoRR abs/2104.13970 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01201
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01201
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Multi-user VoiceFilter-Lite via Attentive Speaker Embedding. CoRR abs/2107.01201 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-07513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-07513
Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He:
Tied & Reduced RNN-T Decoder. CoRR abs/2109.07513 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-00165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-00165
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He:
Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning. CoRR abs/2110.00165 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03327
Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland:
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition. CoRR abs/2110.03327 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-00127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-00127
Arun Narayanan, Chung-Cheng Chiu, Tom O'Malley, Quan Wang, Yanzhang He:
Cross-attention conformer for context modeling in speech enhancement for ASR. CoRR abs/2111.00127 (2021)
2020
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathHLNPBCLA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathHLNPBCLA20
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency. ICASSP 2020: 6059-6063
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCSPHSW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCSPHSW20
Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Ruoming Pang, Yanzhang He, Trevor Strohman, Yonghui Wu:
Towards Fast and Accurate Streaming End-To-End ASR. ICASSP 2020: 6069-6073
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathPWHCS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathPWHCS20
Tara N. Sainath, Ruoming Pang, Ron J. Weiss, Yanzhang He, Chung-Cheng Chiu, Trevor Strohman:
An Attention-Based Joint Acoustic and Text on-Device End-To-End Model. ICASSP 2020: 7039-7043
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShangguanKHMB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShangguanKHMB20
Yuan Shangguan, Kate Knister, Yanzhang He, Ian McGraw, Françoise Beaufays:
Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer. INTERSPEECH 2020: 591-595
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chang0RHLSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chang0RHLSS20
Shuo-Yiin Chang, Bo Li, David Rybach, Yanzhang He, Wei Li, Tara N. Sainath, Trevor Strohman:
Low Latency Speech Recognition Using End-to-End Prefetching. INTERSPEECH 2020: 1962-1966
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiQCPH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiQCPH20
Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He:
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition. INTERSPEECH 2020: 2122-2126
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLSWCLHLPNG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLSWCLHLPNG20
Quan Wang, Ignacio López-Moreno, Mert Saglam, Kevin W. Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein:
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition. INTERSPEECH 2020: 2677-2681
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12710
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency. CoRR abs/2003.12710 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01416
Yuan Shangguan, Kate Knister, Yanzhang He, Ian McGraw, Françoise Beaufays:
Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer. CoRR abs/2006.01416 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-13093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-13093
Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He:
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition. CoRR abs/2008.13093 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-04323
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-04323
Quan Wang, Ignacio López-Moreno, Mert Saglam, Kevin W. Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein:
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition. CoRR abs/2009.04323 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11148
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization. CoRR abs/2010.11148 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11428
Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition. CoRR abs/2010.11428 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-10798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-10798
Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster End-to-End Model for Streaming ASR. CoRR abs/2011.10798 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-06749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-06749
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath:
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging. CoRR abs/2012.06749 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangPHSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangPHSS19
Shuo-Yiin Chang, Rohit Prabhavalkar, Yanzhang He, Tara N. Sainath, Gabor Simko:
Joint Endpointing and Decoding with End-to-end Models. ICASSP 2019: 5626-5630
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeSPMAZRKWPLBSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeSPMAZRKWPLBSL19
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathPRHPLVLS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathPRHPLVLS19
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu:
Two-Pass End-to-End Speech Recognition. INTERSPEECH 2019: 2773-2777
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08295
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-10992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-10992
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu:
Two-Pass End-to-End Speech Recognition. CoRR abs/1908.10992 (2019)
2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06621
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018)
2017
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/appt/HeJDF17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/appt/HeJDF17
Yanzhang He, Xiaohong Jiang, Changbo Dai, Zikun Fan:
Self-adaptive Failure Detector for Peer-to-Peer Distributed System Considering the Link Faults. APPT 2017: 64-75
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HePRLBM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HePRLBM17
Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw:
Streaming small-footprint keyword spotting using sequence-to-sequence models. ASRU 2017: 474-481
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-09617
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-09617
Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw:
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models. CoRR abs/1710.09617 (2017)
2016
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HeBFHJOFP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HeBFHJOFP16
Yanzhang He, Peter Baumann, Hao Fang, Brian Hutchinson, Aaron Jaech, Mari Ostendorf, Eric Fosler-Lussier, Janet B. Pierrehumbert:
Using Pronunciation-Based Morphological Subword Units to Improve OOV Handling in Keyword Search. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 79-92 (2016)
2015
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/BagchiMWHPF15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/BagchiMWHPF15
Deblin Bagchi, Michael I. Mandel, Zhongqiu Wang, Yanzhang He, Andrew R. Plummer, Eric Fosler-Lussier:
Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition. ASRU 2015: 496-503
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SuPHH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SuPHH15
Hang Su, Van Tung Pham, Yanzhang He, James Hieronymus:
Improvements on transducing syllable lattice to word lattice for keyword search. ICASSP 2015: 4729-4733
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanHBFW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanHBFW15
Kun Han, Yanzhang He, Deblin Bagchi, Eric Fosler-Lussier, DeLiang Wang:
Deep neural network based spectral feature mapping for robust speech recognition. INTERSPEECH 2015: 2484-2488
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeF15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeF15
Yanzhang He, Eric Fosler-Lussier:
Segmental conditional random fields with deep neural networks as acoustic models for first-pass word recognition. INTERSPEECH 2015: 2640-2644
2014
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEcloud/HeJWYC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEcloud/HeJWYC14
Yanzhang He, Xiaohong Jiang, Zhaohui Wu, Kejiang Ye, Zhongzhong Chen:
Scalability Analysis and Improvement of Hadoop Virtual Cluster with Cost Consideration. IEEE CLOUD 2014: 594-601
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/hpcc/LiJH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpcc/LiJH14
Xiang Li, Xiaohong Jiang, Yanzhang He:
Virtual Machine Scheduling Considering Both Computing and Cooling Energy. HPCC/CSS/ICESS 2014: 244-247
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeHBOFP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeHBOFP14
Yanzhang He, Brian Hutchinson, Peter Baumann, Mari Ostendorf, Eric Fosler-Lussier, Janet B. Pierrehumbert:
Subword-based modeling for handling OOV words inkeyword spotting. ICASSP 2014: 7864-7868
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SuHHFW14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SuHHFW14
Hang Su, James Hieronymus, Yanzhang He, Eric Fosler-Lussier, Steven Wegmann:
Syllable based keyword search: Transducing syllable lattices to word lattices. SLT 2014: 489-494
2013
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pieee/Fosler-LussierHJP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pieee/Fosler-LussierHJP13
Eric Fosler-Lussier, Yanzhang He, Preethi Jyothi, Rohit Prabhavalkar:
Conditional Random Fields in Speech, Audio, and Language Processing. Proc. IEEE 101(5): 1054-1075 (2013)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/appt/HeJYML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/appt/HeJYML13
Yanzhang He, Xiaohong Jiang, Kejiang Ye, Ran Ma, Xiang Li:
HPACS: A High Privacy and Availability Cloud Storage Platform with Matrix Encryption. APPT 2013: 132-145
2012
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cluster/YeJHLYH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cluster/YeJHLYH12
Kejiang Ye, Xiaohong Jiang, Yanzhang He, Xiang Li, Haiming Yan, Peng Huang:
vHadoop: A Scalable Hadoop Virtual Cluster Platform for MapReduce-Based Parallel Machine Learning with Performance Consideration. CLUSTER Workshops 2012: 152-160
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeF12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeF12
Yanzhang He, Eric Fosler-Lussier:
Efficient Segmental Conditional Random Fields for One-Pass Phone Recognition. INTERSPEECH 2012: 1898-1901

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.