default search action

combined dblp search
author search
venue search
publication search

ask others

Tara N. Sainath

> Home > Persons

Person information

affiliation: Google Inc., New York, NY, USA
affiliation: IBM T. J. Watson Research Center, Yorktown Heights, NY, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PrabhavalkarHSSW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PrabhavalkarHSSW24
Rohit Prabhavalkar, Takaaki Hori, Tara N. Sainath, Ralf Schlüter, Shinji Watanabe:
End-to-End Speech Recognition: A Survey. IEEE ACM Trans. Audio Speech Lang. Process. 32: 325-351 (2024)
[c183]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WuLZCLBSW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WuLZCLBSW24
Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. ACL (1) 2024: 2078-2093
[c182]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimHMSSM0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimHMSSM0S24
Khe Chai Sim, Zhouyuan Huo, Tsendsuren Munkhdalai, Nikhil Siddhartha, Adam Stooke, Zhong Meng, Bo Li, Tara N. Sainath:
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models. ICASSP 2024: 6900-6904
[c181]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DingQRHRLPWSHLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DingQRHRLPWSHLY24
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal:
USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models. ICASSP 2024: 10756-10760
[c180]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Bai0LSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Bai0LSS24
Junwen Bai, Bo Li, Qiujia Li, Tara N. Sainath, Trevor Strohman:
Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR. ICASSP 2024: 10841-10845
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarMWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarMWS24
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. ICASSP 2024: 11816-11820
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShanGMWCS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShanGMWCS24
Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting Conformers With Structured State-Space Sequence Models For Online Speech Recognition. ICASSP 2024: 12221-12225
[c177]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GargHSSCAMKWMHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GargHSSCAMKWMHS24
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English with Audio Classification. ICASSP 2024: 12356-12360
[c176]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangACGHQ0WCS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangACGHQ0WCS24
W. Ronny Huang, Cyril Allauzen, Tongzhou Chen, Kilol Gupta, Ke Hu, James Qin, Yu Zhang, Yongqiang Wang, Shuo-Yiin Chang, Tara N. Sainath:
Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study. ICASSP 2024: 13306-13310
[c175]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangPSMHLS0QCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangPSMHLS0QCSZ24
Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217
[i96]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08992
Junwen Bai, Bo Li, Qiujia Li, Tara N. Sainath, Trevor Strohman:
Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR. CoRR abs/2401.08992 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12789
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12789
W. Ronny Huang, Cyril Allauzen, Tongzhou Chen, Kilol Gupta, Ke Hu, James Qin, Yu Zhang, Yongqiang Wang, Shuo-Yiin Chang, Tara N. Sainath:
Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study. CoRR abs/2401.12789 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12862
Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. CoRR abs/2402.12862 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17184
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. CoRR abs/2402.17184 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-19709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-19709
Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara N. Sainath, Pedro Moreno Mengibar:
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models. CoRR abs/2403.19709 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02921
Zhong Meng, Zelin Wu, Rohit Prabhavalkar, Cal Peyser, Weiran Wang, Nanxin Chen, Tara N. Sainath, Bhuvana Ramabhadran:
Text Injection for Neural Contextual Biasing. CoRR abs/2406.02921 (2024)
2023
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ArumugamCSPWB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ArumugamCSPWB23
Guru Prakash Arumugam, Shuo-Yiin Chang, Tara N. Sainath, Rohit Prabhavalkar, Quan Wang, Shaan Bijwadia:
Improved Long-Form Speech Recognition By Jointly Modeling The Primary And Non-Primary Speakers. ASRU 2023: 1-8
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/CaiQDHWBPSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/CaiQDHWBPSH23
Xingyu Cai, David Qiu, Shaojin Ding, Dongseong Hwang, Weiran Wang, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He:
Efficient Cascaded Streaming ASR System Via Frame Rate Reduction. ASRU 2023: 1-8
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuSLZCWZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuSLZCWZL23
Ke Hu, Tara N. Sainath, Bo Li, Yu Zhang, Yong Cheng, Tao Wang, Yujing Zhang, Frederick Liu:
Improving Multilingual and Code-Switching ASR Using Large Language Model Generated Text. ASRU 2023: 1-7
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BotrosPSCSB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BotrosPSCSB23
Rami Botros, Rohit Prabhavalkar, Johan Schalkwyk, Ciprian Chelba, Tara N. Sainath, Françoise Beaufays:
Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASR. ICASSP 2023: 1-5
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangZSLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangZSLS23
Shuo-Yiin Chang, Chao Zhang, Tara N. Sainath, Bo Li, Trevor Strohman:
Context-Aware end-to-end ASR Using Self-Attentive Embedding and Tensor Fusion. ICASSP 2023: 1-5
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HernandezZDBPSHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HernandezZDBPSHM23
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. ICASSP 2023: 1-5
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSLDHDZCCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSLDHDZCCS23
Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman:
Massively Multilingual Shallow Fusion with Large Language Models. ICASSP 2023: 1-5
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangCSHRDPAPS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangCSHRDPAPS23
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman:
E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model. ICASSP 2023: 1-5
[c166]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuoSLHSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuoSLHSS23
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning from Speech Foundation Model Using Hierarchical Feature Fusion. ICASSP 2023: 1-5
[c165]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHHBPSSZHSB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHHBPSSZHSB23
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. ICASSP 2023: 1-5
[c164]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengWPSCVZLRR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengWPSCVZLRR23
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5
[c163]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PeyserPCPHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PeyserPCPHS23
Cal Peyser, Michael Picheny, Kyunghyun Cho, Rohit Prabhavalkar, W. Ronny Huang, Tara N. Sainath:
A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale. ICASSP 2023: 1-5
[c162]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathPCRA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathPCRA23
Tara N. Sainath, Rohit Prabhavalkar, Diamantino Caseiro, Pat Rondon, Cyril Allauzen:
Improving Contextual Biasing with Text Injection. ICASSP 2023: 1-5
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZDZCRSHMK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZDZCRSHMK23
Weiran Wang, Ding Zhao, Shaojin Ding, Hao Zhang, Shuo-Yiin Chang, David Rybach, Tara N. Sainath, Yanzhang He, Ian McGraw, Shankar Kumar:
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks. ICASSP 2023: 1-5
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLZCPSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLZCPSS23
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman:
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition. ICASSP 2023: 1-5
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLZCSSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLZCSSL23
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. ICASSP 2023: 1-5
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLSSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLSSC23
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Shuo-Yiin Chang:
UML: A Universal Monolingual Output Layer For Multilingual Asr. ICASSP 2023: 1-5
[c157]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenY00CCPLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenY00CCPLS23
Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shuo-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath:
How to Estimate Model Transferability of Pre-Trained Speech Models? INTERSPEECH 2023: 456-460
[c156]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuoSHMSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuoSHMSM23
Zhouyuan Huo, Khe Chai Sim, Dongseong Hwang, Tsendsuren Munkhdalai, Tara N. Sainath, Pedro Moreno Mengibar:
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods. INTERSPEECH 2023: 556-560
[c155]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PeyserMPRSPCH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PeyserMPRSPCH23
Cal Peyser, Zhong Meng, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho, Ke Hu:
Improving Joint Speech-Text Representations Without Alignment. INTERSPEECH 2023: 1354-1358
[c154]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangZKCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangZKCS23
W. Ronny Huang, Hao Zhang, Shankar Kumar, Shuo-Yiin Chang, Tara N. Sainath:
Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR. INTERSPEECH 2023: 2778-2782
[c153]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hu0S0B23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hu0S0B23
Ke Hu, Bo Li, Tara N. Sainath, Yu Zhang, Françoise Beaufays:
Mixture-of-Expert Conformer for Streaming Multilingual ASR. INTERSPEECH 2023: 3327-3331
[c152]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li0HSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li0HSM23
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. INTERSPEECH 2023: 3357-3361
[i90]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-04327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-04327
Cal Peyser, W. Ronny Huang, Tara N. Sainath, Rohit Prabhavalkar, Michael Picheny, Kyunghyun Cho:
Dual Learning for Large Vocabulary On-Device ASR. CoRR abs/2301.04327 (2023)
[i89]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-07851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-07851
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman:
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition. CoRR abs/2301.07851 (2023)
[i88]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01496
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01496
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. CoRR abs/2302.01496 (2023)
[i87]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08583
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023)
[i86]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08917
Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman:
Massively Multilingual Shallow Fusion with Large Language Models. CoRR abs/2302.08917 (2023)
[i85]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11186
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Shuo-Yiin Chang:
UML: A Universal Monolingual Output Layer for Multilingual ASR. CoRR abs/2302.11186 (2023)
[i84]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01037
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023)
[i83]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03329
Rohit Prabhavalkar, Takaaki Hori, Tara N. Sainath, Ralf Schlüter, Shinji Watanabe:
End-to-End Speech Recognition: A Survey. CoRR abs/2303.03329 (2023)
[i82]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08343
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08343
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. CoRR abs/2303.08343 (2023)
[i81]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-15293
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-15293
Sepand Mavandadi, Tara N. Sainath, Ke Hu, Zelin Wu:
A Deliberation-based Joint Acoustic and Text Decoder. CoRR abs/2303.15293 (2023)
[i80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-00171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-00171
Rami Botros, Anmol Gulati, Tara N. Sainath, Krzysztof Choromanski, Ruoming Pang, Trevor Strohman, Weiran Wang, Jiahui Yu:
Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR. CoRR abs/2304.00171 (2023)
[i79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-00173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-00173
Rami Botros, Rohit Prabhavalkar, Johan Schalkwyk, Ciprian Chelba, Tara N. Sainath, Françoise Beaufays:
Lego-Features: Exporting modular encoder features for streaming and deliberation ASR. CoRR abs/2304.00173 (2023)
[i78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-11053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-11053
Cal Peyser, Michael Picheny, Kyunghyun Cho, Rohit Prabhavalkar, W. Ronny Huang, Tara N. Sainath:
A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale. CoRR abs/2304.11053 (2023)
[i77]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13408
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. CoRR abs/2305.13408 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15663
Ke Hu, Bo Li, Tara N. Sainath, Yu Zhang, Françoise Beaufays:
Mixture-of-Expert Conformer for Streaming Multilingual ASR. CoRR abs/2305.15663 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18419
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18419
W. Ronny Huang, Hao Zhang, Shankar Kumar, Shuo-Yiin Chang, Tara N. Sainath:
Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR. CoRR abs/2305.18419 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01015
Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shuo-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath:
How to Estimate Model Transferability of Pre-Trained Speech Models? CoRR abs/2306.01015 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-12925
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-12925
Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara N. Sainath, Johan Schalkwyk, Matthew Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirovic, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Havnø Frank:
AudioPaLM: A Large Language Model That Can Speak and Listen. CoRR abs/2306.12925 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06125
Cal Peyser, Zhong Meng, Ke Hu, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho:
Improving Joint Speech-Text Representations Without Alignment. CoRR abs/2308.06125 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07395
Shaan Bijwadia, Shuo-Yiin Chang, Weiran Wang, Zhong Meng, Hao Zhang, Tara N. Sainath:
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models. CoRR abs/2308.07395 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08551
Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting conformers with structured state space models for online speech recognition. CoRR abs/2309.08551 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09996
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English With Audio Classification. CoRR abs/2309.09996 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12963
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00178
Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08553
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Shivani Agrawal, Zhonglin Han, Jian Li, Amir Yazdanbakhsh:
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models. CoRR abs/2312.08553 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11123
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11123
Guru Prakash Arumugam, Shuo-Yiin Chang, Tara N. Sainath, Rohit Prabhavalkar, Quan Wang, Shaan Bijwadia:
Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers. CoRR abs/2312.11123 (2023)
2022
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/LeeWLMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/LeeWLMS22
Hung-Yi Lee, Shinji Watanabe, Karen Livescu, Abdelrahman Mohamed, Tara N. Sainath:
Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1174-1178 (2022)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/MohamedLBHEIKLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/MohamedLBHEIKLL22
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe:
Self-Supervised Speech Representation Learning: A Review. IEEE J. Sel. Top. Signal Process. 16(6): 1179-1210 (2022)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ZhangPHQGSJXHWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ZhangPHQGSJXHWZ22
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. IEEE J. Sel. Top. Signal Process. 16(6): 1519-1532 (2022)
[c151]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiPZSSHZFGP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiPZSSHZFGP22
Bo Li, Ruoming Pang, Yu Zhang, Tara N. Sainath, Trevor Strohman, Parisa Haghani, Yun Zhu, Brian Farris, Neeraj Gaur, Manasa Prasad:
Massively Multilingual ASR: A Lifelong Learning Solution. ICASSP 2022: 6397-6401
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaiLZBSSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaiLZBSSS22
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. ICASSP 2022: 6402-6406
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangHS22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangHS22a
Weiran Wang, Ke Hu, Tara N. Sainath:
Deliberation of Streaming RNN-Transducer by Non-Autoregressive Decoding. ICASSP 2022: 7452-7456
[c148]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSNPS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSNPS22
Ke Hu, Tara N. Sainath, Arun Narayanan, Ruoming Pang, Trevor Strohman:
Transducer-Based Streaming Deliberation for Cascaded Encoders. ICASSP 2022: 8107-8111
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathHNBWQCPG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathHNBWQCPG22
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Weiran Wang, David Qiu, Chung-Cheng Chiu, Rohit Prabhavalkar, Alexander Gruenstein, Anmol Gulati, Bo Li, David Rybach, Emmanuel Guzman, Ian McGraw, James Qin, Krzysztof Choromanski, Qiao Liang, Robert David, Ruoming Pang, Shuo-Yiin Chang, Trevor Strohman, W. Ronny Huang, Wei Han, Yonghui Wu, Yu Zhang:
Improving The Latency And Quality Of Cascaded Encoders. ICASSP 2022: 8112-8116
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLLSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLLSC22
Chao Zhang, Bo Li, Zhiyun Lu, Tara N. Sainath, Shuo-Yiin Chang:
Improving the Fusion of Acoustic and Text Representations in RNN-T. ICASSP 2022: 8117-8121
[c145]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangPSPSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangPSPSK22
W. Ronny Huang, Cal Peyser, Tara N. Sainath, Ruoming Pang, Trevor D. Strohman, Shankar Kumar:
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition. INTERSPEECH 2022: 689-693
[c144]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCSVPHRGMPSH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCSVPHRGMPSH22
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. INTERSPEECH 2022: 1031-1035
[c143]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangHS22
Weiran Wang, Ke Hu, Tara N. Sainath:
Streaming Align-Refine for Non-autoregressive Deliberation. INTERSPEECH 2022: 1696-1700
[c142]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingWZSHDBWPLHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingWZSHDBWPLHM22
Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. INTERSPEECH 2022: 1706-1710
[c141]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLSZSLH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLSZSLH22
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He:
Turn-Taking Prediction for Natural Conversational Speech. INTERSPEECH 2022: 1821-1825
[c140]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangPWS0LSUFS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangPWS0LSUFS22
Shuo-Yiin Chang, Guru Prakash, Zelin Wu, Tara N. Sainath, Bo Li, Qiao Liang, Adam Stambler, Shyam Upadhyay, Manaal Faruqui, Trevor Strohman:
Streaming Intended Query Detection using E2E Modeling for Continued Conversation. INTERSPEECH 2022: 1826-1830
[c139]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSPCXSCLLHHB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSPCXSCLLHHB22
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani:
A Language Agnostic Multilingual Streaming On-Device ASR System. INTERSPEECH 2022: 3188-3192
[c138]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangLSSMCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangLSSMCH22
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani:
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification. INTERSPEECH 2022: 3223-3227
[c137]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PeyserHRSPC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PeyserHRSPC22
Cal Peyser, W. Ronny Huang, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho:
Towards Disentangled Speech Representations. INTERSPEECH 2022: 3603-3607
[c136]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuSHPSMW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuSHPSMW22
Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang:
Improving Deliberation by Text-Only and Semi-Supervised Training. INTERSPEECH 2022: 4940-4944
[c135]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangCRSPPLA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangCRSPPLA22
W. Ronny Huang, Shuo-Yiin Chang, David Rybach, Tara N. Sainath, Rohit Prabhavalkar, Cal Peyser, Zhiyun Lu, Cyril Allauzen:
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR. INTERSPEECH 2022: 4995-4999
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SainathPBZHCLWS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SainathPBZHCLWS22
Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang, Zhouyuan Huo, Zhehuai Chen, Bo Li, Weiran Wang, Trevor Strohman:
JOIST: A Joint Speech and Text Streaming Model for ASR. SLT 2022: 52-59
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MunkhdalaiWPSLRS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MunkhdalaiWPSLRS22
Tsendsuren Munkhdalai, Zelin Wu, Golan Pundak, Khe Chai Sim, Jiayang Li, Pat Rondon, Tara N. Sainath:
NAM+: Towards Scalable End-to-End Contextual Biasing for Adaptive ASR. SLT 2022: 190-196
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PeyserHSPPC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PeyserHSPPC22
Cal Peyser, W. Ronny Huang, Tara N. Sainath, Rohit Prabhavalkar, Michael Picheny, Kyunghyun Cho:
Dual Learning for Large Vocabulary On-Device ASR. SLT 2022: 245-251
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BijwadiaCLSZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BijwadiaCLSZH22
Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Yanzhang He:
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems. SLT 2022: 310-316
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HuLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HuLS22
Ke Hu, Bo Li, Tara N. Sainath:
Scaling Up Deliberation For Multilingual ASR. SLT 2022: 771-776
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MavandadiLZFSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MavandadiLZFSS22
Sepand Mavandadi, Bo Li, Chao Zhang, Brian Farris, Tara N. Sainath, Trevor Strohman:
A Truly Multilingual First Pass and Monolingual Second Pass Streaming on-Device ASR System. SLT 2022: 838-845
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10240
Chao Zhang, Bo Li, Zhiyun Lu, Tara N. Sainath, Shuo-Yiin Chang:
Improving the fusion of acoustic and text representations in RNN-T. CoRR abs/2201.10240 (2022)
[i63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-05008
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-05008
W. Ronny Huang, Cal Peyser, Tara N. Sainath, Ruoming Pang, Trevor Strohman, Shankar Kumar:
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition. CoRR abs/2203.05008 (2022)
[i62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-06164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-06164
Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. CoRR abs/2204.06164 (2022)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07553
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. CoRR abs/2204.07553 (2022)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07556
Weiran Wang, Ke Hu, Tara N. Sainath:
Streaming Align-Refine for Non-autoregressive Deliberation. CoRR abs/2204.07556 (2022)
[i59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-10749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-10749
W. Ronny Huang, Shuo-Yiin Chang, David Rybach, Rohit Prabhavalkar, Tara N. Sainath, Cyril Allauzen, Cal Peyser, Zhiyun Lu:
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR. CoRR abs/2204.10749 (2022)
[i58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10643
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe:
Self-Supervised Speech Representation Learning: A Review. CoRR abs/2205.10643 (2022)
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-14716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-14716
Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang:
Improving Deliberation by Text-Only and Semi-Supervised Training. CoRR abs/2206.14716 (2022)
[i56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13191
Cal Peyser, W. Ronny Huang, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho:
Towards Disentangled Speech Representations. CoRR abs/2208.13191 (2022)
[i55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13321
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He:
Turn-Taking Prediction for Natural Conversational Speech. CoRR abs/2208.13321 (2022)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13322
Shuo-Yiin Chang, Guru Prakash, Zelin Wu, Qiao Liang, Tara N. Sainath, Bo Li, Adam Stambler, Shyam Upadhyay, Manaal Faruqui, Trevor Strohman:
Streaming Intended Query Detection using E2E Modeling for Continued Conversation. CoRR abs/2208.13322 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13916
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani:
A Language Agnostic Multilingual Streaming On-Device ASR System. CoRR abs/2208.13916 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06058
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani:
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification. CoRR abs/2209.06058 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05785
Ke Hu, Bo Li, Tara N. Sainath:
Scaling Up Deliberation for Multilingual ASR. CoRR abs/2210.05785 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07353
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07353
Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang, Zhouyuan Huo, Zhehuai Chen, Bo Li, Weiran Wang, Trevor Strohman:
JOIST: A Joint Speech and Text Streaming Model For ASR. CoRR abs/2210.07353 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00786
Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Yanzhang He:
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems. CoRR abs/2211.00786 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01263
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02712
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion. CoRR abs/2211.02712 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-15432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-15432
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman:
E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model. CoRR abs/2211.15432 (2022)
2021
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiPSGZQHHMB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiPSGZQHHMB21
Bo Li, Ruoming Pang, Tara N. Sainath, Anmol Gulati, Yu Zhang, James Qin, Parisa Haghani, W. Ronny Huang, Min Ma, Junwen Bai:
Scaling End-to-End Models for Large-Scale Multilingual ASR. ASRU 2021: 1011-1018
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NarayananSPYCPV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NarayananSPYCPV21
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded Encoders for Unifying Streaming and Non-Streaming ASR. ICASSP 2021: 5629-5633
[c126]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiGYSCNCPHQ0LZS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiGYSCNCPHQ0LZS21
Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster end-to-end Model for Streaming ASR. ICASSP 2021: 5634-5638
[c125]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarHRC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarHRC21
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath:
Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging. ICASSP 2021: 5659-5663
[c124]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShrivastavaGCZS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShrivastavaGCZS21
Harsh Shrivastava, Ankush Garg, Yuan Cao, Yu Zhang, Tara N. Sainath:
Echo State Speech Recognition. ICASSP 2021: 5669-5673
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuCLCSHNHGWP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuCLCSHNHGWP21
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization. ICASSP 2021: 6004-6008
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QiuLHZLCPBLHSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QiuLHZLCPBLHSM21
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence for Subword End-To-End ASR. ICASSP 2021: 6393-6397
[c121]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/YuHGCLSWP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YuHGCLSWP21
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang:
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling. ICLR 2021
[c120]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathHNBPRAVQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathHNBPRAVQ21
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Ruoming Pang, David Rybach, Cyril Allauzen, Ehsan Variani, James Qin, Quoc-Nam Le-The, Shuo-Yiin Chang, Bo Li, Anmol Gulati, Jiahui Yu, Chung-Cheng Chiu, Diamantino Caseiro, Wei Li, Qiao Liang, Pat Rondon:
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. Interspeech 2021: 1777-1781
[c119]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangSPKRS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangSPKRS21
W. Ronny Huang, Tara N. Sainath, Cal Peyser, Shankar Kumar, David Rybach, Trevor Strohman:
Lookup-Table Recurrent Language Models for Long Tail Speech Recognition. Interspeech 2021: 2002-2006
[c118]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MavandadiSHW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MavandadiSHW21
Sepand Mavandadi, Tara N. Sainath, Ke Hu, Zelin Wu:
A Deliberation-Based Joint Acoustic and Text Decoder. Interspeech 2021: 2057-2061
[c117]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangSW21
Peidong Wang, Tara N. Sainath, Ron J. Weiss:
Multitask Training with Text Data for End-to-End Speech Recognition. Interspeech 2021: 2566-2570
[c116]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BotrosSDG0H21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BotrosSDG0H21
Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He:
Tied & Reduced RNN-T Decoder. Interspeech 2021: 4563-4567
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HuPSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HuPSS21
Ke Hu, Ruoming Pang, Tara N. Sainath, Trevor Strohman:
Transformer Based Deliberation for Two-Pass Speech Recognition. SLT 2021: 68-74
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChiuNHPZJPSNCW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChiuNHPZJPSNCW21
Chung-Cheng Chiu, Arun Narayanan, Wei Han, Rohit Prabhavalkar, Yu Zhang, Navdeep Jaitly, Ruoming Pang, Tara N. Sainath, Patrick Nguyen, Liangliang Cao, Yonghui Wu:
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions. SLT 2021: 873-880
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-11577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-11577
Ke Hu, Ruoming Pang, Tara N. Sainath, Trevor Strohman:
Transformer Based Deliberation for Two-Pass Speech Recognition. CoRR abs/2101.11577 (2021)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09114
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09114
Harsh Shrivastava, Ankush Garg, Yuan Cao, Yu Zhang, Tara N. Sainath:
Echo State Speech Recognition. CoRR abs/2102.09114 (2021)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-06716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-06716
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence For Subword End-to-End ASR. CoRR abs/2103.06716 (2021)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-04552
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-04552
W. Ronny Huang, Tara N. Sainath, Cal Peyser, Shankar Kumar, David Rybach, Trevor Strohman:
Lookup-Table Recurrent Language Models for Long Tail Speech Recognition. CoRR abs/2104.04552 (2021)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-14830
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-14830
Bo Li, Ruoming Pang, Tara N. Sainath, Anmol Gulati, Yu Zhang, James Qin, Parisa Haghani, W. Ronny Huang, Min Ma:
Scaling End-to-End Models for Large-Scale Multilingual ASR. CoRR abs/2104.14830 (2021)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-07513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-07513
Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He:
Tied & Reduced RNN-T Decoder. CoRR abs/2109.07513 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13226
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. CoRR abs/2109.13226 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08137
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. CoRR abs/2111.08137 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-11442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-11442
Weiran Wang, Ke Hu, Tara N. Sainath:
Deliberation of Streaming RNN-Transducer by Non-autoregressive Decoding. CoRR abs/2112.11442 (2021)
2020
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathHLNPBCLA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathHLNPBCLA20
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency. ICASSP 2020: 6059-6063
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCSPHSW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCSPHSW20
Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Ruoming Pang, Yanzhang He, Trevor Strohman, Yonghui Wu:
Towards Fast and Accurate Streaming End-To-End ASR. ICASSP 2020: 6069-6073
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathPWHCS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathPWHCS20
Tara N. Sainath, Ruoming Pang, Ron J. Weiss, Yanzhang He, Chung-Cheng Chiu, Trevor Strohman:
An Attention-Based Joint Acoustic and Text on-Device End-To-End Model. ICASSP 2020: 7039-7043
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PeyserSP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PeyserSP20
Cal Peyser, Tara N. Sainath, Golan Pundak:
Improving Proper Noun Recognition in End-To-End Asr by Customization of the Mwer Loss Criterion. ICASSP 2020: 7789-7793
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSPP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSPP20
Ke Hu, Tara N. Sainath, Ruoming Pang, Rohit Prabhavalkar:
Deliberation Model Based Two-Pass End-To-End Speech Recognition. ICASSP 2020: 7799-7803
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuLZAS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuLZAS20
Zelin Wu, Bo Li, Yu Zhang, Petar S. Aleksic, Tara N. Sainath:
Multistate Encoding with End-To-End Speech RNN Transducer Network. ICASSP 2020: 7819-7823
[c107]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chang0RHLSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chang0RHLSS20
Shuo-Yiin Chang, Bo Li, David Rybach, Yanzhang He, Wei Li, Tara N. Sainath, Trevor Strohman:
Low Latency Speech Recognition Using End-to-End Prefetching. INTERSPEECH 2020: 1962-1966
[c106]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathPRGS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathPRGS20
Tara N. Sainath, Ruoming Pang, David Rybach, Basi García, Trevor Strohman:
Emitting Word Timings with End-to-End Models. INTERSPEECH 2020: 3615-3619
[c105]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PeyserMSAPK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PeyserMSAPK20
Cal Peyser, Sepand Mavandadi, Tara N. Sainath, James Apfel, Ruoming Pang, Shankar Kumar:
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus. INTERSPEECH 2020: 4921-4925
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07962
Ke Hu, Tara N. Sainath, Ruoming Pang, Rohit Prabhavalkar:
Deliberation Model Based Two-Pass End-to-End Speech Recognition. CoRR abs/2003.07962 (2020)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12710
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency. CoRR abs/2003.12710 (2020)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-03271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-03271
Chung-Cheng Chiu, Arun Narayanan, Wei Han, Rohit Prabhavalkar, Yu Zhang, Navdeep Jaitly, Ruoming Pang, Tara N. Sainath, Patrick Nguyen, Liangliang Cao, Yonghui Wu:
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions. CoRR abs/2005.03271 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09756
Cal Peyser, Tara N. Sainath, Golan Pundak:
Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion. CoRR abs/2005.09756 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-10491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-10491
Cal Peyser, Sepand Mavandadi, Tara N. Sainath, James Apfel, Ruoming Pang, Shankar Kumar:
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus. CoRR abs/2008.10491 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-06030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-06030
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang:
Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling. CoRR abs/2010.06030 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11148
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization. CoRR abs/2010.11148 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14318
Peidong Wang, Tara N. Sainath, Ron J. Weiss:
Multitask Training with Text Data for End-to-End Speech Recognition. CoRR abs/2010.14318 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14606
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded encoders for unifying streaming and non-streaming ASR. CoRR abs/2010.14606 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-10798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-10798
Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster End-to-End Model for Streaming ASR. CoRR abs/2011.10798 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-06749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-06749
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath:
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging. CoRR abs/2012.06749 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/PurwinsLVSCS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/PurwinsLVSCS19
Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 13(2): 206-219 (2019)
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChiuKPCSWHZPKNN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChiuKPCSWHZPKNN19
Chung-Cheng Chiu, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang:
A Comparison of End-to-End Models for Long-Form Speech Recognition. ASRU 2019: 889-896
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NarayananPCRSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NarayananPCRSS19
Arun Narayanan, Rohit Prabhavalkar, Chung-Cheng Chiu, David Rybach, Tara N. Sainath, Trevor Strohman:
Recognizing Long-Form Speech Using Streaming End-to-End Models. ASRU 2019: 920-927
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0028SPW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0028SPW19
Bo Li, Tara N. Sainath, Ruoming Pang, Zelin Wu:
Semi-supervised Training for End-to-end Models via Weak Distillation. ICASSP 2019: 2837-2841
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZSWC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZSWC19
Bo Li, Yu Zhang, Tara N. Sainath, Yonghui Wu, William Chan:
Bytes Are All You Need: End-to-end Multilingual Speech Recognition and Synthesis with Bytes. ICASSP 2019: 5621-5625
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangPHSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangPHSS19
Shuo-Yiin Chang, Rohit Prabhavalkar, Yanzhang He, Tara N. Sainath, Gabor Simko:
Joint Endpointing and Decoding with End-to-end Models. ICASSP 2019: 5626-5630
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoSW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoSW19
Jinxi Guo, Tara N. Sainath, Ron J. Weiss:
A Spelling Correction Model for End-to-end Speech Recognition. ICASSP 2019: 5651-5655
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BruguierPPS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BruguierPPS19
Antoine Bruguier, Rohit Prabhavalkar, Golan Pundak, Tara N. Sainath:
Phoebe: Pronunciation-aware Contextualization for End-to-end Speech Recognition. ICASSP 2019: 6171-6175
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeSPMAZRKWPLBSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeSPMAZRKWPLBSL19
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0002PS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0002PS19
Uri Alon, Golan Pundak, Tara N. Sainath:
Contextual Speech Recognition with Difficult Negative Training Examples. ICASSP 2019: 6440-6444
[c95]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoSRRBLP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoSRRBLP19
Ding Zhao, Tara N. Sainath, David Rybach, Pat Rondon, Deepti Bhatia, Bo Li, Ruoming Pang:
Shallow-Fusion End-to-End Contextual Biasing. INTERSPEECH 2019: 1418-1422
[c94]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KannanDSWRWBCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KannanDSWRWBCL19
Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee:
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model. INTERSPEECH 2019: 2130-2134
[c93]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuBSPP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuBSPP19
Ke Hu, Antoine Bruguier, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak:
Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models. INTERSPEECH 2019: 2155-2159
[c92]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PeyserZSW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PeyserZSW19
Cal Peyser, Hao Zhang, Tara N. Sainath, Zelin Wu:
Improving Performance of End-to-End ASR on Numeric Sequences. INTERSPEECH 2019: 2185-2189
[c91]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathPRHPLVLS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathPRHPLVLS19
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu:
Two-Pass End-to-End Speech Recognition. INTERSPEECH 2019: 2773-2777
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-07178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-07178
Jinxi Guo, Tara N. Sainath, Ron J. Weiss:
A spelling correction model for end-to-end speech recognition. CoRR abs/1902.07178 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08295
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-00078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-00078
Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. CoRR abs/1905.00078 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-09292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-09292
Ke Hu, Antoine Bruguier, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak:
Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models. CoRR abs/1906.09292 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-01372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-01372
Cal Peyser, Hao Zhang, Tara N. Sainath, Zelin Wu:
Improving Performance of End-to-End ASR on Numeric Sequences. CoRR abs/1907.01372 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-10992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-10992
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu:
Two-Pass End-to-End Speech Recognition. CoRR abs/1908.10992 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-05330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-05330
Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee:
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model. CoRR abs/1909.05330 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-11455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-11455
Arun Narayanan, Rohit Prabhavalkar, Chung-Cheng Chiu, David Rybach, Tara N. Sainath, Trevor Strohman:
Recognizing long-form speech using streaming end-to-end models. CoRR abs/1910.11455 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-02242
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-02242
Chung-Cheng Chiu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu:
A comparison of end-to-end models for long-form speech recognition. CoRR abs/1911.02242 (2019)
2018
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSSBWNCWR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSSBWNCWR18
Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yanghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model. ICASSP 2018: 4749-4753
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChiuSWPNCKWRGJL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChiuSWPNCKWRGJL18
Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models. ICASSP 2018: 4774-4778
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarSWN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarSWN18
Rohit Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick Nguyen, Zhifeng Chen, Chung-Cheng Chiu, Anjuli Kannan:
Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models. ICASSP 2018: 4839-4843
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ToshniwalSWLMWR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ToshniwalSWLMWR18
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition with a Single End-to-End Model. ICASSP 2018: 4904-4908
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangLSSTOV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangLSSTOV18
Shuo-Yiin Chang, Bo Li, Gabor Simko, Tara N. Sainath, Anshuman Tripathi, Aäron van den Oord, Oriol Vinyals:
Temporal Modeling Using Dilated Convolution and Gating for Voice-Activity-Detection. ICASSP 2018: 5549-5553
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimSNMNB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimSNMNB18
Chanwoo Kim, Tara N. Sainath, Arun Narayanan, Ananya Misra, Rajeev C. Nongpiur, Michiel Bacchiani:
Spectral Distortion Model for Training Phase-Sensitive Deep-Neural Networks for Far-Field Speech Recognition. ICASSP 2018: 5729-5733
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KannanWNSCP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KannanWNSCP18
Anjuli Kannan, Yonghui Wu, Patrick Nguyen, Tara N. Sainath, Zhifeng Chen, Rohit Prabhavalkar:
An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model. ICASSP 2018: 5824-5828
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathPKLKRSNL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathPKLKRSNL18
Tara N. Sainath, Rohit Prabhavalkar, Shankar Kumar, Seungji Lee, Anjuli Kannan, David Rybach, Vlad Schogol, Patrick Nguyen, Bo Li, Yonghui Wu, Zhifeng Chen, Chung-Cheng Chiu:
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models. ICASSP 2018: 5859-5863
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathCPKWNC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathCPKWNC18
Tara N. Sainath, Chung-Cheng Chiu, Rohit Prabhavalkar, Anjuli Kannan, Yonghui Wu, Patrick Nguyen, Zhifeng Chen:
Improving the Performance of Online Neural Transducer Models. ICASSP 2018: 5864-5868
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeymannBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeymannBS18
Jahn Heymann, Michiel Bacchiani, Tara N. Sainath:
Performance of Mask Based Statistical Beamforming in a Smart Home Scenario. ICASSP 2018: 6722-6726
[c80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PangSPGWZC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PangSPGWZC18
Ruoming Pang, Tara N. Sainath, Rohit Prabhavalkar, Suyog Gupta, Yonghui Wu, Shuyuan Zhang, Chung-Cheng Chiu:
Compression of End-to-End Models. INTERSPEECH 2018: 27-31
[c79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimNMTPSHLB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimNMTPSHLB18
Khe Chai Sim, Arun Narayanan, Ananya Misra, Anshuman Tripathi, Golan Pundak, Tara N. Sainath, Parisa Haghani, Bo Li, Michiel Bacchiani:
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition. INTERSPEECH 2018: 892-896
[c78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WilliamsKARS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WilliamsKARS18
Ian Williams, Anjuli Kannan, Petar S. Aleksic, David Rybach, Tara N. Sainath:
Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search. INTERSPEECH 2018: 2227-2231
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ToshniwalKCWSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ToshniwalKCWSL18
Shubham Toshniwal, Anjuli Kannan, Chung-Cheng Chiu, Yonghui Wu, Tara N. Sainath, Karen Livescu:
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition. SLT 2018: 369-375
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PundakSPKZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PundakSPKZ18
Golan Pundak, Tara N. Sainath, Rohit Prabhavalkar, Anjuli Kannan, Ding Zhao:
Deep Context: End-to-end Contextual Speech Recognition. SLT 2018: 418-425
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-10857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-10857
Shubham Toshniwal, Anjuli Kannan, Chung-Cheng Chiu, Yonghui Wu, Tara N. Sainath, Karen Livescu:
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition. CoRR abs/1807.10857 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02480
Golan Pundak, Tara N. Sainath, Rohit Prabhavalkar, Anjuli Kannan, Ding Zhao:
Deep context: end-to-end contextual speech recognition. CoRR abs/1808.02480 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-12170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-12170
Uri Alon, Golan Pundak, Tara N. Sainath:
Contextual Speech Recognition with Difficult Negative Training Examples. CoRR abs/1810.12170 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06621
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09021
Bo Li, Yu Zhang, Tara N. Sainath, Yonghui Wu, William Chan:
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes. CoRR abs/1811.09021 (2018)
2017
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SainathWWLNVBSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SainathWWLNVBSS17
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Bo Li, Arun Narayanan, Ehsan Variani, Michiel Bacchiani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 965-979 (2017)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tpds/ChungSRPGACK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tpds/ChungSRPGACK17
I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury:
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q. IEEE Trans. Parallel Distributed Syst. 28(6): 1703-1714 (2017)
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SimNBSB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SimNBSB17
Khe Chai Sim, Arun Narayanan, Tom Bagby, Tara N. Sainath, Michiel Bacchiani:
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow. ASRU 2017: 258-264
[c74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimMCHNSB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimMCHNSB17
Chanwoo Kim, Ananya Misra, Kean K. Chin, Thad Hughes, Arun Narayanan, Tara N. Sainath, Michiel Bacchiani:
Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home. INTERSPEECH 2017: 379-383
[c73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSNCBMSSPCSWWV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSNCBMSSPCSWWV17
Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403
[c72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrabhavalkarRSL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrabhavalkarRSL17
Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, Navdeep Jaitly:
A Comparison of Sequence-to-Sequence Models for Speech Recognition. INTERSPEECH 2017: 939-943
[c71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS17
Bo Li, Tara N. Sainath:
Reducing the Computational Complexity of Two-Dimensional LSTMs. INTERSPEECH 2017: 964-968
[c70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PundakS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PundakS17
Golan Pundak, Tara N. Sainath:
Highway-LSTM and Recurrent Highway Networks for Speech Recognition. INTERSPEECH 2017: 1303-1307
[c69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathPSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathPSN17
Tara N. Sainath, Vijayaditya Peddinti, Olivier Siohan, Arun Narayanan:
Annealed f-Smoothing as a Mechanism to Speed up Neural Network Training. INTERSPEECH 2017: 3542-3546
[c68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrabhavalkarSLR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrabhavalkarSLR17
Rohit Prabhavalkar, Tara N. Sainath, Bo Li, Kanishka Rao, Navdeep Jaitly:
An Analysis of "Attention" in Sequence-to-Sequence Models. INTERSPEECH 2017: 3702-3706
[c67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLSSP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLSSP17
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Gabor Simko, Carolina Parada:
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition. INTERSPEECH 2017: 3812-3816
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/SainathWWNBLVSSCMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/SainathWWNBLVSSCMK17
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Bo Li, Ehsan Variani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Raw Multichannel Processing Using Deep Neural Networks. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 105-133
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01694
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01694
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition With A Single End-To-End Model. CoRR abs/1711.01694 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01541
Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yonghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model. CoRR abs/1712.01541 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01769
Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Katya Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-art Speech Recognition With Sequence-to-Sequence Models. CoRR abs/1712.01769 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01807
Tara N. Sainath, Chung-Cheng Chiu, Rohit Prabhavalkar, Anjuli Kannan, Yonghui Wu, Patrick Nguyen, Zhifeng Chen:
Improving the Performance of Online Neural Transducer Models. CoRR abs/1712.01807 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01818
Rohit Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick Nguyen, Zhifeng Chen, Chung-Cheng Chiu, Anjuli Kannan:
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models. CoRR abs/1712.01818 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01864
Tara N. Sainath, Rohit Prabhavalkar, Shankar Kumar, Seungji Lee, Anjuli Kannan, David Rybach, Vlad Schogol, Patrick Nguyen, Bo Li, Yonghui Wu, Zhifeng Chen, Chung-Cheng Chiu:
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models. CoRR abs/1712.01864 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01996
Anjuli Kannan, Yonghui Wu, Patrick Nguyen, Tara N. Sainath, Zhifeng Chen, Rohit Prabhavalkar:
An analysis of incorporating an external language model into a sequence-to-sequence model. CoRR abs/1712.01996 (2017)
2016
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathWWNB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathWWNB16
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani:
Factored spatial and spectral multichannel raw waveform CLDNNs. ICASSP 2016: 5075-5079
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuSS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuSS16
Zhiyun Lu, Vikas Sindhwani, Tara N. Sainath:
Learning compact recurrent neural networks. ICASSP 2016: 5960-5964
[c64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PundakS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PundakS16
Golan Pundak, Tara N. Sainath:
Lower Frame Rate Neural Network Acoustic Models. INTERSPEECH 2016: 22-26
[c63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiSSB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiSSB16
Ehsan Variani, Tara N. Sainath, Izhak Shafran, Michiel Bacchiani:
Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling. INTERSPEECH 2016: 808-812
[c62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathL16
Tara N. Sainath, Bo Li:
Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks. INTERSPEECH 2016: 813-817
[c61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathNWVWBS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathNWVWBS16
Tara N. Sainath, Arun Narayanan, Ron J. Weiss, Ehsan Variani, Kevin W. Wilson, Michiel Bacchiani, Izhak Shafran:
Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction. INTERSPEECH 2016: 1971-1975
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSWWB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSWWB16
Bo Li, Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Michiel Bacchiani:
Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition. INTERSPEECH 2016: 1976-1980
[c59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZazoSSP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZazoSSP16
Rubén Zazo, Tara N. Sainath, Gabor Simko, Carolina Parada:
Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection. INTERSPEECH 2016: 3668-3672
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LuSS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LuSS16
Zhiyun Lu, Vikas Sindhwani, Tara N. Sainath:
Learning Compact Recurrent Neural Networks. CoRR abs/1604.02594 (2016)
2015
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/SainathKSSMDR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/SainathKSSMDR15
Tara N. Sainath, Brian Kingsbury, George Saon, Hagen Soltau, Abdel-rahman Mohamed, George E. Dahl, Bhuvana Ramabhadran:
Deep Convolutional Neural Networks for Large-scale Speech Tasks. Neural Networks 64: 39-48 (2015)
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathWWNBS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathWWNBS15
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Andrew W. Senior:
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms. ASRU 2015: 30-36
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SeniorSQSR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SeniorSQSR15
Andrew W. Senior, Hasim Sak, Felix de Chaumont Quitry, Tara N. Sainath, Kanishka Rao:
Acoustic modelling with CD-CTC-SMBR LSTM RNNS. ASRU 2015: 604-609
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathVSS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathVSS15
Tara N. Sainath, Oriol Vinyals, Andrew W. Senior, Hasim Sak:
Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks. ICASSP 2015: 4580-4584
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarAPN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarAPN15
Rohit Prabhavalkar, Raziel Alvarez, Carolina Parada, Preetum Nakkiran, Tara N. Sainath:
Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks. ICASSP 2015: 4704-4708
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenPS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenPS15
Guoguo Chen, Carolina Parada, Tara N. Sainath:
Query-by-example keyword spotting using long short-term memory networks. ICASSP 2015: 5236-5240
[c53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathWSWV15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathWSWV15
Tara N. Sainath, Ron J. Weiss, Andrew W. Senior, Kevin W. Wilson, Oriol Vinyals:
Learning the speech front-end with raw waveform CLDNNs. INTERSPEECH 2015: 1-5
[c52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLSVAP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLSVAP15
Yu-hsin Chen, Ignacio López-Moreno, Tara N. Sainath, Mirkó Visontai, Raziel Alvarez, Carolina Parada:
Locally-connected and convolutional neural networks for small footprint speaker recognition. INTERSPEECH 2015: 1136-1140
[c51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathP15
Tara N. Sainath, Carolina Parada:
Convolutional neural networks for small-footprint keyword spotting. INTERSPEECH 2015: 1478-1482
[c50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoPSCCJSSBB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoPSCCJSSBB15
Hank Liao, Golan Pundak, Olivier Siohan, Melissa K. Carroll, Noah Coccaro, Qi-Ming Jiang, Tara N. Sainath, Andrew W. Senior, Françoise Beaufays, Michiel Bacchiani:
Large vocabulary automatic speech recognition for children. INTERSPEECH 2015: 1611-1615
[c49]
- view
- export record
  dblp key:
  - conf/nips/SindhwaniSK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SindhwaniSK15
Vikas Sindhwani, Tara N. Sainath, Sanjiv Kumar:
Structured Transforms for Small-Footprint Deep Learning. NIPS 2015: 3088-3096
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SindhwaniSK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SindhwaniSK15
Vikas Sindhwani, Tara N. Sainath, Sanjiv Kumar:
Structured Transforms for Small-Footprint Deep Learning. CoRR abs/1510.01722 (2015)
2014
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangASSR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangASSR14
Po-Sen Huang, Haim Avron, Tara N. Sainath, Vikas Sindhwani, Bhuvana Ramabhadran:
Kernel methods match Deep Neural Networks on TIMIT. ICASSP 2014: 205-209
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PeddintiSMRNG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PeddintiSMRNG14
Vijayaditya Peddinti, Tara N. Sainath, Shay Maymon, Bhuvana Ramabhadran, David Nahamoo, Vaibhava Goel:
Deep Scattering Spectrum with deep neural networks. ICASSP 2014: 210-214
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSS14
Hagen Soltau, George Saon, Tara N. Sainath:
Joint training of convolutional and non-convolutional neural networks. ICASSP 2014: 5572-5576
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKMSR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKMSR14
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Saon, Bhuvana Ramabhadran:
Improvements to filterbank and delta learning within a deep neural network framework. ICASSP 2014: 6839-6843
[c44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathPKFRN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathPKFRN14
Tara N. Sainath, Vijayaditya Peddinti, Brian Kingsbury, Petr Fousek, Bhuvana Ramabhadran, David Nahamoo:
Deep scattering spectra with deep neural networks for LVCSR tasks. INTERSPEECH 2014: 900-904
[c43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathCRPGKSAC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathCRPGKSAC14
Tara N. Sainath, I-Hsin Chung, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Brian Kingsbury, George Saon, Vernon Austel, Upendra V. Chaudhari:
Parallel deep neural network training for LVCSR tasks using blue gene/Q. INTERSPEECH 2014: 1048-1052
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/ChungSRPGACK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/ChungSRPGACK14
I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury:
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q. SC 2014: 745-753
2013
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SainathKSR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SainathKSR13
Tara N. Sainath, Brian Kingsbury, Hagen Soltau, Bhuvana Ramabhadran:
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks. IEEE Trans. Speech Audio Process. 21(11): 2267-2276 (2013)
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKMR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKMR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran:
Learning filter banks within a deep neural network framework. ASRU 2013: 297-302
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathHKAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathHKAR13
Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling. ASRU 2013: 303-308
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to Deep Convolutional Neural Networks for LVCSR. ASRU 2013: 315-320
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKSAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKSAR13
Tara N. Sainath, Brian Kingsbury, Vikas Sindhwani, Ebru Arisoy, Bhuvana Ramabhadran:
Low-rank matrix factorization for Deep Neural Network training with high-dimensional output targets. ICASSP 2013: 6655-6659
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CuiCRKKMMPSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CuiCRKKMMPSS13
Jia Cui, Xiaodong Cui, Bhuvana Ramabhadran, Janice Kim, Brian Kingsbury, Jonathan Mamou, Lidia Mangu, Michael Picheny, Tara N. Sainath, Abhinav Sethy:
Developing speech recognition systems for corpus indexing under the IARPA Babel program. ICASSP 2013: 6753-6757
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarSNRK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarSNRK13
Rohit Prabhavalkar, Tara N. Sainath, David Nahamoo, Bhuvana Ramabhadran, Dimitri Kanevsky:
An evaluation of posterior modeling techniques for phonetic recognition. ICASSP 2013: 7165-7169
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DahlSH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DahlSH13
George E. Dahl, Tara N. Sainath, Geoffrey E. Hinton:
Improving deep neural networks for LVCSR using rectified linear units and dropout. ICASSP 2013: 8609-8613
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathMKR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathMKR13
Tara N. Sainath, Abdel-rahman Mohamed, Brian Kingsbury, Bhuvana Ramabhadran:
Deep convolutional neural networks for LVCSR. ICASSP 2013: 8614-8618
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to deep convolutional neural networks for LVCSR. CoRR abs/1309.1501 (2013)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SainathHKAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SainathHKAR13
Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improving training time of Hessian-free optimization for deep neural networks using preconditioning and sampling. CoRR abs/1309.1508 (2013)
2012
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/SainathRNKCDGBS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/SainathRNKCDGBS12
Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky, Dirk Van Compernolle, Kris Demuynck, Jort F. Gemmeke, Jerome R. Bellegarda, Shiva Sundaram:
Exemplar-Based Processing for Speech Recognition: An Overview. IEEE Signal Process. Mag. 29(6): 98-113 (2012)
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ItohSJZR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ItohSJZR12
Nobuyasu Itoh, Tara N. Sainath, Dan-Ning Jiang, Jie Zhou, Bhuvana Ramabhadran:
N-best entropy based data selection for acoustic modeling. ICASSP 2012: 4133-4136
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKR12
Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran:
Auto-encoder bottleneck features using deep belief networks. ICASSP 2012: 4153-4156
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PlahlSRN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PlahlSRN12
Christian Plahl, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
Improved pre-training of Deep Belief Networks using Sparse Encoding Symmetric Machines. ICASSP 2012: 4165-4168
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KingsburySS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KingsburySS12
Brian Kingsbury, Tara N. Sainath, Hagen Soltau:
Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization. INTERSPEECH 2012: 10-13
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathNKR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathNKR12
Tara N. Sainath, David Nahamoo, Dimitri Kanevsky, Bhuvana Ramabhadran:
Enhancing Exemplar-Based Posteriors for Speech Recognition Tasks. INTERSPEECH 2012: 2130-2133
[c28]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/naacl/ArisoySKR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ArisoySKR12
Ebru Arisoy, Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran:
Deep Neural Network Language Models. WLM@NAACL-HLT 2012: 20-28
2011
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SainathRPNK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SainathRPNK11
Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, David Nahamoo, Dimitri Kanevsky:
Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2598-2613 (2011)
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKRFNM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKRFNM11
Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran, Petr Fousek, Petr Novák, Abdel-rahman Mohamed:
Making Deep Belief Networks effective for large vocabulary continuous speech recognition. ASRU 2011: 30-35
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathNKRS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathNKRS11
Tara N. Sainath, David Nahamoo, Dimitri Kanevsky, Bhuvana Ramabhadran, Parikshit M. Shah:
A convex hull approach to sparse representations for exemplar-based speech recognition. ASRU 2011: 59-64
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathNRKGS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathNRKGS11
Tara N. Sainath, David Nahamoo, Bhuvana Ramabhadran, Dimitri Kanevsky, Vaibhava Goel, Parikshit M. Shah:
Exemplar-based Sparse Representation phone identification features. ICASSP 2011: 4492-4495
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangSSR11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangSSR11
Bin Zhang, Abhinav Sethy, Tara N. Sainath, Bhuvana Ramabhadran:
Application specific loss minimization using gradient boosting. ICASSP 2011: 4880-4883
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MohamedSDRHP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MohamedSDRHP11
Abdel-rahman Mohamed, Tara N. Sainath, George E. Dahl, Bhuvana Ramabhadran, Geoffrey E. Hinton, Michael A. Picheny:
Deep Belief Networks using discriminative features for phone recognition. ICASSP 2011: 5060-5063
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanevskyNSRO11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanevskyNSRO11
Dimitri Kanevsky, David Nahamoo, Tara N. Sainath, Bhuvana Ramabhadran, Peder A. Olsen:
A-Functions: A generalization of Extended Baum-Welch transformations to convex optimization. ICASSP 2011: 5164-5167
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathRNK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathRNK11
Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky:
Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition. INTERSPEECH 2011: 785-788
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanevskyNSR11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanevskyNSR11
Dimitri Kanevsky, David Nahamoo, Tara N. Sainath, Bhuvana Ramabhadran:
Convergence of Line Search A-Function Methods. INTERSPEECH 2011: 997-1000
2010
[c19]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/fusion/KanevskyCHGRS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fusion/KanevskyCHGRS10
Dimitri Kanevsky, Avishy Carmi, Lior Horesh, Pini Gurfil, Bhuvana Ramabhadran, Tara N. Sainath:
Kalman filtering for compressed sensing. FUSION 2010: 1-8
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CarmiSGKNR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CarmiSGKNR10
Avishy Carmi, Tara N. Sainath, Pini Gurfil, Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran:
The Use of isometric transformations and bayesian estimation in compressive sensing for fMRI classification. ICASSP 2010: 493-496
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathCKR10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathCKR10
Tara N. Sainath, Avishy Carmi, Dimitri Kanevsky, Bhuvana Ramabhadran:
Bayesian compressive sensing for phonetic classification. ICASSP 2010: 4370-4373
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/TellerWACDFFGHHJKLRS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/TellerWACDFFGHHJKLRS10
Seth J. Teller, Matthew R. Walter, Matthew E. Antone, Andrew Correa, Randall Davis, Luke Fletcher, Emilio Frazzoli, Jim Glass, Jonathan P. How, Albert S. Huang, Jeong hwan Jeon, Sertac Karaman, Brandon Luders, Nicholas Roy, Tara N. Sainath:
A voice-commandable robotic forklift working alongside humans in minimally-prepared outdoor environments. ICRA 2010: 526-533
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoelSRONK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoelSRONK10
Vaibhava Goel, Tara N. Sainath, Bhuvana Ramabhadran, Peder A. Olsen, David Nahamoo, Dimitri Kanevsky:
Incorporating sparse representation phone identification features in automatic speech recognition using exponential families. INTERSPEECH 2010: 1345-1348
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathRNKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathRNKS10
Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky, Abhinav Sethy:
Sparse representation features for speech recognition. INTERSPEECH 2010: 2254-2257
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SethySRK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SethySRK10
Abhinav Sethy, Tara N. Sainath, Bhuvana Ramabhadran, Dimitri Kanevsky:
Data selection for language modeling using sparse representations. INTERSPEECH 2010: 2258-2261
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathMKRNH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathMKRNH10
Tara N. Sainath, Sameer Maskey, Dimitri Kanevsky, Bhuvana Ramabhadran, David Nahamoo, Julia Hirschberg:
Sparse representations for text categorization. INTERSPEECH 2010: 2266-2269
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanevskySRN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanevskySRN10
Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
An analysis of sparseness and regularization in exemplar-based methods for speech classification. INTERSPEECH 2010: 2842-2845

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[b1]
- view
  - electronic edition via handle.net
  - no references & citations available
- export record
  dblp key:
  - phd/ndltd/Sainath09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ndltd/Sainath09
Tara N. Sainath:
Applications of broad class knowledge for noise robust speech recognition. Massachusetts Institute of Technology, Cambridge, MA, USA, 2009
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/Sainath09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/Sainath09
Tara N. Sainath:
Island-driven search using broad phonetic classes. ASRU 2009: 287-292
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathRP09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathRP09
Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny:
An exploration of large vocabulary tools for small vocabulary phonetic recognition. ASRU 2009: 359-364
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanevskySR09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanevskySR09
Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran:
A generalized family of parameter estimation techniques. ICASSP 2009: 1725-1728
2008
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKR08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKR08
Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran:
Gradient steepness metrics using extended Baum-Welch transformations for universal pattern recognition tasks. ICASSP 2008: 4533-4536
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanevskySRN08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanevskySRN08
Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
Generalization of extended baum-welch parameter estimation for discriminative training and decoding. INTERSPEECH 2008: 277-280
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathZ08
Tara N. Sainath, Victor Zue:
A comparison of broad phonetic and acoustic units for noise robust segment-based phonetic recognition. INTERSPEECH 2008: 2378-2381
2007
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKR07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKR07
Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran:
Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations. ASRU 2007: 306-311
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKI07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKI07
Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyengar:
Unsupervised Audio Segmentation using Extended Baum-Welch Transformations. ICASSP (1) 2007: 209-212
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathZK07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathZK07
Tara N. Sainath, Victor Zue, Dimitri Kanevsky:
Audio classification using extended baum-welch transformations. INTERSPEECH 2007: 2969-2972
2006
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathH06
Tara N. Sainath, Timothy J. Hazen:
A Sinusoidal Model Approach to Acoustic Landmark Detection and Segmentation for Robust Segment-Based Speech Recognition. ICASSP (1) 2006: 525-528

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.