default search action

combined dblp search
author search
venue search
publication search

ask others

Jay Mahadeokar

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiaKZL0WSMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiaKZL0WSMK25
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. ICASSP 2025: 1-5
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajKJMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajKJMK25
Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. ICASSP 2025: 1-5
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YadavKRZJLXWMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YadavKRZJLXWMK25
Amit Kumar Singh Yadav, Gil Keren, Desh Raj, Wei Zhou, Junteng Jia, Ke Li, Ying Xu, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli:
Speech-N-LlaMA: Improving Speech LLMs with Multi-Pass Training. ICASSP 2025: 1-5
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangRLMJKLHDMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangRLMJKLHDMK25
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. ICASSP 2025: 1-5
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouJSMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouJSMK25
Wei Zhou, Junteng Jia, Leda Sari, Jay Mahadeokar, Ozlem Kalinli:
CJST: CTC Compressor based Joint Speech and Text Training for Decoder-Only ASR. ICASSP 2025: 1-5
2024
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShangguanYLWFWD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShangguanYLWFWD24
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models. ICASSP 2024: 10216-10220
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieLGTSSWJMK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieLGTSSWJMK24
Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model. ICASSP 2024: 12201-12205
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoMMSWMKFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoMMSWMKFS24
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FathullahWLJSLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FathullahWLJSLG24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FathullahWLLJSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FathullahWLLJSM24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01716
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-16664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-16664
Laxmi Pandey, Ke Li, Jinxi Guo, Debjyoti Paul, Arthur Guo, Jay Mahadeokar, Xuedong Zhang:
Towards scalable efficient on-device ASR with transfer learning. CoRR abs/2407.16664 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-21783
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-21783
Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurélien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Rozière, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris McConnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-Olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab AlBadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Grégoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel M. Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer van der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, et al.:
The Llama 3 Herd of Models. CoRR abs/2407.21783 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08148
Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. CoRR abs/2409.08148 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11494
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. CoRR abs/2409.11494 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01162
Wonjune Kang, Junteng Jia, Chunyang Wu, Wei Zhou, Egor Lakomkin, Yashesh Gaur, Leda Sari, Suyoun Kim, Ke Li, Jay Mahadeokar, Ozlem Kalinli:
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech. CoRR abs/2410.01162 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03752
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. CoRR abs/2410.03752 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-07607
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-07607
Wei Zhou, Junteng Jia, Leda Sari, Jay Mahadeokar, Ozlem Kalinli:
CJST: CTC Compressor based Joint Speech and Text Training for Decoder-Only ASR. CoRR abs/2411.07607 (2024)
2023
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/JiaLMMMKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/JiaLMMMKS23
Junteng Jia, Ke Li, Mani Malek, Kshitiz Malik, Jay Mahadeokar, Ozlem Kalinli, Frank Seide:
Joint Federated Learning and Personalization for on-Device ASR. ASRU 2023: 1-8
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMGSKKSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMGSKKSL23
Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le:
Improving fast-slow Encoder based Transducer with Streaming Deliberation. ICASSP 2023: 1-5
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiangSSMPZTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiangSSMPZTS23
Dawei Liang, Hang Su, Tarun Singh, Jay Mahadeokar, Shanil Puri, Jiedan Zhu, Edison Thomaz, Mike Seltzer:
Dynamic Speech Endpoint Detection with Regression Targets. ICASSP 2023: 1-5
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajJMWMZK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajJMWMZK23
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FathullahWSJXML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FathullahWSJXML23
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245
[c18]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeVSKSMWMAMH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeVSKSMWMAMH23
Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. NeurIPS 2023
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12498
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00998
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00998
Shuo Liu, Leda Sari, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli:
Towards Selection of Text-to-speech Data to Augment ASR Training. CoRR abs/2306.00998 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15687
Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. CoRR abs/2306.15687 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11795
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01947
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models. CoRR abs/2309.01947 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13018
Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model. CoRR abs/2309.13018 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06753
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023)
2022
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiWWXMZLLSNKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiWWXMZLLSNKS22
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaMZSKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaMZSKS22
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. INTERSPEECH 2022: 536-540
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSLLZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSLLZC22
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast slow cascaded encoders. INTERSPEECH 2022: 2083-2087
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MoritzSLMF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MoritzSLMF22
Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. SLT 2022: 324-330
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15773
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast-slow cascaded encoders. CoRR abs/2203.15773 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15966
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide:
Federated Domain Adaptation for ASR with Full Self-Supervision. CoRR abs/2203.15966 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-08858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-08858
Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. CoRR abs/2204.08858 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11588
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14252
Dawei Liang, Hang Su, Tarun Singh, Jay Mahadeokar, Shanil Puri, Jiedan Zhu, Edison Thomaz, Mike Seltzer:
Dynamic Speech Endpoint Detection with Regression Targets. CoRR abs/2210.14252 (2022)
2021
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimSMBFSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimSMBFSL21
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. ICASSP 2021: 7333-7337
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VenkateshVMSFSC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VenkateshVMSFSC21
Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-Efficient Speech Recognition on Smart Devices. ICASSP 2021: 8368-8372
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeJKKSMCSFKSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeJKKSMCSFKSS21
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiNWMLPXYCFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiNWMLPXYCFKS21
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSSWXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSSWXS21
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShangguanPSMSZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShangguanPSMSZW21
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MahadeokarSLKSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MahadeokarSLKSL21
Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. SLT 2021: 52-59
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LeKCMFS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LeKCMFS21
Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. SLT 2021: 251-257
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11531
Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-efficient Speech Recognition on Smart Devices. CoRR abs/2102.11531 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02176
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02194
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02207
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02232
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05241
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15018
Yao-Yuan Yang, Moto Hira, Zhaoheng Ni, Anjali Chourdia, Artyom Astafurov, Caroline Chen, Ching-Feng Yeh, Christian Puhrsch, David Pollack, Dmitriy Genzel, Donny Greenberg, Edward Z. Yang, Jason Lian, Jay Mahadeokar, Jeff Hwang, Ji Chen, Peter Goldsborough, Prabhat Roy, Sean Narenthiran, Shinji Watanabe, Soumith Chintala, Vincent Quenneville-Bélair, Yangyang Shi:
TorchAudio: Building Blocks for Audio and Speech Processing. CoRR abs/2110.15018 (2021)
2020
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangMLLXMHTZZFZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangMLLXMHTZZFZ20
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeLZMKF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeLZMKF20
Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, Christian Fuegen:
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks. ICASSP 2020: 7499-7503
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JainKMZMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JainKMZMS20
Mahaveer Jain, Gil Keren, Jay Mahadeokar, Geoffrey Zweig, Florian Metze, Yatharth Saraf:
Contextual RNN-T for Open Domain ASR. INTERSPEECH 2020: 11-15
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-03411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-03411
Mahaveer Jain, Gil Keren, Jay Mahadeokar, Yatharth Saraf:
Contextual RNN-T For Open Domain ASR. CoRR abs/2006.03411 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13878
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13878
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. CoRR abs/2010.13878 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03072
Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. CoRR abs/2011.03072 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07754
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07754
Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. CoRR abs/2011.07754 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09799
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12977
Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer:
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention. CoRR abs/1910.12977 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-01629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-01629
Mahaveer Jain, Kjell Schubert, Jay Mahadeokar, Ching-Feng Yeh, Kaustubh Kalgaonkar, Anuroop Sriram, Christian Fuegen, Michael L. Seltzer:
RNN-T For Latency Controlled ASR With Improved Beam Search. CoRR abs/1911.01629 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-02115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-02115
Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, Christian Fuegen:
Spatial Attention for Far-field Speech Recognition with Deep Beamforming Neural Networks. CoRR abs/1911.02115 (2019)
2014
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jco/MahadeokarS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jco/MahadeokarS14
Jay Mahadeokar, Sanjeev Saxena:
Faster algorithm to find anti-risk path between two nodes of an undirected graph. J. Comb. Optim. 27(4): 798-807 (2014)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/www/JainM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/JainM14
Vidit Jain, Jay Mahadeokar:
Short-text representation using diffusion wavelets. WWW (Companion Volume) 2014: 301-302
2013
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jda/MahadeokarS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jda/MahadeokarS13
Jay Mahadeokar, Sanjeev Saxena:
Faster replacement paths algorithms in case of edge or node failure for undirected, positive integer weighted graphs. J. Discrete Algorithms 23: 54-62 (2013)
2012
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iwoca/MahadeokarS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwoca/MahadeokarS12
Jay Mahadeokar, Sanjeev Saxena:
Faster Replacement Paths Algorithm for Undirected, Positive Integer Weighted Graphs with Small Diameter. IWOCA 2012: 81-85

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.