default search action
Sanjeev Khudanpur
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2023
- [j27]Matthew Maciejewski, Jing Shi, Shinji Watanabe, Sanjeev Khudanpur:
A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data. Comput. Speech Lang. 77: 101410 (2023) - [j26]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
SURT 2.0: Advances in Transducer-Based Multi-Talker Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3800-3813 (2023) - 2022
- [j25]Amy Lynne Shelton, E. Emory Davis, Cathryn S. Cortesa, Jonathan D. Jones, Gregory D. Hager, Sanjeev Khudanpur, Barbara Landau:
Characterizing the Details of Spatial Construction: Cognitive Constraints and Variability. Cogn. Sci. 46(1) (2022) - [j24]Zili Huang, Marc Delcroix, Leibny Paola García-Perera, Shinji Watanabe, Desh Raj, Sanjeev Khudanpur:
Joint speaker diarization and speech recognition based on region proposal networks. Comput. Speech Lang. 72: 101316 (2022) - [j23]Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Eng Siong Chng, Suzy J. Styles, Sanjeev Khudanpur:
Efficient Self-Supervised Learning Representations for Spoken Language Identification. IEEE J. Sel. Top. Signal Process. 16(6): 1296-1307 (2022) - 2021
- [j22]Jonathan D. Jones, Cathryn S. Cortesa, Amy Lynne Shelton, Barbara Landau, Sanjeev Khudanpur, Gregory D. Hager:
Fine-Grained Activity Recognition for Assembly Videos. IEEE Robotics Autom. Lett. 6(2): 3728-3735 (2021) - [j21]Hang Lv, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang, Lei Xie, Sanjeev Khudanpur:
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation. IEEE Signal Process. Lett. 28: 703-707 (2021) - 2018
- [j20]Vijayaditya Peddinti, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs. IEEE Signal Process. Lett. 25(3): 373-377 (2018) - [j19]Hossein Hadian, Hossein Sameti, Daniel Povey, Sanjeev Khudanpur:
Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 1949-1961 (2018) - 2017
- [j18]Narges Ahmidi, Lingling Tao, Shahin Sefati, Yixin Gao, Colin Lea, Benjamín Béjar Haro, Luca Zappella, Sanjeev Khudanpur, René Vidal, Gregory D. Hager:
A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery. IEEE Trans. Biomed. Eng. 64(9): 2025-2041 (2017) - 2016
- [j17]Yixin Gao, S. Swaroop Vedula, Gyusung I. Lee, Mija R. Lee, Sanjeev Khudanpur, Gregory D. Hager:
Query-by-example surgical activity detection. Int. J. Comput. Assist. Radiol. Surg. 11(6): 987-996 (2016) - [j16]Scott Novotney, Richard M. Schwartz, Sanjeev Khudanpur:
Getting more from automatic transcripts for semi-supervised language modeling. Comput. Speech Lang. 36: 93-109 (2016) - 2011
- [j15]Balakrishnan Varadarajan, Sanjeev Khudanpur, Trac D. Tran:
Stepwise Optimal Subspace Pursuit for Improving Sparse Recovery. IEEE Signal Process. Lett. 18(1): 27-30 (2011) - 2010
- [j14]Christopher M. White, Sanjeev Khudanpur, Patrick J. Wolfe:
Likelihood-Based Semi-Supervised Model Selection With Applications to Speech Processing. IEEE J. Sel. Top. Signal Process. 4(6): 1016-1026 (2010) - 2009
- [j13]Zhifei Li, Chris Callison-Burch, Sanjeev Khudanpur, Wren N. G. Thornton:
Decoding in JoshuaOpen Source, Parsing-Based Machine Translation. Prague Bull. Math. Linguistics 91: 47-56 (2009) - [j12]Janet M. Baker, Li Deng, James R. Glass, Sanjeev Khudanpur, Chin-Hui Lee, Nelson Morgan, Douglas D. O'Shaughnessy:
Developments and directions in speech recognition and understanding, Part 1 [DSP Education]. IEEE Signal Process. Mag. 26(3): 75-80 (2009) - [j11]Janet M. Baker, Li Deng, Sanjeev Khudanpur, Chin-Hui Lee, James R. Glass, Nelson Morgan, Douglas D. O'Shaughnessy:
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education]. IEEE Signal Process. Mag. 26(4): 78-85 (2009) - 2005
- [j10]Bruno Jedynak, Sanjeev Khudanpur:
Maximum Likelihood Set for Estimating a Probability Mass Function. Neural Comput. 17(7): 1508-1530 (2005) - 2004
- [j9]Sanjeev Khudanpur, Woosung Kim:
Contemporaneous text as side-information in statistical language modeling. Comput. Speech Lang. 18(2): 143-162 (2004) - [j8]Helen M. Meng, Berlin Chen, Sanjeev Khudanpur, Gina-Anne Levow, Wai Kit Lo, Douglas W. Oard, Patrick Schone, Karen Tang, Hsin-Min Wang, Jianqiang Wang:
Mandarin-English Information (MEI): investigating translingual speech retrieval. Comput. Speech Lang. 18(2): 163-179 (2004) - [j7]Murat Saraclar, Sanjeev Khudanpur:
Pronunciation change in conversational speech and its implications for automatic speech recognition. Comput. Speech Lang. 18(4): 375-395 (2004) - [j6]Woosung Kim, Sanjeev Khudanpur:
Lexical triggers and latent semantic analysis for cross-lingual language model adaptation. ACM Trans. Asian Lang. Inf. Process. 3(2): 94-112 (2004) - 2003
- [j5]Daqing He, Douglas W. Oard, Jianqiang Wang, Jun Luo, Dina Demner-Fushman, Kareem Darwish, Philip Resnik, Sanjeev Khudanpur, Michael Nossal, Michael Subotin, Anton Leuski:
Making MIRACLEs: Interactive translingual search for Cebuano and Hindi. ACM Trans. Asian Lang. Inf. Process. 2(3): 219-244 (2003) - 2002
- [j4]Sanjeev Khudanpur, Prakash Narayan:
Order estimation for a special class of hidden Markov sources and binary renewal processes. IEEE Trans. Inf. Theory 48(6): 1704-1713 (2002) - 2000
- [j3]Murat Saraclar, Harriet J. Nock, Sanjeev Khudanpur:
Pronunciation modeling by sharing Gaussian densities across phonetic models. Comput. Speech Lang. 14(2): 137-160 (2000) - [j2]Sanjeev Khudanpur, Jun Wu:
Maximum entropy techniques for exploiting syntactic, semantic and collocational dependencies in language modeling. Comput. Speech Lang. 14(4): 355-372 (2000) - 1999
- [j1]Michael Riley, William Byrne, Michael Finke, Sanjeev Khudanpur, Andrej Ljolje, John W. McDonough, Harriet J. Nock, Murat Saraclar, Charles Wooters, George Zavaliagkos:
Stochastic pronunciation modelling from hand-labelled phonetic corpora. Speech Commun. 29(2-4): 209-224 (1999)
Conference and Workshop Papers
- 2024
- [c234]Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola García, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey, Sanjeev Khudanpur:
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition. LREC/COLING 2024: 3700-3706 - [c233]Hexin Liu, Leibny Paola García, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur:
Enhancing Code-Switching Speech Recognition With Interactive Language Biases. ICASSP 2024: 10886-10890 - [c232]Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. ICASSP 2024: 11831-11835 - [c231]Amir Hussein, Brian Yan, Antonios Anastasopoulos, Shinji Watanabe, Sanjeev Khudanpur:
Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization. ICASSP 2024: 11971-11975 - [c230]Amir Hussein, Dorsa Zeinali, Ondrej Klejch, Matthew Wiesner, Brian Yan, Shammur Absar Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur:
Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora. ICASSP 2024: 12006-12010 - [c229]Nathaniel R. Robinson, Raj Dabre, Ammon Shurtz, Rasul Dent, Onenamiyi Onesi, Claire Bizon Monroc, Loïc Grobol, Hasan Muhammad, Ashi Garg, Naome A. Etori, Vijay Murari Tiyyala, Olanrewaju Samuel, Matthew Dean Stutzman, Bismarck Bamfo Odoom, Sanjeev Khudanpur, Stephen D. Richardson, Kenton Murray:
Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages. NAACL-HLT 2024: 3083-3110 - [c228]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Paola García, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. Odyssey 2024: 91-98 - 2023
- [c227]Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Learning From Flawed Data: Weakly Supervised Automatic Speech Recognition. ASRU 2023: 1-8 - [c226]Martin Sustek, Sonal Joshi, Henry Li, Thomas Thebaud, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks. ASRU 2023: 1-8 - [c225]Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System. ASRU 2023: 1-8 - [c224]Dongji Gao, Jiatong Shi, Shun-Po Chuang, Leibny Paola García, Hung-Yi Lee, Shinji Watanabe, Sanjeev Khudanpur:
Euro: Espnet Unsupervised ASR Open-Source Toolkit. ICASSP 2023: 1-5 - [c223]Zili Huang, Desh Raj, Paola García, Sanjeev Khudanpur:
Adapting Self-Supervised Models to Multi-Talker Speech Recognition Using Speaker Embeddings. ICASSP 2023: 1-5 - [c222]Ruizhe Huang, Matthew Wiesner, Leibny Paola García-Perera, Daniel Povey, Jan Trmal, Sanjeev Khudanpur:
Building Keyword Search System from End-To-End Asr Systems. ICASSP 2023: 1-5 - [c221]Hexin Liu, Haihua Xu, Leibny Paola García, Andy W. H. Khong, Yi He, Sanjeev Khudanpur:
Reducing Language Confusion for Code-Switching Speech Recognition with Token-Level Language Diarization. ICASSP 2023: 1-5 - [c220]Chun Chieh Chang, Leibny Paola García-Perera, Sanjeev Khudanpur:
Crosslingual Handwritten Text Generation Using GANs. ICDAR Workshops (2) 2023: 285-301 - [c219]Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, Sanjeev Khudanpur:
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts. INTERSPEECH 2023: 924-928 - [c218]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
GPU-accelerated Guided Source Separation for Meeting Transcription. INTERSPEECH 2023: 3507-3511 - [c217]Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur:
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation. INTERSPEECH 2023: 4074-4078 - [c216]Yi Han Victoria Chua, Hexin Liu, Leibny Paola García, Fei Ting Woon, Jinyi Wong, Xiangyu Zhang, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles:
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization. INTERSPEECH 2023: 4109-4113 - [c215]Suzy J. Styles, Yi Han Victoria Chua, Fei Ting Woon, Hexin Liu, Leibny Paola García, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels:
Investigating model performance in language identification: beyond simple error statistics. INTERSPEECH 2023: 4129-4133 - [c214]Amir Hussein, Cihan Xiao, Neha Verma, Thomas Thebaud, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2023 Dialect Speech Translation System Description. IWSLT@ACL 2023: 283-290 - [c213]Henry Li Xinyuan, Neha Verma, Bismarck Bamfo Odoom, Ujvala Pradeep, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2023 Multilingual Speech Translation System Description. IWSLT@ACL 2023: 302-310 - 2022
- [c212]Zili Huang, Shinji Watanabe, Shu-Wen Yang, Paola García, Sanjeev Khudanpur:
Investigating Self-Supervised Learning for Speech Enhancement and Separation. ICASSP 2022: 6837-6841 - [c211]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models. ICASSP 2022: 8597-8601 - [c210]Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Suzy J. Styles, Sanjeev Khudanpur:
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification. INTERSPEECH 2022: 2233-2237 - [c209]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. INTERSPEECH 2022: 5035-5039 - [c208]Yiwen Shao, Jesús Villalba, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak:
Chunking Defense for Adversarial Attacks on ASR. INTERSPEECH 2022: 5045-5049 - [c207]Jinyi Yang, Amir Hussein, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2022 Dialect Speech Translation System Description. IWSLT@ACL 2022: 319-326 - [c206]Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur:
Enhancing Language Identification Using Dual-Mode Model with Knowledge Distillation. Odyssey 2022: 248-254 - [c205]Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur:
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition. SLT 2022: 777-784 - 2021
- [c204]Matthew Maciejewski, Jing Shi, Shinji Watanabe, Sanjeev Khudanpur:
Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step. ICASSP 2021: 5774-5778 - [c203]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. ICASSP 2021: 5864-5868 - [c202]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition. ICASSP 2021: 6019-6023 - [c201]Ke Li, Daniel Povey, Sanjeev Khudanpur:
A Parallelizable Lattice Rescoring Strategy with Neural Language Models. ICASSP 2021: 6518-6522 - [c200]Hexin Liu, Leibny Paola García-Perera, Xinyi Zhang, Justin Dauwels, Andy W. H. Khong, Sanjeev Khudanpur, Suzy J. Styles:
End-to-End Language Diarization for Bilingual Code-Switching Speech. Interspeech 2021: 1489-1493 - [c199]Desh Raj, Sanjeev Khudanpur:
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem. Interspeech 2021: 2351-2355 - [c198]Matthew Wiesner, Mousmita Sarma, Ashish Arora, Desh Raj, Dongji Gao, Ruizhe Huang, Supreet Preet, Moris Johnson, Zikra Iqbal, Nagendra Goel, Jan Trmal, Leibny Paola García-Perera, Sanjeev Khudanpur:
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. Interspeech 2021: 2906-2910 - [c197]Matthew Maciejewski, Shinji Watanabe, Sanjeev Khudanpur:
Speaker Verification-Based Evaluation of Single-Channel Speech Separation. Interspeech 2021: 3520-3524 - [c196]Guoguo Chen, Shuzhou Chai, Guan-Bo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Zhao You, Zhiyong Yan:
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio. Interspeech 2021: 3670-3674 - [c195]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Curricula for Multilingual Neural Machine Translation Training. MTSummit (1) 2021: 1-9 - [c194]Desh Raj, Zili Huang, Sanjeev Khudanpur:
Multi-Class Spectral Clustering with Overlaps for Speaker Diarization. SLT 2021: 582-589 - [c193]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs. SLT 2021: 881-888 - [c192]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora. WMT@EMNLP 2021: 1100-1109 - 2020
- [c191]Yuan Cao, Sanjeev Khudanpur:
Sample Selection for Large-scale MT Discriminative Training. AMTA 2020 - [c190]Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR. ICASSP 2020: 6334-6338 - [c189]Zili Huang, Shinji Watanabe, Yusuke Fujita, Paola García, Yiwen Shao, Daniel Povey, Sanjeev Khudanpur:
Speaker Diarization with Region Proposal Network. ICASSP 2020: 6514-6518 - [c188]Ke Li, Zhe Liu, Tianxing He, Hongzhao Huang, Fuchun Peng, Daniel Povey, Sanjeev Khudanpur:
An Empirical Study of Transformer-Based Neural Language Model Adaptation. ICASSP 2020: 7934-7938 - [c187]Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR. INTERSPEECH 2020: 561-565 - [c186]Pegah Ghahramani, Hossein Hadian, Daniel Povey, Hynek Hermansky, Sanjeev Khudanpur:
An Alternative to MFCCs for ASR. INTERSPEECH 2020: 1664-1667 - [c185]Ke Li, Daniel Povey, Sanjeev Khudanpur:
Neural Language Modeling with Implicit Cache Pointers. INTERSPEECH 2020: 3625-3629 - [c184]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Alignment-Free Lattice-Free MMI. INTERSPEECH 2020: 4258-4262 - [c183]Ruizhe Huang, Ke Li, Ashish Arora, Daniel Povey, Sanjeev Khudanpur:
Efficient MDI Adaptation for n-Gram Language Models. INTERSPEECH 2020: 4916-4920 - 2019
- [c182]Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur:
Incremental Lattice Determinization for WFST Decoders. ASRU 2019: 1-7 - [c181]Yiming Wang, Sanjeev Khudanpur, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe:
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit. ASRU 2019: 136-143 - [c180]Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur:
Probing the Information Encoded in X-Vectors. ASRU 2019: 726-733 - [c179]Matthew Wiesner, Oliver Adams, David Yarowsky, Jan Trmal, Sanjeev Khudanpur:
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer. ASRU 2019: 1048-1054 - [c178]Saurabhchand Bhati, Chunxi Liu, Jesús Villalba, Jan Trmal, Sanjeev Khudanpur, Najim Dehak:
Bottom-Up Unsupervised Word Discovery via Acoustic Units. GlobalSIP 2019: 1-5 - [c177]David Snyder, Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
Speaker Recognition for Multi-speaker Conversations Using X-vectors. ICASSP 2019: 5796-5800 - [c176]Vimal Manohar, Szu-Jui Chen, Zhiqi Wang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur:
Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System. ICASSP 2019: 6665-6669 - [c175]Chun-Chieh Chang, Ashish Arora, Leibny Paola García-Perera, David Etter, Daniel Povey, Sanjeev Khudanpur:
Optical Character Recognition with Chinese and Korean Character Decomposition. WML@ICDAR 2019: 134-139 - [c174]Ashish Arora, Paola García, Shinji Watanabe, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun-Chieh Chang, Babak Rekabdar, Bagher BabaAli, Daniel Povey, David Etter, Desh Raj, Hossein Hadian, Jan Trmal:
Using ASR Methods for OCR. ICDAR 2019: 663-668 - [c173]Fei Wu, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network. INTERSPEECH 2019: 1-5 - [c172]Jiamin Xie, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Multi-PLDA Diarization on Children's Speech. INTERSPEECH 2019: 376-380 - [c171]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492 - [c170]Daniel Garcia-Romero, David Snyder, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition. INTERSPEECH 2019: 1493-1496 - [c169]Daniel Garcia-Romero, David Snyder, Shinji Watanabe, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
Speaker Recognition Benchmark Using the CHiME-5 Corpus. INTERSPEECH 2019: 1506-1510 - [c168]David Snyder, Jesús Villalba, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak, Sanjeev Khudanpur:
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. INTERSPEECH 2019: 2468-2472 - [c167]Yiming Wang, David Snyder, Hainan Xu, Vimal Manohar, Phani Sankar Nidadavolu, Daniel Povey, Sanjeev Khudanpur:
The JHU ASR System for VOiCES from a Distance Challenge 2019. INTERSPEECH 2019: 2488-2492 - [c166]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings. INTERSPEECH 2019: 4375-4379 - [c165]Jonathan D. Jones, Gregory D. Hager, Sanjeev Khudanpur:
Toward Computer Vision Systems That Understand Real-World Assembly Processes. WACV 2019: 426-434 - [c164]Matthew Maciejewski, Gregory Sell, Yusuke Fujita, Leibny Paola García-Perera, Shinji Watanabe, Sanjeev Khudanpur:
Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains. WASPAA 2019: 165-169 - 2018
- [c163]Cathryn S. Cortesa, Jonathan D. Jones, Gregory D. Hager, Sanjeev Khudanpur, Barbara Landau, Amy Lynne Shelton:
Constraints and Development in Children's Block Construction. CogSci 2018 - [c162]Vimal Manohar, Hossein Hadian, Daniel Povey, Sanjeev Khudanpur:
Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI. ICASSP 2018: 4844-4848 - [c161]Neville Ryant, Elika Bergelson, Kenneth Church, Alejandrina Cristià, Jun Du, Sriram Ganapathy, Sanjeev Khudanpur, Diana Kowalski, Mahesh Krishnamoorthy, Rajat Kulshreshta, Mark Liberman, Yu-Ding Lu, Matthew Maciejewski, Florian Metze, Ján Profant, Lei Sun, Yu Tsao, Zhou Yu:
Enhancement and Analysis of Conversational Speech: JSALT 2017. ICASSP 2018: 5154-5158 - [c160]Matthew Maciejewski, David Snyder, Vimal Manohar, Najim Dehak, Sanjeev Khudanpur:
Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods. ICASSP 2018: 5244-5248 - [c159]David Snyder, Daniel Garcia-Romero, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
X-Vectors: Robust DNN Embeddings for Speaker Recognition. ICASSP 2018: 5329-5333 - [c158]Daniel Povey, Hossein Hadian, Pegah Ghahremani, Ke Li, Sanjeev Khudanpur:
A Time-Restricted Self-Attention Layer for ASR. ICASSP 2018: 5874-5878 - [c157]Hainan Xu, Tongfei Chen, Dongji Gao, Yiming Wang, Ke Li, Nagendra Goel, Yishay Carmiel, Daniel Povey, Sanjeev Khudanpur:
A Pruned Rnnlm Lattice-Rescoring Algorithm for Automatic Speech Recognition. ICASSP 2018: 5929-5933 - [c156]Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, Odette Scharenborg, Emmanuel Dupoux, Lukás Burget, François Yvon, Sanjeev Khudanpur:
Bayesian Models for Unit Discovery on a Very Low Resource Language. ICASSP 2018: 5939-5943 - [c155]Hainan Xu, Ke Li, Yiming Wang, Jian Wang, Shiyin Kang, Xie Chen, Daniel Povey, Sanjeev Khudanpur:
Neural Network Language Modeling with Letter-Based Features and Importance Sampling. ICASSP 2018: 6109-6113 - [c154]Hossein Hadian, Hossein Sameti, Daniel Povey, Sanjeev Khudanpur:
End-to-end Speech Recognition Using Lattice-free MMI. INTERSPEECH 2018: 12-16 - [c153]Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur, Najim Dehak:
End-to-end Deep Neural Network Age Estimation. INTERSPEECH 2018: 277-281 - [c152]Pegah Ghahremani, Hossein Hadian, Hang Lv, Daniel Povey, Sanjeev Khudanpur:
Acoustic Modeling from Frequency Domain Representations of Speech. INTERSPEECH 2018: 1596-1600 - [c151]Gaofeng Cheng, Daniel Povey, Lu Huang, Ji Xu, Sanjeev Khudanpur, Yonghong Yan:
Output-Gate Projected Gated Recurrent Unit for Speech Recognition. INTERSPEECH 2018: 1793-1797 - [c150]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH 2018: 2052-2056 - [c149]Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
A GPU-based WFST Decoder with Exact Lattice Generation. INTERSPEECH 2018: 2212-2216 - [c148]Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak, Daniel Povey, Shinji Watanabe, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. INTERSPEECH 2018: 2808-2812 - [c147]Ke Li, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition. INTERSPEECH 2018: 3373-3377 - [c146]Daniel Povey, Gaofeng Cheng, Yiming Wang, Ke Li, Hainan Xu, Mahsa Yarmohammadi, Sanjeev Khudanpur:
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks. INTERSPEECH 2018: 3743-3747 - [c145]David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
Spoken Language Recognition using X-vectors. Odyssey 2018: 105-111 - [c144]Hossein Hadian, Daniel Povey, Hossein Sameti, Jan Trmal, Sanjeev Khudanpur:
Improving LF-MMI Using Unconstrained Supervisions for ASR. SLT 2018: 43-47 - [c143]Vimal Manohar, Pegah Ghahremani, Daniel Povey, Sanjeev Khudanpur:
A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models. SLT 2018: 250-257 - [c142]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. SLT 2018: 656-663 - 2017
- [c141]Pegah Ghahremani, Vimal Manohar, Hossein Hadian, Daniel Povey, Sanjeev Khudanpur:
Investigation of transfer learning for ASR using LF-MMI trained neural networks. ASRU 2017: 279-286 - [c140]Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning. ASRU 2017: 346-352 - [c139]Cathryn S. Cortesa, Jonathan D. Jones, Gregory D. Hager, Sanjeev Khudanpur, Amy Lynne Shelton, Barbara Landau:
Characterizing spatial construction processes: Toward computational tools to understand cognition. CogSci 2017 - [c138]Tom Ko, Vijayaditya Peddinti, Daniel Povey, Michael L. Seltzer, Sanjeev Khudanpur:
A study on data augmentation of reverberant speech for robust speech recognition. ICASSP 2017: 5220-5224 - [c137]Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukás Burget, Sanjeev Khudanpur:
An empirical evaluation of zero resource acoustic unit discovery. ICASSP 2017: 5305-5309 - [c136]Santosh Kesiraju, Raghavendra Pappagari, Lucas Ondel, Lukás Burget, Najim Dehak, Sanjeev Khudanpur, Jan Cernocký, Suryakanth V. Gangashetty:
Topic identification of spoken documents using unsupervised acoustic unit discovery. ICASSP 2017: 5745-5749 - [c135]Hossein Hadian, Daniel Povey, Hossein Sameti, Sanjeev Khudanpur:
Phone Duration Modeling for LVCSR Using Neural Networks. INTERSPEECH 2017: 518-522 - [c134]David Snyder, Daniel Garcia-Romero, Daniel Povey, Sanjeev Khudanpur:
Deep Neural Network Embeddings for Text-Independent Speaker Verification. INTERSPEECH 2017: 999-1003 - [c133]Gaofeng Cheng, Vijayaditya Peddinti, Daniel Povey, Vimal Manohar, Sanjeev Khudanpur, Yonghong Yan:
An Exploration of Dropout with LSTMs. INTERSPEECH 2017: 1586-1590 - [c132]Yiming Wang, Vijayaditya Peddinti, Hainan Xu, Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
Backstitch: Counteracting Finite-Sample Bias via Negative Steps. INTERSPEECH 2017: 1631-1635 - [c131]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech Without ASR. INTERSPEECH 2017: 2501-2505 - [c130]Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework. INTERSPEECH 2017: 2541-2545 - [c129]Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur:
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH 2017: 3597-3601 - 2016
- [c128]Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Acoustic data-driven pronunciation lexicon generation for logographic languages. ICASSP 2016: 5350-5354 - [c127]Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James R. Glass:
Highway long short-term memory RNNS for distant speech recognition. ICASSP 2016: 5755-5759 - [c126]Chunxi Liu, Preethi Jyothi, Hao Tang, Vimal Manohar, Rose Sloan, Tyler Kekona, Mark Hasegawa-Johnson, Sanjeev Khudanpur:
Adapting ASR for under-resourced languages using mismatched transcriptions. ICASSP 2016: 5840-5844 - [c125]Chunxi Liu, Aren Jansen, Sanjeev Khudanpur:
Context-dependent point process models for keyword search and detection-based ASR. ICASSP 2016: 6025-6029 - [c124]Yixin Gao, S. Swaroop Vedula, Gyusung I. Lee, Mija R. Lee, Sanjeev Khudanpur, Gregory D. Hager:
Unsupervised surgical data alignment with application to automatic activity annotation. ICRA 2016: 4158-4163 - [c123]Vijayaditya Peddinti, Vimal Manohar, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Far-Field ASR Without Parallel Data. INTERSPEECH 2016: 1996-2000 - [c122]Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahremani, Vimal Manohar, Xingyu Na, Yiming Wang, Sanjeev Khudanpur:
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI. INTERSPEECH 2016: 2751-2755 - [c121]Pegah Ghahremani, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic Modelling from the Signal Domain Using CNNs. INTERSPEECH 2016: 3434-3438 - [c120]Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur, John Godfrey:
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification. LREC 2016 - [c119]David Snyder, Pegah Ghahremani, Daniel Povey, Daniel Garcia-Romero, Yishay Carmiel, Sanjeev Khudanpur:
Deep neural network-based speaker embeddings for end-to-end speaker verification. SLT 2016: 165-170 - 2015
- [c118]Vijayaditya Peddinti, Guoguo Chen, Vimal Manohar, Tom Ko, Daniel Povey, Sanjeev Khudanpur:
JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS. ASRU 2015: 539-546 - [c117]Gaurav Kumar, Graeme W. Blackwood, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation. EMNLP 2015: 1902-1907 - [c116]Hynek Hermansky, Lukás Burget, Jordan Cohen, Emmanuel Dupoux, Naomi Feldman, John Godfrey, Sanjeev Khudanpur, Matthew Maciejewski, Sri Harish Reddy Mallidi, Anjali Menon, Tetsuji Ogawa, Vijayaditya Peddinti, Richard C. Rose, Richard M. Stern, Matthew Wiesner, Karel Veselý:
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop. ICASSP 2015: 5009-5013 - [c115]Vassil Panayotov, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Librispeech: An ASR corpus based on public domain audio books. ICASSP 2015: 5206-5210 - [c114]Eleanor Chodroff, John Godfrey, Sanjeev Khudanpur, Colin Wilson:
Structured variability in acoustic realization: a corpus study of voice onset time in American English stops. ICPhS 2015 - [c113]Guoguo Chen, Hainan Xu, Minhua Wu, Daniel Povey, Sanjeev Khudanpur:
Pronunciation and silence probability modeling for ASR. INTERSPEECH 2015: 533-537 - [c112]Hainan Xu, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Modeling phonetic context with non-random forests for speech recognition. INTERSPEECH 2015: 2117-2121 - [c111]Vijayaditya Peddinti, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Reverberation robust acoustic modeling using i-vectors with time delay neural networks. INTERSPEECH 2015: 2440-2444 - [c110]Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Semi-supervised maximum mutual information training of deep neural network acoustic models. INTERSPEECH 2015: 2630-2634 - [c109]Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur:
A time delay neural network architecture for efficient modeling of long temporal contexts. INTERSPEECH 2015: 3214-3218 - [c108]Tom Ko, Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur:
Audio augmentation for speech recognition. INTERSPEECH 2015: 3586-3589 - [c107]Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
A diversity-penalizing ensemble training method for deep learning. INTERSPEECH 2015: 3590-3594 - [c106]Daniel Povey, Xiaohui Zhang, Sanjeev Khudanpur:
Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging. ICLR (Workshop) 2015 - 2014
- [c105]Yuan Cao, Sanjeev Khudanpur:
Online Learning in Tensor Space. ACL (1) 2014: 666-675 - [c104]Jonathan Wintrode, Sanjeev Khudanpur:
Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection. ACL (1) 2014: 1316-1325 - [c103]Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Improving deep neural network acoustic models using generalized maxout networks. ICASSP 2014: 215-219 - [c102]Pegah Ghahremani, Bagher BabaAli, Daniel Povey, Korbinian Riedhammer, Jan Trmal, Sanjeev Khudanpur:
A pitch extraction algorithm tuned for automatic speech recognition. ICASSP 2014: 2494-2498 - [c101]Gaurav Kumar, Matt Post, Daniel Povey, Sanjeev Khudanpur:
Some insights from translating conversational telephone speech. ICASSP 2014: 3231-3235 - [c100]Jonathan Wintrode, Sanjeev Khudanpur:
Limited resource term detection for effective topic identification of speech. ICASSP 2014: 7118-7122 - [c99]Chunxi Liu, Aren Jansen, Guoguo Chen, Keith Kintzley, Jan Trmal, Sanjeev Khudanpur:
Low-resource open vocabulary keyword search using point process models. INTERSPEECH 2014: 2789-2793 - [c98]Gaurav Kumar, Yuan Cao, Ryan Cotterell, Chris Callison-Burch, Daniel Povey, Sanjeev Khudanpur:
Translations of the Callhome Egyptian Arabic corpus for conversational speech translation. IWSLT 2014 - [c97]Jonathan Wintrode, Sanjeev Khudanpur:
Combining local and broad topic context to improve term detection. SLT 2014: 442-447 - [c96]Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:
A keyword search system using open source software. SLT 2014: 530-535 - 2013
- [c95]Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Using proxies for OOV keywords in the keyword search task. ASRU 2013: 416-421 - [c94]Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze, Richard C. Rose, Mike Seltzer, Pascal Clark, Ian McGraw, Balakrishnan Varadarajan, Erin Bennett, Benjamin Börschinger, Justin T. Chiu, Ewan Dunbar, Abdellah Fourtassi, David Harwath, Chia-ying Lee, Keith D. Levin, Atta Norouzian, Vijayaditya Peddinti, Rachael Richardson, Thomas Schatz, Samuel Thomas:
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition. ICASSP 2013: 8111-8115 - [c93]Guoguo Chen, Sanjeev Khudanpur, Daniel Povey, Jan Trmal, David Yarowsky, Oguz Yilmaz:
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages. ICASSP 2013: 8560-8564 - [c92]Matt Post, Gaurav Kumar, Adam Lopez, Damianos G. Karakos, Chris Callison-Burch, Sanjeev Khudanpur:
Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus. IWSLT 2013 - [c91]Narges Ahmidi, Yixin Gao, Benjamín Béjar Haro, S. Swaroop Vedula, Sanjeev Khudanpur, René Vidal, Gregory D. Hager:
String Motif-Based Description of Tool Motion for Detecting Skill and Gestures in Robotic Surgery. MICCAI (1) 2013: 26-33 - 2012
- [c90]Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur:
Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining. ACL (1) 2012: 175-183 - [c89]Puyang Xu, Sanjeev Khudanpur, Maider Lehr, Emily Tucker Prud'hommeaux, Nathan Glenn, Damianos G. Karakos, Brian Roark, Kenji Sagae, Murat Saraclar, Izhak Shafran, Daniel M. Bikel, Chris Callison-Burch, Yuan Cao, Keith B. Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley:
Continuous space discriminative language modeling. ICASSP 2012: 2129-2132 - [c88]Kenji Sagae, Maider Lehr, Emily Tucker Prud'hommeaux, Puyang Xu, Nathan Glenn, Damianos G. Karakos, Sanjeev Khudanpur, Brian Roark, Murat Saraclar, Izhak Shafran, Daniel M. Bikel, Chris Callison-Burch, Yuan Cao, Keith B. Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley:
Hallucinated n-best lists for discriminative language modeling. ICASSP 2012: 5001-5004 - [c87]Arda Çelebi, Hasim Sak, Erinç Dikici, Murat Saraclar, Maider Lehr, Emily Tucker Prud'hommeaux, Puyang Xu, Nathan Glenn, Damianos G. Karakos, Sanjeev Khudanpur, Brian Roark, Kenji Sagae, Izhak Shafran, Daniel M. Bikel, Chris Callison-Burch, Yuan Cao, Keith B. Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley:
Semi-supervised discriminative language modeling for Turkish ASR. ICASSP 2012: 5025-5028 - [c86]Puyang Xu, Brian Roark, Sanjeev Khudanpur:
Phrasal Cohort Based Unsupervised Discriminative Language Modeling. INTERSPEECH 2012: 198-201 - [c85]Damianos G. Karakos, Brian Roark, Izhak Shafran, Kenji Sagae, Maider Lehr, Emily Tucker Prud'hommeaux, Puyang Xu, Nathan Glenn, Sanjeev Khudanpur, Murat Saraclar, Daniel M. Bikel, Mark Dredze, Chris Callison-Burch, Yuan Cao, Keith B. Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley:
Deriving conversation-based features from unlabeled speech for discriminative language modeling. INTERSPEECH 2012: 202-205 - [c84]Scott Novotney, Ivan Bulyko, Richard M. Schwartz, Sanjeev Khudanpur, Owen Kimball:
Semi-Supervised Methods for Improving Keyword Search of Unseen Terms. INTERSPEECH 2012: 1215-1218 - [c83]Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur:
Efficient Structured Language Modeling for Speech Recognition. INTERSPEECH 2012: 1660-1663 - [c82]Lingling Tao, Ehsan Elhamifar, Sanjeev Khudanpur, Gregory D. Hager, René Vidal:
Sparse Hidden Markov Models for Surgical Gesture Classification and Skill Evaluation. IPCAI 2012: 167-177 - [c81]Brian Roark, Arda Çelebi, Erinç Dikici, Sanjeev Khudanpur, Maider Lehr, Emily Prud'hommeaux, Kenji Sagae, Murat Saraclar, Izhak Shafran, Puyang Xu:
Hallucinating system outputs for discriminative language modeling. MLSLP 2012 - [c80]Ariya Rastrow, Sanjeev Khudanpur, Mark Dredze:
Revisiting the Case for Explicit Syntactic Information in Language Models. WLM@NAACL-HLT 2012: 50-58 - 2011
- [c79]Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur:
Efficient discriminative training of long-span language models. ASRU 2011: 214-219 - [c78]Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur:
Adapting n-gram maximum entropy language models with conditional entropy regularization. ASRU 2011: 220-225 - [c77]Puyang Xu, Sanjeev Khudanpur, Asela Gunawardana:
Randomized maximum entropy language models. ASRU 2011: 226-230 - [c76]Damianos G. Karakos, Mark Dredze, Ken Ward Church, Aren Jansen, Sanjeev Khudanpur:
Estimating document frequencies in a speech corpus. ASRU 2011: 407-412 - [c75]Zhifei Li, Ziyuan Wang, Jason Eisner, Sanjeev Khudanpur, Brian Roark:
Minimum Imputed-Risk: Unsupervised Discriminative Training for Machine Translation. EMNLP 2011: 920-929 - [c74]Puyang Xu, Asela Gunawardana, Sanjeev Khudanpur:
Efficient Subsampling for Training Complex Language Models. EMNLP 2011: 1128-1136 - [c73]Balakrishnan Varadarajan, Sanjeev Khudanpur:
Learning and inference algorithms for partially observed structured switching vector autoregressive models. ICASSP 2011: 1281-1284 - [c72]Balakrishnan Varadarajan, Garimella S. V. S. Sivaram, Sanjeev Khudanpur:
Dirichlet Mixture Models of neural net posteriors for HMM-based speech recognition. ICASSP 2011: 5028-5031 - [c71]Ariya Rastrow, Markus Dreyer, Abhinav Sethy, Sanjeev Khudanpur, Bhuvana Ramabhadran, Mark Dredze:
Hill climbing on speech lattices: A new rescoring framework. ICASSP 2011: 5032-5035 - [c70]Tomás Mikolov, Stefan Kombrink, Lukás Burget, Jan Cernocký, Sanjeev Khudanpur:
Extensions of recurrent neural network language model. ICASSP 2011: 5528-5531 - [c69]Anoop Deoras, Tomás Mikolov, Stefan Kombrink, Martin Karafiát, Sanjeev Khudanpur:
Variational approximation of long-span language models for lvcsr. ICASSP 2011: 5532-5535 - [c68]Scott Novotney, Richard M. Schwartz, Sanjeev Khudanpur:
Unsupervised Arabic Dialect Adaptation with Self-Training. INTERSPEECH 2011: 541-544 - 2010
- [c67]Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, Jason Eisner:
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets. COLING (Posters) 2010: 656-664 - [c66]Damianos G. Karakos, Jason Smith, Sanjeev Khudanpur:
Hypothesis ranking and two-pass approaches for machine translation system combination. ICASSP 2010: 5202-5205 - [c65]Tomás Mikolov, Martin Karafiát, Lukás Burget, Jan Cernocký, Sanjeev Khudanpur:
Recurrent neural network based language model. INTERSPEECH 2010: 1045-1048 - [c64]Saeedeh Momtazi, Sanjeev Khudanpur, Dietrich Klakow:
A Comparative Study of Word Co-occurrence for Term Clustering in Language Model-based Sentence Retrieval. HLT-NAACL 2010: 325-328 - [c63]Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Ann Irvine, Sanjeev Khudanpur, Lane Schwartz, Wren N. G. Thornton, Ziyuan Wang, Jonathan Weese, Omar Zaidan:
Joshua 2.0: A Toolkit for Parsing-Based Machine Translation with Syntax, Semirings, Discriminative Training and Other Goodies. WMT@ACL 2010: 133-137 - 2009
- [c62]Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N. G. Thornton, Jonathan Weese, Omar Zaidan:
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation. ACL/IJCNLP (Software Demonstrations) 2009: 25-28 - [c61]Zhifei Li, Jason Eisner, Sanjeev Khudanpur:
Variational Decoding for Statistical Machine Translation. ACL/IJCNLP 2009: 593-601 - [c60]Puyang Xu, Damianos G. Karakos, Sanjeev Khudanpur:
Self-supervised discriminative training of statistical language models. ASRU 2009: 317-322 - [c59]Arnab Ghoshal, Sanjeev Khudanpur, Dietrich Klakow:
Impact of novel sources on content-based image and video retrieval. ICASSP 2009: 1937-1940 - [c58]Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Michael Riley, Morgan Ulinski:
WEB-derived pronunciations. ICASSP 2009: 4289-4292 - [c57]Christopher M. White, Ariya Rastrow, Sanjeev Khudanpur, Frederick Jelinek:
Unsupervised estimation of the language model scaling factor. INTERSPEECH 2009: 1195-1198 - [c56]Balakrishnan Varadarajan, Carol E. Reiley, Henry Lin, Sanjeev Khudanpur, Gregory D. Hager:
Data-Derived Models for Segmentation with Application to Surgical Assessment and Training. MICCAI (1) 2009: 426-434 - [c55]Zhifei Li, Sanjeev Khudanpur:
Efficient Extraction of Oracle-best Translations from Hypergraphs. HLT-NAACL (Short Papers) 2009: 9-12 - [c54]Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Bhuvana Ramabhadran, Michael Riley, Murat Saraclar, Abhinav Sethy, Morgan Ulinski, Christopher M. White:
Web derived pronunciations for spoken term detection. SIGIR 2009: 83-90 - [c53]Zhifei Li, Chris Callison-Burch, Chris Dyer, Sanjeev Khudanpur, Lane Schwartz, Wren N. G. Thornton, Jonathan Weese, Omar Zaidan:
Joshua: An Open Source Toolkit for Parsing-Based Machine Translation. WMT@EACL 2009: 135-139 - 2008
- [c52]Damianos G. Karakos, Jason Eisner, Sanjeev Khudanpur, Markus Dreyer:
Machine Translation System Combination using ITG-based Alignments. ACL (2) 2008: 81-84 - [c51]Balakrishnan Varadarajan, Sanjeev Khudanpur, Emmanuel Dupoux:
Unsupervised Learning of Acoustic Sub-word Units. ACL (2) 2008: 165-168 - [c50]Zhifei Li, Sanjeev Khudanpur:
Large-scale Discriminative n-gram Language Models for Statistical Machine Translation. AMTA 2008: 133-142 - [c49]Lukás Burget, Petr Schwarz, Pavel Matejka, Mirko Hannemann, Ariya Rastrow, Christopher M. White, Sanjeev Khudanpur, Hynek Hermansky, Jan Cernocký:
Combination of strongly and weakly constrained recognizers for reliable detection of OOVS. ICASSP 2008: 4081-4084 - [c48]David Farris, Christopher M. White, Sanjeev Khudanpur:
Sample selection for automatic language identification. ICASSP 2008: 4225-4228 - [c47]Balakrishnan Varadarajan, Sanjeev Khudanpur:
Automatically learning speaker-independent acoustic subword units. INTERSPEECH 2008: 1333-1336 - [c46]Christopher M. White, Sanjeev Khudanpur, James K. Baker:
An investigation of acoustic models for multilingual code-switching. INTERSPEECH 2008: 2691-2694 - [c45]Damianos G. Karakos, Sanjeev Khudanpur, Carey E. Priebe:
Computation of Csiszár's mutual Information of order α. ISIT 2008: 2106-2110 - [c44]Carol E. Reiley, Henry C. Lin, Balakrishnan Varadarajan, Balázs Vágvölgyi, Sanjeev Khudanpur, David D. Yuh, Gregory D. Hager:
Automatic Recognition of Surgical Motions Using Statistical Modeling for Capturing Variability. MMVR 2008: 396-401 - [c43]Damianos G. Karakos, Sanjeev Khudanpur:
Sequential system combination for machine translation of speech. SLT 2008: 257-260 - [c42]Zhifei Li, Sanjeev Khudanpur:
A Scalable Decoder for Parsing-Based Machine Translation with Equivalent Language Model State Maintenance. SSST@ACL 2008: 10-18 - 2007
- [c41]Damianos G. Karakos, Sanjeev Khudanpur, Jason Eisner, Carey E. Priebe:
Iterative Denoising using Jensen-Renyi Divergences with an Application to Unsupervised Document Categorization. ICASSP (2) 2007: 509-512 - [c40]Yi Su, Frederick Jelinek, Sanjeev Khudanpur:
Large-scale random forest language models for speech recognition. INTERSPEECH 2007: 598-601 - [c39]Damianos G. Karakos, Sanjeev Khudanpur:
Error Bounds and Improved Probability Estimation using the Maximum Likelihood Set. ISIT 2007: 1851-1855 - [c38]Damianos G. Karakos, Jason Eisner, Sanjeev Khudanpur, Carey E. Priebe:
Cross-Instance Tuning of Unsupervised Document Clustering Algorithms. HLT-NAACL 2007: 252-259 - [c37]Markus Dreyer, Keith B. Hall, Sanjeev Khudanpur:
Comparing Reordering Constraints for SMT Using Efficient BLEU Oracle Computation. SSST@HLT-NAACL 2007: 103-110 - 2006
- [c36]Jimmy Lin, Damianos G. Karakos, Dina Demner-Fushman, Sanjeev Khudanpur:
Generative Content Models for Structural Analysis of Medical Abstracts. BioNLP@NAACL-HLT 2006: 65-72 - [c35]Arnab Ghoshal, Sanjeev Khudanpur:
Source Adaptation for Improved Content-Based Video Retrieval. ICASSP (2) 2006: 133-136 - [c34]Damianos G. Karakos, Sanjeev Khudanpur:
Language Modeling with the Maximum Likelihood Set: Complexity Issues and the Back-off Formula. ISIT 2006: 2814-2818 - [c33]Arnab Ghoshal, Sanjeev Khudanpur, João Magalhães, Simon E. Overell, Stefan M. Rüger, Alexei Yavlinsky:
Imperial College and Johns Hopkins University at TRECVID. TRECVID 2006 - 2005
- [c32]Damianos G. Karakos, Sanjeev Khudanpur, Jason Eisner, Carey E. Priebe:
Unsupervised classification via decision trees: an information-theoretic perspective. ICASSP (5) 2005: 1081-1084 - [c31]Giridharan Iyengar, Pinar Duygulu, Shaolei Feng, Pavel Ircing, Sanjeev Khudanpur, Dietrich Klakow, M. R. Krause, Raghavan Manmatha, Harriet J. Nock, D. Petkova, Brock Pytlik, Paola Virga:
Joint visual-text modeling for automatic retrieval of multimedia documents. ACM Multimedia 2005: 21-30 - [c30]Arnab Ghoshal, Pavel Ircing, Sanjeev Khudanpur:
Hidden Markov models for automatic annotation and content-based retrieval of images and video. SIGIR 2005: 544-551 - [c29]Brock Pytlik, Arnab Ghoshal, Damianos G. Karakos, Sanjeev Khudanpur:
TRECVID 2005 Experiment at Johns Hopkins University: Using Hidden Markov Models for Video Retrieval. TRECVID 2005 - 2004
- [c28]Woosung Kim, Sanjeev Khudanpur:
Cross-lingual latent semantic analysis for language modeling. ICASSP (1) 2004: 257-260 - [c27]Franz Josef Och, Daniel Gildea, Sanjeev Khudanpur, Anoop Sarkar, Kenji Yamada, Alexander M. Fraser, Shankar Kumar, Libin Shen, David Smith, Katherine Eng, Viren Jain, Zhen Jin, Dragomir R. Radev:
A Smorgasbord of Features for Statistical Machine Translation. HLT-NAACL 2004: 161-168 - [c26]Daqing He, Dina Demner-Fushman, Douglas W. Oard, Damianos G. Karakos, Sanjeev Khudanpur:
Improving Passage Retrieval Using Interactive Elicition and Statistical Modeling. TREC 2004 - 2003
- [c25]Paola Virga, Sanjeev Khudanpur:
Transliteration of Proper Names in Cross-Lingual Information Retrieval. NER@ACL 2003: 57-64 - [c24]Woosung Kim, Sanjeev Khudanpur:
Cross-Lingual Lexical Triggers in Statistical Language Modeling. EMNLP 2003 - [c23]Woosung Kim, Sanjeev Khudanpur:
Language model adaptation using cross-lingual information. INTERSPEECH 2003: 3129-3132 - [c22]Yonggang Deng, Sanjeev Khudanpur:
Latent Semantic Information in Maximum Entropy Language Models for Conversational Speech Recognition. HLT-NAACL 2003 - [c21]Douglas W. Oard, David S. Doermann, Bonnie J. Dorr, Daqing He, Philip Resnik, Amy Weinberg, William J. Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn, Kevin Knight:
Desparately Seeking Cebuano. HLT-NAACL 2003 - [c20]Paola Virga, Sanjeev Khudanpur:
Transliteration of proper names in cross-language applications. SIGIR 2003: 365-366 - 2002
- [c19]Jun Wu, Sanjeev Khudanpur:
Building a topic-dependent maximum entropy model for very large corpora. ICASSP 2002: 777-780 - [c18]Sanjeev Khudanpur, Woosung Kim:
Using cross-language cues for story-specific language modeling. INTERSPEECH 2002: 513-516 - 2001
- [c17]Pavel Ircing, Pavel Krbec, Jan Hajic, Josef Psutka, Sanjeev Khudanpur, Frederick Jelinek, William Byrne:
On large vocabulary continuous speech recognition of highly inflectional language - czech. INTERSPEECH 2001: 487-490 - [c16]Woosung Kim, Sanjeev Khudanpur, Jun Wu:
Smoothing issues in the structured language model. INTERSPEECH 2001: 717-720 - [c15]Frederick Jelinek, William J. Byrne, Sanjeev Khudanpur, Barbora Hladká, Hermann Ney, Franz Josef Och, J. Curín, Josef Psutka:
Robust Knowledge Discovery from Parallel Speech and Text Sources. HLT 2001 - [c14]Helen M. Meng, Berlin Chen, Sanjeev Khudanpur, Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Patrick Schone, Karen Tang, Hsin-Min Wang, Jianqiang Wang:
Mandarin-English Information: Investigating Translingual Speech Retrieval. HLT 2001 - 2000
- [c13]William Byrne, Peter Beyerlein, Juan M. Huerta, Sanjeev Khudanpur, B. Marthi, John Morgan, Nino Peterek, Joe Picone, Dimitra Vergyri, W. Wang:
Towards language independent acoustic modeling. ICASSP 2000: 1029-1032 - [c12]Murat Saraçlar, Sanjeev Khudanpur:
Pronunciation ambiguity vs. pronunciation variability in speech recognition. ICASSP 2000: 1679-1682 - [c11]Jun Wu, Sanjeev Khudanpur:
Syntactic heads in statistical language modeling. ICASSP 2000: 1699-1702 - [c10]Jun Wu, Sanjeev Khudanpur:
Efficient training methods for maximum entropy language modeling. INTERSPEECH 2000: 114-118 - 1999
- [c9]Sanjeev Khudanpur, Jun Wu:
A maximum entropy language model integrating N-grams and topic dependencies for conversational speech recognition. ICASSP 1999: 553-556 - [c8]Vassilios Digalakis, Heather Collier, Sid Berkowitz, Adrian Corduneanu, Enrico Bocchieri, Ashvin Kannan, Constantinos Boulis, Sanjeev Khudanpur, William Byrne, Ananth Sankar:
Rapid speech recognizer adaptation to new speakers. ICASSP 1999: 765-768 - [c7]Ashvin Kannan, Sanjeev Khudanpur:
Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition. ICASSP 1999: 769-772 - [c6]Murat Saraclar, Harriet J. Nock, Sanjeev Khudanpur:
Pronunciation modeling by sharing gaussian densities across phonetic models. EUROSPEECH 1999: 515-518 - [c5]Jun Wu, Sanjeev Khudanpur:
Combining nonlocal, syntactic and n-gram dependencies in language modeling. EUROSPEECH 1999: 2179-2182 - [c4]William J. Byrne, Jan Hajic, Pavel Ircing, Frederick Jelinek, Sanjeev Khudanpur, Jerome McDonough, Nino Peterek, Josef Psutka:
Large Vocabulary Speech Recognition for Read and Broadcast Czech. TSD 1999: 235-240 - 1998
- [c3]William Byrne, Michael Finke, Sanjeev Khudanpur, John W. McDonough, Harriet J. Nock, Michael Riley, Murat Saraçlar, Charles Wooters, George Zavaliagkos:
Pronunciation modelling using a hand-labelled corpus for conversational speech recognition. ICASSP 1998: 313-316 - [c2]Vaibhava Goel, William Byrne, Sanjeev Khudanpur:
LVCSR rescoring with modified loss functions: a decision theoretic perspective. ICASSP 1998: 425-428 - 1997
- [c1]Ciprian Chelba, David Engle, Frederick Jelinek, Victor Jimenez, Sanjeev Khudanpur, Lidia Mangu, Harry Printz, Eric Ristad, Ronald Rosenfeld, Andreas Stolcke, Dekai Wu:
Structure and performance of a dependency language model. EUROSPEECH 1997: 2775-2778
Editorship
- 2012
- [e1]Bhuvana Ramabhadran, Sanjeev Khudanpur, Ebru Arisoy:
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, WLM@NAACL-HLT 2012, Montrèal, Canada, June 8, 2012. Association for Computational Linguistics 2012, ISBN 978-1-937284-20-6 [contents]
Informal and Other Publications
- 2024
- [i59]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. CoRR abs/2401.15676 (2024) - [i58]Nathaniel R. Robinson, Raj Dabre, Ammon Shurtz, Rasul Dent, Onenamiyi Onesi, Claire Bizon Monroc, Loïc Grobol, Hasan Muhammad, Ashi Garg, Naome A. Etori, Vijay Murari Tiyyala, Olanrewaju Samuel, Matthew Dean Stutzman, Bismarck Bamfo Odoom, Sanjeev Khudanpur, Stephen D. Richardson, Kenton Murray:
Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages. CoRR abs/2405.05376 (2024) - [i57]Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. CoRR abs/2406.02560 (2024) - [i56]Ruizhe Huang, Mahsa Yarmohammadi, Sanjeev Khudanpur, Daniel Povey:
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation. CoRR abs/2407.10303 (2024) - [i55]Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization. CoRR abs/2409.03655 (2024) - [i54]Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
HLTCOE JHU Submission to the Voice Privacy Challenge 2024. CoRR abs/2409.08913 (2024) - [i53]Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak, Sanjeev Khudanpur:
Clean Label Attacks against SLU Systems. CoRR abs/2409.08985 (2024) - [i52]Alexander Polok, Dominik Klement, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
Target Speaker ASR with Whisper. CoRR abs/2409.09543 (2024) - 2023
- [i51]Suzy J. Styles, Yi Han Victoria Chua, Fei Ting Woon, Hexin Liu, Leibny Paola García-Perera, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels:
Investigating model performance in language identification: beyond simple error statistics. CoRR abs/2305.18925 (2023) - [i50]Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, Sanjeev Khudanpur:
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts. CoRR abs/2306.01031 (2023) - [i49]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition. CoRR abs/2306.10559 (2023) - [i48]Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur:
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation. CoRR abs/2306.11252 (2023) - [i47]Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola García, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur:
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios. CoRR abs/2306.13734 (2023) - [i46]Amir Hussein, Dorsa Zeinali, Ondrej Klejch, Matthew Wiesner, Brian Yan, Shammur Absar Chowdhury, Ahmed M. Ali, Shinji Watanabe, Sanjeev Khudanpur:
Speech collage: code-switched audio generation by collaging monolingual corpora. CoRR abs/2309.15674 (2023) - [i45]Amir Hussein, Brian Yan, Antonios Anastasopoulos, Shinji Watanabe, Sanjeev Khudanpur:
Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization. CoRR abs/2309.15686 (2023) - [i44]Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition. CoRR abs/2309.15796 (2023) - [i43]Hexin Liu, Leibny Paola García, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur:
Enhancing Code-switching Speech Recognition with Interactive Language Biases. CoRR abs/2309.16953 (2023) - 2022
- [i42]Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur:
Enhance Language Identification using Dual-mode Model with Knowledge Distillation. CoRR abs/2203.03218 (2022) - [i41]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser. CoRR abs/2204.03851 (2022) - [i40]Hexin Liu, Haihua Xu, Leibny Paola García, Andy W. H. Khong, Yi He, Sanjeev Khudanpur:
Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization. CoRR abs/2210.14567 (2022) - [i39]Zili Huang, Desh Raj, Paola García, Sanjeev Khudanpur:
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings. CoRR abs/2211.00482 (2022) - [i38]Dongji Gao, Jiatong Shi, Shun-Po Chuang, Leibny Paola García, Hung-yi Lee, Shinji Watanabe, Sanjeev Khudanpur:
EURO: ESPnet Unsupervised ASR Open-source Toolkit. CoRR abs/2211.17196 (2022) - [i37]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
GPU-accelerated Guided Source Separation for Meeting Transcription. CoRR abs/2212.05271 (2022) - 2021
- [i36]Shota Horiguchi, Nelson Yalta, Paola García, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur:
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap. CoRR abs/2102.01363 (2021) - [i35]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. CoRR abs/2102.04488 (2021) - [i34]Ke Li, Daniel Povey, Sanjeev Khudanpur:
A Parallelizable Lattice Rescoring Strategy with Neural Language Models. CoRR abs/2103.05081 (2021) - [i33]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Policies for Multilingual Training of Neural Machine Translation Systems. CoRR abs/2103.06964 (2021) - [i32]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora. CoRR abs/2103.06968 (2021) - [i31]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder For Automatic Speech Recognition. CoRR abs/2103.09063 (2021) - [i30]Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR abs/2103.17122 (2021) - [i29]Desh Raj, Sanjeev Khudanpur:
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem. CoRR abs/2104.01954 (2021) - [i28]Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan:
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio. CoRR abs/2106.06909 (2021) - [i27]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models. CoRR abs/2110.04863 (2021) - [i26]Piotr Zelasko, Daniel Povey, Jan "Yenda" Trmal, Sanjeev Khudanpur:
Lhotse: a speech data representation library for the modern deep learning ecosystem. CoRR abs/2110.12561 (2021) - 2020
- [i25]Zili Huang, Shinji Watanabe, Yusuke Fujita, Paola García, Yiwen Shao, Daniel Povey, Sanjeev Khudanpur:
Speaker Diarization with Region Proposal Network. CoRR abs/2002.06220 (2020) - [i24]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Alignment-Free Lattice-Free MMI. CoRR abs/2005.08347 (2020) - [i23]Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR. CoRR abs/2005.09824 (2020) - [i22]Ashish Arora, Desh Raj, Aswin Shanmugam Subramanian, Ke Li, Bar Ben-Yair, Matthew Maciejewski, Piotr Zelasko, Paola García, Shinji Watanabe, Sanjeev Khudanpur:
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge. CoRR abs/2006.07898 (2020) - [i21]Ruizhe Huang, Ke Li, Ashish Arora, Daniel Povey, Sanjeev Khudanpur:
Efficient MDI Adaptation for n-gram Language Models. CoRR abs/2008.02385 (2020) - [i20]Matthew Maciejewski, Jing Shi, Shinji Watanabe, Sanjeev Khudanpur:
Training Noisy Single-Channel Speech Separation With Noisy Oracle Sources: A Large Gap and A Small Step. CoRR abs/2010.12430 (2020) - [i19]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs. CoRR abs/2011.01997 (2020) - [i18]Desh Raj, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur:
Frustratingly Easy Noise-aware Training of Acoustic Models. CoRR abs/2011.02090 (2020) - [i17]Desh Raj, Zili Huang, Sanjeev Khudanpur:
Multi-class Spectral Clustering with Overlaps for Speaker Diarization. CoRR abs/2011.02900 (2020) - [i16]Jonathan D. Jones, Cathryn S. Cortesa, Amy Lynne Shelton, Barbara Landau, Sanjeev Khudanpur, Gregory D. Hager:
Fine-grained activity recognition for assembly videos. CoRR abs/2012.01392 (2020) - 2019
- [i15]Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur:
Probing the Information Encoded in x-vectors. CoRR abs/1909.06351 (2019) - [i14]Yiming Wang, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur:
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit. CoRR abs/1909.08723 (2019) - 2018
- [i13]Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, Odette Scharenborg, Emmanuel Dupoux, Lukás Burget, François Yvon, Sanjeev Khudanpur:
Bayesian Models for Unit Discovery on a Very Low Resource Language. CoRR abs/1802.06053 (2018) - [i12]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak:
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR abs/1802.08731 (2018) - [i11]Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
A GPU-based WFST Decoder with Exact Lattice Generation. CoRR abs/1804.03243 (2018) - [i10]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. CoRR abs/1807.06204 (2018) - [i9]Matthew Maciejewski, Gregory Sell, Leibny Paola García-Perera, Shinji Watanabe, Sanjeev Khudanpur:
Building Corpora for Single-Channel Speech Separation Across Multiple Domains. CoRR abs/1811.02641 (2018) - [i8]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Low Resource Multi-modal Data Augmentation for End-to-end ASR. CoRR abs/1812.03919 (2018) - 2017
- [i7]Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukás Burget, Sanjeev Khudanpur:
An Empirical Evaluation of Zero Resource Acoustic Unit Discovery. CoRR abs/1702.01360 (2017) - [i6]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech without ASR. CoRR abs/1703.07476 (2017) - [i5]Jan Trmal, Gaurav Kumar, Vimal Manohar, Sanjeev Khudanpur, Matt Post, Paul McNamee:
Using of heterogeneous corpora for training of an ASR system. CoRR abs/1706.00321 (2017) - [i4]Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework. CoRR abs/1706.03747 (2017) - 2015
- [i3]Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James R. Glass:
Highway Long Short-Term Memory RNNs for Distant Speech Recognition. CoRR abs/1510.08983 (2015) - 2013
- [i2]Damianos G. Karakos, Mark Dredze, Sanjeev Khudanpur:
Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation. CoRR abs/1303.5148 (2013) - 2009
- [i1]Christopher M. White, Sanjeev Khudanpur, Patrick J. Wolfe:
Likelihood-based semi-supervised model selection with applications to speech processing. CoRR abs/0911.3944 (2009)
Coauthor Index
aka: Pegah Ghahramani
aka: Dan Povey
aka: Murat Saraçlar
aka: Jan "Yenda" Trmal
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 21:02 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint