default search action
Desh Raj
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c24]Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola García, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey, Sanjeev Khudanpur:
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition. LREC/COLING 2024: 3700-3706 - [c23]George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Mohamed Nabih Ali, Alessio Brutti:
Training Early-Exit Architectures for Automatic Speech Recognition: Fine-Tuning Pre-Trained Models or Training from Scratch. ICASSP Workshops 2024: 685-689 - [c22]Jennifer Drexler Fox, Desh Raj, Natalie Delworth, Quinn McNamara, Corey Miller, Migüel Jetté:
Updated Corpora and Benchmarks for Long-Form Speech Recognition. ICASSP 2024: 13246-13250 - [c21]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Paola García, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. Odyssey 2024: 91-98 - [i22]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. CoRR abs/2401.15676 (2024) - [i21]Desh Raj:
Listening to Multi-talker Conversations: Modular and End-to-end Perspectives. CoRR abs/2402.08932 (2024) - [i20]Desh Raj, Gil Keren, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Faster Speech-LLaMA Inference with Multi-token Prediction. CoRR abs/2409.08148 (2024) - [i19]Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. CoRR abs/2409.11494 (2024) - 2023
- [j5]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
SURT 2.0: Advances in Transducer-Based Multi-Talker Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3800-3813 (2023) - [c20]Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Learning From Flawed Data: Weakly Supervised Automatic Speech Recognition. ASRU 2023: 1-8 - [c19]Zili Huang, Desh Raj, Paola García, Sanjeev Khudanpur:
Adapting Self-Supervised Models to Multi-Talker Speech Recognition Using Speaker Embeddings. ICASSP 2023: 1-5 - [c18]Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5 - [c17]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
GPU-accelerated Guided Source Separation for Meeting Transcription. INTERSPEECH 2023: 3507-3511 - [i18]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition. CoRR abs/2306.10559 (2023) - [i17]Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola García, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur:
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios. CoRR abs/2306.13734 (2023) - [i16]George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Alessio Brutti:
Training dynamic models using early exits for automatic speech recognition on resource-constrained devices. CoRR abs/2309.09546 (2023) - [i15]Jennifer Drexler Fox, Desh Raj, Natalie Delworth, Quinn McNamara, Corey Miller, Migüel Jetté:
Updated Corpora and Benchmarks for Long-Form Speech Recognition. CoRR abs/2309.15013 (2023) - [i14]Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition. CoRR abs/2309.15796 (2023) - 2022
- [j4]Prateek Singh, Rajat Ujjainiya, Satyartha Prakash, Salwa Naushin, Viren Sardana, Nitin Bhatheja, Ajay Pratap Singh, Joydeb Barman, Kartik Kumar, Saurabh Gayali, Raju Khan, Birendra Singh Rawat, Karthik Bharadwaj Tallapaka, Mahesh Anumalla, Amit Lahiri, Susanta Kar, Vivek Bhosale, Mrigank Srivastava, Madhav Nilakanth Mugale, C. P. Pandey, Shaziya Khan, Shivani Katiyar, Desh Raj, Sharmeen Ishteyaque, Sonu Khanka, Ankita Rani, Promila, Jyotsna Sharma, Anuradha Seth, Mukul Dutta, Nishant Saurabh, Murugan Veerapandian, Ganesh Venkatachalam, Deepak Bansal, Dinesh Gupta, Prakash M. Halami, Muthukumar Serva Peddha, Ravindra P. Veeranna, Anirban Pal, Ranvijay Kumar Singh, Suresh Kumar Anandasadagopan, Parimala Karuppanan, Syed Nasar Rahman, Gopika Selvakumar, Venkatesan Subramanian, Malay Kumar Karmakar, Harish Kumar Sardana, Anamika Kothari, Devendra Singh Parihar, Anupma Thakur, Anas Saifi, Naman Gupta, Yogita Singh, Ritu Reddu, Rizul Gautam, Anuj Mishra, Avinash Mishra, Iranna Gogeri, Geethavani Rayasam, Yogendra Padwad, Vikram Patial, Vipin Hallan, Damanpreet Singh, Narendra Tirpude, Partha Chakrabarti, Sujay Krishna Maity, Dipyaman Ganguly, Ramakrishna Sistla, Narender Kumar Balthu, Kiran Kumar A, Siva Ranjith, B. Vijay Kumar, Piyush Singh Jamwal, Anshu Wali, Sajad Ahmed, Rekha Chouhan, Sumit G. Gandhi, Nancy Sharma, Garima Rai, Faisal Irshad, Vijay Lakshmi Jamwal, Masroor Ahmad Paddar, Sameer Ullah Khan, Fayaz Malik, Debashish Ghosh, Ghanshyam Thakkar, Saroj Kanta Barik, Prabhanshu Tripathi, Yatendra Kumar Satija, Sneha Mohanty, Md. Tauseef Khan, Umakanta Subudhi, Pradip Sen, Rashmi Kumar, Anshu Bhardwaj, Pawan Gupta, Deepak Sharma, Amit Tuli, Saumya Ray chaudhuri, Srinivasan Krishnamurthi, L. Prakash, Ch V. Rao, B. N. Singh, Arvindkumar Chaurasiya, Meera Chaurasiyar, Mayuri Bhadange, Bhagyashree Likhitkar, Sharada Mohite, Yogita Patil, Mahesh Kulkarni, Rakesh Joshi, Vaibhav Pandya, Sachin Mahajan, Amita Patil, Rachel Samson, Tejas Vare, Mahesh Dharne, Ashok Giri, Shilpa Paranjape, G. Narahari Sastry, Jatin Kalita, Tridip Phukan, Prasenjit Manna, Wahengbam Romi, Pankaj Bharali, Dibyajyoti Ozah, Ravi Kumar Sahu, Prachurjya Dutta, Moirangthem Goutam Singh, Gayatri Gogoi, Yasmin Begam Tapadar, Elapavalooru VSSK. Babu, Rajeev K. Sukumaran, Aishwarya R. Nair, Anoop Puthiyamadam, Prajeesh Kooloth Valappil, Adrash Velayudhan Pillai Prasannakumari, Kalpana Chodankar, Samir Damare, Ved Varun Agrawal, Kumardeep Chaudhary, Anurag Agrawal, Shantanu Sengupta, Debasis Dash:
A machine learning-based approach to determine infection status in recipients of BBV152 (Covaxin) whole-virion inactivated SARS-CoV-2 vaccine for serological surveys. Comput. Biol. Medicine 146: 105419 (2022) - [j3]Zili Huang, Marc Delcroix, Leibny Paola García-Perera, Shinji Watanabe, Desh Raj, Sanjeev Khudanpur:
Joint speaker diarization and speech recognition based on region proposal networks. Comput. Speech Lang. 72: 101316 (2022) - [c16]Desh Raj, Liang Lu, Zhuo Chen, Yashesh Gaur, Jinyu Li:
Continuous Streaming Multi-Talker ASR with Dual-Path Transducers. ICASSP 2022: 7317-7321 - [c15]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models. ICASSP 2022: 8597-8601 - [c14]Giovanni Morrone, Samuele Cornell, Desh Raj, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini:
Low-Latency Speech Separation Guided Diarization for Telephone Conversations. SLT 2022: 641-646 - [i13]Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022) - [i12]Zili Huang, Desh Raj, Paola García, Sanjeev Khudanpur:
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings. CoRR abs/2211.00482 (2022) - [i11]Desh Raj, Daniel Povey, Sanjeev Khudanpur:
GPU-accelerated Guided Source Separation for Meeting Transcription. CoRR abs/2212.05271 (2022) - 2021
- [c13]Katerina Zmolíková, Marc Delcroix, Desh Raj, Shinji Watanabe, Jan Cernocký:
Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics. Interspeech 2021: 1464-1468 - [c12]Desh Raj, Sanjeev Khudanpur:
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem. Interspeech 2021: 2351-2355 - [c11]Matthew Wiesner, Mousmita Sarma, Ashish Arora, Desh Raj, Dongji Gao, Ruizhe Huang, Supreet Preet, Moris Johnson, Zikra Iqbal, Nagendra Goel, Jan Trmal, Leibny Paola García-Perera, Sanjeev Khudanpur:
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. Interspeech 2021: 2906-2910 - [c10]Maokui He, Desh Raj, Zili Huang, Jun Du, Zhuo Chen, Shinji Watanabe:
Target-Speaker Voice Activity Detection with Improved i-Vector Estimation for Unknown Number of Speaker. Interspeech 2021: 3555-3559 - [c9]Desh Raj, Zili Huang, Sanjeev Khudanpur:
Multi-Class Spectral Clustering with Overlaps for Speaker Diarization. SLT 2021: 582-589 - [c8]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs. SLT 2021: 881-888 - [c7]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904 - [c6]Zhong-Qiu Wang, Hakan Erdogan, Scott Wisdom, Kevin W. Wilson, Desh Raj, Shinji Watanabe, Zhuo Chen, John R. Hershey:
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement. SLT 2021: 905-911 - [i10]Shota Horiguchi, Nelson Yalta, Paola García, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur:
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap. CoRR abs/2102.01363 (2021) - [i9]Desh Raj, Sanjeev Khudanpur:
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem. CoRR abs/2104.01954 (2021) - [i8]Desh Raj, Liang Lu, Zhuo Chen, Yashesh Gaur, Jinyu Li:
Continuous Streaming Multi-Talker ASR with Dual-path Transducers. CoRR abs/2109.08555 (2021) - [i7]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models. CoRR abs/2110.04863 (2021) - 2020
- [i6]Ashish Arora, Desh Raj, Aswin Shanmugam Subramanian, Ke Li, Bar Ben-Yair, Matthew Maciejewski, Piotr Zelasko, Paola García, Shinji Watanabe, Sanjeev Khudanpur:
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge. CoRR abs/2006.07898 (2020) - [i5]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs. CoRR abs/2011.01997 (2020) - [i4]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020) - [i3]Desh Raj, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur:
Frustratingly Easy Noise-aware Training of Acoustic Models. CoRR abs/2011.02090 (2020) - [i2]Desh Raj, Zili Huang, Sanjeev Khudanpur:
Multi-class Spectral Clustering with Overlaps for Speaker Diarization. CoRR abs/2011.02900 (2020)
2010 – 2019
- 2019
- [c5]Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur:
Probing the Information Encoded in X-Vectors. ASRU 2019: 726-733 - [c4]Ashish Arora, Paola García, Shinji Watanabe, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun-Chieh Chang, Babak Rekabdar, Bagher BabaAli, Daniel Povey, David Etter, Desh Raj, Hossein Hadian, Jan Trmal:
Using ASR Methods for OCR. ICDAR 2019: 663-668 - [i1]Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur:
Probing the Information Encoded in x-vectors. CoRR abs/1909.06351 (2019) - 2018
- [j2]Shakaiba Majeed, Aditya Gupta, Desh Raj, Frank Chung-Hoon Rhee:
Uncertain fuzzy self-organization based clustering: interval type-2 fuzzy approach to adaptive resonance theory. Inf. Sci. 424: 69-90 (2018) - [j1]Desh Raj, Aditya Gupta, Bhuvnesh Garg, Kenil Tanna, Frank Chung-Hoon Rhee:
Analysis of Data Generated From Multidimensional Type-1 and Type-2 Fuzzy Membership Functions. IEEE Trans. Fuzzy Syst. 26(2): 681-693 (2018) - 2017
- [c3]Desh Raj, Sunil Kumar Sahu, Ashish Anand:
Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text. CoNLL 2017: 311-321 - [c2]Desh Raj, Aditya Gupta, Kenil Tanna, Bhuvnesh Garg, Frank Chung-Hoon Rhee:
Principal component analysis approach in selecting type-1 and type-2 fuzzy membership functions for high-dimensional data. IFSA-SCIS 2017: 1-6 - 2016
- [c1]Desh Raj, Kenil Tanna, Bhuvnesh Garg, Frank Chung-Hoon Rhee:
Visual analysis and representations of type-2 fuzzy membership functions. FUZZ-IEEE 2016: 550-554
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint