


default search action
Sathvik Udupa
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c18]Amartyaveer
, Saurabh Kumar, Sumit Sharma, Sathvik Udupa, Sandhya Badiger, Abhayjeet Singh, Deekshitha G, Jesuraja Bandekar, Savitha Murthy, Prasanta Kumar Ghosh:
Improving Dialect Identification in Indian Languages Using Multimodal Features from Dialect Informed ASR. ICASSP 2025: 1-5
[i6]Sathvik Udupa, Shinji Watanabe, Petr Schwarz, Jan Cernocký:
Streaming Endpointer for Spoken Dialogue using Neural Audio Codecs and Label-Delayed Training. CoRR abs/2506.07081 (2025)
[i5]Sonal Kumar, Simon Sedlácek, Vaibhavi Lokegaonkar, Fernando López, Wenyi Yu, Nishit Anand, Hyeonggon Ryu, Lichang Chen, Maxim Plicka, Miroslav Hlavácek, William Fineas Ellingwood, Sathvik Udupa, Siyuan Hou, Allison Ferner, Sara Barahona, Cecilia Bolaños, Satish Rahi, Laura Herrera-Alarcón, Satvik Dixit, Rupali S. Patil, Soham Deshmukh, Lasha Koroshinadze, Yao Liu, Leibny Paola Garcia Perera, Eleni Zanou, Themos Stafylakis, Joon Son Chung, David Harwath, Chao Zhang, Dinesh Manocha, Alicia Lozano-Diez, Santosh Kesiraju, Sreyan Ghosh, Ramani Duraiswami:
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence. CoRR abs/2508.13992 (2025)- 2024
[c17]Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R., Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich:
LIMMITS'24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning. ICASSP Workshops 2024: 61-62
[c16]Jesuraj Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh:
Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss. INTERSPEECH 2024
[c15]Sathvik Udupa, Jesuraj Bandekar, Saurabh Kumar, Deekshitha G, Sandhya Badiger, Abhayjeet Singh Savitha Murthy, Priyanka Pai, Srinivasa Raghavan K. M., Raoul Nanavati, Prasanta Kumar Ghosh:
Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models. INTERSPEECH 2024
[c14]Sathvik Udupa, Soumi Maiti, Prasanta Kumar Ghosh:
IndicMOS: Multilingual MOS Prediction for 7 Indian languages. INTERSPEECH 2024- 2023
[c13]Sathvik Udupa, Jesuraja Bandekar, Deekshitha G, Saurabh Kumar, Prasanta Kumar Ghosh, Sandhya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan K. M., Raoul Nanavati:
Gated Multi Encoders and Multitask Objectives for Dialectal Speech Recognition in Indian Languages. ASRU 2023: 1-8
[c12]Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R., Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Heiga Zen, Pranaw Kumar, Kamal Kant, Amol Bole, Bira Chandra Singh, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich:
Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech. ICASSP 2023: 1-2
[c11]Sathvik Udupa, Prasanta Kumar Ghosh:
Real-Time MRI Video Synthesis from Time Aligned Phonemes with Sequence-to-Sequence Networks. ICASSP 2023: 1-5
[c10]Sathvik Udupa, C. Siddarth, Prasanta Kumar Ghosh:
Improved Acoustic-to-Articulatory Inversion Using Representations from Pretrained Self-Supervised Learning Models. ICASSP 2023: 1-5
[c9]Jesuraja Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh:
Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion. INTERSPEECH 2023: 5147-5151
[c8]Abhayjeet Singh, Anjali Jayakumar, Deekshitha G, Hitesh Kumar, Jesuraja Bandekar, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh:
An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language. SPECOM (2) 2023: 164-172
[c7]Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K. S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, Prashanthi V
, Priyanka Pai, Raoul Nanavati, Sai Praneeth Reddy Mora, Srinivasa Raghavan K. M.:
An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language. SPECOM (2) 2023: 173-181
[i4]Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K. S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Savitha, Prasanta Kumar Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Rohan Saxena, Sai Praneeth Reddy Mora, Srinivasa Raghavan K. M.:
Model Adaptation for ASR in low-resource Indian Languages. CoRR abs/2307.07948 (2023)- 2022
[c6]Sathvik Udupa, Aravind Illa, Prasanta Kumar Ghosh:
Streaming model for Acoustic to Articulatory Inversion with transformer networks. INTERSPEECH 2022: 625-629
[c5]Anish Bhanushali, Grant Bridgman, Deekshitha G, Prasanta Kumar Ghosh, Pratik Kumar, Saurabh Kumar, Adithya Raj Kolladath, Nithya Ravi, Aaditeshwar Seth, Ashish Seth, Abhayjeet Singh, Vrunda N. Sukhadia, Srinivasan Umesh, Sathvik Udupa, Lodagala V. S. V. Durga Prasad:
Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi. INTERSPEECH 2022: 3548-3552
[c4]C. Siddarth, Sathvik Udupa, Prasanta Kumar Ghosh:
Watch Me Speak: 2D Visualization of Human Mouth during Speech. INTERSPEECH 2022: 3667-3668
[i3]Sathvik Udupa, C. Siddarth, Prasanta Kumar Ghosh:
Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models. CoRR abs/2210.16871 (2022)
[i2]Sathvik Udupa, Prasanta Kumar Ghosh:
Real-Time MRI Video synthesis from time aligned phonemes with sequence-to-sequence networks. CoRR abs/2210.16881 (2022)- 2021
[c3]Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Estimating Articulatory Movements in Speech Production with Transformer Networks. Interspeech 2021: 1154-1158
[c2]Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Web Interface for Estimating Articulatory Movements in Speech Production from Acoustics and Text. Interspeech 2021: 4868-4869
[i1]Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Estimating articulatory movements in speech production with transformer networks. CoRR abs/2104.05017 (2021)- 2020
[c1]Jhansi Mallela, Aravind Illa, Suhas B. N.
, Sathvik Udupa, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh:
Voice based classification of patients with Amyotrophic Lateral Sclerosis, Parkinson's Disease and Healthy Controls with CNN-LSTM using transfer learning. ICASSP 2020: 6784-6788
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-16 00:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







