default search action
K. Sreenivasa Rao
Person information
- affiliation: Indian Institute of Technology Kharagpur, West Bengal, India
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j96]Priya Dharshini G, K. Sreenivasa Rao:
Transfer Accent Identification Learning for Enhancing Speech Emotion Recognition. Circuits Syst. Signal Process. 43(8): 5090-5120 (2024) - [j95]Arijul Haque, Krothapalli Sreenivasa Rao:
Speech emotion recognition with transfer learning and multi-condition training for noisy environments. Int. J. Speech Technol. 27(2): 353-365 (2024) - [j94]Arijul Haque, K. Sreenivasa Rao:
Hierarchical emotion recognition from speech using source, power spectral and prosodic features. Multim. Tools Appl. 83(7): 19629-19661 (2024) - [j93]Abhijit Debnath, K. Sreenivasa Rao, Partha Pratim Das:
A multi-modal lecture video indexing and retrieval framework with multi-scale residual attention network and multi-similarity computation. Signal Image Video Process. 18(3): 1993-2006 (2024) - [j92]Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, Paavo Alku, K. Sreenivasa Rao, Pabitra Mitra:
Automatic classification of neurological voice disorders using wavelet scattering features. Speech Commun. 157: 103040 (2024) - [i18]Shalika Kumbham, Abhijit Debnath, Krothapalli Sreenivasa Rao:
Efficient Indexing of Meta-Data (Extracted from Educational Videos). CoRR abs/2401.01356 (2024) - [i17]Debopriyo Banerjee, Krothapalli Sreenivasa Rao, Shamik Sural, Niloy Ganguly:
BOXREC: Recommending a Box of Preferred Outfits in Online Shopping. CoRR abs/2402.16660 (2024) - [i16]Aravinda Reddy P. N, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra:
MLSD-GAN - Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement. CoRR abs/2404.12679 (2024) - [i15]Aravinda Reddy P. N, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, Vinod Rathod:
Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection. CoRR abs/2406.13384 (2024) - [i14]Aravinda Reddy P. N, Raghavendra Ramachandra, K. Sreenivasa Rao, Pabitra Mitra:
NeuralMultiling: A Novel Neural Architecture Search for Smartphone based Multilingual Speaker Verification. CoRR abs/2408.04362 (2024) - 2023
- [j91]P. Sudhakar, K. Sreenivasa Rao, Pabitra Mitra:
Unsupervised spoken term discovery using pseudo lexical induction. Int. J. Speech Technol. 26(3): 801-816 (2023) - [j90]Priya Dharshini G, K. Sreenivasa Rao:
Accent classification from an emotional speech in clean and noisy environments. Multim. Tools Appl. 82(3): 3485-3508 (2023) - [j89]P. Sudhakar, K. Sreenivasa Rao, Pabitra Mitra:
A Novel Zero-Resource Spoken Term Detection Using Affinity Kernel Propagation with Acoustic Feature Map. SN Comput. Sci. 4(3): 310 (2023) - [c85]P. N. Aravinda Reddy, K. Sreenivasa Rao, Raghavendra Ramachandra, Pabitra Mitra:
ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swapping. CVIP (1) 2023: 219-230 - [c84]Abhijit Debnath, K. Sreenivasa Rao, Partha Pratim Das:
Similarity-based Multi-Modal Lecture Video Indexing and Retrieval with Deep Learning. ICCCNT 2023: 1-7 - [c83]P. Sudhakar, K. Sreenivasa Rao, Pabitra Mitra:
Self-Paced Pattern Augmentation for Spoken Term Detection in Zero-Resource. INTERSPEECH 2023: 1618-1622 - [c82]Saikat Biswas, Koushiki Dasgupta Chaudhuri, Pabitra Mitra, Krothapalli Sreenivasa Rao:
Relation Predictions in Comorbid Disease Centric Knowledge Graph Using Heterogeneous GNN Models. IWBBIO (2) 2023: 343-356 - [c81]P. Sudhakar, K. Sreenivasa Rao, Pabitra Mitra:
Unsupervised Discovery of Recurring Spoken Terms Using Diagonal Patterns. PReMI 2023: 61-69 - [i13]Aravinda Reddy P. N, K. Sreenivasa Rao, Raghavendra Ramachandra, Pabitra Mitra:
ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swapping. CoRR abs/2310.12736 (2023) - 2022
- [j88]Kishore Kumar Ravi, K. Sreenivasa Rao:
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network. Comput. Speech Lang. 71: 101259 (2022) - [j87]Kishore Kumar Ravi, Krothapalli Sreenivasa Rao:
Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals. Circuits Syst. Signal Process. 41(4): 2088-2117 (2022) - [j86]Kumud Tripathi, K. Sreenivasa Rao:
CycleGAN-Based Speech Mode Transformation Model for Robust Multilingual ASR. Circuits Syst. Signal Process. 41(9): 5283-5305 (2022) - [j85]Kumud Tripathi, K. Sreenivasa Rao:
Correction to: CycleGAN-Based Speech Mode Transformation Model for Robust Multilingual ASR. Circuits Syst. Signal Process. 41(9): 5306 (2022) - [j84]Yagnavajjula Madhu Keerthana, K. Sreenivasa Rao, Pabitra Mitra:
Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features. Int. J. Speech Technol. 25(4): 967-973 (2022) - [j83]Kumud Tripathi, K. Sreenivasa Rao:
VOP detection for read and conversation speech using CWT coefficients and phone boundaries. J. Ambient Intell. Humaniz. Comput. 13(1): 105-116 (2022) - [c80]Soumen Paul, Rounak Saha, Swarup Padhi, Srijoni Majumdar, Partha Pratim Das, K. Sreenivasa Rao:
NrityaManch: An Annotation and Retrieval System for Bharatanatyam Dance. FIRE 2022: 65-73 - [i12]Gurunath Reddy M, K. Sreenivasa Rao, Partha Pratim Das:
Melody Extraction from Polyphonic Music by Deep Learning Approaches: A Review. CoRR abs/2202.01078 (2022) - 2021
- [j82]Hareesh Mandalapu, Aravinda Reddy P. N, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, S. R. Mahadeva Prasanna, Christoph Busch:
Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey. IEEE Access 9: 37431-37455 (2021) - [j81]Hareesh Mandalapu, Aravinda Reddy P. N, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, S. R. Mahadeva Prasanna, Christoph Busch:
Multilingual Audio-Visual Smartphone Dataset and Evaluation. IEEE Access 9: 153240-153257 (2021) - [j80]Pradeep Rengaswamy, Gurunath Reddy M., K. Sreenivasa Rao, Pallab Dasgupta:
hf0: A Hybrid Pitch Extraction Method for Multimodal Voice. Circuits Syst. Signal Process. 40(1): 262-275 (2021) - [j79]Pradeep Rengaswamy, K. Sreenivasa Rao, Pallab Dasgupta:
SongF0: A Spectrum-Based Fundamental Frequency Estimation for Monophonic Songs. Circuits Syst. Signal Process. 40(2): 772-797 (2021) - [j78]G. Sreeram, S. Pradeep, K. Sreenivasa Rao, B. Deevana Raju, Nikhat Parveen:
Moving ridge neuronal espionage network simulation for reticulum invasion sensing. Int. J. Pervasive Comput. Commun. 17(1): 64-77 (2021) - [j77]Nirmalya Sen, Md. Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, Tapan Kumar Basu:
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. Int. J. Speech Technol. 24(4): 1067-1088 (2021) - [j76]Kumud Tripathi, K. Sreenivasa Rao:
Robust vowel region detection method for multimode speech. Multim. Tools Appl. 80(9): 13615-13637 (2021) - [j75]K. E. Manjunath, Srinivasa Raghavan K. M., K. Sreenivasa Rao, Dinesh Babu Jayagopi, V. Ramasubramanian:
Approaches for Multilingual Phone Recognition in Code-switched and Non-code-switched Scenarios Using Indian Languages. ACM Trans. Asian Low Resour. Lang. Inf. Process. 20(4): 55:1-55:19 (2021) - [j74]Saikat Biswas, Pabitra Mitra, Krothapalli Sreenivasa Rao:
Relation Prediction of Co-Morbid Diseases Using Knowledge Graph Completion. IEEE ACM Trans. Comput. Biol. Bioinform. 18(2): 708-717 (2021) - [c79]Soumava Paul, Gurunath Reddy M, K. Sreenivasa Rao, Partha Pratim Das:
Knowledge Distillation for Singing Voice Detection. Interspeech 2021: 4159-4163 - [i11]Hareesh Mandalapu, Aravinda Reddy P. N, Raghavendra Ramachandra, K. Sreenivasa Rao, Pabitra Mitra, S. R. Mahadeva Prasanna, Christoph Busch:
Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey. CoRR abs/2101.09725 (2021) - [i10]Nirmalya Sen, Md. Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, Tapan Kumar Basu:
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. CoRR abs/2105.11728 (2021) - [i9]Hareesh Mandalapu, Aravinda Reddy P. N, Raghavendra Ramachandra, K. Sreenivasa Rao, Pabitra Mitra, S. R. Mahadeva Prasanna, Christoph Busch:
Multilingual Audio-Visual Smartphone Dataset And Evaluation. CoRR abs/2109.04138 (2021) - 2020
- [j73]Mittapalle Kiran Reddy, Paavo Alku, Krothapalli Sreenivasa Rao:
Detection of Specific Language Impairment in Children Using Glottal Source Features. IEEE Access 8: 15273-15279 (2020) - [j72]Mittapalle Kiran Reddy, K. Sreenivasa Rao:
Excitation modelling using epoch features for statistical parametric speech synthesis. Comput. Speech Lang. 60 (2020) - [j71]Mittapalle Kiran Reddy, K. Sreenivasa Rao:
DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features. Neural Process. Lett. 51(2): 2029-2042 (2020) - [j70]Pradeep Rengaswamy, Mittapalle Kiran Reddy, Krothapalli Sreenivasa Rao, Pallab Dasgupta:
Robust f0 extraction from monophonic signals using adaptive sub-band filtering. Speech Commun. 116: 77-85 (2020) - [j69]Kumud Tripathi, Mittapalle Kiran Reddy, K. Sreenivasa Rao:
Multilingual and multimode phone recognition system for Indian languages. Speech Commun. 119: 12-23 (2020) - [j68]Harikrishna D. M., K. Sreenivasa Rao:
Children's Story Classification in Indian Languages Using Linguistic and Keyword-based Features. ACM Trans. Asian Low Resour. Lang. Inf. Process. 19(2): 30:1-30:22 (2020) - [j67]Debopriyo Banerjee, Krothapalli Sreenivasa Rao, Shamik Sural, Niloy Ganguly:
BOXREC: Recommending a Box of Preferred Outfits in Online Shopping. ACM Trans. Intell. Syst. Technol. 11(6): 69:1-69:28 (2020) - [c78]Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das:
Glottal Closure Instants Detection from EGG Signal by Classification Approach. INTERSPEECH 2020: 4891-4895 - [c77]Kumud Tripathi, K. Sreenivasa Rao:
Multilingual speech mode classification model for Indian languages. NCC 2020: 1-6 - [i8]Soumava Paul, Gurunath Reddy M, K. Sreenivasa Rao, Partha Pratim Das:
Knowledge Distillation for Singing Voice Detection. CoRR abs/2011.04297 (2020)
2010 – 2019
- 2019
- [b5]K. Sreenivasa Rao, N. P. Narendra:
Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis - Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning. Springer 2019, ISBN 978-3-030-02758-2, pp. 1-129 - [j66]R. Pradeep, Mittapalle Kiran Reddy, K. Sreenivasa Rao:
LSTM-Based Robust Voicing Decision Applied to DNN-Based Speech Synthesis. Autom. Control. Comput. Sci. 53(4): 328-332 (2019) - [j65]R. Pradeep, K. Sreenivasa Rao:
Incorporation of Manner of Articulation Constraint in LSTM for Speech Recognition. Circuits Syst. Signal Process. 38(8): 3482-3500 (2019) - [j64]K. E. Manjunath, Dinesh Babu Jayagopi, K. Sreenivasa Rao, V. Ramasubramanian:
Development and analysis of multilingual phone recognition systems using Indian languages. Int. J. Speech Technol. 22(1): 157-168 (2019) - [j63]Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, K. Sreenivasa Rao:
CWT-Based Approach for Epoch Extraction From Telephone Quality Speech. IEEE Signal Process. Lett. 26(8): 1107-1111 (2019) - [c76]Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das:
Glottal Closure Instants Detection from Speech Signal by Deep Features Extracted from Raw Speech and Linear Prediction Residual. INTERSPEECH 2019: 156-160 - [i7]Pradeep Rengaswamy, Gurunath Reddy M., Krothapalli Sreenivasa Rao:
hf0: A hybrid pitch extraction method for multimodal voice. CoRR abs/1904.09765 (2019) - [i6]Kumud Tripathi, K. Sreenivasa Rao:
VOP Detection for Read and Conversation Speech using CWT Coefficients and Phone Boundaries. CoRR abs/1908.08668 (2019) - [i5]Kumud Tripathi, Mittapalle Kiran Reddy, K. Sreenivasa Rao:
Multilingual and Multimode Phone Recognition System for Indian Languages. CoRR abs/1908.09634 (2019) - [i4]Mittapalle Kiran Reddy, K. Sreenivasa Rao:
DNN-based cross-lingual voice conversion using Bottleneck Features. CoRR abs/1909.03974 (2019) - 2018
- [j62]K. E. Manjunath, K. Sreenivasa Rao:
Improvement of Phone Recognition Accuracy Using Articulatory Features. Circuits Syst. Signal Process. 37(2): 704-728 (2018) - [j61]Gurunath Reddy M., K. Sreenivasa Rao:
Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method. Circuits Syst. Signal Process. 37(7): 2911-2933 (2018) - [j60]Mittapalle Kiran Reddy, Krothapalli Sreenivasa Rao:
Inverse filter based excitation model for HMM-based speech synthesis system. IET Signal Process. 12(4): 544-548 (2018) - [j59]Jainath Yadav, K. Sreenivasa Rao:
Neural network and GMM based feature mappings for consonant-vowel recognition in emotional environment. Int. J. Speech Technol. 21(3): 421-433 (2018) - [j58]Kumud Tripathi, K. Sreenivasa Rao:
Improvement of phone recognition accuracy using speech mode classification. Int. J. Speech Technol. 21(3): 489-500 (2018) - [j57]Arup Kumar Dutta, K. Sreenivasa Rao:
Language identification using phase information. Int. J. Speech Technol. 21(3): 509-519 (2018) - [j56]Prasenjit Dhara, K. Sreenivasa Rao:
Automatic note transcription system for Hindustani classical music. Int. J. Speech Technol. 21(4): 987-1003 (2018) - [j55]Kishore Kumar Ravi, Lokendra Birla, K. Sreenivasa Rao:
A robust unsupervised pattern discovery and clustering of speech signals. Pattern Recognit. Lett. 116: 254-261 (2018) - [j54]Jainath Yadav, Md. S. Fahad, K. Sreenivasa Rao:
Epoch detection from emotional speech signal using zero time windowing. Speech Commun. 96: 142-149 (2018) - [c75]Kishore Kumar Ravi, Sandipan Sarkar, Pradeep Rengaswamy, K. Sreenivasa Rao:
Audio Mining: Unsupervised Spoken Term Detection over an Audio Database. ICACCI 2018: 514-518 - [c74]Kumud Tripathi, K. Sreenivasa Rao:
Discriminative sparse representation for speech mode classification. ICACCI 2018: 655-659 - [c73]Mittapalle Kiran Reddy, K. Sreenivasa Rao:
DNN-based Bilingual (Telugu-Hindi) Polyglot Speech Synthesis. ICACCI 2018: 1808-1811 - [c72]Tanumay Mandal, K. Sreenivasa Rao:
Robust Detection of Glottal Activity Using Unwrapped Phase Electroglottographic Signal. ICASSP 2018: 5584-9 - [c71]Pradeep Rengaswamy, K. Sreenivasa Rao:
Modifying LSTM Posteriors with Manner of Articulation Knowledge to Improve Speech Recognition Performance. ICMLA 2018: 769-772 - [c70]Kumud Tripathi, K. Sreenivasa Rao:
Analysis of sparse representation based feature on speech mode classification. INTERSPEECH 2018: 731-735 - [c69]Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das:
Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events. INTERSPEECH 2018: 831-835 - [c68]K. E. Manjunath, K. Sreenivasa Rao, Dinesh Babu Jayagopi, V. Ramasubramanian:
Indian Languages ASR: A Multilingual Phone Recognition Framework with IPA Based Common Phone-set, Predicted Articulatory Features and Feature fusion. INTERSPEECH 2018: 1016-1020 - [c67]Tanumay Mandal, K. Sreenivasa Rao, Sanjay Kumar Gupta:
Classification of Disorders in Vocal Folds Using Electroglottographic Signal. INTERSPEECH 2018: 3002-3006 - [c66]Debopriyo Banerjee, Niloy Ganguly, Shamik Sural, Krothapalli Sreenivasa Rao:
One for the Road: Recommending Male Street Attire. PAKDD (3) 2018: 571-582 - [i3]R. Pradeep, K. Sreenivasa Rao:
Manner of Articulation Detection using Connectionist Temporal Classification to Improve Automatic Speech Recognition Performance. CoRR abs/1811.01644 (2018) - [i2]Pradeep Rangan, K. Sreenivasa Rao:
Beam Search Decoding using Manner of Articulation Detection Knowledge Derived from Connectionist Temporal Classification. CoRR abs/1811.07720 (2018) - [i1]Gurunath Reddy M., Tanumay Mandal, Krothapalli Sreenivasa Rao:
Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning. CoRR abs/1811.09956 (2018) - 2017
- [j53]S. B. Sunil Kumar, Tanumay Mandal, K. Sreenivasa Rao:
Robust glottal activity detection using the phase of an electroglottographic signal. Biomed. Signal Process. Control. 36: 27-38 (2017) - [j52]Dipanjan Nandi, Debadatta Pati, K. Sreenivasa Rao:
Implicit processing of LP residual for language identification. Comput. Speech Lang. 41: 68-87 (2017) - [j51]Dipanjan Nandi, Debadatta Pati, K. Sreenivasa Rao:
Parametric representation of excitation source information for language identification. Comput. Speech Lang. 41: 88-115 (2017) - [j50]N. P. Narendra, K. Sreenivasa Rao:
Generation of creaky voice for improving the quality of HMM-based speech synthesis. Comput. Speech Lang. 42: 38-58 (2017) - [j49]N. P. Narendra, K. Sreenivasa Rao:
Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System. Circuits Syst. Signal Process. 36(9): 3650-3673 (2017) - [j48]Arijul Haque, Krothapalli Sreenivasa Rao:
Modification of energy spectra, epoch parameters and prosody for emotion conversion in speech. Int. J. Speech Technol. 20(1): 15-25 (2017) - [j47]Sourjya Sarkar, K. Sreenivasa Rao:
Supervector-based approaches in a discriminative framework for speaker verification in noisy environments. Int. J. Speech Technol. 20(2): 387-416 (2017) - [j46]Mittapalle Kiran Reddy, K. Sreenivasa Rao:
Robust Pitch Extraction Method for the HMM-Based Speech Synthesis System. IEEE Signal Process. Lett. 24(8): 1133-1137 (2017) - [c65]S. B. Sunil Kumar, K. Sreenivasa Rao, Tanumay Mandal:
Accurate Synchronization of Speech and EGG Signal Using Phase Information. INTERSPEECH 2017: 694-698 - 2016
- [j45]Jainath Yadav, K. Sreenivasa Rao:
Prosodic Mapping Using Neural Networks for Emotion Conversion in Hindi Language. Circuits Syst. Signal Process. 35(1): 139-162 (2016) - [j44]V. Ramu Reddy, K. Sreenivasa Rao:
Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks. Neurocomputing 171: 1323-1334 (2016) - [j43]K. Manjunath, K. Sreenivasa Rao:
Articulatory and excitation source features for speech recognition in read, extempore and conversation modes. Int. J. Speech Technol. 19(1): 121-134 (2016) - [j42]N. P. Narendra, K. Sreenivasa Rao:
Time-domain deterministic plus noise model based hybrid source modeling for statistical parametric speech synthesis. Speech Commun. 77: 65-83 (2016) - [j41]S. B. Sunil Kumar, K. Sreenivasa Rao:
Voice/non-voice detection using phase of zero frequency filtered speech signal. Speech Commun. 81: 90-103 (2016) - [c64]R. Pradeep, K. Sreenivasa Rao:
Deep neural networks for kannada phoneme recognition. IC3 2016: 1-6 - [c63]Prasenjit Dhara, Pradeep Rengaswamy, K. Sreenivasa Rao:
Designing automatic note transcription system for Hindustani classical music. ICACCI 2016: 899-903 - [c62]Manish Kumar Rai, Neetish, Md. S. Fahad, Jainath Yadav, K. Sreenivasa Rao:
Language identification using PLDA based on i-vector in noisy environment. ICACCI 2016: 1014-1020 - [c61]Gurunath Reddy M., K. Sreenivasa Rao:
Predominant melody extraction from vocal polyphonic music signal by combined spectro-temporal method. ICASSP 2016: 455-459 - [c60]N. P. Narendra, K. Sreenivasa Rao:
A deterministic plus noise model of excitation signal using principal component analysis for parametric speech synthesis. ICASSP 2016: 5635-5639 - [c59]Kumud Tripathi, Parakrant Sarkar, K. Sreenivasa Rao:
Sentence Based Discourse Classification for Hindi Story Text-to-Speech (TTS) System. ICON 2016: 46-54 - [c58]Pradeep Rengaswamy, Gurunath Reddy M., K. Sreenivasa Rao, Pallab Dasgupta:
A Robust Non-Parametric and Filtering Based Approach for Glottal Closure Instant Detection. INTERSPEECH 2016: 1795-1799 - [c57]Gurunath Reddy M., K. Sreenivasa Rao:
Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals. INTERSPEECH 2016: 3309-3313 - 2015
- [j40]N. P. Narendra, K. Sreenivasa Rao:
Robust Voicing Detection and \(F_{0}\) Estimation for HMM-Based Speech Synthesis. Circuits Syst. Signal Process. 34(8): 2597-2619 (2015) - [j39]K. Manjunath, K. Sreenivasa Rao:
Source and system features for phone recognition. Int. J. Speech Technol. 18(2): 257-270 (2015) - [j38]Dipanjan Nandi, Debadatta Pati, K. Sreenivasa Rao:
Implicit excitation source features for robust language identification. Int. J. Speech Technol. 18(3): 459-477 (2015) - [j37]K. Sreenivasa Rao, Shashidhar G. Koolagudi:
Recognition of emotions from video using acoustic and facial features. Signal Image Video Process. 9(5): 1029-1045 (2015) - [c56]Kishore Prahallad, Anandaswarup Vadapalli, Sai Krishna Rallabandi, Santosh Kesiraju, Hema A. Murthy, T. Nagarajan, Bira Chandra Singh, T. Sajani, K. Sreenivasa Rao, Suryakanth V. Gangashetty, Simon King, Keiichi Tokuda, Alan W. Black:
The Blizzard Challenge 2015. Blizzard Challenge 2015 - [c55]Randheer Bagi, Jainath Yadav, K. Sreenivasa Rao:
Improved recognition rate of language identification system in noisy environment. IC3 2015: 214-219 - [c54]Harikrishna D. M., Gurunath Reddy M., K. Sreenivasa Rao:
Multi-stage children story speech synthesis for Hindi. IC3 2015: 220-224 - [c53]Parakrant Sarkar, K. Sreenivasa Rao:
Analysis and modeling pauses for synthesis of storytelling speech based on discourse modes. IC3 2015: 225-230 - [c52]Arup Kumar Dutta, K. Sreenivasa Rao:
Robust language identification using Power Normalized Cepstral Coefficients. IC3 2015: 253-256 - [c51]Arijul Haque, K. Sreenivasa Rao:
Analysis and modification of spectral energy for neutral to sad emotion conversion. IC3 2015: 263-268 - [c50]Gurunath Reddy M., K. Sreenivasa Rao:
Neutral to happy emotion conversion by blending prosody and laughter. IC3 2015: 342-347 - [c49]Harikrishna D. M., K. Sreenivasa Rao:
Children story classification based on structure of the story. ICACCI 2015: 1485-1490 - [c48]R. Pradeep, Prasenjit Dhara, K. Sreenivasa Rao, Pallab Dasgupta:
Raga identification based on Normalized Note Histogram features. ICACCI 2015: 1491-1496 - [c47]Shashidhar G. Koolagudi, Shivakranthi B, K. Sreenivasa Rao, Pravin B. Ramteke:
Contribution of Telugu vowels in identifying emotions. ICAPR 2015: 1-6 - [c46]N. P. Narendra, K. Sreenivasa Rao:
Optimal residual frame based source modeling for HMM-based speech synthesis. ICAPR 2015: 1-5 - [c45]Rashmi Verma, Parakrant Sarkar, K. Sreenivasa Rao:
Conversion of neutral speech to storytelling style speech. ICAPR 2015: 1-6 - [c44]N. P. Narendra, K. Sreenivasa Rao:
Automatic detection of creaky voice using epoch parameters. INTERSPEECH 2015: 2347-2351 - [c43]N. P. Narendra, K. Sreenivasa Rao:
Hybrid Source Modeling Method Utilizing Optimal Residual Frames for HMM-based Speech Synthesis. MIKE 2015: 277-286 - [c42]Parakrant Sarkar, K. Sreenivasa Rao:
Data-driven pause prediction for speech synthesis in storytelling style speech. NCC 2015: 1-5 - 2014
- [b4]K. Sreenivasa Rao, Anil Kumar Vuppala:
Speech Processing in Mobile Environments. Springer Briefs in Electrical and Computer Engineering, Springer 2014, ISBN 978-3-319-03115-6, pp. i-xii, 1-121 - [j36]Sourjya Sarkar, K. Sreenivasa Rao:
Stochastic feature compensation methods for speaker verification in noisy environments. Appl. Soft Comput. 19: 198-214 (2014) - [j35]K. Sreenivasa Rao, Dipanjan Nandi, Shashidhar G. Koolagudi:
Film segmentation and indexing using autoassociative neural networks. Int. J. Speech Technol. 17(1): 65-74 (2014) - [j34]K. Sreenivasa Rao, Ketan Pachpande:
Segmentation, indexing and retrieval of TV broadcast news bulletins using Gaussian mixture models and vector quantization codebooks. Int. J. Speech Technol. 17(3): 259-269 (2014) - [c41]Parakrant Sarkar, Arijul Haque, Arup Kumar Dutta, Gurunath Reddy M., Harikrishna D. M., Prasenjit Dhara, Rashmi Verma, N. P. Narendra, Sunil Kr. S. B., Jainath Yadav, K. Sreenivasa Rao:
Designing prosody rule-set for converting neutral TTS speech to storytelling style speech for Indian languages: Bengali, Hindi and Telugu. IC3 2014: 473-477 - [c40]Dipanjan Nandi, Arup Kumar Dutta, K. Sreenivasa Rao:
Significance of CV transition and steady vowel regions for language identification. IC3 2014: 513-517 - [c39]V. Ramu Reddy, Parakrant Sarkar, K. Sreenivasa Rao:
Duration Modeling by Multi-Models based on Vowel Production characteristics. ICON 2014: 39-47 - [c38]Sourjya Sarkar, K. Sreenivasa Rao:
A novel boosting algorithm for improved i-vector based speaker verification in noisy environments. INTERSPEECH 2014: 671-675 - [c37]K. E. Manjunath, K. Sreenivasa Rao:
Automatic Phonetic Transcription for read, extempore and conversation speech for an Indian language: Bengali. NCC 2014: 1-6 - 2013
- [b3]K. Sreenivasa Rao, Shashidhar G. Koolagudi:
Robust Emotion Recognition using Spectral and Prosodic Features. Springer Briefs in Electrical and Computer Engineering, Springer 2013, ISBN 978-1-4614-6359-7, pp. i-xii, 1-118 - [b2]K. Sreenivasa Rao, Shashidhar G. Koolagudi:
Emotion Recognition using Speech Features. Springer Briefs in Electrical and Computer Engineering, Springer 2013, ISBN 978-1-4614-5142-6, pp. i-xii, 1-124 - [j33]N. P. Narendra, K. Sreenivasa Rao:
Optimal weight tuning method for unit selection cost functions in syllable based text-to-speech synthesis. Appl. Soft Comput. 13(2): 773-781 (2013) - [j32]V. Ramu Reddy, K. Sreenivasa Rao:
Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis. Comput. Speech Lang. 27(5): 1105-1126 (2013) - [j31]K. Sreenivasa Rao, Shashidhar G. Koolagudi, Vempada Ramu Reddy:
Emotion recognition from speech using global and local prosodic features. Int. J. Speech Technol. 16(2): 143-160 (2013) - [j30]Krothapalli Sreenivasa Rao, Shashidhar G. Koolagudi:
Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. 16(2): 181-201 (2013) - [j29]Anil Kumar Vuppala, K. Sreenivasa Rao:
Vowel onset point detection for noisy speech using spectral energy at formant frequencies. Int. J. Speech Technol. 16(2): 229-235 (2013) - [j28]K. Sreenivasa Rao, Sudhamay Maity, V. Ramu Reddy:
Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. 16(4): 413-430 (2013) - [j27]V. Ramu Reddy, Sudhamay Maity, K. Sreenivasa Rao:
Identification of Indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. 16(4): 489-511 (2013) - [j26]Avinash Kumar Singh, Jayanta Mukhopadhyay, K. Sreenivasa Rao, Kapinaiah Viswanath:
Classification of Infant Cries Using Dynamics of Epoch Features. J. Intell. Syst. 22(3): 351-364 (2013) - [j25]K. Sreenivasa Rao, Anil Kumar Vuppala:
Non-uniform time scale modification using instants of significant excitation and vowel onset points. Speech Commun. 55(6): 745-756 (2013) - [j24]Jainath Yadav, K. Sreenivasa Rao:
Detection of Vowel Offset Point From Speech Signal. IEEE Signal Process. Lett. 20(4): 299-302 (2013) - [c36]Jainath Yadav, K. Sreenivasa Rao:
Analysis of detection of vowel offset point for coded speech. IC3 2013: 485-490 - [c35]V. Ramu Reddy, K. Sreenivasa Rao:
High quality text-to-speech synthesis system with efficient duration models developed using coding schemes based on vowel production characteristics. ISDA 2013: 7-12 - [c34]Nirmalya Sen, Hemant A. Patil, Shyamal Kr. Das Mandal, K. Sreenivasa Rao:
Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification. MIKE 2013: 780-789 - [c33]S. B. Sunil Kumar, K. Sreenivasa Rao, Debadatta Pati:
Phonetic and Prosodically Rich Transcribed speech corpus in Indian languages: Bengali and Odia. O-COCOSDA/CASLRE 2013: 1-5 - [c32]K. E. Manjunath, K. Sreenivasa Rao, Debadatta Pati:
Development of phonetic engine for Indian languages: Bengali and Oriya. O-COCOSDA/CASLRE 2013: 1-6 - [c31]Dipanjan Nandi, Debadatta Pati, K. Sreenivasa Rao:
Language identification using Hilbert envelope and phase information of linear prediction residual. O-COCOSDA/CASLRE 2013: 1-6 - [c30]Hemant A. Patil, Tanvina B. Patel, Nirmesh J. Shah, Hardik B. Sailor, Raghava Krishnan, G. R. Kasthuri, T. Nagarajan, S. Lilly Christina, Naresh Kumar, Veera Raghavendra, S. Prahallad Kishore, S. R. Mahadeva Prasanna, Nagaraj Adiga, Sanasam Ranbir Singh, Anand Konjengbam, Pranaw Kumar, Bira Chandra Singh, S. L. Binil Kumar, T. G. Bhadran, T. Sajini, Arup Saha, Tulika Basu, K. Sreenivasa Rao, N. P. Narendra, Anil Kumar Sao, Rakesh Kumar, Pranhari Talukdar, Purnendu Acharyaa, Somnath Chandra, Swaran Lata, Hema A. Murthy:
A syllable-based framework for unit selection synthesis in 13 Indian languages. O-COCOSDA/CASLRE 2013: 1-8 - [c29]Sourjya Sarkar, K. Sreenivasa Rao:
Significance of utterance partitioning in GMM-SVM based speaker verification in varying background environment. O-COCOSDA/CASLRE 2013: 1-5 - [c28]Ravi Kalyan Bhakat, N. P. Narendra, Krothapalli Sreenivasa Rao:
Corpus Based Emotional Speech Synthesis in Hindi. PReMI 2013: 390-395 - [c27]Vempada Ramu Reddy, Krothapalli Sreenivasa Rao:
Duration Modeling Using Multi-model Based on Positional Information. PReMI 2013: 404-409 - 2012
- [b1]K. Sreenivasa Rao:
Predicting Prosody from Text for Text-to-Speech Synthesis. Springer Briefs in Electrical and Computer Engineering, Springer 2012, ISBN 978-1-4614-1337-0, pp. i-xii, 1-130 - [j23]Rabul Hussain Laskar, D. Chakrabarty, Fazal Ahmed Talukdar, K. Sreenivasa Rao, Kalyan Banerjee:
Comparing ANN and GMM in a voice conversion framework. Appl. Soft Comput. 12(11): 3332-3342 (2012) - [j22]Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti:
Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points. Circuits Syst. Signal Process. 31(4): 1459-1474 (2012) - [j21]Krothapalli Sreenivasa Rao:
Unconstrained Pitch Contour Modification Using Instants of Significant Excitation. Circuits Syst. Signal Process. 31(6): 2133-2152 (2012) - [j20]Shashidhar G. Koolagudi, K. Sreenivasa Rao:
Emotion recognition from speech: a review. Int. J. Speech Technol. 15(2): 99-117 (2012) - [j19]Shashidhar G. Koolagudi, K. Sreenivasa Rao:
Emotion recognition from speech using source, system, and prosodic features. Int. J. Speech Technol. 15(2): 265-289 (2012) - [j18]Krothapalli Sreenivasa Rao, Jaynath Yadav, Sourjya Sarkar, Shashidhar G. Koolagudi, Anil Kumar Vuppala:
Neural network based feature transformation for emotion independent speaker identification. Int. J. Speech Technol. 15(3): 335-349 (2012) - [j17]Rabul Hussain Laskar, Kalyan Banerjee, Fazal Ahmed Talukdar, K. Sreenivasa Rao:
A pitch synchronous approach to design voice conversion system using source-filter correlation. Int. J. Speech Technol. 15(3): 419-431 (2012) - [j16]Shashidhar G. Koolagudi, Krothapalli Sreenivasa Rao:
Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features. Int. J. Speech Technol. 15(4): 495-511 (2012) - [j15]Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti, K. Sreenivasa Rao:
Vowel Onset Point Detection for Low Bit Rate Coded Speech. IEEE Trans. Speech Audio Process. 20(6): 1894-1903 (2012) - [j14]N. P. Narendra, K. Sreenivasa Rao:
Syllable Specific Unit Selection Cost Functions for Text-to-Speech Synthesis. ACM Trans. Speech Lang. Process. 9(3): 5:1-5:24 (2012) - [c26]Shashidhar G. Koolagudi, Shan E. Fatima, K. Sreenivasa Rao:
Speaker recognition in the case of emotional environment using transformation of speech features. CUBE 2012: 118-123 - [c25]Santosh Kumar Bharti, Shashidhar G. Koolagudi, K. Sreenivasa Rao, Ankur Choudhary, Binod Kumar:
Voice conversion using linear prediction coefficients and artificial neural network. CUBE 2012: 240-245 - [c24]V. Ramu Reddy, K. Sreenivasa Rao:
Intensity Modeling for Syllable Based Text-to-Speech Synthesis. IC3 2012: 106-117 - [c23]Krishnendu Ghosh, K. Sreenivasa Rao:
Data-Driven Phrase Break Prediction for Bengali Text-to-Speech System. IC3 2012: 118-129 - [c22]Shashidhar G. Koolagudi, Anurag Barthwal, Swati Devliyal, K. Sreenivasa Rao:
Real Life Emotion Classification from Speech Using Gaussian Mixture Models. IC3 2012: 250-261 - [c21]Shashidhar G. Koolagudi, Swati Devliyal, Anurag Barthwal, K. Sreenivasa Rao:
Emotion Recognition from Semi Natural Speech Using Artificial Neural Networks and Excitation Source Features. IC3 2012: 273-282 - [c20]Shashidhar G. Koolagudi, Deepika Rastogi, K. Sreenivasa Rao:
Spoken Language Identification Using Spectral Features. IC3 2012: 496-497 - [c19]V. Ramu Reddy, K. Sreenivasa Rao:
Better human computer interaction by enhancing the quality of text-to-speech synthesis. IHCI 2012: 1-6 - 2011
- [j13]K. Sreenivasa Rao, V. K. Saroj, Sudhamay Maity, Shashidhar G. Koolagudi:
Recognition of emotions from video using neural network models. Expert Syst. Appl. 38(10): 13181-13185 (2011) - [j12]K. Sreenivasa Rao:
Application of prosody models for developing speech systems in Indian languages. Int. J. Speech Technol. 14(1): 19-33 (2011) - [j11]Shashidhar G. Koolagudi, Rao Sreenivasa Krothapalli:
Two stage emotion recognition based on speaking rate. Int. J. Speech Technol. 14(1): 35-48 (2011) - [j10]N. P. Narendra, K. Sreenivasa Rao, Krishnendu Ghosh, Vempada Ramu Reddy, Sudhamay Maity:
Development of syllable-based text to speech synthesis system in Bengali. Int. J. Speech Technol. 14(3): 167 (2011) - [j9]Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti, P. Krishnamoorthy, S. R. M. Prasanna:
Recognition of consonant-vowel (CV) units under background noise using combined temporal and spectral preprocessing. Int. J. Speech Technol. 14(3): 259 (2011) - [c18]Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti:
Effect of Noise on Recognition of Consonant-Vowel (CV) Units. IC3 2011: 191-200 - [c17]Anil Kumar Vuppala, Jainath Yadav, K. Sreenivasa Rao, Saswat Chakrabarti:
Effect of Noise on Vowel Onset Point Detection. IC3 2011: 201-211 - [c16]Rahul Chauhan, Jainath Yadav, Shashidhar G. Koolagudi, K. Sreenivasa Rao:
Text Independent Emotion Recognition Using Spectral Features. IC3 2011: 359-370 - [c15]N. P. Narendra, K. Sreenivasa Rao:
Segment Specific Concatenation Cost for Syllable Based Bengali TTS. IC3 2011: 371-382 - 2010
- [j8]K. Sreenivasa Rao:
Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput. Speech Lang. 24(3): 474-494 (2010) - [j7]Krothapalli S. Rao, Shashidhar G. Koolagudi:
Selection of Suitable Features for Modeling the Durations of Syllables. J. Softw. Eng. Appl. 3(12): 1107-1117 (2010) - [j6]Krothapalli Sreenivasa Rao:
Real Time Prosody Modification. J. Signal Inf. Process. 1(1): 50-62 (2010) - [c14]Anil Kumar Vuppala, Saswat Chakrabarti, K. Sreenivasa Rao:
Effect of Speech Coding on Recognition of Consonant-Vowel (CV) Units. IC3 (1) 2010: 284-294 - [c13]Shashidhar G. Koolagudi, Sudhin Ray, K. Sreenivasa Rao:
Emotion Classification Based on Speaking Rate. IC3 (1) 2010: 316-327
2000 – 2009
- 2009
- [j5]K. Sreenivasa Rao, B. Yegnanarayana:
Intonation modeling for Indian languages. Comput. Speech Lang. 23(2): 240-256 (2009) - [j4]K. Sreenivasa Rao, B. Yegnanarayana:
Duration modification using glottal closure instants and vowel onset points. Speech Commun. 51(12): 1263-1269 (2009) - [c12]Shashidhar G. Koolagudi, Sudhamay Maity, Anil Kumar Vuppala, Saswat Chakrabarti, K. Sreenivasa Rao:
IITKGP-SESC: Speech Database for Emotion Analysis. IC3 2009: 485-492 - [c11]K. Sreenivasa Rao, S. R. Mahadeva Prasanna, T. V. Sagar:
Significance of Word and Syllable Level Information for Expressive Speech Processing. ICAPR 2009: 159-162 - [c10]K. Sreenivasa Rao, Sudhamay Maity, Amol Taru, Shashidhar G. Koolagudi:
Unit Selection Using Linguistic, Prosodic and Spectral Distance for Developing Text-to-Speech System in Hindi. PReMI 2009: 531-536 - [c9]Shashidhar G. Koolagudi, K. Sreenivasa Rao:
Exploring Speech Features for Classifying Emotions along Valence Dimension. PReMI 2009: 537-542 - 2008
- [p1]K. Sreenivasa Rao:
Modeling Supra-Segmental Features of Syllables Using Neural Networks. Speech, Audio, Image and Biomedical Signal Processing using Neural Networks 2008: 71-95 - 2007
- [j3]K. Sreenivasa Rao, B. Yegnanarayana:
Modeling durations of syllables using neural networks. Comput. Speech Lang. 21(2): 282-295 (2007) - [j2]K. Sreenivasa Rao, S. R. Mahadeva Prasanna, Bayya Yegnanarayana:
Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay Function. IEEE Signal Process. Lett. 14(10): 762-765 (2007) - [c8]K. Sreenivasa Rao, Rabul Hussain Laskar, Shashidhar G. Koolagudi:
Voice Transformation by Mapping the Features at Syllable Level. PReMI 2007: 479-486 - 2006
- [j1]K. Sreenivasa Rao, B. Yegnanarayana:
Prosody modification using instants of significant excitation. IEEE Trans. Speech Audio Process. 14(3): 972-980 (2006) - [c7]K. Sreenivasa Rao, B. Yegnanarayana:
Voice Conversion by Prosody and Vocal Tract Modification. ICIT 2006: 111-116 - 2004
- [c6]K. Sreenivasa Rao, B. Yegnanarayana:
Modeling syllable duration in Indian languages using neural networks. ICASSP (5) 2004: 313-316 - [c5]K. Sreenivasa Rao, S. R. Mahadeva Prasanna, B. Yegnanarayana:
Two-Stage Duration Model for Indian Languages Using Neural Networks. ICONIP 2004: 1179-1185 - [c4]Krothapalli Sreenivasa Rao, Bayya Yegnanarayana:
Intonation modeling for indian languages. INTERSPEECH 2004: 733-736 - 2003
- [c3]K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation. ICASSP (1) 2003: 528-531 - [c2]K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation. ICME 2003: 389-392 - 2002
- [c1]B. Yegnanarayana, S. R. Mahadeva Prasanna, K. Sreenivasa Rao:
Speech enhancement using excitation source information. ICASSP 2002: 541-544
Coauthor Index
aka: Gurunath Reddy M.
aka: S. R. M. Prasanna
aka: Raghavendra Ramachandra
aka: Vempada Ramu Reddy
aka: Jaynath Yadav
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint