


default search action
International Journal of Speech Technology, Volume 25
Volume 25, Number 1, March 2022
- Wided Bakari

, Mahmoud Neji:
A novel semantic and logical-based approach integrating RTE technique in the Arabic question-answering. 1-17 - Mohamed Morchid

:
Bidirectional internal memory gate recurrent neural networks for spoken language understanding. 19-27 - Rim Laatar

, Chafik Aloulou, Lamia Hadrich Belguith
:
Towards a historical dictionary for Arabic language. 29-41 - Eiman Alsharhan

, Allan Ramsay, Hanady Ahmed:
Evaluating the effect of using different transcription schemes in building a speech recognition system for Arabic. 43-56 - Ricky Mohanty, Bandi Kumar Mallik, Sandeep Singh Solanki:

Normalized approximate descent used for spike based automatic bird species recognition system. 57-65 - Ankit Kumar

, Rajesh Kumar Aggarwal:
Hindi speech recognition using time delay neural network acoustic modeling with i-vector adaptation. 67-78 - Fady K. Fahmy

, Hazem M. Abbas, Mahmoud I. Khalil
:
Boosting subjective quality of Arabic text-to-speech (TTS) using end-to-end deep architecture. 79-88 - Abir Masmoudi, Chafik Aloulou, Abdel Ghader Sidi Abdellahi, Lamia Hadrich Belguith

:
Automatic diacritization of Tunisian dialect text using SMT model. 89-104 - Aakshi Mittal, Mohit Dua

:
Automatic speaker verification systems and spoof detection techniques: review and analysis. 105-134 - Song-Il Mun, Chol-Jin Han, Hye-Song Hong:

Exploiting variable length segments with coarticulation effect in online speech recognition based on deep bidirectional recurrent neural network and context-sensitive segment. 135-146 - Sreehari Vanajavilas Ravindran

, Leena Mary:
Automatic short utterance speaker recognition using stationary wavelet coefficients of pitch synchronised LP residual. 147-161 - Achraf Benba

, Imane Laaqira, Abdelilah Jilbab
, Ahmed Hammouch:
Using novel method: Real Cepstral Discrete Cosine Transform, for detecting Parkinson from multiple system atrophy, other neurological diseases and healthy cases using voice analysis. 163-172 - Bidhan Barai

, Tapas Chakraborty
, Nibaran Das, Subhadip Basu
, Mita Nasipuri:
Closed-set speaker identification using VQ and GMM based models. 173-196 - Pavan Raju Kammili

, B. H. V. S. Ramakrishnam Raju, A. Sri Krishna:
Handling emotional speech: a prosody based data augmentation technique for improving neutral speech trained ASR systems. 197-204 - Hyok-Chol Ri, Chol Kim, Mok-Ran Jo:

A method for constructing Korean spontaneous spoken language corpus based on an imitation of abbreviated and transformed particles. 205-210 - Lee-Chung Kwek, Alan Wee-Chiat Tan, Heng-Siong Lim, Cheah Heng Tan, Khaled A. Alaghbari

:
Sparse representation and reproduction of speech signals in complex Fourier basis. 211-217 - Nikunj Tahilramani

, Ninad Bhatt:
Information hiding in proposed 10.6 kbps CS-ACELP based speech codec using Quantization Index Modulation. 219-230 - Gonzalo D. Sad, Lucas D. Terissi, Juan Carlos Gómez:

Complementary models for audio-visual speech classification. 231-249 - Tiantian Tang, Yanhua Long

, Yijie Li, Jiaen Liang:
Acoustic domain mismatch compensation in bird audio detection. 251-260 - Jiangyu Han

, Yan Shi, Yanhua Long
, Jiaen Liang:
Exploring single channel speech separation for short-time text-dependent speaker verification. 261-268 - Lallouani Bouchakour

, Mohamed Debyeche:
Noise-robust speech recognition in mobile network based on convolution neural networks. 269-277 - Khaled M. Abdelwahab

, Saied M. Abd El-atty, Ayman M. Brisha, Fathi E. Abd El-Samie:
Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD. 279-288 - Emilia Parada-Cabaleiro

, Anton Batliner, Alice Baird, Björn W. Schuller
:
Correction to: The perception of emotional cues by children in artificial background noise. 289
Volume 25, Number 2, June 2022
- (Withdrawn) Big Data Analytics integrated AAC Framework for English language teaching. 291-304

- Yubin Liu, C. B. Sivaparthipan

, Achyut Shankar:
Human-computer interaction based visual feedback system for augmentative and alternative communication. 305-314 - Ping Zhang, K. Deepa Thilak

, Renjith V. Ravi
:
Big data analytics and augmentative and alternative communication in EFL teaching. 315-329 - Wei Li, Xiaoli Qiu, Yang Li, Jing Ji, Xinxin Liu, Shuanzhu Li:

Towards a novel machine learning approach to support augmentative and alternative communication (AAC). 331-341 - Min Wang, BalaAnand Muthu

, C. B. Sivaparthipan
:
Smart assistance to dyslexia students using artificial intelligence based augmentative alternative communication. 343-353 - Xiang Lan, Zhongwang Cao, Le Yu:

Analyzing the mental states of the sports student based on augmentative communication with human-computer interaction. 355-365 - Wenjuan Hu, Premalatha R, R. S. Aiswarya:

Physical education system and training framework based on human-computer interaction for augmentative and alternative communication. 367-377 - (Withdrawn) Computer vision for facial analysis using human-computer interaction models. 379-389

- Man Liu:

English speech emotion recognition method based on speech recognition. 391-398 - Shanshan Yang, Ding Liu:

Automatic annotation method of VR speech corpus based on artificial intelligence. 399-407 - Ran Qian, Sudhakar Sengan, Sapna Juneja:

English language teaching based on big data analytics in augmentative and alternative communication system. 409-420 - Yanmei Huang, Qiang Mei, Mulan Hu, Thanjai Vadivel

, A. Daison Raj:
A voice-assisted intelligent software architecture based on deep game network. 421-433 - K. Meenakshi

, G. Maragatham:
AdvIris: a hybrid approach to detecting adversarial iris examples using wavelet transform. 435-441 - Khaled Lounnas

, Mourad Abbas
, Mohamed Lichouri
, Mohamed Hamidi
, Hassan Satori
, Hocine Teffahi:
Enhancement of spoken digits recognition for under-resourced languages: case of Algerian and Moroccan dialects. 443-455 - Lakshmi Srinivas Dendukuri

, Jakeer Hussain Shaik:
Emotional speech analysis and classification using variational mode decomposition. 457-469 - Mohamed Monir, Mona Kareem, Sami M. El-Dolil, Adel A. Saleeb, Adel S. El-Fishawy

, Mohamed Abd-Elsalam Nassar, Mohamed A. Zein Eldin, Fathi E. Abd El-Samie:
Cancelable speaker identification based on cepstral coefficients and comb filters. 471-492 - Hao Wu, Linkai Luo

, Hong Peng, Wei Wen:
A method of multi-models fusion for speaker recognition. 493-498 - Gautam Chakraborty

, Mridusmita Sharma, Navajit Saikia, Kandarpa Kumar Sarma:
Soft-computation based speech recognition system for Sylheti language. 499-509 - Pradeep Tiwari, Anand D. Darji:

Pertinent feature selection techniques for automatic emotion recognition in stressed speech. 511-526 - Girish Gidaye, Jagannath H. Nirmal, Kadria Ezzine, Mondher Frikha:

Unified wavelet-based framework for evaluation of voice impairment. 527-548 - Girish Gidaye, Jagannath H. Nirmal, Kadria Ezzine, Mondher Frikha:

Correction to: Unified wavelet-based framework for evaluation of voice impairment. 549
Volume 25, Number 3, September 2022
- D. Bhavana

, K. Kishore Kumar
, D. Ravi Tej:
Infrared and visible image fusion using latent low rank technique for surveillance applications. 551-560 - Basavoju Harish

, Mulpuri Santhi Sri Rukmini, Kosaraju Sivani
:
Design of MAC unit for digital filters in signal processing and communication. 561-565 - P. Ramakrishna, K. Hari Kishore

:
A low power reconfigurable ADC for bioimpedance monitroing system. 567-574 - (Withdrawn) Audio fingerprint analysis for speech processing using deep learning method. 575-581

- Rohit Lamba

, Tarun Gulati
, Hadeel Fahad Alharbi, Anurag Jain
:
A hybrid system for Parkinson's disease diagnosis using machine learning techniques. 583-593 - A. Vijayarani

, G. G. Lakshmi Priya:
Salient object detection based on adaptive recalibration technique through deep network. 595-604 - (Withdrawn) Nonlinear acoustic noise cancellation based automatic speech recognition system (NANC-ASR) with convolutional neural networks. 605-613

- (Withdrawn) Drought Prediction and Analysis of Water level based on satellite images Using Deep Convolutional Neural Network. 615-623

- (Withdrawn) Detecting adversarial attacks on audio-visual speech recognition using deep learning method. 625-631

- Niveditha V. R., Senthilnathan Palaniappan, K. Naresh, Chinmaya Kumar Nayak

, B. Swapna:
High speed low area decimation filter for hearing aid application. 633-639 - (Withdrawn) An adaptive speech signal processing for COVID-19 detection using deep learning approach. 641-649

- Hamsa A. Abdullah

, Raya K. Mohammed:
FPGA-based modified chaotic system for speech transmission. 651-657 - Zinah Abdulridha Abutiheen

, Enas Ali Mohammed, Mohsin Hasan Hussein
:
Behavior analysis in Arabic social media. 659-666 - Long Shi:

Application of big data language recognition technology and GPU parallel computing in English teaching visualization system. 667-677 - Samia Abd El-Moneim, Eman Abd El-Mordy, Mohamed Abd-Elsalam Nassar, Moawad I. Dessouky, Nabil A. Ismail, Adel S. El-Fishawy

, Sami A. El-Dolil, Ibrahim M. El-Dokany, Fathi E. Abd El-Samie:
Performance enhancement of text-independent speaker recognition in noisy and reverberation conditions using Radon transform with deep learning. 679-687 - Samia Abd El-Moneim, Mohamed Abd-Elsalam Nassar, Moawad I. Dessouky, Nabil A. Ismail, Adel S. El-Fishawy

, Fathi E. Abd El-Samie:
Cancellable template generation for speaker recognition based on spectrogram patch selection and deep convolutional neural networks. 689-696 - Sunil Kumar Koduri

, Kishore Kumar Tappeta:
Discrete cosine transform-based data hiding for speech bandwidth extension. 697-706 - Tulika Jha, Ramisetty Kavya, J. Jabez Christopher

, Vasan Arunachalam:
Machine learning techniques for speech emotion recognition using paralinguistic acoustic features. 707-725 - Anshul Kumar, Ankit Kumar Jain

:
Emotion detection in psychological texts by fine-tuning BERT using emotion-cause pair extraction. 727-743 - Rahul Kumar Jaiswal

, Sreenivasa Reddy Yeduri, Linga Reddy Cenkeramaddi
:
Single-channel speech enhancement using implicit Wiener filter for high-quality speech communication. 745-758 - Chinmay Maiti, Bibhas Chandra Dhara:

A blind audio watermarking based on singular value decomposition and quantization. 759-771 - Vijay M. Sardar

, Manisha L. Jadhav, Saurabh H. Deshmukh:
Timbre features with MEDIAN values for compensating intra-speaker variability in speaker identification of whispering sound. 773-782
Volume 25, Number 4, December 2022
- Praseetha V. M.

, P. P. Joby
:
Speech emotion recognition using data augmentation. 783-792 - S. Anjali Devi, S. Sivakumar

:
An efficient contextual glove feature extraction model on large textual databases. 793-802 - Shaikh Abdul Waheed

, P. Sheik Abdul Khader, A. Abdul Azeez Khan
, K. Javubar Sathick:
Feature extraction from behavioral styles of children for prediction of severity of stuttering using historical stuttering data. 803-815 - Yi Jiang, Erli Cheng, Yonghao Li, Yali Zhang:

Construction of complex environment speech signal communication system based on 5G and AI driven feature extraction techniques. 817-830 - V. Srinivasarao, Umesh Ghanekar:

A new double backward distributive weighted adaptive filtering approach for speech quality improvement. 831-836 - Ashok Kumar Konduru

, J. L. Mazher Iqbal:
Handling high dimensional features by ensemble learning for emotion identification from speech signal. 837-851 - Xiao Ye, Xin Lv:

Data analysis framework for visual interactive product design under the background of cloud social speech environment. 853-862 - Kunyu Li

, Xunxiang Li:
AI driven human-computer interaction design framework of virtual environment based on comprehensive semantic data analysis with feature extraction. 863-877 - Zong-Peng Kuo, Joy Iong-Zong Chen

:
To deploy trained speech with DNN-LSTM framework for controlling a smart wheeled-robot in limited learning circumstance. 879-891 - Haidong Xu:

Intelligent automobile auxiliary propagation system based on speech recognition and AI driven feature extraction techniques. 893-905 - Dinesh Kumar Anguraj, J. Anitha, S. John Justin Thangaraj

, L. Ramesh, Seetha Rama Krishna, D. Mythrayee:
Analysis of influencing features with spectral feature extraction and multi-class classification using deep neural network for speech recognition system. 907-920 - Lu Yang:

HTK-based speech recognition and corpus-based English vocabulary online guiding system. 921-931 - A. Kishore Kumar

, Shefali Waldekar, Md. Sahidullah, Goutam Kumar Saha:
Robust acoustic domain identification with its application to speaker diarization. 933-945 - Rahul Kumar Jaiswal

, Rajesh Kumar Dubey
:
Non-intrusive speech quality assessment using context-aware neural networks. 947-965 - Yagnavajjula Madhu Keerthana

, K. Sreenivasa Rao, Pabitra Mitra:
Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features. 967-973 - Tiemin Mei

, Guorong He, Yandong Zhao, Jihan Dong:
Blind identification of the inverse of SIMO system and deconvolution with Kalman filter. 975-986 - Xuefei Wang, Yanhua Long

, Dongxing Xu:
Universal and accent-discriminative encoders for conformer-based accent-invariant speech recognition. 987-995 - Adnan Gutub

:
Integrity verification of Holy Quran verses recitation via incomplete watermarking authentication. 997-1011 - Chol-Jin Han, Un-Chol Ri, Song-Il Mun, Kang-Song Jang, Song-Yun Han:

An end-to-end TTS model with pronunciation predictor. 1013-1024 - Luciana Albuquerque

, António J. S. Teixeira
, Catarina Oliveira
, Daniela Figueiredo
:
Age and vowel classification improvement by the inclusion of vowel dynamic features. 1025-1040 - (Withdrawn) Speaker identification using hybrid neural network support vector machine classifier. 1041-1053


manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














