


Остановите войну!
for scientists:


default search action
John H. L. Hansen
Person information

- affiliation: University of Texas at Dallas
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j147]Shahram Ghorbani
, John H. L. Hansen
:
Domain Expansion for End-to-End Speech Recognition: Applications for Accent/Dialect Speech. IEEE ACM Trans. Audio Speech Lang. Process. 31: 762-774 (2023) - [i46]Mufan Sang, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu:
Improving Transformer-based Networks With Locality For Automatic Speaker Verification. CoRR abs/2302.08639 (2023) - 2022
- [j146]Iván López-Espejo
, Zheng-Hua Tan
, John H. L. Hansen
, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. IEEE Access 10: 4169-4199 (2022) - [j145]Rasa Lileikyte, Dwight Irvin
, John H. L. Hansen
:
Assessing child communication engagement and statistical speech patterns for American English via speech recognition in naturalistic active learning spaces. Speech Commun. 140: 98-108 (2022) - [j144]Zhenyu Wang, John H. L. Hansen
:
Multi-Source Domain Adaptation for Text-Independent Forensic Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 60-75 (2022) - [j143]Vinay Kothapally
, John H. L. Hansen
:
SkipConvGAN: Monaural Speech Dereverberation Using Generative Adversarial Networks via Complex Time-Frequency Masking. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1600-1613 (2022) - [j142]Ria Ghosh
, Hussnain Ali, John H. L. Hansen
:
CCi-MOBILE: A Portable Real Time Speech Processing Platform for Cochlear Implant and Hearing Research. IEEE Trans. Biomed. Eng. 69(3): 1251-1263 (2022) - [j141]Yongkang Liu
, Ziran Wang
, Kyungtae Han
, Zhenyu Shou, Prashant Tiwari, John H. L. Hansen
:
Vision-Cloud Data Fusion for ADAS: A Lane Change Prediction Case Study. IEEE Trans. Intell. Veh. 7(2): 210-220 (2022) - [c403]Ria Ghosh, John H. L. Hansen:
Bimodal Cochlear Implant Processing based on Assisted Hearing algorithms with CCi-MOBILE: an open-source research platform. EMBC 2022: 4265-4268 - [c402]Mufan Sang, John H. L. Hansen:
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning. INTERSPEECH 2022: 321-325 - [c401]John H. L. Hansen, Zhenyu Wang:
Audio Anti-spoofing Using Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning. INTERSPEECH 2022: 376-380 - [c400]Jiamin Xie, John H. L. Hansen:
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition. INTERSPEECH 2022: 1392-1396 - [c399]Avamarie Brueggeman, John H. L. Hansen:
Speaker Trait Enhancement for Cochlear Implant Users: A Case Study for Speaker Emotion Perception. INTERSPEECH 2022: 2268-2272 - [c398]Vinay Kothapally, John H. L. Hansen:
Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation. INTERSPEECH 2022: 2543-2547 - [c397]Szu-Jui Chen, Jiamin Xie, John H. L. Hansen:
FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition. INTERSPEECH 2022: 3058-3062 - [c396]Satwik Dutta, Sarah Anne Tao, Jacob C. Reyna, Rebecca Elizabeth Hacker, Dwight W. Irvin, Jay F. Buzhardt, John H. L. Hansen:
Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments. INTERSPEECH 2022: 4322-4326 - [c395]Mu Yang, Kevin Hirschi, Stephen Daniel Looney, Okim Kang, John H. L. Hansen:
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment. INTERSPEECH 2022: 4481-4485 - [c394]Chelzy Belitz, John H. L. Hansen:
Challenges in Metadata Creation for Massive Naturalistic Team-Based Audio Data. INTERSPEECH 2022: 5210-5214 - [c393]Juliana N. Saba, John H. L. Hansen:
Speech Modification for Intelligibility in Cochlear Implant Listeners: Individual Effects of Vowel- and Consonant-Boosting. INTERSPEECH 2022: 5473-5477 - [e3]Hanseok Ko, John H. L. Hansen:
Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. ISCA 2022 [contents] - [i45]Zhenyu Wang, John H. L. Hansen:
Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System. CoRR abs/2201.13246 (2022) - [i44]Mu Yang, Kevin Hirschi, Stephen D. Looney, Okim Kang, John H. L. Hansen:
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment. CoRR abs/2203.15937 (2022) - [i43]Szu-Jui Chen, Jiamin Xie, John H. L. Hansen:
FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition. CoRR abs/2206.15056 (2022) - [i42]Mufan Sang, John H. L. Hansen:
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning. CoRR abs/2207.04540 (2022) - [i41]Wei Xia, John H. L. Hansen:
Data-driven Attention and Data-independent DCT based Global Context Modeling for Text-independent Speaker Recognition. CoRR abs/2208.02778 (2022) - [i40]Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli:
Learning ASR pathways: A sparse multilingual ASR model. CoRR abs/2209.05735 (2022) - [i39]Aditya Joglekar, John H. L. Hansen:
Fearless Steps Challenge Phase-1 Evaluation Plan. CoRR abs/2211.02051 (2022) - [i38]Zhenyu Wang, John H. L. Hansen:
Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning. CoRR abs/2211.09898 (2022) - [i37]Zhenyu Wang, John H. L. Hansen:
Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition. CoRR abs/2211.09913 (2022) - [i36]Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise. CoRR abs/2211.10565 (2022) - [i35]Vinay Kothapally, John H. L. Hansen:
SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking. CoRR abs/2211.12623 (2022) - [i34]Vinay Kothapally, John H. L. Hansen:
Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation. CoRR abs/2211.12632 (2022) - 2021
- [j140]Fahimeh Bahmaninezhad
, Chunlei Zhang, John H. L. Hansen
:
An investigation of domain adaptation in speaker embedding space for speaker recognition. Speech Commun. 129: 7-16 (2021) - [j139]Shivesh Ranjan
, John H. L. Hansen
:
Curriculum Learning based approaches for robust end-to-end far-field speech recognition. Speech Commun. 132: 123-131 (2021) - [j138]John H. L. Hansen
, Allen R. Stauffer, Wei Xia:
Nonlinear waveform distortion: Assessment and detection of clipping on speech data and systems. Speech Commun. 134: 20-31 (2021) - [j137]Midia Yousefi, John H. L. Hansen
:
Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 28-40 (2021) - [j136]Finnian Kelly, John H. L. Hansen
:
Analysis and Calibration of Lombard Effect and Whisper for Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 927-942 (2021) - [j135]Kazi Nazmul Haque
, Rajib Rana, Jiajun Liu
, John H. L. Hansen
, Nicholas Cummins, Carlos Busso
, Björn W. Schuller
:
Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2575-2590 (2021) - [c392]Midia Yousefi, John H. L. Hansen:
Speaker Conditioning of Acoustic Models Using Affine Transformation for Multi-Speaker Speech Recognition. ASRU 2021: 283-288 - [c391]Szu-Jui Chen, Wei Xia, John H. L. Hansen:
Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora. ASRU 2021: 289-295 - [c390]Mufan Sang, Wei Xia, John H. L. Hansen:
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning. ICASSP 2021: 6169-6173 - [c389]Prasanna V. Kothalkar, Sathvik Datla, Satwik Dutta, John H. L. Hansen, Yagmur Seven, Dwight Irvin, Jay Buzhardt:
Measuring Frequency of Child-directed WH-Question Words for Alternate Preschool Locations using Speech Recognition and Location Tracking Technologies. ICMI Companion 2021: 414-418 - [c388]Aditya Joglekar, Seyed Omid Sadjadi, Meena Chandra Shekar, Christopher Cieri, John H. L. Hansen:
Fearless Steps Challenge Phase-3 (FSC P3): Advancing SLT for Unseen Channel and Mission Data Across NASA Apollo Audio. Interspeech 2021: 986-990 - [c387]Midia Yousefi, John H. L. Hansen:
Real-Time Speaker Counting in a Cocktail Party Scenario Using Attention-Guided Convolutional Neural Network. Interspeech 2021: 1484-1488 - [c386]Ram C. M. C. Shekar, Chelzy Belitz, John H. L. Hansen:
Development of CNN-Based Cochlear Implant and Normal Hearing Sound Recognition Models Using Natural and Auralized Environmental Audio. SLT 2021: 728-733 - [c385]Hazem Younis, John H. L. Hansen:
Challenges in real-time-embedded IoT Command Recognition. WF-IoT 2021: 848-851 - [i33]Szu-Jui Chen, Wei Xia, John H. L. Hansen:
Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora. CoRR abs/2109.11086 (2021) - [i32]Midia Yousefi, John H. L. Hansen:
Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network. CoRR abs/2111.00316 (2021) - [i31]Midia Yousefi, John H. L. Hansen:
Single-channel speech separation using Soft-minimum Permutation Invariant Training. CoRR abs/2111.08635 (2021) - [i30]Iván López-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. CoRR abs/2111.10592 (2021) - [i29]Yongkang Liu, Ziran Wang, Kyungtae Han, Zhenyu Shou, Prashant Tiwari, John H. L. Hansen:
Vision-Cloud Data Fusion for ADAS: A Lane Change Prediction Case Study. CoRR abs/2112.04042 (2021) - 2020
- [c384]Ria Ghosh
, Ram Charan Chandra Shekar, John H. L. Hansen:
Portable Smart-Space Research Interface to Predetermine Environment Acoustics for Cochlear implant and Hearing aid users with CCi-MOBILE. EMBC 2020: 4221-4224 - [c383]Midia Yousefi, John H. L. Hansen:
Frame-Based Overlapping Speech Detection Using Convolutional Neural Networks. ICASSP 2020: 6744-6748 - [c382]Zhenyu Wang, John H. L. Hansen, Yanlu Xie:
A multi-view approach for Mandarin non-native mispronunciation verification. ICASSP 2020: 8079-8083 - [c381]Zhenyu Wang, Wei Xia, John H. L. Hansen:
Cross-Domain Adaptation with Discrepancy Minimization for Text-Independent Forensic Speaker Verification. INTERSPEECH 2020: 2257-2261 - [c380]Mufan Sang, Wei Xia, John H. L. Hansen:
Open-Set Short Utterance Forensic Speaker Verification Using Teacher-Student Network with Explicit Inductive Bias. INTERSPEECH 2020: 2262-2266 - [c379]Aditya Joglekar, John H. L. Hansen, Meena Chandra Shekhar, Abhijeet Sangwan:
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data. INTERSPEECH 2020: 2617-2621 - [c378]Wei Xia, John H. L. Hansen:
Speaker Representation Learning Using Global Context Guided Channel and Time-Frequency Transformations. INTERSPEECH 2020: 3226-3230 - [c377]Vinay Kothapally, Wei Xia, Shahram Ghorbani, John H. L. Hansen, Wei Xue, Jing Huang:
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation Using Optimally Smoothed Spectral Mapping. INTERSPEECH 2020: 3935-3939 - [c376]Kevin Hirschi, Okim Kang, Catia Cucchiarini, John H. L. Hansen, Keelan Evanini, Helmer Strik:
Mobile-Assisted Prosody Training for Limited English Proficiency: Learner Background and Speech Learning Pattern. INTERSPEECH 2020: 4452-4456 - [c375]Avamarie Brueggeman, John H. L. Hansen:
Effect of Spectral Complexity Reduction and Number of Instruments on Musical Enjoyment with Cochlear Implants. INTERSPEECH 2020: 4636-4640 - [c374]Yongkang Liu, Ziran Wang, Kyungtae Han, Zhenyu Shou, Prashant Tiwari, John H. L. Hansen:
Sensor Fusion of Camera and Cloud Digital Twin Information for Intelligent Vehicles. IV 2020: 182-187 - [c373]Rasa Lileikyte, Dwight Irvin, John H. L. Hansen:
Assessing Child Communication Engagement via Speech Recognition in Naturalistic Active Learning Spaces. Odyssey 2020: 396-401 - [i28]Yongkang Liu, Ziran Wang, Kyungtae Han, Zhenyu Shou, Prashant Tiwari, John H. L. Hansen:
Sensor Fusion of Camera and Cloud Digital Twin Information for Intelligent Vehicles. CoRR abs/2007.04350 (2020) - [i27]Vinay Kothapally, Wei Xia, Shahram Ghorbani, John H. L. Hansen, Wei Xue, Jing Huang:
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping. CoRR abs/2007.09131 (2020) - [i26]Aditya Joglekar, John H. L. Hansen, Meena Chandra Shekhar, Abhijeet Sangwan:
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data. CoRR abs/2008.06764 (2020) - [i25]Wei Xia, John H. L. Hansen:
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations. CoRR abs/2009.00768 (2020) - [i24]Zhenyu Wang, Wei Xia, John H. L. Hansen:
Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification. CoRR abs/2009.02444 (2020) - [i23]Zhenyu Wang, John H. L. Hansen, Yanlu Xie:
A multi-view approach for Mandarin non-native mispronunciation verification. CoRR abs/2009.02573 (2020) - [i22]Mufan Sang, Wei Xia, John H. L. Hansen:
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias. CoRR abs/2009.09556 (2020) - [i21]Meemnur Rashid, Kaisar Ahmed Alman, Khaled Hasan, John H. L. Hansen, Taufiq Hasan:
Respiratory Distress Detection from Telephone Speech using Acoustic and Prosodic Features. CoRR abs/2011.09270 (2020) - [i20]Mufan Sang, Wei Xia, John H. L. Hansen:
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning. CoRR abs/2012.06896 (2020)
2010 – 2019
- 2019
- [j134]John H. L. Hansen
, Maryam Najafian, Rasa Lileikyte, Dwight Irvin
, Beth S. Rous
:
Speech and language processing for assessing child-adult interaction based on diarization and location. Int. J. Speech Technol. 22(3): 697-709 (2019) - [j133]Seyedmahdad Mirsamadi
, John H. L. Hansen
:
Multi-domain adversarial training of neural network acoustic models for distant speech recognition. Speech Commun. 106: 21-30 (2019) - [c372]Michelle Bancroft, Reza Lotfian, John H. L. Hansen, Carlos Busso
:
Exploring the Intersection Between Speaker Verification and Emotion Recognition. ACII Workshops 2019: 337-342 - [c371]Shahram Ghorbani, Soheil Khorram, John H. L. Hansen:
Domain Expansion in DNN-Based Acoustic Models for Robust Speech Recognition. ASRU 2019: 107-113 - [c370]Salar Jafarlou, Soheil Khorram, Vinay Kothapally, John H. L. Hansen:
Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition. ASRU 2019: 252-259 - [c369]John H. L. Hansen, Hussnain Ali, Juliana N. Saba, M. C. Ram Charan, Nursadul Mamun, Ria Ghosh, Avamarie Brueggeman:
CCi-MOBILE: Design and Evaluation of a Cochlear Implant and Hearing Aid Research Platform for Speech Scientists and Engineers. BHI 2019: 1-4 - [c368]Chunlei Zhang, Fahimeh Bahmaninezhad, Shivesh Ranjan, Harishchandra Dubey
, Wei Xia, John H. L. Hansen:
UTD-CRSS Systems for 2018 NIST Speaker Recognition Evaluation. ICASSP 2019: 5776-5780 - [c367]Wei Xia, Jing Huang, John H. L. Hansen:
Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation. ICASSP 2019: 5816-5820 - [c366]Chunlei Zhang, Qian Zhang, John H. L. Hansen:
Semi-supervised Learning with Generative Adversarial Networks for Arabic Dialect Identification. ICASSP 2019: 5986-5990 - [c365]Harishchandra Dubey
, Abhijeet Sangwan, John H. L. Hansen:
Transfer Learning Using Raw Waveform Sincnet for Robust Speaker Diarization. ICASSP 2019: 6296-6300 - [c364]Harishchandra Dubey, Abhijeet Sangwan, John H. L. Hansen:
Toeplitz Inverse Covariance Based Robust Speaker Clustering for Naturalistic Audio Streams. INTERSPEECH 2019: 416-420 - [c363]John H. L. Hansen, Aditya Joglekar, Meena Chandra Shekhar, Vinay Kothapally, Chengzhu Yu, Lakshmish Kaushik, Abhijeet Sangwan:
The 2019 Inaugural Fearless Steps Challenge: A Giant Leap for Naturalistic Audio. INTERSPEECH 2019: 1851-1855 - [c362]Chelzy Belitz, Hussnain Ali, John H. L. Hansen:
A Machine Learning Based Clustering Protocol for Determining Hearing Aid Initial Configurations from Pure-Tone Audiograms. INTERSPEECH 2019: 2325-2329 - [c361]Nursadul Mamun, Ria Ghosh
, John H. L. Hansen:
Quantifying Cochlear Implant Users' Ability for Speaker Identification Using CI Auditory Stimuli. INTERSPEECH 2019: 3118-3122 - [c360]Qing Wang, Pengcheng Guo, Sining Sun, Lei Xie, John H. L. Hansen:
Adversarial Regularization for End-to-End Robust Speaker Verification. INTERSPEECH 2019: 4010-4014 - [c359]Nursadul Mamun, Soheil Khorram, John H. L. Hansen:
Convolutional Neural Network-Based Speech Enhancement for Cochlear Implant Recipients. INTERSPEECH 2019: 4265-4269 - [c358]Midia Yousefi, Soheil Khorram, John H. L. Hansen:
Probabilistic Permutation Invariant Training for Speech Separation. INTERSPEECH 2019: 4604-4608 - [c357]Yongkang Liu, John H. L. Hansen:
Towards Complexity Level Classification of Driving Scenarios Using Environmental Information. ITSC 2019: 810-815 - [c356]Ekim Yurtsever
, Yongkang Liu, Jacob Lambert, Chiyomi Miyajima, Eijiro Takeuchi, Kazuya Takeda, John H. L. Hansen:
Risky Action Recognition in Lane Change Video Clips using Deep Spatiotemporal Networks with Segmentation Mask Transfer. ITSC 2019: 3100-3107 - [c355]Prasanna V. Kothalkar, Dwight Irvin, Ying Luo, Joanne Rojas, John Nash, Beth S. Rous
, John H. L. Hansen:
Tagging child-adult interactions in naturalistic, noisy, daylong school environments using i-vector based diarization system. SLaTE 2019: 89-93 - [i19]Yang Zheng, Izzat H. Izzat, John H. L. Hansen:
Exploring OpenStreetMap Availability for Driving Environment Understanding. CoRR abs/1903.04084 (2019) - [i18]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i17]Ekim Yurtsever, Yongkang Liu, Jacob Lambert, Chiyomi Miyajima, Eijiro Takeuchi, Kazuya Takeda, John H. L. Hansen:
Risky Action Recognition in Lane Change Video Clips using Deep Spatiotemporal Networks with Segmentation Mask Transfer. CoRR abs/1906.02859 (2019) - [i16]Nursadul Mamun, Soheil Khorram, John H. L. Hansen:
Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients. CoRR abs/1907.02526 (2019) - [i15]Harishchandra Dubey, Abhijeet Sangwan, John H. L. Hansen:
Toeplitz Inverse Covariance based Robust Speaker Clustering for Naturalistic Audio Streams. CoRR abs/1907.05584 (2019) - [i14]Nursadul Mamun, Ria Ghosh, John H. L. Hansen:
Quantifying Cochlear Implant Users' Ability for Speaker Identification using CI Auditory Stimuli. CoRR abs/1908.00031 (2019) - [i13]Wei Xia, Jing Huang, John H. L. Hansen:
Cross-lingual Text-independent Speaker Verification using Unsupervised Adversarial Discriminative Domain Adaptation. CoRR abs/1908.01447 (2019) - [i12]Midia Yousefi, Soheil Khorram, John H. L. Hansen:
Probabilistic Permutation Invariant Training for Speech Separation. CoRR abs/1908.01768 (2019) - [i11]Shahram Ghorbani, Soheil Khorram, John H. L. Hansen:
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition. CoRR abs/1910.00565 (2019) - [i10]Salar Jafarlou, Soheil Khorram, Vinay Kothapally, John H. L. Hansen:
Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition. CoRR abs/1910.07047 (2019) - [i9]Fahimeh Bahmaninezhad, Shi-Xiong Zhang, Yong Xu, Meng Yu, John H. L. Hansen, Dong Yu:
A Unified Framework for Speech Separation. CoRR abs/1912.07814 (2019) - 2018
- [j132]Abhinav Misra, John H. L. Hansen:
Modelling and compensation for language mismatch in speaker verification. Speech Commun. 96: 58-66 (2018) - [j131]John H. L. Hansen
, Hynek Boril:
On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks. Speech Commun. 101: 94-108 (2018) - [j130]Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen
:
Speech Activity Detection in Naturalistic Audio Environments: Fearless Steps Apollo Corpus. IEEE Signal Process. Lett. 25(9): 1290-1294 (2018) - [j129]Shivesh Ranjan
, John H. L. Hansen
:
Curriculum Learning Based Approaches for Noise Robust Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 197-210 (2018) - [j128]Qian Zhang, John H. L. Hansen
:
Language/Dialect Recognition Based on Unsupervised Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 26(5): 873-882 (2018) - [j127]Abhinav Misra, John H. L. Hansen
:
Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1549-1558 (2018) - [j126]Chunlei Zhang
, Kazuhito Koishida, John H. L. Hansen
:
Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1633-1644 (2018) - [j125]Harishchandra Dubey
, Abhijeet Sangwan, John H. L. Hansen
:
Leveraging Frequency-Dependent Kernel and DIP-Based Clustering for Robust Speech Activity Detection in Naturalistic Audio Streams. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 2056-2071 (2018) - [c354]Prasanna V. Kothalkar, Johanna Rudolph, Christine Dollaghan, Jennifer McGlothlin, Thomas F. Campbell, John H. L. Hansen:
Automatic Screening to Detect 'At Risk' Child Speech Samples using a Clinical Group Verification framework. EMBC 2018: 4909-4913 - [c353]Harishchandra Dubey
, Abhijeet Sangwan, John H. L. Hansen:
Robust Feature Clustering for Unsupervised Speech Activity Detection. ICASSP 2018: 2726-2730 - [c352]Wei Xia, John H. L. Hansen:
Speaker Recognition with Nonlinear Distortion: Clipping Analysis and Impact. INTERSPEECH 2018: 746-750 - [c351]Fahimeh Bahmaninezhad,