


default search action
Zheng-Hua Tan
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j83]Andreas Jonas Fuglsig
, Zheng-Hua Tan
, Lars Søndergaard Bertelsen
, Jesper Jensen
, Jens Christian Lindof, Jan Østergaard
:
Joint Far- and Near-End Speech and Listening Enhancement With Minimum Processing. IEEE Access 12: 119983-120004 (2024) - [j82]Georg Ørnskov Rønsch, Iván López-Espejo
, Daniel Michelsanti
, Yuying Xie
, Petar Popovski
, Zheng-Hua Tan
:
Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds. Int. J. Comput. Integr. Manuf. 37(4): 438-453 (2024) - [j81]Philippe Gonzalez
, Zheng-Hua Tan
, Jan Østergaard
, Jesper Jensen
, Tommy Sonne Alstrøm
, Tobias May
:
The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems. IEEE Signal Process. Lett. 31: 2225-2229 (2024) - [j80]Yiming Zhang
, Ruoyi Du
, Zheng-Hua Tan
, Wenwu Wang
, Zhanyu Ma
:
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework. IEEE Signal Process. Lett. 31: 2520-2524 (2024) - [j79]Mathias Bach Pedersen
, Søren Holdt Jensen
, Zheng-Hua Tan
, Jesper Jensen
:
Data-Driven Non-Intrusive Speech Intelligibility Prediction Using Speech Presence Probability. IEEE ACM Trans. Audio Speech Lang. Process. 32: 55-67 (2024) - [j78]Philippe Gonzalez
, Zheng-Hua Tan
, Jan Østergaard
, Jesper Jensen
, Tommy Sonne Alstrøm
, Tobias May
:
Investigating the Design Space of Diffusion Models for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4486-4500 (2024) - [c126]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihály Petreczky:
PAC-Bayes Generalisation Bounds for Dynamical Systems including Stable RNNs. AAAI 2024: 11901-11909 - [c125]Yuying Xie, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, Reinhold Haeb-Umbach:
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. EUSIPCO 2024: 436-440 - [c124]M. Asjid Tanveer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard:
Envelope Based Deep Source Separation and EEG Auditory Attention Decoding for Speech and Music. EUSIPCO 2024: 872-876 - [c123]Andreas Jonas Fuglsig
, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Joint Minimum Processing Beamforming and Near-End Listening Enhancement. ICASSP Workshops 2024: 485-489 - [c122]Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Self-Supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions. ICASSP 2024: 10126-10130 - [c121]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm
, Tobias May
:
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. ICASSP 2024: 10431-10435 - [c120]Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan:
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners. ICLR 2024 - [c119]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models. ICML 2024 - [c118]Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Complex Recurrent Variational Autoencoder for Speech Resynthesis and Enhancement. IJCNN 2024: 1-7 - [c117]Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen:
Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter. IWAENC 2024: 444-448 - [i70]Jacob Mørk, Holger Severin Bovbjerg, Gergely Kiss, Zheng-Hua Tan:
Noise-Robust Keyword Spotting through Self-supervised Pretraining. CoRR abs/2403.18560 (2024) - [i69]Sarthak Yadav, Zheng-Hua Tan:
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations. CoRR abs/2406.02178 (2024) - [i68]Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Zero-Shot Audio Captioning Using Soft and Hard Prompts. CoRR abs/2406.06295 (2024) - [i67]Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs. CoRR abs/2408.16568 (2024) - [i66]Gustav Wagner Zakarias, Lars Kai Hansen, Zheng-Hua Tan:
BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning. CoRR abs/2410.02387 (2024) - 2023
- [j77]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
Explicit construction of the minimum error variance estimator for stochastic LTI-ss systems. Autom. 153: 111018 (2023) - [j76]Iván López-Espejo
, Amin Edraki, Wai-Yip Chan, Zheng-Hua Tan, Jesper Jensen:
On the deficiency of intelligibility metrics as proxies for subjective intelligibility. Speech Commun. 150: 9-22 (2023) - [j75]Sharon Gannot
, Zheng-Hua Tan
, Martin Haardt
, Nancy F. Chen
, Hoi-To Wai
, Ivan Tashev
, Walter Kellermann
, Justin Dauwels
:
Data Science Education: The Signal Processing Perspective [SP Education]. IEEE Signal Process. Mag. 40(7): 89-93 (2023) - [j74]Andreas Jonas Fuglsig
, Jesper Jensen
, Zheng-Hua Tan
, Lars Søndergaard Bertelsen
, Jens Christian Lindof, Jan Østergaard
:
Minimum Processing Near-End Listening Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2233-2245 (2023) - [j73]Yiming Zhang
, Hong Yu
, Ruoyi Du, Zheng-Hua Tan
, Wenwu Wang
, Zhanyu Ma
, Yuan Dong
:
ACTUAL: Audio Captioning With Caption Feature Space Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2643-2657 (2023) - [j72]Zhanyu Ma
, Xiaoou Lu, Jiyang Xie
, Zhen Yang
, Jing-Hao Xue
, Zheng-Hua Tan
, Bo Xiao
, Jun Guo
:
On the Comparisons of Decorrelation Approaches for Non-Gaussian Neutral Vector Variables. IEEE Trans. Neural Networks Learn. Syst. 34(4): 1823-1837 (2023) - [c116]Yuying Xie
, Thomas Arildsen, Zheng-Hua Tan:
Improved Disentangled Speech Representations Using Contrastive Learning in Factorized Hierarchical Variational Autoencoder. EUSIPCO 2023: 1330-1334 - [c115]Holger Severin Bovbjerg
, Zheng-Hua Tan:
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining. ICASSP Workshops 2023: 1-5 - [c114]Iván López-Espejo
, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting. ICASSP 2023: 1-5 - [c113]Daniel Michelsanti, Zheng-Hua Tan, Sergi Rotger-Griful, Jesper Jensen:
A Vision-Assisted Hearing Aid System Based on Deep Learning. ICASSP Workshops 2023: 1-4 - [c112]Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho:
Radio Sensing with Large Intelligent Surface for 6G. ICASSP 2023: 1-5 - [c111]Juan Felipe Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen:
Speech inpainting: Context-based speech synthesis guided by video. INTERSPEECH 2023: 4459-4463 - [i65]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihály Petreczky:
PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss. CoRR abs/2303.16816 (2023) - [i64]Juan F. Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen:
Speech inpainting: Context-based speech synthesis guided by video. CoRR abs/2306.00489 (2023) - [i63]Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan:
Masked Autoencoders with Multi-Window Attention Are Better Audio Learners. CoRR abs/2306.00561 (2023) - [i62]Andreas Jonas Fuglsig
, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Joint Minimum Processing Beamforming and Near-end Listening Enhancement. CoRR abs/2309.11243 (2023) - [i61]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. CoRR abs/2312.02683 (2023) - [i60]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Investigating the Design Space of Diffusion Models for Speech Enhancement. CoRR abs/2312.04370 (2023) - [i59]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs. CoRR abs/2312.09793 (2023) - [i58]Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions. CoRR abs/2312.16613 (2023) - 2022
- [j71]Iván López-Espejo
, Zheng-Hua Tan
, John H. L. Hansen
, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. IEEE Access 10: 4169-4199 (2022) - [j70]Poul Hoang
, Zheng-Hua Tan
, Jan Mark de Haan, Jesper Jensen
:
The Minimum Overlap-Gap Algorithm for Speech Enhancement. IEEE Access 10: 14698-14716 (2022) - [j69]Bjørn Uttrup Dideriksen
, Kristoffer Derosche, Zheng-Hua Tan
:
iVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning. IEEE Access 10: 48405-48418 (2022) - [j68]Mathias Bach Pedersen
, Asger Heidemann Andersen, Søren Holdt Jensen, Zheng-Hua Tan
, Jesper Jensen
:
Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data. IEEE Access 10: 66175-66189 (2022) - [j67]Jiyang Xie
, Zhanyu Ma
, Jianjun Lei
, Guoqiang Zhang
, Jing-Hao Xue
, Zheng-Hua Tan
, Jun Guo
:
Advanced Dropout: A Model-Free Methodology for Bayesian Dropout Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4605-4625 (2022) - [j66]Poul Hoang
, Jan Mark de Haan, Zheng-Hua Tan
, Jesper Jensen
:
Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices. IEEE ACM Trans. Audio Speech Lang. Process. 30: 706-720 (2022) - [c110]Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth de Carvalho, Zheng-Hua Tan, Stephan Sigg:
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars. EUSIPCO 2022: 1916-1920 - [c109]Iván López-Espejo
, Zheng-Hua Tan, Jesper Jensen:
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting. IberSPEECH 2022: 131-135 - [c108]Andreas Jonas Fuglsig
, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan:
Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index. ICASSP 2022: 7752-7756 - [c107]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c106]Claus M. Larsen, Peter Koch
, Zheng-Hua Tan:
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay. INTERSPEECH 2022: 3759-3763 - [c105]Cristian J. Vaca-Rubio
, Roberto Pereira
, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski
:
Floor Map Reconstruction Through Radio Sensing and Learning by a Large Intelligent Surface. MLSP 2022: 1-6 - [c104]Chien-Cheng Wu
, Zheng-Hua Tan, Cedomir Stefanovic:
AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning. VTC Spring 2022: 1-6 - [i57]Achintya Kumar Sarkar, Zheng-Hua Tan:
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification. CoRR abs/2201.06426 (2022) - [i56]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i55]Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth de Carvalho, Zheng-Hua Tan, Stephan Sigg:
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars. CoRR abs/2205.10321 (2022) - [i54]Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface. CoRR abs/2206.10750 (2022) - [i53]Holger Severin Bovbjerg, Zheng-Hua Tan:
Improving Label-Deficient Keyword Spotting Using Self-Supervised Pretraining. CoRR abs/2210.01703 (2022) - [i52]Andreas Jonas Fuglsig
, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Minimum Processing Near-end Listening Enhancement. CoRR abs/2210.17154 (2022) - [i51]Christian Heider Nielsen, Zheng-Hua Tan:
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise. CoRR abs/2211.01621 (2022) - [i50]Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder. CoRR abs/2211.08191 (2022) - [i49]Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise. CoRR abs/2211.10565 (2022) - [i48]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayesian-Like Error Bound for a Class of Linear Time-Invariant Stochastic State-Space Models. CoRR abs/2212.14838 (2022) - 2021
- [j65]Achintya Kumar Sarkar
, Zheng-Hua Tan:
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification. Comput. Speech Lang. 70: 101229 (2021) - [j64]Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jun Guo:
Deep InterBoost networks for small-sample image classification. Neurocomputing 456: 492-503 (2021) - [j63]Cristian J. Vaca-Rubio
, Pablo Ramirez-Espinosa
, Kimmo Kansanen
, Zheng-Hua Tan
, Elisabeth de Carvalho
, Petar Popovski
:
Assessing Wireless Sensing Potential With Large Intelligent Surfaces. IEEE Open J. Commun. Soc. 2: 934-947 (2021) - [j62]Achintya Kumar Sarkar
, Zheng-Hua Tan
:
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification With Autoregressive Prediction Coding. IEEE Signal Process. Lett. 28: 364-368 (2021) - [j61]Daniel Michelsanti
, Zheng-Hua Tan
, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1368-1396 (2021) - [j60]Iván López-Espejo
, Zheng-Hua Tan
, Jesper Jensen
:
A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2254-2266 (2021) - [c103]Chien-Cheng Wu
, Petar Popovski
, Zheng-Hua Tan, Cedomir Stefanovic:
Design of AoI-Aware 5G Uplink Scheduler Using Reinforcement Learning. 5GWF 2021: 176-181 - [c102]Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han
, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe
, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing. ASRU 2021: 679-686 - [c101]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihály Petreczky:
PAC-Bayesian theory for stochastic LTI systems. CDC 2021: 6626-6633 - [c100]Poul Hoang
, Zheng-Hua Tan, Jan Mark de Haan, Jesper Jensen:
Joint Maximum Likelihood Estimation of Power Spectral Densities and Relative Acoustic Transfer Functions for Acoustic Beamforming. ICASSP 2021: 6119-6123 - [c99]Giovanni Morrone
, Daniel Michelsanti
, Zheng-Hua Tan, Jesper Jensen:
Audio-Visual Speech Inpainting with Deep Learning. ICASSP 2021: 6653-6657 - [c98]Morten Østergaard Nielsen, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan:
Compression of DNNs Using Magnitude Pruning and Nonlinear Information Bottleneck Training. MLSP 2021: 1-6 - [c97]Yuying Xie
, Thomas Arildsen, Zheng-Hua Tan:
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective. MLSP 2021: 1-6 - [c96]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. SLT 2021: 323-329 - [c95]Anders E. Kalør
, Daniel Michelsanti
, Federico Chiariotti
, Zheng-Hua Tan, Petar Popovski
:
Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices. SPAWC 2021: 251-255 - [i47]Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan:
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification. CoRR abs/2102.02074 (2021) - [i46]Deividas Eringis
, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihály Petreczky:
PAC-Bayesian theory for stochastic LTI systems. CoRR abs/2103.12866 (2021) - [i45]Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On TasNet for Low-Latency Single-Speaker Speech Enhancement. CoRR abs/2103.14882 (2021) - [i44]Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing. CoRR abs/2104.00960 (2021) - [i43]Max Væhrens
, Andreas Jonas Fuglsig, Anders Post Jacobsen, Nicolai Almskou Rasmussen, Victor Mølbach Nissen, Joachim Roland Hejslet, Zheng-Hua Tan:
Improvement of Noise-Robust Single-Channel Voice Activity Detection with Spatial Pre-processing. CoRR abs/2104.05481 (2021) - [i42]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
Optimal Prediction of Unmeasured Output from Measurable Outputs In LTI Systems. CoRR abs/2109.02384 (2021) - [i41]Anders E. Kalør, Daniel Michelsanti, Federico Chiariotti, Zheng-Hua Tan, Petar Popovski:
Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices. CoRR abs/2110.05757 (2021) - [i40]Chien-Cheng Wu, Petar Popovski, Zheng-Hua Tan, Cedomir Stefanovic:
Design of AoI-Aware 5G Uplink Scheduler UsingReinforcement Learning. CoRR abs/2110.09995 (2021) - [i39]Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan:
Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index. CoRR abs/2111.07759 (2021) - [i38]Iván López-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. CoRR abs/2111.10592 (2021) - 2020
- [j59]Zheng-Hua Tan, Achintya Kumar Sarkar
, Najim Dehak
:
rVAD: An unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59: 1-21 (2020) - [j58]Bhaskar D. Rao, Zheng-Hua Tan
:
Highlights From the Machine Learning for Signal Processing Technical Committee [In the Spotlight]. IEEE Signal Process. Mag. 37(6): 200-202 (2020) - [j57]Morten Kolbæk
, Zheng-Hua Tan
, Søren Holdt Jensen, Jesper Jensen:
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 825-838 (2020) - [j56]Iván López-Espejo
, Zheng-Hua Tan
, Jesper Jensen
:
Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1233-1247 (2020) - [j55]Juan M. Martín-Doñas
, Jesper Jensen
, Zheng-Hua Tan
, Angel M. Gomez
, Antonio M. Peinado
:
Online Multichannel Speech Enhancement Based on Recursive EM and DNN-Based Speech Presence Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 28: 3080-3094 (2020) - [j54]Xiaoxu Li
, Dongliang Chang
, Zhanyu Ma
, Zheng-Hua Tan
, Jing-Hao Xue
, Jie Cao, Jingyi Yu, Jun Guo
:
OSLNet: Deep Small-Sample Classification With an Orthogonal Softmax Layer. IEEE Trans. Image Process. 29: 6482-6495 (2020) - [j53]Miklas Strøm Kristoffersen
, Sven Ewan Shepstone
, Zheng-Hua Tan
:
The Importance of Context When Recommending TV Content: Dataset and Algorithms. IEEE Trans. Multim. 22(6): 1531-1541 (2020) - [c94]Cristian J. Vaca-Rubio
, Pablo Ramirez-Espinosa
, Robin Jess Williams
, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski
:
A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting. CrownCom 2020: 126-138 - [c93]Iván López-Espejo
, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. EUSIPCO 2020: 331-335 - [c92]Saeid Samizade, Zheng-Hua Tan, Chao Shen, Xiaohong Guan:
Adversarial Example Detection by Classification for Deep Speech Recognition. ICASSP 2020: 3102-3106 - [c91]Poul Hoang
, Zheng-Hua Tan, Thomas Lunner, Jan Mark de Haan, Jesper Jensen:
Maximum Likelihood Estimation of the Interference-Plus-Noise Cross Power Spectral Density Matrix for Own Voice Retrieval. ICASSP 2020: 6939-6943 - [c90]Zeyu Song, Dongliang Chang, Zhanyu Ma, Xiaoxu Li, Zheng-Hua Tan:
CC-Loss: Channel Correlation Loss for Image Classification. ICPR 2020: 7601-7608 - [c89]Daniel Michelsanti
, Olga Slizovskaia
, Gloria Haro
, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen:
Vocoder-Based Speech Synthesis from Silent Videos. INTERSPEECH 2020: 3530-3534 - [i37]Miklas S. Kristoffersen
, Sven Ewan Shepstone, Zheng-Hua Tan:
Context-Aware Recommendations for Televisions Using Deep Embeddings with Relaxed N-Pairs Loss Objective. CoRR abs/2002.01554 (2020) - [i36]Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen:
Vocoder-Based Speech Synthesis from Silent Videos. CoRR abs/2004.02541 (2020) - [i35]Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo:
OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer. CoRR abs/2004.09033 (2020) - [i34]Achintya Kumar Sarkar, Zheng-Hua Tan:
On Bottleneck Features for Text-Dependent Speaker Verification Using X-vectors. CoRR abs/2005.07383 (2020) - [i33]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. CoRR abs/2006.00217 (2020) - [i32]Cristian J. Vaca-Rubio
, Pablo Ramirez-Espinosa, Robin Jess Williams
, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting. CoRR abs/2006.06563 (2020) - [i31]Achintya Kumar Sarkar, Himangshu Sarma, Priyanka Dwivedi, Zheng-Hua Tan:
Data augmentation enhanced speaker enrollment for text-dependent speaker verification. CoRR abs/2007.08004 (2020) - [i30]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. CoRR abs/2007.13118 (2020) - [i29]