default search action

combined dblp search
author search
venue search
publication search

ask others

Zheng-Hua Tan

> Home > Persons

Person information

affiliation: Aalborg University, Department of Electronic Systems, Denmark
affiliation (PhD 1999): Shanghai Jiao Tong University, China

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/VillaniCTOJ26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/VillaniCTOJ26
Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen:
Near-End Listening Enhancement via Combined Noise-Dependent Spectro-Temporal Energy Reallocation. IEEE Access 14: 43795-43812 (2026)
[j88]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ZakariasHT26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ZakariasHT26
Gustav Wagner Zakarias, Lars Kai Hansen, Zheng-Hua Tan:
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization. Trans. Mach. Learn. Res. 2026 (2026)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-09448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-09448
Ioannis Stylianou, Jon Francombe, Pablo Martínez-Nuevo, Sven Ewan Shepstone, Zheng-Hua Tan:
Population-Aligned Audio Reproduction With LLM-Based Equalizers. CoRR abs/2601.09448 (2026)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-16253
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-16253
Kevin Wilkinghoff, Keisuke Imoto, Zheng-Hua Tan:
How Much Does Machine Identity Matter in Anomalous Sound Detection at Test Time? CoRR abs/2602.16253 (2026)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-18777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-18777
Kevin Wilkinghoff, Gordon Wichern, Jonathan Le Roux, Zheng-Hua Tan:
Mind the Gap: Detecting Cluster Exits for Robust Local Density-Based Score Normalization in Anomalous Sound Detection. CoRR abs/2602.18777 (2026)
2025
[j87]
- view
  authority control:
- export record
  dblp key:
  - journals/csysl/AhdabTL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csysl/AhdabTL25
Mohamad Al Ahdab, Zheng-Hua Tan, John Leth:
Distributions and Direct Parametrization for Stable Stochastic State-Space Models. IEEE Control. Syst. Lett. 9: 444-449 (2025)
[j86]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/XieT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/XieT25
Yuying Xie, Zheng-Hua Tan:
A survey of deep learning for complex speech spectrograms. Speech Commun. 175: 103319 (2025)
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KuhneKJBGBT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KuhneKJBGBT25
Nikolai Lund Kühne, Astrid H. F. Kitchena, Marie S. Jensen, Mikkel S. L. Brøndt, Martin Gonzalez, Christophe A. N. Biscio, Zheng-Hua Tan:
Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models. ICASSP 2025: 1-5
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LydakiT0G25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LydakiT0G25
Eleftheria Lydaki, Zheng-Hua Tan, Jesper Jensen, Meng Guo:
Deep Feedback Cancellation for Hearing Aids with Improved System Stability and Sound Quality. ICASSP 2025: 1-5
[c133]
- view
- export record
  dblp key:
  - conf/icml/AhdabLT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AhdabLT25
Mohamad Al Ahdab, John Leth, Zheng-Hua Tan:
Optimal Sensor Scheduling and Selection for Continuous-Discrete Kalman Filtering with Auxiliary Dynamics. ICML 2025
[c132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuhneO0T25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuhneO0T25
Nikolai Lund Kühne, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan:
xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement. INTERSPEECH 2025
[c131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillaniCTO025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillaniCTO025
Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen:
Analysis and Extension of a Near-End Listening Enhancement Method Based on Long-Term Fractile Noise Statistics. INTERSPEECH 2025
[c130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YadavTT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YadavTT25
Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
AxLSTMs: learning self-supervised audio representations with xLSTMs. INTERSPEECH 2025
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/YadavTT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/YadavTT25
Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
AudioMAE++: Learning Better Masked Audio Representations with Swiglu FFNS. MLSP 2025: 1-6
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/BovbjergOJWT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/BovbjergOJWT25
Holger Severin Bovbjerg, Jan Østergaard, Jesper Jensen, Shinji Watanabe, Zheng-Hua Tan:
Learning Robust Spatial Representations from Binaural Audio through Feature Distillation. WASPAA 2025: 1-5
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03184
Holger Severin Bovbjerg, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan:
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining. CoRR abs/2501.03184 (2025)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03523
Achintya kr. Sarkar, Priyanka Dwivedi, Zheng-Hua Tan:
Vocal Tract Length Warped Features for Spoken Keyword Spotting. CoRR abs/2501.03523 (2025)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-06146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-06146
Nikolai Lund Kühne, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan:
xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement. CoRR abs/2501.06146 (2025)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-10435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-10435
Kevin Wilkinghoff, Takuya Fujimura, Keisuke Imoto, Jonathan Le Roux, Zheng-Hua Tan, Tomoki Toda:
Handling Domain Shifts for Anomalous Sound Detection: A Review of DCASE-Related Work. CoRR abs/2503.10435 (2025)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-14177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-14177
Mohamad Al Ahdab, Zheng-Hua Tan, John Leth:
Distributions and Direct Parametrization for Stable Stochastic State-Space Models. CoRR abs/2503.14177 (2025)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-08694
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-08694
Yuying Xie, Zheng-Hua Tan:
A Survey of Deep Learning for Complex Speech Spectrograms. CoRR abs/2505.08694 (2025)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-00966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-00966
Nikolai Lund Kühne, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement. CoRR abs/2507.00966 (2025)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-10464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-10464
Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
AudioMAE++: learning better masked audio representations with SwiGLU FFNs. CoRR abs/2507.10464 (2025)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-11240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-11240
Mohamad Al Ahdab, John Leth, Zheng-Hua Tan:
Optimal Sensor Scheduling and Selection for Continuous-Discrete Kalman Filtering with Auxiliary Dynamics. CoRR abs/2507.11240 (2025)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-20914
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-20914
Holger Severin Bovbjerg, Jan Østergaard, Jesper Jensen, Shinji Watanabe, Zheng-Hua Tan:
Learning Robust Spatial Representations from Binaural Audio through Feature Distillation. CoRR abs/2508.20914 (2025)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-13927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-13927
Kevin Wilkinghoff, Zheng-Hua Tan:
DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models. CoRR abs/2509.13927 (2025)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-18691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-18691
Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
An overview of neural architectures for self-supervised audio representation learning from masked spectrograms. CoRR abs/2509.18691 (2025)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24478
Lasse Borgholt, Jakob D. Havtorn, Christian Igel, Lars Maaløe, Zheng-Hua Tan:
A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems. CoRR abs/2509.24478 (2025)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-01958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-01958
Nikolai Lund Kühne, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enhancement. CoRR abs/2510.01958 (2025)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-15432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-15432
Kevin Wilkinghoff, Alessia Cornaggia-Urrigshardt, Zheng-Hua Tan:
Quantization-Based Score Calibration for Few-Shot Keyword Spotting with Dynamic Time Warping in Noisy Environments. CoRR abs/2510.15432 (2025)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-17281
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-17281
Ioannis Stylianou, Achintya kr. Sarkar, Nauman Dawalatabad, James R. Glass, Zheng-Hua Tan:
LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection. CoRR abs/2512.17281 (2025)
2024
[j85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/FuglsigTBJLO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/FuglsigTBJLO24
Andreas Jonas Fuglsig, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jesper Jensen, Jens Christian Lindof, Jan Østergaard:
Joint Far- and Near-End Speech and Listening Enhancement With Minimum Processing. IEEE Access 12: 119983-120004 (2024)
[j84]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcim/RonschLMXPT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcim/RonschLMXPT24
Georg Ørnskov Rønsch, Iván López-Espejo, Daniel Michelsanti, Yuying Xie, Petar Popovski, Zheng-Hua Tan:
Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds. Int. J. Comput. Integr. Manuf. 37(4): 438-453 (2024)
[j83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/GonzalezTOJAM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/GonzalezTOJAM24
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems. IEEE Signal Process. Lett. 31: 2225-2229 (2024)
[j82]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ZhangDTWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhangDTWM24
Yiming Zhang, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework. IEEE Signal Process. Lett. 31: 2520-2524 (2024)
[j81]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/PedersenJTJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PedersenJTJ24
Mathias Bach Pedersen, Søren Holdt Jensen, Zheng-Hua Tan, Jesper Jensen:
Data-Driven Non-Intrusive Speech Intelligibility Prediction Using Speech Presence Probability. IEEE ACM Trans. Audio Speech Lang. Process. 32: 55-67 (2024)
[j80]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeerJTOB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeerJTOB24
Peter Leer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw:
How to Train Your Ears: Auditory-Model Emulation for Large-Dynamic-Range Inputs and Mild-to-Severe Hearing Losses. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2006-2020 (2024)
[j79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/GonzalezTOJAM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GonzalezTOJAM24
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Investigating the Design Space of Diffusion Models for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4486-4500 (2024)
[c127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/EringisLTWP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/EringisLTWP24
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihály Petreczky:
PAC-Bayes Generalisation Bounds for Dynamical Systems including Stable RNNs. AAAI 2024: 11901-11909
[c126]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/XieKRTH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/XieKRTH24
Yuying Xie, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, Reinhold Haeb-Umbach:
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. EUSIPCO 2024: 436-440
[c125]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/Tanveer0TO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Tanveer0TO24
M. Asjid Tanveer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard:
Envelope Based Deep Source Separation and EEG Auditory Attention Decoding for Speech and Music. EUSIPCO 2024: 872-876
[c124]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FuglsigJTBLO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FuglsigJTBLO24
Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Joint Minimum Processing Beamforming and Near-End Listening Enhancement. ICASSP Workshops 2024: 485-489
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Bovbjerg0OT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Bovbjerg0OT24
Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Self-Supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions. ICASSP 2024: 10126-10130
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GonzalezTO0AM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GonzalezTO0AM24
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. ICASSP 2024: 10431-10435
[c121]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YadavTHT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YadavTHT24
Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan:
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners. ICLR 2024
[c120]
- view
- export record
  dblp key:
  - conf/icml/EringisLTWP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EringisLTWP24
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models. ICML 2024: 12560-12587
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/XieAT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/XieAT24
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Complex Recurrent Variational Autoencoder for Speech Resynthesis and Enhancement. IJCNN 2024: 1-7
[c118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YadavT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YadavT24
Sarthak Yadav, Zheng-Hua Tan:
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations. INTERSPEECH 2024
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/VillaniCTOJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/VillaniCTOJ24
Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen:
Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter. IWAENC 2024: 444-448
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-18560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-18560
Jacob Mørk, Holger Severin Bovbjerg, Gergely Kiss, Zheng-Hua Tan:
Noise-Robust Keyword Spotting through Self-supervised Pretraining. CoRR abs/2403.18560 (2024)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02178
Sarthak Yadav, Zheng-Hua Tan:
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations. CoRR abs/2406.02178 (2024)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06295
Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Zero-Shot Audio Captioning Using Soft and Hard Prompts. CoRR abs/2406.06295 (2024)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16568
Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs. CoRR abs/2408.16568 (2024)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-02387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-02387
Gustav Wagner Zakarias, Lars Kai Hansen, Zheng-Hua Tan:
BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning. CoRR abs/2410.02387 (2024)
2023
[j78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/automatica/EringisLTWP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/automatica/EringisLTWP23
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
Explicit construction of the minimum error variance estimator for stochastic LTI-ss systems. Autom. 153: 111018 (2023)
[j77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/LopezEspejoECTJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LopezEspejoECTJ23
Iván López-Espejo, Amin Edraki, Wai-Yip Chan, Zheng-Hua Tan, Jesper Jensen:
On the deficiency of intelligibility metrics as proxies for subjective intelligibility. Speech Commun. 150: 9-22 (2023)
[j76]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/GannotTHCWTKD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/GannotTHCWTKD23
Sharon Gannot, Zheng-Hua Tan, Martin Haardt, Nancy F. Chen, Hoi-To Wai, Ivan Tashev, Walter Kellermann, Justin Dauwels:
Data Science Education: The Signal Processing Perspective [SP Education]. IEEE Signal Process. Mag. 40(7): 89-93 (2023)
[j75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/Fuglsig0TBLO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Fuglsig0TBLO23
Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Minimum Processing Near-End Listening Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2233-2245 (2023)
[j74]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangYDTWMD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangYDTWMD23
Yiming Zhang, Hong Yu, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma, Yuan Dong:
ACTUAL: Audio Captioning With Caption Feature Space Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2643-2657 (2023)
[j73]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/MaLXYXTXG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/MaLXYXTXG23
Zhanyu Ma, Xiaoou Lu, Jiyang Xie, Zhen Yang, Jing-Hao Xue, Zheng-Hua Tan, Bo Xiao, Jun Guo:
On the Comparisons of Decorrelation Approaches for Non-Gaussian Neutral Vector Variables. IEEE Trans. Neural Networks Learn. Syst. 34(4): 1823-1837 (2023)
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/XieAT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/XieAT23
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Improved Disentangled Speech Representations Using Contrastive Learning in Factorized Hierarchical Variational Autoencoder. EUSIPCO 2023: 1330-1334
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BovbjergT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BovbjergT23
Holger Severin Bovbjerg, Zheng-Hua Tan:
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining. ICASSP Workshops 2023: 1-5
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LopezEspejoSTJH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LopezEspejoSTJH23
Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting. ICASSP 2023: 1-5
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MichelsantiTRJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MichelsantiTRJ23
Daniel Michelsanti, Zheng-Hua Tan, Sergi Rotger-Griful, Jesper Jensen:
A Vision-Assisted Hearing Aid System Based on Deep Learning. ICASSP Workshops 2023: 1-4
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VacaRubioRKTC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VacaRubioRKTC23
Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho:
Radio Sensing with Large Intelligent Surface for 6G. ICASSP 2023: 1-5
[c111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MontesinosMHT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MontesinosMHT023
Juan Felipe Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen:
Speech inpainting: Context-based speech synthesis guided by video. INTERSPEECH 2023: 4459-4463
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16816
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihály Petreczky:
PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss. CoRR abs/2303.16816 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00489
Juan F. Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen:
Speech inpainting: Context-based speech synthesis guided by video. CoRR abs/2306.00489 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00561
Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan:
Masked Autoencoders with Multi-Window Attention Are Better Audio Learners. CoRR abs/2306.00561 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11243
Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Joint Minimum Processing Beamforming and Near-end Listening Enhancement. CoRR abs/2309.11243 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02683
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02683
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. CoRR abs/2312.02683 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04370
Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Investigating the Design Space of Diffusion Models for Speech Enhancement. CoRR abs/2312.04370 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09793
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs. CoRR abs/2312.09793 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16613
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16613
Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions. CoRR abs/2312.16613 (2023)
2022
[j72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/Lopez-EspejoTHJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/Lopez-EspejoTHJ22
Iván López-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. IEEE Access 10: 4169-4199 (2022)
[j71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/HoangTHJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/HoangTHJ22
Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Jesper Jensen:
The Minimum Overlap-Gap Algorithm for Speech Enhancement. IEEE Access 10: 14698-14716 (2022)
[j70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/DideriksenDT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/DideriksenDT22
Bjørn Uttrup Dideriksen, Kristoffer Derosche, Zheng-Hua Tan:
iVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning. IEEE Access 10: 48405-48418 (2022)
[j69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/PedersenAJTJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/PedersenAJTJ22
Mathias Bach Pedersen, Asger Heidemann Andersen, Søren Holdt Jensen, Zheng-Hua Tan, Jesper Jensen:
Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data. IEEE Access 10: 66175-66189 (2022)
[j68]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/XieMLZXTG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/XieMLZXTG22
Jiyang Xie, Zhanyu Ma, Jianjun Lei, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo:
Advanced Dropout: A Model-Free Methodology for Bayesian Dropout Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4605-4625 (2022)
[j67]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HoangHTJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HoangHTJ22
Poul Hoang, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices. IEEE ACM Trans. Audio Speech Lang. Process. 30: 706-720 (2022)
[c110]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/Vaca-RubioSPCTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Vaca-RubioSPCTS22
Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth de Carvalho, Zheng-Hua Tan, Stephan Sigg:
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars. EUSIPCO 2022: 1916-1920
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/iberspeech/LopezEspejoTJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iberspeech/LopezEspejoTJ22
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting. IberSPEECH 2022: 131-135
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FuglsigOJBMT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FuglsigOJBMT22
Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan:
Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index. ICASSP 2022: 7752-7756
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZGFDZHXTWQLYM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZGFDZHXTWQLYM22
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Larsen0T22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Larsen0T22
Claus M. Larsen, Peter Koch, Zheng-Hua Tan:
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay. INTERSPEECH 2022: 3759-3763
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/Vaca-RubioPMGTC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/Vaca-RubioPMGTC22
Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Floor Map Reconstruction Through Radio Sensing and Learning by a Large Intelligent Surface. MLSP 2022: 1-6
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/vtc/WuTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vtc/WuTS22
Chien-Cheng Wu, Zheng-Hua Tan, Cedomir Stefanovic:
AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning. VTC Spring 2022: 1-6
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-06426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-06426
Achintya Kumar Sarkar, Zheng-Hua Tan:
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification. CoRR abs/2201.06426 (2022)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03647
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10321
Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth de Carvalho, Zheng-Hua Tan, Stephan Sigg:
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars. CoRR abs/2205.10321 (2022)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-10750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-10750
Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface. CoRR abs/2206.10750 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01703
Holger Severin Bovbjerg, Zheng-Hua Tan:
Improving Label-Deficient Keyword Spotting Using Self-Supervised Pretraining. CoRR abs/2210.01703 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17154
Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Minimum Processing Near-end Listening Enhancement. CoRR abs/2210.17154 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01621
Christian Heider Nielsen, Zheng-Hua Tan:
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise. CoRR abs/2211.01621 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08191
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder. CoRR abs/2211.08191 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10565
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10565
Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise. CoRR abs/2211.10565 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-14838
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayesian-Like Error Bound for a Class of Linear Time-Invariant Stochastic State-Space Models. CoRR abs/2212.14838 (2022)
2021
[j66]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/SarkarT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/SarkarT21
Achintya Kumar Sarkar, Zheng-Hua Tan:
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification. Comput. Speech Lang. 70: 101229 (2021)
[j65]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/LiCMTXCG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/LiCMTXCG21
Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jun Guo:
Deep InterBoost networks for small-sample image classification. Neurocomputing 456: 492-503 (2021)
[j64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ojcs/Vaca-RubioRKTCP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ojcs/Vaca-RubioRKTCP21
Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Assessing Wireless Sensing Potential With Large Intelligent Surfaces. IEEE Open J. Commun. Soc. 2: 934-947 (2021)
[j63]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/SarkarT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/SarkarT21
Achintya Kumar Sarkar, Zheng-Hua Tan:
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification With Autoregressive Prediction Coding. IEEE Signal Process. Lett. 28: 364-368 (2021)
[j62]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MichelsantiTZXY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MichelsantiTZXY21
Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1368-1396 (2021)
[j61]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Lopez-EspejoTJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Lopez-EspejoTJ21
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2254-2266 (2021)
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/5gwf/WuPTS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/5gwf/WuPTS21
Chien-Cheng Wu, Petar Popovski, Zheng-Hua Tan, Cedomir Stefanovic:
Design of AoI-Aware 5G Uplink Scheduler Using Reinforcement Learning. 5GWF 2021: 176-181
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RaoFHXJHJXWWTBY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RaoFHXJHJXWWTBY21
Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing. ASRU 2021: 679-686
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/EringisLTWEP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/EringisLTWEP21
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihály Petreczky:
PAC-Bayesian theory for stochastic LTI systems. CDC 2021: 6626-6633
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoangTH021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoangTH021
Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Jesper Jensen:
Joint Maximum Likelihood Estimation of Power Spectral Densities and Relative Acoustic Transfer Functions for Acoustic Beamforming. ICASSP 2021: 6119-6123
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MorroneMT021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MorroneMT021
Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen:
Audio-Visual Speech Inpainting with Deep Learning. ICASSP 2021: 6653-6657
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/NielsenOJT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/NielsenOJT21
Morten Østergaard Nielsen, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan:
Compression of DNNs Using Magnitude Pruning and Nonlinear Information Bottleneck Training. MLSP 2021: 1-6
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/XieAT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/XieAT21
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective. MLSP 2021: 1-6
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SahidullahSVLSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SahidullahSVLSK21
Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. SLT 2021: 323-329
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/spawc/KalorMCTP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spawc/KalorMCTP21
Anders E. Kalør, Daniel Michelsanti, Federico Chiariotti, Zheng-Hua Tan, Petar Popovski:
Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices. SPAWC 2021: 251-255
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-02074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-02074
Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan:
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification. CoRR abs/2102.02074 (2021)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-12866
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-12866
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihály Petreczky:
PAC-Bayesian theory for stochastic LTI systems. CoRR abs/2103.12866 (2021)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-14882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-14882
Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On TasNet for Low-Latency Single-Speaker Speech Enhancement. CoRR abs/2103.14882 (2021)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-00960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-00960
Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing. CoRR abs/2104.00960 (2021)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-05481
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-05481
Max Væhrens, Andreas Jonas Fuglsig, Anders Post Jacobsen, Nicolai Almskou Rasmussen, Victor Mølbach Nissen, Joachim Roland Hejslet, Zheng-Hua Tan:
Improvement of Noise-Robust Single-Channel Voice Activity Detection with Spatial Pre-processing. CoRR abs/2104.05481 (2021)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-02384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-02384
Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
Optimal Prediction of Unmeasured Output from Measurable Outputs In LTI Systems. CoRR abs/2109.02384 (2021)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05757
Anders E. Kalør, Daniel Michelsanti, Federico Chiariotti, Zheng-Hua Tan, Petar Popovski:
Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices. CoRR abs/2110.05757 (2021)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09995
Chien-Cheng Wu, Petar Popovski, Zheng-Hua Tan, Cedomir Stefanovic:
Design of AoI-Aware 5G Uplink Scheduler UsingReinforcement Learning. CoRR abs/2110.09995 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-07759
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-07759
Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan:
Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index. CoRR abs/2111.07759 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-10592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-10592
Iván López-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. CoRR abs/2111.10592 (2021)
2020
[j60]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/TanSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/TanSD20
Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59: 1-21 (2020)
[j59]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/RaoT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/RaoT20
Bhaskar D. Rao, Zheng-Hua Tan:
Highlights From the Machine Learning for Signal Processing Technical Committee [In the Spotlight]. IEEE Signal Process. Mag. 37(6): 200-202 (2020)
[j58]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KolbaekTJJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KolbaekTJJ20
Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 825-838 (2020)
[j57]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Lopez-EspejoTJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Lopez-EspejoTJ20
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1233-1247 (2020)
[j56]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Martin-Donas0TG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Martin-Donas0TG20
Juan M. Martín-Doñas, Jesper Jensen, Zheng-Hua Tan, Angel M. Gomez, Antonio M. Peinado:
Online Multichannel Speech Enhancement Based on Recursive EM and DNN-Based Speech Presence Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 28: 3080-3094 (2020)
[j55]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/LiCMTXCYG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/LiCMTXCYG20
Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo:
OSLNet: Deep Small-Sample Classification With an Orthogonal Softmax Layer. IEEE Trans. Image Process. 29: 6482-6495 (2020)
[j54]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/KristoffersenST20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/KristoffersenST20
Miklas Strøm Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
The Importance of Context When Recommending TV Content: Dataset and Algorithms. IEEE Trans. Multim. 22(6): 1531-1541 (2020)
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/crowncom/Vaca-RubioRJKTC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/crowncom/Vaca-RubioRJKTC20
Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Robin Jess Williams, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting. CrownCom 2020: 126-138
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/Lopez-EspejoTJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Lopez-EspejoTJ20
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. EUSIPCO 2020: 331-335
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SamizadeT0G20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SamizadeT0G20
Saeid Samizade, Zheng-Hua Tan, Chao Shen, Xiaohong Guan:
Adversarial Example Detection by Classification for Deep Speech Recognition. ICASSP 2020: 3102-3106
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoangTLH020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoangTLH020
Poul Hoang, Zheng-Hua Tan, Thomas Lunner, Jan Mark de Haan, Jesper Jensen:
Maximum Likelihood Estimation of the Interference-Plus-Noise Cross Power Spectral Density Matrix for Own Voice Retrieval. ICASSP 2020: 6939-6943
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/SongCMLT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/SongCMLT20
Zeyu Song, Dongliang Chang, Zhanyu Ma, Xiaoxu Li, Zheng-Hua Tan:
CC-Loss: Channel Correlation Loss for Image Classification. ICPR 2020: 7601-7608
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MichelsantiSHGT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MichelsantiSHGT20
Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen:
Vocoder-Based Speech Synthesis from Silent Videos. INTERSPEECH 2020: 3530-3534
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-01554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-01554
Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
Context-Aware Recommendations for Televisions Using Deep Embeddings with Relaxed N-Pairs Loss Objective. CoRR abs/2002.01554 (2020)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-02541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-02541
Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen:
Vocoder-Based Speech Synthesis from Silent Videos. CoRR abs/2004.02541 (2020)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-09033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-09033
Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo:
OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer. CoRR abs/2004.09033 (2020)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07383
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07383
Achintya Kumar Sarkar, Zheng-Hua Tan:
On Bottleneck Features for Text-Dependent Speaker Verification Using X-vectors. CoRR abs/2005.07383 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-00217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-00217
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. CoRR abs/2006.00217 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-06563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-06563
Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Robin Jess Williams, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting. CoRR abs/2006.06563 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-08004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-08004
Achintya Kumar Sarkar, Himangshu Sarma, Priyanka Dwivedi, Zheng-Hua Tan:
Data augmentation enhanced speaker enrollment for text-dependent speaker verification. CoRR abs/2007.08004 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13118
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13118
Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. CoRR abs/2007.13118 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-09586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-09586
Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. CoRR abs/2008.09586 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04556
Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen:
Audio-Visual Speech Inpainting with Deep Learning. CoRR abs/2010.04556 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05244
Jiyang Xie, Zhanyu Ma, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo:
Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization. CoRR abs/2010.05244 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05469
Zeyu Song, Dongliang Chang, Zhanyu Ma, Xiaoxu Li, Zheng-Hua Tan:
CC-Loss: Channel Correlation Loss For Image Classification. CoRR abs/2010.05469 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08465
Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Assessing Wireless Sensing Potential with Large Intelligent Surfaces. CoRR abs/2011.08465 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-12536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-12536
Achintya Kumar Sarkar, Zheng-Hua Tan:
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding. CoRR abs/2011.12536 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/QiT19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/QiT19a
Yonggang Qi, Zheng-Hua Tan:
SketchSegNet+: An End-to-End Learning of RNN for Multi-Class Sketch Semantic Segmentation. IEEE Access 7: 102717-102726 (2019)
[j52]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MichelsantiTSJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MichelsantiTSJ19
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Deep-learning-based audio-visual speech enhancement in presence of Lombard effect. Speech Commun. 115: 38-50 (2019)
[j51]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KolbaekTJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KolbaekTJ19
Morten Kolbaek, Zheng-Hua Tan, Jesper Jensen:
On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(2): 283-295 (2019)
[j50]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SarkarTTSG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SarkarTTSG19
Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1267-1279 (2019)
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/HoangTHL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/HoangTHL019
Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Thomas Lunner, Jesper Jensen:
Robust Bayesian and Maximum a Posteriori Beamforming for Hearing Assistive Devices. GlobalSIP 2019: 1-5
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MichelsantiTS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MichelsantiTS019
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Effects of Lombard Reflex on the Performance of Deep-learning-based Audio-visual Speech Enhancement Systems. ICASSP 2019: 6615-6619
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MichelsantiTS019a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MichelsantiTS019a
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
On Training Targets and Objective Functions for Deep-learning-based Audio-visual Speech Enhancement. ICASSP 2019: 8077-8081
[c85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lopez-EspejoT019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lopez-EspejoT019
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers. INTERSPEECH 2019: 3223-3227
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/XieMZXT019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/XieMZXT019
Jiyang Xie, Zhanyu Ma, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo:
Soft Dropout And Its Variational Bayes Approximation. MLSP 2019: 1-6
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/visapp/CoifmanRKST19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/visapp/CoifmanRKST19
Andrea Coifman, Péter Rohoska, Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
Subjective Annotations for Vision-based Attention Level Estimation. VISIGRAPP (5: VISAPP) 2019: 249-256
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-04554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-04554
Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. CoRR abs/1905.04554 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12605
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12605
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Deep-Learning-Based Audio-Visual Speech Enhancement in Presence of Lombard Effect. CoRR abs/1905.12605 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03588
Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method. CoRR abs/1906.03588 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-09417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-09417
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers. CoRR abs/1906.09417 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-01019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-01019
Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement. CoRR abs/1909.01019 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06076
Miklas S. Kristoffersen, Jacob L. Wieland, Sven Ewan Shepstone, Zheng-Hua Tan, Vinoba Vinayagamoorthy:
Deep Joint Embeddings of Context and Content for Recommendation. CoRR abs/1909.06076 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10013
Saeid Samizade, Zheng-Hua Tan, Chao Shen, Xiaohong Guan:
Adversarial Example Detection by Classification for Deep Speech Recognition. CoRR abs/1910.10013 (2019)
2018
[j49]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/SarkarT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/SarkarT18
Achintya Kumar Sarkar, Zheng-Hua Tan:
Incorporating pass-phrase dependent background models for text-dependent speaker verification. Comput. Speech Lang. 47: 259-271 (2018)
[j48]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/MaCTSTX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/MaCTSTX18
Zhanyu Ma, Jen-Tzung Chien, Zheng-Hua Tan, Yi-Zhe Song, Jalil Taghia, Ming Xiao:
Recent advances in machine learning for non-Gaussian data processing. Neurocomputing 278: 1-3 (2018)
[j47]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/ChienLT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/ChienLT18
Jen-Tzung Chien, Chao-Hsi Lee, Zheng-Hua Tan:
Latent Dirichlet mixture model. Neurocomputing 278: 12-22 (2018)
[j46]
- view
  authority control:
- export record
  dblp key:
  - journals/ijsr/TanTDVSRH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijsr/TanTDVSRH18
Zheng-Hua Tan, Nicolai Bæk Thomsen, Xiaodong Duan, Evgenios Vlachos, Sven Ewan Shepstone, Morten Højfeldt Rasmussen, Jesper Lisby Højvang:
iSocioBot: A Multimodal Interactive Social Robot. Int. J. Soc. Robotics 10(1): 5-19 (2018)
[j45]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/DuanT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/DuanT18
Xiaodong Duan, Zheng-Hua Tan:
A spatial self-similarity based feature learning method for face recognition under varying poses. Pattern Recognit. Lett. 111: 109-116 (2018)
[j44]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/PengTLZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/PengTLZ18
Renhua Peng, Zheng-Hua Tan, Xiaodong Li, Chengshi Zheng:
A perceptually motivated LP residual estimator in noisy and reverberant environments. Speech Commun. 96: 129-141 (2018)
[j43]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/AndersenHT018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/AndersenHT018
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions. Speech Commun. 102: 1-13 (2018)
[j42]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/ShepstoneTJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ShepstoneTJ18
Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Audio-Based Granularity-Adapted Emotion Classification. IEEE Trans. Affect. Comput. 9(2): 176-190 (2018)
[j41]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SahidullahTHKTP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SahidullahTHKTP18
Md. Sahidullah, Dennis Alexander Lehmann Thomsen, Rosa González Hautamäki, Tomi Kinnunen, Zheng-Hua Tan, Robert Parts, Martti Pitkänen:
Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 44-56 (2018)
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/FarmaniPT018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/FarmaniPT018
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Bias-Compensated Informed Sound Source Localization Using Relative Transfer Functions. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1271-1285 (2018)
[j39]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/AndersenHTJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/AndersenHTJ18
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 26(10): 1925-1939 (2018)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/tce/ShepstoneTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tce/ShepstoneTK18
Sven Ewan Shepstone, Zheng-Hua Tan, Miklas S. Kristoffersen:
Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users. IEEE Trans. Consumer Electron. 64(1): 11-18 (2018)
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/MaXLTYG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/MaXLTYG18
Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang, Jun Guo:
Decorrelation of Neutral Vector Variables: Theory and Applications. IEEE Trans. Neural Networks Learn. Syst. 29(1): 129-143 (2018)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuTMMG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuTMMG18
Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Rainer Martin, Jun Guo:
Spoofing Detection in Automatic Speaker Verification Systems Using DNN Classifiers and Dynamic Acoustic Features. IEEE Trans. Neural Networks Learn. Syst. 29(10): 4633-4644 (2018)
[j35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/wpc/GuoTCZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/wpc/GuoTCZ18
Jun Guo, Zheng-Hua Tan, Sung Ho Cho, Guoqiang Zhang:
Wireless Personal Communications: Machine Learning for Big Data Processing in Mobile Internet. Wirel. Pers. Commun. 102(3): 2093-2098 (2018)
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KolbcekT018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KolbcekT018
Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Monaural Speech Enhancement Using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure. ICASSP 2018: 5059-5063
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrederiksenVWTD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrederiksenVWTD18
Peter Sibbern Frederiksen, Jesús Villalba, Shinji Watanabe, Zheng-Hua Tan, Najim Dehak:
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. INTERSPEECH 2018: 1823-1827
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/VlachosT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/VlachosT18
Evgenios Vlachos, Zheng-Hua Tan:
Public perception of android robots: Indications from an analysis of YouTube comments. IROS 2018: 1255-1260
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/ro-man/TrovatoPBCTBT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ro-man/TrovatoPBCTBT18
Gabriele Trovato, Renato Paredes, Javier Balvin, Francisco Cuéllar, Nicolai Bæk Thomsen, Søren Bech, Zheng-Hua Tan:
The Sound or Silence: Investigating the Influence of Robot Noise on Proxemics. RO-MAN 2018: 713-718
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/um/KristoffersenST18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/um/KristoffersenST18
Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
A Dataset for Inferring Contextual Preferences of Users Watching TV. UMAP 2018: 367-368
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-00604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-00604
Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure. CoRR abs/1802.00604 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-06764
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-06764
Ioannis T. Christou, Emmanouil Amolochitis, Zheng-Hua Tan:
A Parallel/Distributed Algorithmic Framework for Mining All Quantitative Association Rules. CoRR abs/1804.06764 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-08404
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-08404
Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
On the Equivalence between Objective Intelligibility and Mean-Squared Error for Deep Neural Network based Speech Enhancement. CoRR abs/1806.08404 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-00337
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-00337
Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
The Importance of Context When Recommending TV Content: Dataset and Algorithms. CoRR abs/1808.00337 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06234
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
On Training Targets and Objective Functions for Deep-Learning-Based Audio-Visual Speech Enhancement. CoRR abs/1811.06234 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06250
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Effects of Lombard Reflex on the Performance of Deep-Learning-Based Audio-Visual Speech Enhancement Systems. CoRR abs/1811.06250 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-04949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-04949
Andrea Coifman, Péter Rohoska, Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
Subjective Annotations for Vision-Based Attention Level Estimation. CoRR abs/1812.04949 (2018)
2017
[j34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/YuTZMG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/YuTZMG17
Hong Yu, Zheng-Hua Tan, Yiming Zhang, Zhanyu Ma, Jun Guo:
DNN Filter Bank Cepstral Coefficients for Spoofing Detection. IEEE Access 5: 4779-4787 (2017)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KolbaekTJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KolbaekTJ17
Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement Systems. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 149-163 (2017)
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/FarmaniPTJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/FarmaniPTJ17
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Informed Sound Source Localization Using Relative Transfer Functions for Hearing Aid Applications. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 611-623 (2017)
[j31]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KolbaekYTJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KolbaekYTJ17
Morten Kolbaek, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1901-1913 (2017)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/wpc/PrasadTP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/wpc/PrasadTP17
Swati Prasad, Zheng-Hua Tan, Ramjee Prasad:
Frame Selection for Robust Speaker Identification: A Hybrid Approach. Wirel. Pers. Commun. 97(1): 933-950 (2017)
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/wpc/AstarasPT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/wpc/AstarasPT17
Stefanos Astaras, Aristodemos Pnevmatikakis, Zheng-Hua Tan:
Visual Detection of Events of Interest from Urban Activity. Wirel. Pers. Commun. 97(2): 1877-1888 (2017)
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuKT017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuKT017
Dong Yu, Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Permutation invariant training of deep models for speaker-independent multi-talker speech separation. ICASSP 2017: 241-245
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AndersenHT017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AndersenHT017
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
A non-intrusive Short-Time Objective Intelligibility measure. ICASSP 2017: 5085-5089
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KinnunenSFCHTST17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KinnunenSFCHTST17
Tomi Kinnunen, Md. Sahidullah, Mauro Falcone, Luca Costantini, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Achintya Kumar Sarkar, Zheng-Hua Tan, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Ville Hautamäki, Kong-Aik Lee:
RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research. ICASSP 2017: 5395-5399
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/ictai/DuanTTLJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ictai/DuanTTLJ17
Xiaodong Duan, Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Weighted Score Based Fast Converging CO-training with Application to Audio-Visual Person Identification. ICTAI 2017: 610-617
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHKLa17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHKLa17
Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuTMG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuTMG17
Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification. INTERSPEECH 2017: 1492-1496
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MichelsantiT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MichelsantiT17
Daniel Michelsanti, Zheng-Hua Tan:
Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification. INTERSPEECH 2017: 2008-2012
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarkarSTK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarkarSTK17
Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan, Tomi Kinnunen:
Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data. INTERSPEECH 2017: 2611-2615
[c69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndersenHT017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndersenHT017
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
On the Use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure. INTERSPEECH 2017: 2963-2967
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/KolbaekYT017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/KolbaekYT017
Morten Kolbaek, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Joint separation and denoising of noisy multi-talker speech using recurrent neural networks and permutation invariant training. MLSP 2017: 1-6
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YuTMG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YuTMG17
Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
DNN Filter Bank Cepstral Coefficients for Spoofing Detection. CoRR abs/1702.03791 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KolbaekYTJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KolbaekYTJ17
Morten Kolbæk, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Multi-talker Speech Separation and Tracing with Permutation Invariant Training of Deep Recurrent Neural Networks. CoRR abs/1703.06284 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SarkarT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SarkarT17
Achintya Kumar Sarkar, Zheng-Hua Tan:
Time-Contrastive Learning Based Unsupervised DNN Feature Extraction for Speaker Verification. CoRR abs/1704.02373 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MaXLTYG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MaXLTYG17
Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang, Jun Guo:
Decorrelation of Neutral Vector Variables: Theory and Applications. CoRR abs/1705.10524 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YuTMG17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YuTMG17a
Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification. CoRR abs/1706.03397 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-09588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-09588
Morten Kolbæk, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training. CoRR abs/1708.09588 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-01703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-01703
Daniel Michelsanti, Zheng-Hua Tan:
Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification. CoRR abs/1709.01703 (2017)
2016
[j28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/MaYTG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/MaYTG16
Zhanyu Ma, Hong Yu, Zheng-Hua Tan, Jun Guo:
Text-Independent Speaker Identification Using the Histogram Transform Model. IEEE Access 4: 9733-9739 (2016)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/MaTG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/MaTG16
Zhanyu Ma, Zheng-Hua Tan, Jun Guo:
Feature selection for neutral vector in EEG signal classification. Neurocomputing 174: 937-945 (2016)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/ijsr/JochumVCNHT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijsr/JochumVCNHT16
Elizabeth Ann Jochum, Evgenios Vlachos, Anja Christoffersen, Sally Grindsted Nielsen, Ibrahim A. Hameed, Zheng-Hua Tan:
Using Theatre to Study Interaction with Care Robots. Int. J. Soc. Robotics 8(4): 457-470 (2016)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/kais/ChristouAT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kais/ChristouAT16
Ioannis T. Christou, Emmanouil Amolochitis, Zheng-Hua Tan:
AMORE: design and implementation of a commercial-strength parallel hybrid movie recommendation engine. Knowl. Inf. Syst. 47(3): 671-696 (2016)
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ShepstoneLLTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ShepstoneLLTJ16
Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen:
Total Variability Modeling Using Source-Specific Priors. IEEE ACM Trans. Audio Speech Lang. Process. 24(3): 504-517 (2016)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/AndersenHTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/AndersenHTJ16
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 1908-1920 (2016)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/wpc/KatsarakisPTP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/wpc/KatsarakisPTP16
Nikolaos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan, Ramjee Prasad:
Improved Gaussian Mixture Models for Adaptive Foreground Segmentation. Wirel. Pers. Commun. 87(3): 629-643 (2016)
[c67]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/fusion/FarmaniHPTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fusion/FarmaniHPTJ16
Mojtaba Farmani, Richard Heusdens, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Concurrent localization of sound sources and dual-microphone sub-arrays using TOFs. FUSION 2016: 1931-1936
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FarmaniPTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FarmaniPTJ16
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Informed Direction of Arrival estimation using a spherical-head model for Hearing Aid applications. ICASSP 2016: 360-364
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AndersenHTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AndersenHTJ16
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
A method for predicting the intelligibility of noisy and non-linearly enhanced binaural speech. ICASSP 2016: 4995-4999
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/iecon/LinGJTVL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iecon/LinGJTVL16
Hengwei Lin, Josep M. Guerrero, Chenxi Jia, Zheng-Hua Tan, Juan C. Vasquez, Chengxi Liu:
Adaptive overcurrent protection for microgrids in extensive distribution systems. IECON 2016: 4042-4047
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarkarT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarkarT16
Achintya Kumar Sarkar, Zheng-Hua Tan:
Text Dependent Speaker Verification Using Un-Supervised HMM-UBM and Temporal GMM-UBM. INTERSPEECH 2016: 425-429
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinnunenSKDTSTH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinnunenSKDTSTH16
Tomi Kinnunen, Md. Sahidullah, Ivan Kukanov, Héctor Delgado, Massimiliano Todisco, Achintya Kumar Sarkar, Nicolai Bæk Thomsen, Ville Hautamäki, Nicholas W. D. Evans, Zheng-Hua Tan:
Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus. INTERSPEECH 2016: 430-434
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SahidullahDTYKE16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahidullahDTYKE16
Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu, Tomi Kinnunen, Nicholas W. D. Evans, Zheng-Hua Tan:
Integrated Spoofing Countermeasures and Automatic Speaker Verification: An Evaluation on ASVspoof 2015. INTERSPEECH 2016: 1700-1704
[c60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SahidullahHTKTH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahidullahHTKTH16
Md. Sahidullah, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts, Martti Pitkänen:
Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech. INTERSPEECH 2016: 1720-1724
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomsenTTLJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomsenTTLJ16
Nicolai Bæk Thomsen, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Speaker-Dependent Dictionary-Based Speech Enhancement for Text-Dependent Speaker Verification. INTERSPEECH 2016: 1839-1843
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinnunenSKTST16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinnunenSKTST16
Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Alexander Lehmann Thomsen, Md. Sahidullah, Zheng-Hua Tan:
HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors. INTERSPEECH 2016: 2992-2996
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/mipro/SunMADT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mipro/SunMADT16
Zongji Sun, Li Meng, Aladdin M. Ariyaeeinia, Xiaodong Duan, Zheng-Hua Tan:
Privacy protection performance of De-identified face images with and without background. MIPRO 2016: 1354-1359
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/ChienLT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/ChienLT16
Jen-Tzung Chien, Chao-Hsi Lee, Zheng-Hua Tan:
Dirichlet mixture allocation. MLSP 2016: 1-6
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/DelgadoTSSEKT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/DelgadoTSSEKT16
Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Achintya Kumar Sarkar, Nicholas W. D. Evans, Tomi Kinnunen, Zheng-Hua Tan:
Further optimisations of constant Q cepstral processing for integrated utterance and text-dependent speaker verification. SLT 2016: 179-185
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KolboekTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KolboekTJ16
Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Speech enhancement using Long Short-Term Memory based recurrent Neural Networks for noise robust Speaker Verification. SLT 2016: 305-311
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/spline/Abou-ZleikhaCTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spline/Abou-ZleikhaCTJ16
Mohamed Abou-Zleikha, Mads Græsbøll Christensen, Zheng-Hua Tan, Søren Holdt Jensen:
Projecting emotional speech into arousal-valence space using pairwise preference learning. SPLINE 2016: 1-5
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/spline/AstarasPT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spline/AstarasPT16
Stefanos Astaras, Aristodemos Pnevmatikakis, Zheng-Hua Tan:
Background subtraction for patterns of activities in cities. SPLINE 2016: 1-5
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/spline/ThomsenDTLJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spline/ThomsenDTLJ16
Nicolai Bæk Thomsen, Xiaodong Duan, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Improving the convergence of co-training for audio-visual person identification. SPLINE 2016: 1-5
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/spline/YuSTTMG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spline/YuSTTMG16
Hong Yu, Achintya Kumar Sarkar, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Effect of multi-condition training and speech enhancement methods on spoofing detection. SPLINE 2016: 1-5
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YuKTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YuKTJ16
Dong Yu, Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation. CoRR abs/1607.00325 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SarkarT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SarkarT16
Achintya Kumar Sarkar, Zheng-Hua Tan:
Incorporating Pass-Phrase Dependent Background Models for Text Dependent Speaker Verification. CoRR abs/1611.06423 (2016)
2015
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/QiGSXZT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/QiGSXZT15
Yonggang Qi, Jun Guo, Yi-Zhe Song, Tao Xiang, Honggang Zhang, Zheng-Hua Tan:
Im2Sketch: Sketch generation by unconflicted perceptual grouping. Neurocomputing 165: 338-349 (2015)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/JensenT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/JensenT15
Jesper Jensen, Zheng-Hua Tan:
Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features-A Theoretically Consistent Approach. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 186-197 (2015)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/tsg/KouzelisTBPR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsg/KouzelisTBPR15
Konstantinos Kouzelis, Zheng-Hua Tan, Birgitte Bak-Jensen, Jayakrishnan Radhakrishna Pillai, Ewen Ritchie:
Estimation of Residential Heat Pump Consumption for Flexibility Market Applications. IEEE Trans. Smart Grid 6(4): 1852-1864 (2015)
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/Abou-ZleikhaTCJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Abou-ZleikhaTCJ15
Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen:
A discriminative approach for speaker selection in speaker de-identification systems. EUSIPCO 2015: 2102-2106
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/FarmaniPTJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/FarmaniPTJ15
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Informed TDoA-based direction of arrival estimation for hearing aid applications. GlobalSIP 2015: 953-957
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FarmaniPTJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FarmaniPTJ15
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Maximum likelihood approach to "informed" Sound Source Localization for Hearing Aid applications. ICASSP 2015: 16-20
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FarmaniPTJ15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FarmaniPTJ15a
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
On the influence of microphone array geometry on HRTF-based Sound Source Localization. ICASSP 2015: 439-443
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShepstoneLLTJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShepstoneLLTJ15
Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen:
Source-specific informative prior for i-vector extraction. ICASSP 2015: 4185-4189
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/DuanT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/DuanT15
Xiaodong Duan, Zheng-Hua Tan:
A feature subtraction method for image based kinship verification under uncontrolled environments. ICIP 2015: 1573-1577
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/DuanT15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/DuanT15a
Xiaodong Duan, Zheng-Hua Tan:
Local feature learning for face recognition under varying poses. ICIP 2015: 2905-2909
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndersenHTJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndersenHTJ15
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
A binaural short time objective intelligibility measure for noisy and enhanced speech. INTERSPEECH 2015: 2563-2567
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KraljevskiTB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KraljevskiTB15
Ivan Kraljevski, Zheng-Hua Tan, Maria Paola Bissiri:
Comparison of forced-alignment speech recognition and humans for generating reference VAD. INTERSPEECH 2015: 2937-2941
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/ThomsenTLJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/ThomsenTLJ15
Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
A heuristic approach for a social robot to navigate to a person based on audio and range information. IROS 2015: 5884-5890
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/isvc/DuanT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isvc/DuanT15
Xiaodong Duan, Zheng-Hua Tan:
Neighbors Based Discriminative Feature Difference Learning for Kinship Verification. ISVC (2) 2015: 258-267
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/medinfo/SchaarupHLTAH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/medinfo/SchaarupHLTAH15
Clara Schaarup, Gunnar Hartvigsen, Lars Bo Larsen, Zheng-Hua Tan, Eirik Årsand, Ole Kristian Hejlesen:
Assessing the Potential Use of Eye-Tracking Triangulation for Evaluating the Usability of an Online Diabetes Exercise System. MedInfo 2015: 84-88
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/mipro/KristensenTMG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mipro/KristensenTMG15
Rasmus Lyngby Kristensen, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Binary pattern flavored feature extractors for Facial Expression Recognition: An overview. MIPRO 2015: 1131-1137
2014
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/cee/TanK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cee/TanK14
Zheng-Hua Tan, Ivan Kraljevski:
Joint variable frame rate and length analysis for speech recognition under adverse conditions. Comput. Electr. Eng. 40(7): 2139-2149 (2014)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/expert/AmolochitisCT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/expert/AmolochitisCT14
Emmanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan:
Implementing a Commercial-Strength Parallel Hybrid Movie Recommendation Engine. IEEE Intell. Syst. 29(2): 92-96 (2014)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ShepstoneTJ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ShepstoneTJ14
Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Using Audio-Derived Affective Offset to Enhance TV Recommendation. IEEE Trans. Multim. 16(7): 1999-2010 (2014)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/MaLTG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/MaLTG14
Zhanyu Ma, Arne Leijon, Zheng-Hua Tan, Sheng Gao:
Predictive Distribution of the Dirichlet Mixture Model by Local Variational Inference. J. Signal Process. Syst. 74(3): 359-374 (2014)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/wpc/KatsarakisPTP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/wpc/KatsarakisPTP14
Nikos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan, Ramjee Prasad:
Combination of Multiple Measurement Cues for Visual Face Tracking. Wirel. Pers. Commun. 78(3): 1789-1810 (2014)
[c36]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/Abou-ZleikhaTCJ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Abou-ZleikhaTCJ14
Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen:
Cluster-based adaptation using density forest for HMM phone recognition. EUSIPCO 2014: 2065-2069
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ifip12/Abou-ZleikhaTCJ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ifip12/Abou-ZleikhaTCJ14
Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen:
Utilising Tree-Based Ensemble Learning for Speaker Segmentation. AIAI 2014: 50-59
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomsenTLJ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomsenTLJ14
Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Improving Robustness Against Environmental Sounds for Directing Attention of Social Robots. MA3HMI@INTERSPEECH 2014: 25-34
2013
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ipm/AmolochitisCTP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ipm/AmolochitisCTP13
Emmanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan, Ramjee Prasad:
A heuristic hierarchical scheme for academic search and retrieval. Inf. Process. Manag. 49(6): 1326-1343 (2013)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/tce/ShepstoneTJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tce/ShepstoneTJ13
Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Audio-based age and gender identification to enhance the recommendation of TV content. IEEE Trans. Consumer Electron. 59(3): 721-729 (2013)
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PlchotMMDMCGHMMSSTTZZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PlchotMMDMCGHMMSSTTZZ13
Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak, Jeff Z. Ma, Sandro Cumani, Ondrej Glembek, Hynek Hermansky, Sri Harish Reddy Mallidi, Nima Mesgarani, Richard M. Schwartz, Mehdi Soufifar, Zheng-Hua Tan, Samuel Thomas, Bing Zhang, Xinhui Zhou:
Developing a speaker identification system for the DARPA RATS project. ICASSP 2013: 6768-6772
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShepstoneTJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShepstoneTJ13
Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Demographic recommendation by means of group profile elicitation using speaker age and gender recognition. INTERSPEECH 2013: 2827-2831
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/RasmussenT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/RasmussenT13
Morten Højfeldt Rasmussen, Zheng-Hua Tan:
Fusing eye-gaze and speech recognition for tracking in an automatic reading tutor - a step in the right direction? SLaTE 2013: 112-115
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/vcip/QiGLZXST13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vcip/QiGLZXST13
Yonggang Qi, Jun Guo, Yi Li, Honggang Zhang, Tao Xiang, Yi-Zhe Song, Zheng-Hua Tan:
Perceptual grouping via untangling Gestalt principles. VCIP 2013: 1-6
[c29]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/wpmc/PrasadTP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wpmc/PrasadTP13
Swati Prasad, Zheng-Hua Tan, Ramjee Prasad:
Multi-frame rate based multiple-model training for robust speaker identification of disguised voice. WPMC 2013: 1-4
2012
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/cee/YuTW12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cee/YuTW12
Weichuan Yu, Zheng-Hua Tan, Yi Wan:
Guest Editors' Introduction to the Special Issue on "New Trends in Signal Processing and Biomedical Engineering". Comput. Electr. Eng. 38(1): 1-2 (2012)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MowlaeeSCTKFJ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MowlaeeSCTKFJ12
Pejman Mowlaee, Rahim Saeidi, Mads Græsbøll Christensen, Zheng-Hua Tan, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen:
A Joint Approach for Single-Channel Speaker Identification and Speech Separation. IEEE Trans. Speech Audio Process. 20(9): 2586-2601 (2012)
[c28]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/icpram/AmolochitisCT12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpram/AmolochitisCT12
Emmanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan:
PubSearch - A Hierarchical Heuristic Scheme for Ranking Academic Search Results. ICPRAM (2) 2012: 509-514
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/ssp/MaTP12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssp/MaTP12
Zhanyu Ma, Zheng-Hua Tan, Swati Prasad:
EEG signal classification with super-Dirichlet mixture model. SSP 2012: 440-443
2011
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/ijkl/PetreskiTGPPT11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijkl/PetreskiTGPPT11
Hristijan Petreski, Sofia Tsekeridou, Eri Giannaka, Neeli Rashmi Prasad, Ramjee Prasad, Zheng-Hua Tan:
Technology-enabled social learning: a review. Int. J. Knowl. Learn. 7(3/4): 253-270 (2011)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/PetsatodisBTTP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PetsatodisBTTP11
Theodore Petsatodis, Christos Boukis, Fotios Talantzis, Zheng-Hua Tan, Ramjee Prasad:
Convex Combination of Multiple Statistical Models With Application to VAD. IEEE Trans. Speech Audio Process. 19(8): 2314-2327 (2011)
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MowlaeeSTCKFJ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MowlaeeSTCKFJ11
Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen:
Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge. INTERSPEECH 2011: 677-680
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetsatodisTBTP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetsatodisTBTP11
Theodore Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan, Ramjee Prasad:
Multi-Sensor Voice Activity Detection Based on Multiple Observation Hypothesis Testing. INTERSPEECH 2011: 2633-2636
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/isabel/BakopoulosTGTP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isabel/BakopoulosTGTP11
Menelaos Bakopoulos, Sofia Tsekeridou, Eri Giannaka, Zheng-Hua Tan, Ramjee Prasad:
Mobile video annotation for enhanced rich media communication during emergency handling. ISABEL 2011: 32:1-32:5
[c23]
- view
  - electronic edition @ iscram.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscram/BakopoulosTGTP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscram/BakopoulosTGTP11
Menelaos Bakopoulos, Sofia Tsekeridou, Eri Giannaka, Zheng-Hua Tan, Ramjee Prasad:
Command & control: Information merging, selective visualization and decision support for emergency handling. ISCRAM 2011
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/RasmussenMTLL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/RasmussenMTLL11
Morten Højfeldt Rasmussen, Jack Mostow, Zheng-Hua Tan, Børge Lindberg, Yuanpeng Li:
Evaluating tracking accuracy of an automatic reading tutor. SLaTE 2011: 17-20
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/RasmussenLT11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/RasmussenLT11
Morten Højfeldt Rasmussen, Børge Lindberg, Zheng-Hua Tan:
Combining acoustic and language model miscue detection methods for adult dyslexic read speech. SLaTE 2011: 21-24
[c20]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/wpmc/PrasadTPCGD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wpmc/PrasadTPCGD11
Swati Prasad, Zheng-Hua Tan, Ramjee Prasad, Alvaro Fuentes Cabrera, Ying Gu, Kim Dremstrup:
Feature selection strategy for classification of single-trial EEG elicited by motor imagery. WPMC 2011: 1-4
2010
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/TanHFGO10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/TanHFGO10
Zheng-Hua Tan, Reinhold Haeb-Umbach, Sadaoki Furui, James R. Glass, Maurizio Omologo:
Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments. IEEE J. Sel. Top. Signal Process. 4(5): 769-771 (2010)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/TanL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/TanL10
Zheng-Hua Tan, Børge Lindberg:
Low-Complexity Variable Frame Rate Analysis for Speech Recognition and Voice Activity Detection. IEEE J. Sel. Top. Signal Process. 4(5): 798-807 (2010)
[c19]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/SantoroPTM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SantoroPTM10
Francesco Santoro, Sergio Pedro, Zheng-Hua Tan, Thomas B. Moeslund:
Crowd analysis by using optical flow and density based clustering. EUSIPCO 2010: 269-273
[c18]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/AndersenAKPT10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/AndersenAKPT10
Martina Andersen, Rasmus S. Andersen, Nikos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan:
Three-dimensional adaptive sensing of people in a multi-camera setup. EUSIPCO 2010: 964-968
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MowlaeeSTCFJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MowlaeeSTCFJ10
Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Pasi Fränti, Søren Holdt Jensen:
Joint single-channel speech separation and speaker identification. ICASSP 2010: 4430-4433
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/SaeidiMKTCJF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/SaeidiMKTCJF10
Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals. ICPR 2010: 4565-4568
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaeidiMKTCJF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaeidiMKTCJF10
Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Improving monaural speaker identification by double-talk detection. INTERSPEECH 2010: 1069-1072

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RasmussenTLJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RasmussenTLJ09
Morten Højfeldt Rasmussen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
A system for detecting miscues in dyslexic read speech. INTERSPEECH 2009: 1467-1470
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanL09
Zheng-Hua Tan, Børge Lindberg:
High-accuracy, low-complexity voice activity detection based on a posteriori SNR weighted energy. INTERSPEECH 2009: 2231-2234
[r1]
- view
  - electronic edition @ igi-global.com
  - details & citations
- export record
  dblp key:
  - reference/dataware/Tan09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/reference/dataware/Tan09
Zheng-Hua Tan:
Audio and Speech Processing for Data Mining. Encyclopedia of Data Warehousing and Mining 2009: 98-103
2008
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XuTDL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XuTDL08
Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust Speech Recognition by Nonlocal Means Denoising Processing. IEEE Signal Process. Lett. 15: 701-704 (2008)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/TanL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/TanL08
Zheng-Hua Tan, Børge Lindberg:
Speech Recognition on Mobile Devices. WMMP 2008: 221-237
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanL08
Zheng-Hua Tan, Børge Lindberg:
A posteriori SNR weighted energy based variable frame rate analysis for speech recognition. INTERSPEECH 2008: 1024-1027
2007
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TanDL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TanDL07
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition. IEEE Trans. Speech Audio Process. 15(4): 1391-1403 (2007)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XuDTL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuDTL07
Haitian Xu, Paul Dalsgaard, Zheng-Hua Tan, Børge Lindberg:
Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation. IEEE Trans. Speech Audio Process. 15(8): 2431-2443 (2007)
2006
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tkde/Tan06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tkde/Tan06
Zheng-Hua Tan:
Fuzzy Metagraph and Its Combination with the Indexing Approach in Rule-Based Systems. IEEE Trans. Knowl. Data Eng. 18(6): 829-841 (2006)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuTDL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuTDL06
Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework. ICASSP (1) 2006: 1141-1144
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanDL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanDL06
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment. INTERSPEECH 2006
2005
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/TanDL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/TanDL05
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Automatic speech recognition over error-prone wireless networks. Speech Commun. 47(1-2): 220-242 (2005)
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuTDL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuTDL05
Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust speech recognition based on noise and SNR classification - a multiple-model framework. INTERSPEECH 2005: 977-980
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanDLX05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanDLX05
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg, Haitian Xu:
Robust speech recognition in ubiquitous networking and context-aware computing. INTERSPEECH 2005: 2849-2852
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/TanDL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/TanDL05
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Adaptive Multi-Frame-Rate Scheme for Distributed Speech Recognition Based on a Half Frame-Rate Front-End. MMSP 2005: 1-4
2004
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanDL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanDL04
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
A subvector-based error concealment algorithm for speech recognition over mobile networks. ICASSP (1) 2004: 57-60
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuTDL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuTDL04
Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Spectral subtraction with full-wave rectification and likelihood controlled instantaneous noise estimation for robust speech recognition. INTERSPEECH 2004: 2085-2088
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanDL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanDL04
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
On the integration of speech recognition into personal networks. INTERSPEECH 2004: 2317-2320
2003
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanDL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanDL03
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
OOV-detection and channel error protection for distributed speech recognition over wireless networks. ICASSP (1) 2003: 336-339
2002
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanD02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanD02
Zheng-Hua Tan, Paul Dalsgaard:
Channel error protection scheme for distributed speech recognition. INTERSPEECH 2002: 2225-2228
2001
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/2001
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2001
Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan:
EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001. ISCA 2001 [contents]

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.