default search action
Yu Tsao 0001
Person information
- affiliation: Academia Sinica, Research Center for Information Technology Innovation, Taipei, Taiwan
Other persons with the same name
- Yu Tsao — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j74]Sheng-Yu Peng, I-Chun Liu, Yi-Heng Wu, Ting-Ju Lin, Chun-Jui Chen, Xiu-Zhu Li, Yong-Qi Cheng, Pin-Han Lin, Kuo-Hsuan Hung, Yu Tsao:
An SRAM-Based Reconfigurable Cognitive Computation Matrix for Sensor Edge Applications. IEEE J. Solid State Circuits 59(2): 636-648 (2024) - [j73]Enoch Hsin-Ho Huang, Rong Chao, Yu Tsao, Chao-Min Wu:
ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants. IEEE Trans. Cogn. Dev. Syst. 16(1): 346-357 (2024) - [j72]Syu-Siang Wang, Jia-Yang Chen, Bo-Ren Bai, Shih-Hau Fang, Yu Tsao:
Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3826-3837 (2024) - [c237]I-Chun Liu, Chun-Jui Chen, Xiu-Zhu Li, Yong-Qi Cheng, Chung-Wei Huang, Pin-Han Lin, Hsuan-Wei Pu, Sheng-Yu Peng, Yu Tsao:
The Multilayer Neural Network Implementation Using SRAM-Based Reconfigurable Cognitive Computation Matrices. AICAS 2024: 467-471 - [c236]Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. ICASSP 2024: 831-835 - [c235]Yu-Tung Liu, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
SDEMG: Score-Based Diffusion Model for Surface Electromyographic Signal Denoising. ICASSP 2024: 1736-1740 - [c234]Haibin Wu, Heng-Cheng Kuo, Yu Tsao, Hung-Yi Lee:
Scalable Ensemble-Based Detection Method Against Adversarial Attacks For Speaker Verification. ICASSP 2024: 4670-4674 - [c233]Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894 - [c232]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-Based ASR. ICASSP 2024: 13116-13120 - [c231]Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao:
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice. ICCE 2024: 1-6 - [c230]Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang:
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech. ICLR 2024 - [c229]Ryandhimas E. Zezario, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao, Hsin-Min Wang, Chiou-Shann Fuh:
A Study On Incorporating Whisper For Robust Speech Assessment. ICME 2024: 1-6 - [c228]Li-Chin Chen, Jung-Nien Lai, Hung-En Lin, Hsien-Te Chen, Kuo-Hsuan Hung, Yu Tsao:
Prognosticating Lumbar Spinal Surgery Outcomes for Low Back Pain and Sciatica Patients by Utilizing Preoperative Assessments from Western and Eastern Medicine and Multimodal Fusion Learning Techniques. ICMHI 2024: 262-267 - [i155]Dyah A. M. G. Wisnu, Epri W. Pratiwi, Stefano Rini, Ryandhimas E. Zezario, Hsin-Min Wang, Yu Tsao:
HAAQI-Net: A non-intrusive neural music quality assessment model for hearing aids. CoRR abs/2401.01145 (2024) - [i154]Yu-Tung Liu, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising. CoRR abs/2402.03808 (2024) - [i153]Cho-Yuan Lee, Kuan-Chen Wang, Kai-Chun Liu, Xugang Lu, Ping-Cheng Yeh, Yu Tsao:
A Non-Intrusive Neural Quality Assessment Model for Surface Electromyography Signals. CoRR abs/2402.05482 (2024) - [i152]Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang:
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech. CoRR abs/2402.16321 (2024) - [i151]Tassadaq Hussain, Kia Dashtipour, Yu Tsao, Amir Hussain:
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues. CoRR abs/2402.16394 (2024) - [i150]Jasper Kirton-Wingate, Shafique Ahmed, Adeel Hussain, Mandar Gogate, Kia Dashtipour, Jen-Cheng Hou, Tassadaq Hussain, Yu Tsao, Amir Hussain:
Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids. CoRR abs/2402.16757 (2024) - [i149]Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang:
Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes. CoRR abs/2405.04097 (2024) - [i148]Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao:
An Investigation of Incorporating Mamba for Speech Enhancement. CoRR abs/2405.06573 (2024) - [i147]Whenty Ariyanti, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao:
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer. CoRR abs/2405.08342 (2024) - [i146]Chun Yin, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang:
SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models. CoRR abs/2406.08445 (2024) - [i145]Kuan-Chen Wang, You-Jin Li, Wei-Lun Chen, Yu-Wen Chen, Yi-Ching Wang, Ping-Cheng Yeh, Chao Zhang, Yu Tsao:
Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition. CoRR abs/2406.12699 (2024) - [i144]Wenze Ren, Yi-Cheng Lin, Huang-Cheng Chou, Haibin Wu, Yi-Chiao Wu, Chi-Chun Lee, Hung-yi Lee, Yu Tsao:
EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations. CoRR abs/2407.15458 (2024) - [i143]Muhammad Salman Khan, Moreno La Quatra, Kuo-Hsuan Hung, Szu-Wei Fu, Sabato Marco Siniscalchi, Yu Tsao:
Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement. CoRR abs/2408.04773 (2024) - [i142]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR. CoRR abs/2409.02239 (2024) - [i141]Wen-Chin Huang, Szu-Wei Fu, Erica Cooper, Ryandhimas E. Zezario, Tomoki Toda, Hsin-Min Wang, Junichi Yamagishi, Yu Tsao:
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction. CoRR abs/2409.07001 (2024) - [i140]Jiawei Du, I-Ming Lin, I-Hsiang Chiu, Xuanjun Chen, Haibin Wu, Wenze Ren, Yu Tsao, Hung-yi Lee, Jyh-Shing Roger Jang:
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset. CoRR abs/2409.08731 (2024) - [i139]Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition. CoRR abs/2409.09785 (2024) - 2023
- [j71]Fei Chen, Yu Tsao:
Advances in biomedical signal processing for communication disorders. Biomed. Signal Process. Control. 80(Part): 104346 (2023) - [j70]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing. J. Open Source Softw. 8(91): 5403 (2023) - [j69]Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization. IEEE Signal Process. Lett. 30: 638-642 (2023) - [j68]Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features. IEEE ACM Trans. Audio Speech Lang. Process. 31: 54-70 (2023) - [j67]Yen-Ju Lu, Chia-Yu Chang, Cheng Yu, Ching-Feng Liu, Jeih-weih Hung, Shinji Watanabe, Yu Tsao:
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2738-2750 (2023) - [j66]Heng-Cheng Kuo, Yu-Peng Hsieh, Huan-Hsin Tseng, Chi-Te Wang, Shih-Hau Fang, Yu Tsao:
Toward Real-World Voice Disorder Classification. IEEE Trans. Biomed. Eng. 70(10): 2922-2932 (2023) - [j65]Tsai-Min Chen, Yuan-Hong Tsai, Huan-Hsin Tseng, Kai-Chun Liu, Jhih-Yu Chen, Chih-Han Huang, Guo-Yuan Li, Chun-Yen Shen, Yu Tsao:
SRECG: ECG Signal Super-Resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification. IEEE Trans. Consumer Electron. 69(3): 250-260 (2023) - [c227]Hsin-Tien Chiang, Kuo-Hsuan Hung, Szu-Wei Fu, Heng-Cheng Kuo, Ming-Hsueh Tsai, Yu Tsao:
Study on the Correlation Between Objective Evaluations and Subjective Speech Quality and Intelligibility. ASRU 2023: 1-7 - [c226]Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains. ASRU 2023: 1-7 - [c225]Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao:
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. ASRU 2023: 1-8 - [c224]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Cross-Modal Alignment With Optimal Transport For CTC-Based ASR. ASRU 2023: 1-7 - [c223]Whenty Ariyanti, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao:
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer. EMBC 2023: 1-4 - [c222]En-Ping Chu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao, Chia-Tai Chan:
Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation. EMBC 2023: 1-5 - [c221]I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain, Yu Tsao, Jen-Cheng Hou:
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings. ICASSP Workshops 2023: 1-5 - [c220]Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao, Chia-Tai Chan:
Prefallkd: Pre-Impact Fall Detection Via CNN-ViT Knowledge Distillation. ICASSP 2023: 1-5 - [c219]Chan-Jan Hsu, Ho-Lam Chung, Hung-Yi Lee, Yu Tsao:
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5. ICASSP 2023: 1-5 - [c218]Jasper Kirton-Wingate, Shafique Ahmed, Mandar Gogate, Yu Tsao, Amir Hussain:
Towards Individualised Speech Enhancement: An SNR Preference Learning System for Multi-Modal Hearing Aids. ICASSP Workshops 2023: 1-5 - [c217]Hsin-Yi Lin, Huan-Hsin Tseng, Yu Tsao:
On the Robustness of Non-Intrusive Speech Quality Model by Adversarial Examples. ICASSP 2023: 1-5 - [c216]Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks. ICASSP 2023: 1-5 - [c215]Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen:
D4AM: A General Denoising Framework for Downstream Acoustic Models. ICLR 2023 - [c214]Huan-Hsin Tseng, Hsin-Yi Lin, Kuo-Hsuan Hung, Yu Tsao:
Interpretations of Domain Adaptations via Layer Variational Analysis. ICLR 2023 - [c213]Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech. INTERSPEECH 2023: 2473-2477 - [c212]Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition. INTERSPEECH 2023: 3317-3321 - [c211]Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Tai-Shih Chi, Hsin-Min Wang, Yu Tsao:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. INTERSPEECH 2023: 5018-5022 - [c210]Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion. INTERSPEECH 2023: 5023-5026 - [c209]Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao:
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks. MeMeA 2023: 1-5 - [c208]I-Chun Chern, Steffi Chern, Heng-Cheng Kuo, Huan-Hsin Tseng, Kuo-Hsuan Hung, Yu Tsao:
Voice Direction-Of-Arrival Conversion. MLSP 2023: 1-6 - [c207]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-Based Neural Speech Enhancement. MLSP 2023: 1-6 - [c206]Wen-Yuan Ting, Syu-Siang Wang, Yu Tsao, Borching Su:
IANS: Intelligibility-Aware Null-Steering Beamforming for Dual-Microphone Arrays. MLSP 2023: 1-6 - [c205]Chih-Hsing Chen, Kai-Chun Liu, Ting-Yang Lu, Chih-Ya Chang, Chia-Tai Chan, Yu Tsao:
Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning. NER 2023: 1-4 - [d2]Ying-Ren Chien, Po-Heng Chou, You-Jie Peng, Chun-Yuan Huang, Hen-Wai Tsao, Yu Tsao:
Cyclostationary Impulse Noise Dataset. IEEE DataPort, 2023 - [d1]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310). Zenodo, 2023 - [i138]Yu-Wen Chen, Hsin-Min Wang, Yu Tsao:
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm. CoRR abs/2301.04120 (2023) - [i137]Huan-Hsin Tseng, Hsin-Yi Lin, Kuo-Hsuan Hung, Yu Tsao:
Interpretations of Domain Adaptations via Layer Variational Analysis. CoRR abs/2302.01798 (2023) - [i136]Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao, Chia-Tai Chan:
PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation. CoRR abs/2303.03634 (2023) - [i135]Li-Chin Chen, Kuo-Hsuan Hung, Yi-Ju Tseng, Hsin-Yao Wang, Tse-Min Lu, Wei-Chieh Huang, Yu Tsao:
Self-supervised based general laboratory progress pretrained model for cardiovascular event detection. CoRR abs/2303.06980 (2023) - [i134]Li-Chin Chen, Jung-Nien Lai, Hung-En Lin, Hsien-Te Chen, Kuo-Hsuan Hung, Yu Tsao:
Preoperative Prognosis Assessment of Lumbar Spinal Surgery for Low Back Pain and Sciatica Patients based on Multimodalities and Multimodal Learning. CoRR abs/2303.09085 (2023) - [i133]Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao:
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks. CoRR abs/2304.06335 (2023) - [i132]Enoch Hsin-Ho Huang, Rong Chao, Yu Tsao, Chao-Min Wu:
ElectrodeNet - A Deep Learning Based Sound Coding Strategy for Cochlear Implants. CoRR abs/2305.16753 (2023) - [i131]Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion. CoRR abs/2306.06652 (2023) - [i130]Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Yu Tsao, Tai-Shih Chi, Hsin-Min Wang:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. CoRR abs/2306.06653 (2023) - [i129]Li-Chin Chen, Yi-Heng Lin, Li-Ning Peng, Feng-Ming Wang, Yu-Hsin Chen, Po-Hsun Huang, Shang-Feng Yang, Yu Tsao:
Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula. CoRR abs/2306.06865 (2023) - [i128]Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. CoRR abs/2308.09262 (2023) - [i127]Yu-Wen Chen, Julia Hirschberg, Yu Tsao:
Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement. CoRR abs/2309.01164 (2023) - [i126]Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. CoRR abs/2309.09548 (2023) - [i125]Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023) - [i124]Shafique Ahmed, Chia-Wei Chen, Wenze Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain, Hsin-Min Wang, Yu Tsao, Jen-Cheng Hou:
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement. CoRR abs/2309.11059 (2023) - [i123]Ryandhimas E. Zezario, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao, Hsin-Min Wang, Chiou-Shann Fuh:
A Study on Incorporating Whisper for Robust Speech Assessment. CoRR abs/2309.12766 (2023) - [i122]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Cross-modal Alignment with Optimal Transport for CTC-based ASR. CoRR abs/2309.13650 (2023) - [i121]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR. CoRR abs/2309.16093 (2023) - [i120]Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang:
AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection. CoRR abs/2310.13103 (2023) - [i119]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Neural domain alignment for spoken language recognition based on optimal transport. CoRR abs/2310.13471 (2023) - [i118]Sahibzada Adil Shahzad, Ammarah Hashmi, Yan-Tsung Peng, Yu Tsao, Hsin-Min Wang:
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection. CoRR abs/2311.02733 (2023) - [i117]Hsin-Tien Chiang, Szu-Wei Fu, Hsin-Min Wang, Yu Tsao, John H. L. Hansen:
Multi-objective Non-intrusive Hearing-aid Speech Assessment Model. CoRR abs/2311.08878 (2023) - [i116]Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao:
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice. CoRR abs/2311.15582 (2023) - [i115]Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen:
D4AM: A General Denoising Framework for Downstream Acoustic Models. CoRR abs/2311.16595 (2023) - [i114]Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao:
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. CoRR abs/2311.16604 (2023) - [i113]Haibin Wu, Heng-Cheng Kuo, Yu Tsao, Hung-yi Lee:
Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification. CoRR abs/2312.08622 (2023) - 2022
- [j64]Yu-Wen Chen, Kuo-Hsuan Hung, You-Jin Li, Alexander Chao-Fu Kang, Ya-Hsin Lai, Kai-Chun Liu, Szu-Wei Fu, Syu-Siang Wang, Yu Tsao:
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application. IEEE Access 10: 46082-46099 (2022) - [j63]Yi Lin, Yu Tsao, Po-Jang Hsieh:
Neural correlates of individual differences in predicting ambiguous sounds comprehension level. NeuroImage 251: 119012 (2022) - [j62]Cheng-Hung Hu, Yu-Huai Peng, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
SVSNet: An End-to-End Speaker Voice Similarity Assessment Model. IEEE Signal Process. Lett. 29: 767-771 (2022) - [j61]Lichin Chen, Po-Hsun Chen, Richard Tzong-Han Tsai, Yu Tsao:
EPG2S: Speech Generation and Speech Enhancement Based on Electropalatography and Audio Signals Using Multimodal Learning. IEEE Signal Process. Lett. 29: 2582-2586 (2022) - [j60]Tassadaq Hussain, Wei-Chien Wang, Mandar Gogate, Kia Dashtipour, Yu Tsao, Xugang Lu, Ahsan Adeel, Amir Hussain:
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement. IEEE Trans. Artif. Intell. 3(5): 833-842 (2022) - [j59]Kai-Chun Liu, Kuo-Hsuan Hung, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao:
Deep-Learning-Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems. IEEE Trans. Cogn. Dev. Syst. 14(3): 1270-1281 (2022) - [j58]Yu-Chen Lin, Cheng Yu, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1016-1031 (2022) - [j57]Shang-Yi Chuang, Hsin-Min Wang, Yu Tsao:
Improved Lite Audio-Visual Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1345-1359 (2022) - [c204]Chan-Jan Hsu, Hung-yi Lee, Yu Tsao:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. ACL (2) 2022: 479-489 - [c203]Syu-Siang Wang, Yu Tsao, Wei-Zhong Zheng, Hsiu-Wei Yeh, Pei-Chun Li, Shih-Hau Fang, Ying-Hui Lai:
Dysarthric Speech Enhancement Based on Convolution Neural Network. EMBC 2022: 60-64 - [c202]Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain:
A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning. EMBC 2022: 2581-2584 - [c201]Zicheng Feng, Yu Tsao, Fei Chen:
Recurrent Neural Network-based Estimation and Correction of Relative Transfer Function for Preserving Spatial Cues in Speech Separation. EUSIPCO 2022: 155-159 - [c200]Bo-Rong Chen, Hsin-Tien Chiang, Heng-Cheng Kuo, Yu Tsao, Yih-Chun Hu:
Key Generation with Ambient Audio. GLOBECOM 2022: 5510-5515 - [c199]Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao, Tei-Wei Kuo:
Speech Recovery For Real-World Self-Powered Intermittent Devices. ICASSP 2022: 26-30 - [c198]Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang, Yu Tsao:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. ICASSP 2022: 1116-1120 - [c197]Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao:
Conditional Diffusion Probabilistic Model for Speech Enhancement. ICASSP 2022: 7402-7406 - [c196]Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao:
MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation Based Only on Noisy/ Reverberated Speech. ICASSP 2022: 7412-7416 - [c195]Guan-Ting Lin, Chan-Jan Hsu, Da-Rong Liu, Hung-Yi Lee, Yu Tsao:
Analyzing The Robustness of Unsupervised Speech Recognition. ICASSP 2022: 8202-8206 - [c194]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. ICASSP 2022: 8602-8606 - [c193]