


Остановите войну!
for scientists:


default search action
Dong Yu 0001
俞栋
Person information

- unicode name: 俞栋
- affiliation: Tencent AI Lab, China
- affiliation (1998 - 2017): Microsoft Research, Redmond, WA, USA
- affiliation (PhD): University of Idaho, Moscow, ID, USA
Other persons with the same name
- Dong Yu — disambiguation page
- Dong Yu 0002
— Xi'an Jiaotong University, Institution of Advanced Manufacturing and Technology, China
- Dong Yu 0003 — Beijing Language and Culture University, Beijing, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j58]Jinchuan Tian
, Jianwei Yu
, Chao Weng, Yuexian Zou
, Dong Yu
:
Integrating Lattice-Free MMI Into End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 25-38 (2023) - [j57]Rongzhi Gu
, Shi-Xiong Zhang, Yuexian Zou
, Dong Yu
:
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 849-862 (2023) - [i144]Katerina Zmolíková, Marc Delcroix, Tsubasa Ochiai, Keisuke Kinoshita, Jan Cernocký, Dong Yu:
Neural Target Speech Extraction: An Overview. CoRR abs/2301.13341 (2023) - [i143]Dongchao Yang, Songxiang Liu, Rongjie Huang, Guangzhi Lei, Chao Weng, Helen Meng, Dong Yu:
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt. CoRR abs/2301.13662 (2023) - [i142]Rongzhi Gu, Shi-Xiong Zhang, Dong Yu:
3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty. CoRR abs/2302.13462 (2023) - 2022
- [j56]Jiatong Shi
, Chunlei Zhang
, Chao Weng, Shinji Watanabe
, Meng Yu, Dong Yu
:
An investigation of neural uncertainty estimation for target speaker extraction equipped RNN transducer. Comput. Speech Lang. 73: 101327 (2022) - [j55]Aswin Shanmugam Subramanian
, Chao Weng, Shinji Watanabe
, Meng Yu, Dong Yu
:
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition. Comput. Speech Lang. 75: 101360 (2022) - [j54]Chunlei Zhang
, Dong Yu
:
C3-DINO: Joint Contrastive and Non-Contrastive Self-Supervised Learning for Speaker Verification. IEEE J. Sel. Top. Signal Process. 16(6): 1273-1283 (2022) - [j53]Jinchuan Tian
, Jianwei Yu
, Chao Weng, Yuexian Zou
, Dong Yu
:
Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model. IEEE Signal Process. Lett. 29: 812-816 (2022) - [j52]Linchao Bao
, Xiangkai Lin, Yajing Chen, Haoxian Zhang, Sheng Wang, Xuefei Zhe, Di Kang, Haozhi Huang, Xinwei Jiang, Jue Wang, Dong Yu, Zhengyou Zhang:
High-Fidelity 3D Digital Human Head Creation from RGB-D Selfies. ACM Trans. Graph. 41(1): 3:1-3:21 (2022) - [c247]Lisa Jin, Linfeng Song, Lifeng Jin, Dong Yu, Daniel Gildea:
Hierarchical Context Tagging for Utterance Rewriting. AAAI 2022: 10849-10857 - [c246]Chao Zhao, Wenlin Yao, Dian Yu, Kaiqiang Song, Dong Yu, Jianshu Chen:
Learning-by-Narrating: Narrative Pre-Training for Zero-Shot Dialogue Comprehension. ACL (2) 2022: 212-218 - [c245]Xiang Yue, Xiaoman Pan, Wenlin Yao, Dian Yu, Dong Yu, Jianshu Chen:
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References. ACL (2) 2022: 371-377 - [c244]Irene Li, Linfeng Song, Kun Xu, Dong Yu:
Variational Graph Autoencoding as Cheap Supervision for AMR Coreference Resolution. ACL (1) 2022: 2790-2800 - [c243]Kaiqiang Song, Chen Li, Xiaoyang Wang, Dong Yu, Fei Liu:
Towards Abstractive Grounded Summarization of Podcast Transcripts. ACL (1) 2022: 4407-4418 - [c242]Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Claire Cardie:
Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge. ACL (1) 2022: 8736-8747 - [c241]Sangwoo Cho, Kaiqiang Song, Xiaoyang Wang, Fei Liu, Dong Yu:
Toward Unifying Text Segmentation and Long Document Summarization. EMNLP 2022: 106-118 - [c240]Songyang Zhang, Linfeng Song, Lifeng Jin, Haitao Mi, Kun Xu, Dong Yu, Jiebo Luo:
Learning a Grammar Inducer from Massive Uncurated Instructional Videos. EMNLP 2022: 233-247 - [c239]Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu:
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure. EMNLP 2022: 4698-4724 - [c238]Anton Ratnarajah, Shi-Xiong Zhang, Meng Yu, Zhenyu Tang, Dinesh Manocha, Dong Yu:
Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator. ICASSP 2022: 571-575 - [c237]Yiwen Shao, Shi-Xiong Zhang, Dong Yu:
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature. ICASSP 2022: 6067-6071 - [c236]Songxiang Liu, Shan Yang, Dan Su, Dong Yu:
Referee: Towards Reference-Free Cross-Speaker Style Transfer with Low-Quality Data for Expressive Speech Synthesis. ICASSP 2022: 6307-6311 - [c235]Brian Yan, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Siddharth Dalmia, Dan Berrebbi, Chao Weng, Shinji Watanabe
, Dong Yu:
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization. ICASSP 2022: 6412-6416 - [c234]Jiachen Lian, Chunlei Zhang, Dong Yu:
Robust Disentangled Variational Speech Representation Learning for Zero-Shot Voice Conversion. ICASSP 2022: 6572-6576 - [c233]Zhao You, Shulin Feng, Dan Su, Dong Yu:
Speechmoe2: Mixture-of-Experts Model with Improved Routing. ICASSP 2022: 7217-7221 - [c232]Disong Wang, Shan Yang, Dan Su, Xunying Liu, Dong Yu, Helen Meng:
VCVTS: Multi-Speaker Video-to-Speech Synthesis Via Cross-Modal Knowledge Transfer from Voice Conversion. ICASSP 2022: 7252-7256 - [c231]Dongpeng Ma, Yiwen Wang, Liqiang He, Mingjie Jin, Dan Su, Dong Yu:
DP-DWA: Dual-Path Dynamic Weight Attention Network With Streaming Dfsmn-San For Automatic Speech Recognition. ICASSP 2022: 7692-7696 - [c230]Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou:
Consistent Training and Decoding for End-to-End Speech Recognition Using Lattice-Free MMI. ICASSP 2022: 7782-7786 - [c229]Chunlei Zhang, Jiatong Shi, Chao Weng, Meng Yu, Dong Yu:
Towards end-to-end Speaker Diarization with Generalized Neural Speaker Clustering. ICASSP 2022: 8372-8376 - [c228]Pei Chen, Wenlin Yao, Hongming Zhang, Xiaoman Pan, Dian Yu, Dong Yu, Jianshu Chen:
ZeroKBC: A Comprehensive Benchmark for Zero-Shot Knowledge Base Completion. ICDM (Workshops) 2022: 1-6 - [c227]Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu:
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis. ICLR 2022 - [c226]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. IJCAI 2022: 4157-4163 - [c225]Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Joint Neural AEC and Beamforming with Double-Talk Detection. INTERSPEECH 2022: 2528-2532 - [c224]Jiachen Lian, Chunlei Zhang, Gopala Krishna Anumanchipalli, Dong Yu:
Towards Improved Zero-shot Voice Conversion with Conditional DSVAE. INTERSPEECH 2022: 2598-2602 - [c223]Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Yuexian Zou, Dong Yu:
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR. INTERSPEECH 2022: 3178-3182 - [c222]Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu:
Automatic Prosody Annotation with Pre-Trained Text-Speech Model. INTERSPEECH 2022: 5513-5517 - [c221]Zhao You, Shulin Feng, Dan Su, Dong Yu:
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition. ISCSLP 2022: 170-174 - [c220]Jianhua Tao, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Liang, Pengyuan Zhang, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi:
DDAM '22: 1st International Workshop on Deepfake Detection for Audio Multimedia. ACM Multimedia 2022: 7405-7406 - [c219]Dian Yu, Ben Zhou, Dong Yu:
End-to-End Chinese Speaker Identification. NAACL-HLT 2022: 2274-2285 - [c218]Junyi Peng, Chunlei Zhang, Jan Honza Cernocký, Dong Yu:
Progressive Contrastive Learning for Self-Supervised Text-Independent Speaker Verification. Odyssey 2022: 17-24 - [c217]Jia Cui, Heng Lu, Wenjie Wang, Shiyin Kang, Liqiang He, Guangzhi Li, Dong Yu:
Efficient Text Analysis with Pre-Trained Neural Network Models. SLT 2022: 671-676 - [c216]Zhenyi Wang, Xiaoyang Wang, Li Shen, Qiuling Suo, Kaiqiang Song, Dong Yu, Yan Shen, Mingchen Gao:
Meta-learning without data via Wasserstein distributionally-robust model fusion. UAI 2022: 2045-2055 - [e1]Jianhua Tao, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Lian, Pengyuan Zhang:
DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, Lisboa, Portugal, 14 October 2022. ACM 2022, ISBN 978-1-4503-9496-3 [contents] - [i141]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model. CoRR abs/2201.01995 (2022) - [i140]Songxiang Liu, Dan Su, Dong Yu:
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs. CoRR abs/2201.11972 (2022) - [i139]Disong Wang, Shan Yang, Dan Su, Xunying Liu, Dong Yu, Helen Meng:
VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion. CoRR abs/2202.09081 (2022) - [i138]Xiang Yue, Xiaoman Pan, Wenlin Yao, Dian Yu, Dong Yu, Jianshu Chen:
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References. CoRR abs/2203.08928 (2022) - [i137]Chao Zhao, Wenlin Yao, Dian Yu, Kaiqiang Song, Dong Yu, Jianshu Chen:
Learning-by-Narrating: Narrative Pre-Training for Zero-Shot Dialogue Comprehension. CoRR abs/2203.10249 (2022) - [i136]Kaiqiang Song, Chen Li, Xiaoyang Wang, Dong Yu, Fei Liu:
Towards Abstractive Grounded Summarization of Podcast Transcripts. CoRR abs/2203.11425 (2022) - [i135]Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu:
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis. CoRR abs/2203.13508 (2022) - [i134]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Integrate Lattice-Free MMI into End-to-End Speech Recognition. CoRR abs/2203.15614 (2022) - [i133]Jiachen Lian, Chunlei Zhang, Dong Yu:
Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion. CoRR abs/2203.16705 (2022) - [i132]Zhao You, Shulin Feng, Dan Su, Dong Yu:
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition. CoRR abs/2204.03178 (2022) - [i131]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. CoRR abs/2204.09934 (2022) - [i130]Lifeng Jin, Kun Xu, Linfeng Song, Dong Yu:
Distant finetuning with discourse relations for stance classification. CoRR abs/2204.12693 (2022) - [i129]Jiachen Lian, Chunlei Zhang, Gopala Krishna Anumanchipalli, Dong Yu:
Towards Improved Zero-shot Voice Conversion with Conditional DSVAE. CoRR abs/2205.05227 (2022) - [i128]Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu:
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement. CoRR abs/2205.10401 (2022) - [i127]Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Chao Weng, Yuexian Zou, Dong Yu:
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR. CoRR abs/2206.02093 (2022) - [i126]Jiachen Lian, Chunlei Zhang, Gopala Krishna Anumanchipalli, Dong Yu:
UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder. CoRR abs/2206.02512 (2022) - [i125]Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu:
Automatic Prosody Annotation with Pre-Trained Text-Speech Model. CoRR abs/2206.07956 (2022) - [i124]Lisa Jin, Linfeng Song, Lifeng Jin, Dong Yu, Daniel Gildea:
Hierarchical Context Tagging for Utterance Rewriting. CoRR abs/2206.11218 (2022) - [i123]Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-sound Generation. CoRR abs/2207.09983 (2022) - [i122]Chunlei Zhang, Dong Yu:
C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification. CoRR abs/2208.07446 (2022) - [i121]Zhenhailong Wang, Xiaoman Pan, Dian Yu, Dong Yu, Jianshu Chen, Heng Ji:
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks. CoRR abs/2210.00185 (2022) - [i120]Ben Zhou, Dian Yu, Dong Yu, Dan Roth:
Cross-Lingual Speaker Identification Using Distant Supervision. CoRR abs/2210.05780 (2022) - [i119]Jinchuan Tian, Brian Yan, Jianwei Yu, Chao Weng, Dong Yu, Shinji Watanabe
:
Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks. CoRR abs/2210.07499 (2022) - [i118]Yue Yang, Wenlin Yao, Hongming Zhang, Xiaoyang Wang, Dong Yu, Jianshu Chen:
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination. CoRR abs/2210.12261 (2022) - [i117]Songyang Zhang
, Linfeng Song, Lifeng Jin, Haitao Mi, Kun Xu, Dong Yu, Jiebo Luo:
Learning a Grammar Inducer from Massive Uncurated Instructional Videos. CoRR abs/2210.12309 (2022) - [i116]Fei Wang, Kaiqiang Song, Hongming Zhang, Lifeng Jin, Sangwoo Cho, Wenlin Yao, Xiaoyang Wang, Muhao Chen, Dong Yu:
Salience Allocation as Guidance for Abstractive Summarization. CoRR abs/2210.12330 (2022) - [i115]Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu:
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure. CoRR abs/2210.12487 (2022) - [i114]Sangwoo Cho, Kaiqiang Song, Xiaoyang Wang, Fei Liu, Dong Yu:
Toward Unifying Text Segmentation and Long Document Summarization. CoRR abs/2210.16422 (2022) - [i113]Xiaoman Pan, Wenlin Yao, Hongming Zhang, Dian Yu, Dong Yu, Jianshu Chen:
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models. CoRR abs/2210.16433 (2022) - [i112]Wenyue Hua, Lifeng Jin, Linfeng Song, Haitao Mi, Yongfeng Zhang, Dong Yu:
Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing. CoRR abs/2211.04476 (2022) - [i111]Hongming Zhang, Wenlin Yao, Dong Yu:
Efficient Zero-shot Event Extraction with Context-Definition Alignment. CoRR abs/2211.05156 (2022) - [i110]Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Deep Neural Mel-Subband Beamformer for In-car Speech Separation. CoRR abs/2211.12590 (2022) - [i109]Pei Chen, Wenlin Yao, Hongming Zhang, Xiaoman Pan, Dian Yu, Dong Yu, Jianshu Chen:
ZeroKBC: A Comprehensive Benchmark for Zero-Shot Knowledge Base Completion. CoRR abs/2212.03091 (2022) - [i108]Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation. CoRR abs/2212.08348 (2022) - 2021
- [j51]Rongzhi Gu
, Shi-Xiong Zhang, Yuexian Zou
, Dong Yu:
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain. IEEE Signal Process. Lett. 28: 1370-1374 (2021) - [j50]Daniel Michelsanti
, Zheng-Hua Tan
, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1368-1396 (2021) - [j49]Jianwei Yu
, Shi-Xiong Zhang, Bo Wu, Shansong Liu
, Shoukang Hu
, Mengzhe Geng
, Xunying Liu
, Helen Meng, Dong Yu
:
Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2067-2082 (2021) - [j48]Kun Xu
, Han Wu
, Linfeng Song, Haisong Zhang, Linqi Song
, Dong Yu:
Conversational Semantic Role Labeling. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2465-2475 (2021) - [j47]Zhuohuang Zhang
, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Donald S. Williamson
, Dong Yu
:
Multi-Channel Multi-Frame ADL-MVDR for Target Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3526-3540 (2021) - [c215]Jun Wang, Max W. Y. Lam, Dan Su, Dong Yu:
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect. AAAI 2021: 13961-13969 - [c214]Lemao Liu, Haisong Zhang, Haiyun Jiang, Yangming Li, Enbo Zhao, Kun Xu, Linfeng Song, Suncong Zheng, Botong Zhou, Dick Zhu, Xiao Feng, Tao Chen, Tao Yang, Dong Yu, Feng Zhang, Zhanhui Kang, Shuming Shi:
TexSmart: A System for Enhanced Natural Language Understanding. ACL (demo) 2021: 1-10 - [c213]Tianqing Fang, Haojie Pan, Hongming Zhang, Yangqiu Song, Kun Xu, Dong Yu:
Do Boat and Ocean Suggest Beach? Dialogue Summarization with External Knowledge. AKBC 2021 - [c212]Huirong Huang, Zhiyong Wu, Shiyin Kang, Dongyang Dai, Jia Jia, Tianxiao Fu, Deyi Tuo, Guangzhi Lei, Peng Liu, Dan Su, Dong Yu, Helen Meng:
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams. APSIPA ASC 2021: 1433-1437 - [c211]Liqiang He, Shulin Feng, Dan Su, Dong Yu:
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition. ASRU 2021: 62-67 - [c210]Rongzhi Gu, Shi-Xiong Zhang, Meng Yu, Dong Yu:
3D Spatial Features for Multi-Channel Target Speech Separation. ASRU 2021: 996-1002 - [c209]Liwei Wang, Jing Huang, Yin Li, Kun Xu, Zhengyuan Yang, Dong Yu:
Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation. CVPR 2021: 14090-14100 - [c208]Dian Yu
, Kai Sun, Dong Yu, Claire Cardie:
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data. EMNLP (Findings) 2021: 56-68 - [c207]Xintong Yu, Hongming Zhang, Yangqiu Song, Changshui Zhang, Kun Xu, Dong Yu:
Exophoric Pronoun Resolution in Dialogues with Topic Regularization. EMNLP (1) 2021: 3832-3845 - [c206]Jie Hao, Linfeng Song, Liwei Wang, Kun Xu, Zhaopeng Tu, Dong Yu:
RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging. EMNLP (1) 2021: 4913-4924 - [c205]Lifeng Jin, Linfeng Song, Kun Xu, Dong Yu:
Instance-adaptive training with noise-robust losses against noisy labels. EMNLP (1) 2021: 5647-5663 - [c204]Wenlin Yao, Xiaoman Pan, Lifeng Jin, Jianshu Chen, Dian Yu, Dong Yu:
Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories. EMNLP (1) 2021: 7741-7751 - [c203]Jun Wang, Max W. Y. Lam, Dan Su, Dong Yu:
Contrastive Separative Coding for Self-Supervised Representation Learning. ICASSP 2021: 3865-3869 - [c202]Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu:
Sandglasset: A Light Multi-Granularity Self-Attentive Network for Time-Domain Speech Separation. ICASSP 2021: 5759-5763 - [c201]Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Dong Yu:
ADL-MVDR: All Deep Learning MVDR Beamformer for Target Speech Separation. ICASSP 2021: 6089-6093 - [c200]Xu Li, Na Li, Chao Weng, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Replay and Synthetic Speech Detection with Res2Net Architecture. ICASSP 2021: 6354-6358 - [c199]Chunlei Zhang, Meng Yu, Chao Weng, Dong Yu:
Towards Robust Speaker Verification with Target Speaker Enhancement. ICASSP 2021: 6693-6697 - [c198]Wei Xia, Chunlei Zhang, Chao Weng, Meng Yu, Dong Yu:
Self-Supervised Text-Independent Speaker Verification Using Prototypical Momentum Contrastive Learning. ICASSP 2021: 6723-6727 - [c197]Liqiang He, Dan Su, Dong Yu:
Learned Transferable Architectures Can Surpass Hand-Designed Architectures for Large Scale Speech Recognition. ICASSP 2021: 6788-6792 - [c196]Jiatong Shi, Chunlei Zhang, Chao Weng, Shinji Watanabe
, Meng Yu, Dong Yu:
Improving RNN Transducer with Target Speaker Extraction and Neural Uncertainty Estimation. ICASSP 2021: 6908-6912 - [c195]Aswin Shanmugam Subramanian
, Chao Weng, Shinji Watanabe
, Meng Yu, Yong Xu, Shi-Xiong Zhang, Dong Yu:
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization. ICASSP 2021: 8433-8437 - [c194]Max W. Y. Lam, Jun Wang, Chao Weng, Dan Su, Dong Yu:
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition. Interspeech 2021: 316-320 - [c193]Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu:
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation. Interspeech 2021: 1109-1113 - [c192]Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu:
MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation. Interspeech 2021: 1119-1123 - [c191]Zhao You, Shulin Feng, Dan Su, Dong Yu:
SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts. Interspeech 2021: 2077-2081 - [c190]Meng Yu, Chunlei Zhang, Yong Xu, Shi-Xiong Zhang, Dong Yu:
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment. Interspeech 2021: 2142-2146 - [c189]Yong Xu, Zhuohuang Zhang, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation. Interspeech 2021: 3076-3080 - [c188]Saurabh Kataria, Shi-Xiong Zhang, Dong Yu:
Multi-Channel Speaker Verification for Single and Multi-Talker Speech. Interspeech 2021: 4608-4612 - [c187]Yuewen Cao, Songxiang Liu, Shiyin Kang, Na Hu, Peng Liu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Exploring Cross-lingual Singing Voice Synthesis Using Speech Data. ISCSLP 2021: 1-5 - [c186]Songyang Zhang, Linfeng Song, Lifeng Jin, Kun Xu, Dong Yu, Jiebo Luo:
Video-aided Unsupervis