


default search action
Ya Li 0001
Person information
- affiliation: Beijing University of Posts and Telecommunications, School of Artificial Intelligence, Beijing, China
- affiliation (PhD 2012): Chinese Academy of Sciences (CAS), Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China
Other persons with the same name
- Ya Li — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c68]Puyuan Guo, Tuo Hao, Wenxin Fu, Yingming Gao, Ya Li:
Controllable 3D Dance Generation Using Diffusion-Based Transformer U-Net. AAAI 2025: 3284-3292 - [c67]Huijun Lian, Zekai Sun, Keqi Chen, Yingming Gao, Ya Li:
Beyond Surface Simplicity: Revealing Hidden Reasoning Attributes for Precise Commonsense Diagnosis. ACL (1) 2025: 12820-12835 - [i20]Keqi Chen, Zekai Sun, Yuhua Wen, Huijun Lian, Yingming Gao, Ya Li:
Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling. CoRR abs/2503.03607 (2025) - [i19]Keqi Chen, Zekai Sun, Huijun Lian, Yingming Gao, Ya Li:
Psy-Copilot: Visual Chain of Thought for Counseling. CoRR abs/2503.03645 (2025) - [i18]Zheng Lian, Rui Liu
, Kele Xu, Bin Liu, Xuefei Liu, Yazhou Zhang, Xin Liu, Yong Li, Zebang Cheng, Haolin Zuo, Ziyang Ma, Xiaojiang Peng, Xie Chen, Ya Li, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2025: When Affective Computing Meets Large Language Models. CoRR abs/2504.19423 (2025) - [i17]Jingwei Zhao, Yuhua Wen, Qifei Li, Minchi Hu, Yingying Zhou, Jingyao Xue, Junyang Wu, Yingming Gao, Zhengqi Wen, Jianhua Tao, Ya Li:
Deep Learning Approaches for Multimodal Intent Recognition: A Survey. CoRR abs/2507.22934 (2025) - 2024
- [j18]Mingyue Niu
, Jianhua Tao
, Yongwei Li
, Yong Qin, Ya Li
:
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals. IEEE Trans. Affect. Comput. 15(1): 285-296 (2024) - [j17]Yingming Gao
, Peter Birkholz
, Ya Li
:
Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab and Convolutional Recurrent Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1845-1858 (2024) - [j16]Jinlong Xue
, Yayue Deng
, Yingming Gao
, Ya Li
:
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4700-4712 (2024) - [j15]Mingyue Niu
, Ya Li, Jianhua Tao, Xiuzhuang Zhou, Björn W. Schuller:
DepressionMLP: A Multi-Layer Perceptron Architecture for Automatic Depression Level Prediction via Facial Keypoints and Action Units. IEEE Trans. Circuits Syst. Video Technol. 34(9): 8924-8938 (2024) - [c66]Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li
:
Concss: Contrastive-based Context Comprehension for Dialogue-Appropriate Prosody in Conversational Speech Synthesis. ICASSP 2024: 10706-10710 - [c65]Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li:
Frame-Level Emotional State Alignment Method for Speech Emotion Recognition. ICASSP 2024: 11486-11490 - [c64]Yingming Gao
, Hai Shuang
, Xiaoli Feng
, Jingwen Cheng
, Linkai Peng
, Ya Li
, Jinsong Zhang
, Min Liu
:
A Preliminary Study on Automatic Pronunciation Error Detection for Hearing-impaired Children. ICCIP 2024: 632-637 - [c63]Bingsong Bai, Fengping Wang, Yingming Gao, Ya Li:
SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion. INTERSPEECH 2024 - [c62]Qifei Li, Yingming Gao, Yuhua Wen, Cong Wang, Ya Li:
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition. INTERSPEECH 2024 - [c61]Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining. INTERSPEECH 2024 - [c60]Jinlong Xue, Yayue Deng, Yicheng Han, Yingming Gao, Ya Li:
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model. INTERSPEECH 2024 - [c59]Huijun Lian, Keqi Chen, Zekai Sun, Yingming Gao, Ya Li:
G2DiaR: Enhancing Commonsense Reasoning of LLMs with Graph-to-Dialogue & Reasoning. ISCSLP 2024: 214-218 - [c58]Fengping Wang, Bingsong Bai, Yayue Deng, Jinlong Xue, Yingming Gao, Ya Li:
ExpressiveSinger: Synthesizing Expressive Singing Voice as an Instrument. ISCSLP 2024: 304-308 - [c57]Ruibo Fu, Rui Liu
, Chunyu Qiang, Yingming Gao, Yi Lu, Shuchen Shi, Tao Wang, Ya Li, Zhengqi Wen, Chen Zhang, Hui Bu, Yukun Liu, Xin Qi, Guanjun Li:
ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024. ISCSLP 2024: 626-630 - [e1]Yanmin Qian, Qin Jin, Zhijian Ou, Zhenhua Ling, Zhiyong Wu, Ya Li, Lei Xie, Jianhua Tao:
14th IEEE International Symposium on Chinese Spoken Language Processing, ISCSLP 2024, Beijing, China, November 7-10, 2024. IEEE 2024, ISBN 979-8-3315-1682-6 [contents] - [i16]Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation. CoRR abs/2401.01044 (2024) - [i15]Jinlong Xue, Yayue Deng, Yichen Han, Yingming Gao, Ya Li:
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model. CoRR abs/2406.03706 (2024) - [i14]Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining. CoRR abs/2406.03714 (2024) - [i13]Bingsong Bai, Fengping Wang, Yingming Gao, Ya Li:
SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion. CoRR abs/2406.05692 (2024) - [i12]Ruibo Fu, Rui Liu
, Chunyu Qiang, Yingming Gao, Yi Lu, Shuchen Shi, Tao Wang, Ya Li, Zhengqi Wen, Chen Zhang, Hui Bu, Yukun Liu, Xin Qi, Guanjun Li:
ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024. CoRR abs/2407.12038 (2024) - [i11]Qifei Li, Yingming Gao, Yuhua Wen, Cong Wang, Ya Li:
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition. CoRR abs/2408.09438 (2024) - [i10]Zheng Lian, Haiyang Sun, Licai Sun, Lan Chen, Haoyu Chen, Hao Gu, Zhuofan Wen, Shun Chen, Siyuan Zhang, Hailiang Yao, Mingyu Xu, Kang Chen, Bin Liu, Rui Liu
, Shan Liang, Ya Li, Jiangyan Yi, Jianhua Tao:
Open-vocabulary Multimodal Emotion Recognition: Dataset, Metric, and Benchmark. CoRR abs/2410.01495 (2024) - [i9]Hongming Guo, Ruibo Fu, Yizhong Geng, Shuai Liu, Shuchen Shi, Tao Wang, Chunyu Qiang, Chenxing Li, Ya Li, Zhengqi Wen, Yukun Liu, Xuefei Liu:
Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation. CoRR abs/2412.08577 (2024) - 2023
- [j14]Mingyue Niu
, Ziping Zhao
, Jianhua Tao
, Ya Li, Björn W. Schuller
:
Dual Attention and Element Recalibration Networks for Automatic Depression Level Prediction. IEEE Trans. Affect. Comput. 14(3): 1954-1965 (2023) - [j13]Weixin Li
, Tiantian Cao
, Chang Liu
, Xue Tian
, Ya Li
, Xiaojie Wang
, Xuan Dong
:
Dual-Lens HDR using Guided 3D Exposure CNN and Guided Denoising Transformer. ACM Trans. Multim. Comput. Commun. Appl. 19(5): 158:1-158:20 (2023) - [c56]Jinlong Xue
, Yayue Deng, Fengping Wang, Ya Li
, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis. ICASSP 2023: 1-5 - [c55]Cong Wang
, Yingming Gao
, Ya Li
, Man Zhang
:
GaitParse: Gait Parsing Algorithm with Self-Supervised Fine-Tuning for Gait Recognition. ICCIP 2023: 85-92 - [c54]Dong Wang
, Qifei Li
, Yingming Gao
, Yong Liu
, Ya Li
:
Exploring the interpretability in speech-based adolescent depression detection by SHAP. ICCIP 2023: 562-567 - [c53]Qifei Li, Dong Wang, Yiming Ren, Yingming Gao, Ya Li:
FTA-net: A Frequency and Time Attention Network for Speech Depression Detection. INTERSPEECH 2023: 1723-1727 - [c52]Yayue Deng
, Jinlong Xue
, Fengping Wang
, Yingming Gao
, Ya Li
:
CMCU-CSS: Enhancing Naturalness via Commonsense-based Multi-modal Context Understanding in Conversational Speech Synthesis. ACM Multimedia 2023: 6081-6089 - [c51]Qifei Li
, Yingming Gao
, Ya Li
:
Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition. ACM Multimedia 2023: 9546-9550 - [i8]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis. CoRR abs/2305.02269 (2023) - [i7]Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li:
CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis. CoRR abs/2312.10358 (2023) - [i6]Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li:
Frame-level emotional state alignment method for speech emotion recognition. CoRR abs/2312.16383 (2023) - 2022
- [j12]Mingyue Niu
, Lang He
, Ya Li, Bin Liu:
Depressioner: Facial dynamic representation for automatic depression level prediction. Expert Syst. Appl. 204: 117512 (2022) - [j11]Mingyue Niu
, Ziping Zhao, Jianhua Tao, Ya Li, Björn W. Schuller
:
Selective Element and Two Orders Vectorization Networks for Automatic Depression Severity Diagnosis via Facial Changes. IEEE Trans. Circuits Syst. Video Technol. 32(11): 8065-8077 (2022) - [c50]Ya Li, Mingyue Niu, Ziping Zhao, Jianhua Tao:
Automatic Depression Level Assessment from Speech By Long-Term Global Information Embedding. ICASSP 2022: 8507-8511 - [c49]Ziping Zhao, Zhen Gong, Mingyue Niu, Jiali Ma, Haishuai Wang, Zixing Zhang, Ya Li:
Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network. ICASSP 2022: 9102-9106 - [c48]Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue
, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin:
Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis. ISCSLP 2022: 220-224 - [c47]Jinlong Xue
, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang:
ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis. ISCSLP 2022: 230-234 - [c46]Yichen Han, Ya Li, Yingming Gao, Jinlong Xue
, Songpo Wang, Lei Yang:
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis. MMSP 2022: 1-6 - [i5]Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang:
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis. CoRR abs/2210.03335 (2022) - 2021
- [j10]Jianhua Tao, Jian Huang, Ya Li, Zheng Lian
, Mingyue Niu:
Correction to: Semi-supervised Ladder Networks for Speech Emotion Recognition. Int. J. Autom. Comput. 18(4): 680 (2021) - 2020
- [j9]Zheng Lian
, Ya Li
, Jianhua Tao, Jian Huang, Mingyue Niu:
Expression Analysis Based on Face Regions in Real-world Conditions. Int. J. Autom. Comput. 17(1): 96-107 (2020)
2010 – 2019
- 2019
- [j8]Jianhua Tao
, Jian Huang, Ya Li, Zheng Lian
, Mingyue Niu:
Semi-supervised Ladder Networks for Speech Emotion Recognition. Int. J. Autom. Comput. 16(4): 437-448 (2019) - [c45]Mingyue Niu, Jianhua Tao, Ya Li, Jian Huang, Zheng Lian
:
Discriminative Video Representation with Temporal Order for Micro-expression Recognition. ICASSP 2019: 2112-2116 - [i4]Zheng Lian, Ya Li, Jianhua Tao, Jian Huang:
Speech Emotion Recognition via Contrastive Loss under Siamese Networks. CoRR abs/1910.11174 (2019) - [i3]Zheng Lian, Ya Li, Jianhua Tao, Jian Huang, Mingyue Niu:
Expression Analysis Based on Face Regions in Read-world Conditions. CoRR abs/1911.05188 (2019) - 2018
- [j7]Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao:
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin. J. Signal Process. Syst. 90(7): 1039-1052 (2018) - [c44]Jian Huang, Ya Li, Jianhua Tao, Zheng Lian, Jiangyan Yi:
End-to-End Continuous Emotion Recognition from Video Using 3D Convlstm Networks. ICASSP 2018: 6837-6841 - [c43]Yibin Zheng, Jianhua Tao, Zhengqi Wen, Ya Li:
BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in a Text-to-Speech Front-End. INTERSPEECH 2018: 47-51 - [c42]Jian Huang, Ya Li, Jianhua Tao, Zhen Lian
:
Speech Emotion Recognition from Variable-Length Inputs with Triplet Loss Function. INTERSPEECH 2018: 3673-3677 - [c41]Jian Huang, Ya Li, Jianhua Tao, Zheng Lian
, Mingyue Niu, Minghao Yang:
Multimodal Continuous Emotion Recognition with Data Augmentation Using Recurrent Neural Networks. AVEC@MM 2018: 57-64 - [c40]Jian Huang, Ya Li, Jianhua Tao, Zheng Lian
, Mingyue Niu, Minghao Yang:
Deep Learning for Continuous Multiple Time Series Annotations. AVEC@MM 2018: 91-98 - [i2]Zheng Lian, Ya Li, Jianhua Tao, Jian Huang:
Investigation of Multimodal Features, Classifiers and Fusion Methods for Emotion Recognition. CoRR abs/1809.06225 (2018) - 2017
- [j6]Ya Li
, Jianhua Tao, Linlin Chao, Wei Bao, Yazhu Liu:
CHEAVD: a Chinese natural emotional audio-visual database. J. Ambient Intell. Humaniz. Comput. 8(6): 913-924 (2017) - [j5]Ya Li, Jianhua Tao, Wei Lai
, Xiaoying Xu:
Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis. Speech Commun. 89: 92-102 (2017) - [c39]Jianhua Tao, Ruibo Fu, Yibin Zheng, Zhengqi Wen, Ya Li, Biu Liu:
The NLPR Speech Synthesis entry for Blizzard Challenge 2017. Blizzard Challenge 2017 - [c38]Yibin Zheng, Jianhua Tao, Zhengqi Wen, Ya Li, Bin Liu:
Investigating Efficient Feature Representation Methods and Training Objective for BLSTM-Based Phone Duration Prediction. INTERSPEECH 2017: 784-788 - [c37]Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ya Li:
Distilling Knowledge from an Ensemble of Models for Punctuation Prediction. INTERSPEECH 2017: 2779-2783 - [c36]Jian Huang, Ya Li, Jianhua Tao, Zheng Lian, Zhengqi Wen, Minghao Yang, Jiangyan Yi:
Continuous Multimodal Emotion Prediction Based on Long Short Term Memory Recurrent Neural Network. AVEC@ACM Multimedia 2017: 11-18 - 2016
- [j4]Hao Che, Ya Li, Jianhua Tao, Zhengqi Wen:
Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Boundaries Prediction. J. Signal Process. Syst. 82(2): 263-271 (2016) - [c35]Jianhua Tao, Yibin Zheng, Zhengqi Wen, Ya Li, Biu Liu:
BLSTM Guided Unit Selection Synthesis System for Blizzard Challenge 2016. Blizzard Challenge 2016 - [c34]Ya Li, Jianhua Tao, Björn W. Schuller
, Shiguang Shan
, Dongmei Jiang, Jia Jia:
MEC 2016: The Multimodal Emotion Recognition Challenge of CCPR 2016. CCPR (2) 2016: 667-678 - [c33]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li, Zhengqi Wen:
Long short term memory recurrent neural network based encoding method for emotion recognition in video. ICASSP 2016: 2752-2756 - [c32]Zhengqi Wen, Ya Li, Jianhua Tao:
The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network Based Speech Synthesis. INTERSPEECH 2016: 2248-2252 - [c31]Yibin Zheng, Ya Li, Zhengqi Wen, Xingguang Ding, Jianhua Tao:
Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach. INTERSPEECH 2016: 3201-3205 - [c30]Ye Bai, Jiangyan Yi, Hao Ni, Zhengqi Wen, Bin Liu, Ya Li, Jianhua Tao:
End-to-end keywords spotting based on connectionist temporal classification for Mandarin. ISCSLP 2016: 1-5 - [c29]Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao:
Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis. ISCSLP 2016: 1-5 - [c28]Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao:
Investigating deep neural network adaptation for generating exclamatory and interrogative speech in Mandarin. ISCSLP 2016: 1-5 - [i1]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li, Zhengqi Wen:
Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention. CoRR abs/1603.08321 (2016) - 2015
- [j3]Ya Li, Jianhua Tao, Keikichi Hirose, Xiaoying Xu, Wei Lai:
Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech. Speech Commun. 72: 59-73 (2015) - [c27]Ya Li, Linlin Chao, Yazhu Liu, Wei Bao, Jianhua Tao:
From simulated speech to natural speech, what are the robust features for emotion recognition? ACII 2015: 368-373 - [c26]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li:
Multi task sequence learning for depression scale prediction from video. ACII 2015: 526-531 - [c25]Ya Li, Nick Campbell, Jianhua Tao:
Voice quality: Not only about "you" but also about "your interlocutor". ICASSP 2015: 4739-4743 - [c24]Bin Liu, Jianhua Tao, Zhengqi Wen, Ya Li, Danish Bukhari:
A novel method of artificial bandwidth extension using deep architecture. INTERSPEECH 2015: 2598-2602 - [c23]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li, Zhengqi Wen:
Long Short Term Memory Recurrent Neural Network based Multimodal Dimensional Emotion Recognition. AVEC@ACM Multimedia 2015: 65-72 - 2014
- [c22]Ran Zhang, Jianhua Tao, Ya Li, Zhengqi Wen:
A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation. ICASSP 2014: 295-299 - [c21]Hao Che, Jianhua Tao, Ya Li:
Improving Mandarin prosodic boundary prediction with rich syntactic features. INTERSPEECH 2014: 46-50 - [c20]Ran Zhang, Zhengqi Wen, Jianhua Tao, Ya Li, Bing Liu, Xiaoyan Lou:
A hierarchical viterbi algorithm for Mandarin hybrid speech synthesis system. INTERSPEECH 2014: 795-799 - [c19]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li:
Improving generation performance of speech emotion recognition by denoising autoencoders. ISCSLP 2014: 341-344 - [c18]Xin Xu, Ya Li, Xiaoying Xu, Zhengqi Wen, Hao Che, Shanfeng Liu, Jianhua Tao:
Survey on discriminative feature selection for speech emotion recognition. ISCSLP 2014: 345-349 - [c17]Xiaoying Xu, Huimin Wang, Ya Li, Wei Lai
, Jianhua Tao:
The expression of emotions by text and speech. ISCSLP 2014: 353 - [c16]Wei Bao, Ya Li, Mingliang Gu, Jianhua Tao, Linlin Chao, Shanfeng Liu:
Combining prosodic and spectral features for Mandarin intonation recognition. ISCSLP 2014: 497-500 - [c15]Hao Che, Zhengqi Wen, Ya Li, Jianhua Tao:
Investigating effect of rich syntactic features on Mandarin prosodic phrase boundaries prediction. ISCSLP 2014: 501-505 - [c14]Shanfeng Liu, Zhengqi Wen, Ya Li, Jianhua Tao, Bin Liu:
Context features based pre-selection and weight prediction in concatenation speech synthesis system. ISCSLP 2014: 506-510 - [c13]Bin Liu, Jianhua Tao, Fuyuan Mo, Ya Li, Zhengqi Wen, Shanfeng Liu:
Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability. ISCSLP 2014: 531-535 - [c12]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li, Zhengqi Wen:
Multi-scale Temporal Modeling for Dimensional Emotion Recognition in Video. AVEC@MM 2014: 11-18 - [c11]Wei Lai
, Xiaoying Xu, Ya Li, Hao Che, Shanfeng Liu, Jianhua Tao:
Phonological influences on the realization of final lowering evidence from dialogue Chinese Mandarin. O-COCOSDA 2014: 1-6 - 2013
- [c10]Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li:
Bayesian Inference Based Temporal Modeling for Naturalistic Affective Expression Classification. ACII 2013: 173-178 - [c9]Yang Wang, Jianhua Tao, Minghao Yang, Ya Li:
Extended Decision Tree with or Relationship for HMM-Based Speech Synthesis. ACPR 2013: 225-229 - [c8]Xiaoying Xu, Jianhua Tao, Ya Li:
On Constructing a Chinese Task-Oriental Subjectivity Lexicon. CLSW 2013: 546-554 - [c7]Ran Zhang, Jianhua Tao, Ya Li, Zhengqi Wen:
A novel unit selection method for concatenation speech system using similarity measure. O-COCOSDA/CASLRE 2013: 1-5 - 2012
- [j2]Minghao Yang, Jianhua Tao, Kaihui Mu, Ya Li, Jianfeng Che:
A multimodal approach of generating 3D human-like talking agent. J. Multimodal User Interfaces 5(1-2): 61-68 (2012) - 2011
- [j1]Jianhua Tao, Shifeng Pan, Minghao Yang, Ya Li, Kaihui Mu, Jianfeng Che:
Utterance independent bimodal emotion recognition in spontaneous communication. EURASIP J. Adv. Signal Process. 2011: 4 (2011) - [c6]Shifeng Pan, Jianhua Tao, Ya Li:
The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011. ACII (2) 2011: 388-395 - [c5]Xiaoying Xu, Ya Li, Jianhua Tao, Yingchao Lu:
The Stability Analysis of Disyllabic Stress in Mandarin Speech. ICPhS 2011: 2181-2184 - [c4]Ya Li, Jianhua Tao, Xiaoying Xu:
Hierarchical Stress Modeling in Mandarin Text-to-Speech. INTERSPEECH 2011: 2013-2016 - 2010
- [c3]Jianhua Tao, Shifeng Pan, Ya Li, Zhengqi Wen, Yang Wang:
The WISTON Text to Speech System for Blizzard Challenge 2010. Blizzard Challenge 2010 - [c2]Ya Li, Jianhua Tao, Meng Zhang, Shifeng Pan, Xiaoying Xu:
Text-based unstressed syllable prediction in Mandarin. INTERSPEECH 2010: 1752-1755
2000 – 2009
- 2009
- [c1]Jianhua Tao, Ya Li, Shifeng Pan, Meng Zhang, Hongjun Sun, Zhengqi Wen:
The WISTON Text-to-Speech System for Blizzard Challenge 2009. Blizzard Challenge 2009
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-09-17 01:09 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint