default search action

combined dblp search
author search
venue search
publication search

ask others

Xiaoda Yang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MaXZSJYZFH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MaXZSJYZFH25
Yuhang Ma, Wenting Xu, Chaoyi Zhao, Keqiang Sun, Qinfeng Jin, Xiaoda Yang, Zeng Zhao, Changjie Fan, Zhipeng Hu:
Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection. AAAI 2025: 6027-6035
[c7]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/0003BCZJJ0YYZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/0003BCZJJ0YYZ25
Wenrui Liu, Jionghao Bai, Xize Cheng, Jialong Zuo, Ziyue Jiang, Shengpeng Ji, Minghui Fang, Xiaoda Yang, Qian Yang, Zhou Zhao:
VoxpopuliTTS: a large-scale multilingual TTS corpus for zero-shot speech generation. COLING 2025: 10293-10297
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChengHYLF0J0Z0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChengHYLF0J0Z0025
Xize Cheng, Ruofan Hu, Xiaoda Yang, Jingyu Lu, Dongjie Fu, Zehan Wang, Shengpeng Ji, Rongjie Huang, Boyang Zhang, Tao Jin, Zhou Zhao:
VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words? ICLR 2025
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YanLG0FY0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YanLG0FY0025
Weicai Yan, Wang Lin, Zirun Guo, Ye Wang, Fangming Feng, Xiaoda Yang, Zehan Wang, Tao Jin:
Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision. ICLR 2025
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/www/Hong00ZWCYDDZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/Hong00ZWCYDDZ025
Minjie Hong, Yan Xia, Zehan Wang, Jieming Zhu, Ye Wang, Sihang Cai, Xiaoda Yang, Quanyu Dai, Zhenhua Dong, Zhimeng Zhang, Zhou Zhao:
EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration. WWW 2025: 2754-2762
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-01384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-01384
Xize Cheng, Dongjie Fu, Xiaoda Yang, Minghui Fang, Ruofan Hu, Jingyu Lu, Jionghao Bai, Zehan Wang, Shengpeng Ji, Rongjie Huang, Linjun Li, Yu Chen, Tao Jin, Zhou Zhao:
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios. CoRR abs/2501.01384 (2025)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-14735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-14735
Minjie Hong, Yan Xia, Zehan Wang, Jieming Zhu, Ye Wang, Sihang Cai, Xiaoda Yang, Quanyu Dai, Zhenhua Dong, Zhimeng Zhang, Zhou Zhao:
EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration. CoRR abs/2502.14735 (2025)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-18924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-18924
Ziyue Jiang, Yi Ren, Ruiqi Li, Shengpeng Ji, Zhenhui Ye, Chen Zhang, Jionghao Bai, Xiaoda Yang, Jialong Zuo, Yu Zhang, Rui Liu, Xiang Yin, Zhou Zhao:
Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis. CoRR abs/2502.18924 (2025)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-09445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-09445
Xiaoda Yang, Junyu Lu, Hongshun Qiu, Sijing Li, Hao Li, Shengpeng Ji, Xudong Tang, Jiayang Xu, Jiaqi Duan, Ziyue Jiang, Cong Lin, Sihang Cai, Zejian Xie, Zhuoyang Song, Songxin Zhang:
Astrea: A MOE-based Visual Understanding Model with Progressive Alignment. CoRR abs/2503.09445 (2025)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-02312
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-02312
Xiaoda Yang, Jiayang Xu, Kaixuan Luan, Xinyu Zhan, Hongshun Qiu, Shijun Shi, Hao Li, Shuai Yang, Li Zhang, Checheng Yu, Cewu Lu, Lixin Yang:
OmniCam: Unified Multimodal Video Generation via Camera Control. CoRR abs/2504.02312 (2025)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-13650
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-13650
Sijing Li, Tianwei Lin, Lingshuai Lin, Wenqiao Zhang, Jiang Liu, Xiaoda Yang, Juncheng Li, Yucheng He, Xiaohui Song, Jun Xiao, Yueting Zhuang, Beng Chin Ooi:
EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model. CoRR abs/2504.13650 (2025)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-21423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-21423
Weicai Yan, Wang Lin, Zirun Guo, Ye Wang, Fangming Feng, Xiaoda Yang, Zehan Wang, Tao Jin:
Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision. CoRR abs/2504.21423 (2025)
2024
[c3]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/YangCDQH0JZHZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YangCDQH0JZHZ024
Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin:
AudioVSR: Enhancing Video Speech Recognition with Audio Data. EMNLP 2024: 15352-15361
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FuCYWZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FuCYWZJ24
Dongjie Fu, Xize Cheng, Xiaoda Yang, Hanting Wang, Zhou Zhao, Tao Jin:
Boosting Speech Recognition Robustness to Modality-Distortion with Contrast-Augmented Prompts. ACM Multimedia 2024: 3838-3847
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangCF0ZJZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangCF0ZJZ024
Xiaoda Yang, Xize Cheng, Dongjie Fu, Minghui Fang, Jialong Zuo, Shengpeng Ji, Zhou Zhao, Tao Jin:
SyncTalklip: Highly Synchronized Lip-Readable Speaker Generation with Multi-Task Learning. ACM Multimedia 2024: 8149-8158
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17507
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17507
Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao:
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling. CoRR abs/2406.17507 (2024)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16532
Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13577
Shengpeng Ji, Yifu Chen, Minghui Fang, Jialong Zuo, Jingyu Lu, Hanting Wang, Ziyue Jiang, Long Zhou, Shujie Liu, Xize Cheng, Xiaoda Yang, Zehan Wang, Qian Yang, Jian Li, Yidi Jiang, Jingzhen He, Yunfei Chu, Jin Xu, Zhou Zhao:
WavChat: A Survey of Spoken Dialogue Models. CoRR abs/2411.13577 (2024)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.