


default search action
Jun-Kun Chen
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[i35]Peidong Wang, Naoyuki Kanda, Jian Xue, Jinyu Li, Xiaofei Wang, Aswin Shanmugam Subramanian, Jun-Kun Chen, Sunit Sivasankaran, Xiong Xiao, Yong Zhao:
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation. CoRR abs/2502.02683 (2025)
[i34]Abdelrahman Abouelenin, Atabak Ashfaq, Adam Atkinson, Hany Awadalla, Nguyen Bach, Jianmin Bao, Alon Benhaim, Martin Cai, Vishrav Chaudhary, Congcong Chen, Dong Chen, Dongdong Chen, Jun-Kun Chen, Weizhu Chen, Yen-Chun Chen, Yi-ling Chen, Qi Dai, Xiyang Dai, Ruchao Fan, Mei Gao, Min Gao, Amit Garg, Abhishek Goswami, Junheng Hao, Amr Hendy, Yuxuan Hu, Xin Jin, Mahmoud Khademi, Dongwoo Kim, Young Jin Kim, Gina Lee, Jinyu Li, Yunsheng Li, Chen Liang, Xihui Lin, Zeqi Lin, Mengchen Liu, Yang Liu, Gilsinia Lopez, Chong Luo, Piyush Madan, Vadim Mazalov, Arindam Mitra, Ali Mousavi, Anh Nguyen, Jing Pan, Daniel Perez-Becker, Jacob Platin, Thomas Portet, Kai Qiu, Bo Ren, Liliang Ren, Sambuddha Roy, Ning Shang, Yelong Shen, Saksham Singhal, Subhojit Som, Xia Song, Tetyana Sych, Praneetha Vaddamanu, Shuohang Wang, Yiming Wang, Zhenghao Wang, Haibin Wu, Haoran Xu, Weijian Xu, Yifan Yang, Ziyi Yang, Donghan Yu, Ishmam Zabir, Jianwen Zhang, Li Lyna Zhang, Yunan Zhang, Xiren Zhou:
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs. CoRR abs/2503.01743 (2025)
[i33]Yanming Zhang, Jun-Kun Chen, Jipeng Lyu, Yu-Xiong Wang:
V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes. CoRR abs/2503.10634 (2025)
[i32]Jun-Kun Chen, Aayush Bansal, Minh Phuoc Vo, Yu-Xiong Wang:
Dress&Dance: Dress up and Dance as You Like It - Technical Preview. CoRR abs/2508.21070 (2025)
[i31]Jun-Kun Chen, Aayush Bansal, Minh Phuoc Vo, Yu-Xiong Wang:
Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image - Technical Preview. CoRR abs/2509.04450 (2025)- 2024
[c22]Linzhan Mou, Jun-Kun Chen, Yu-Xiong Wang:
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion. CVPR 2024: 20176-20185
[c21]Jun-Kun Chen, Samuel Rota Bulò, Norman Müller, Lorenzo Porzi, Peter Kontschieder, Yu-Xiong Wang:
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing. CVPR 2024: 21071-21080
[c20]Sara Papi
, Peidong Wang, Jun-Kun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur:
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation. ICASSP 2024: 10381-10385
[c19]Mu Yang, Naoyuki Kanda, Xiaofei Wang, Jun-Kun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka:
Diarist: Streaming Speech Translation with Speaker Diarization. ICASSP 2024: 10866-10870
[c18]Peidong Wang, Jian Xue, Jinyu Li, Jun-Kun Chen, Aswin Shanmugam Subramanian:
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation. INTERSPEECH 2024
[c17]Jun-Kun Chen, Yu-Xiong Wang:
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing. NeurIPS 2024
[c16]Xiuyu Yang, Yunze Man, Jun-Kun Chen, Yu-Xiong Wang:
SceneCraft: Layout-Guided 3D Scene Generation. NeurIPS 2024
[c15]Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yanqing Liu, Jun-Kun Chen, Sheng Zhao, Jinyu Li, Zhizheng Wu, Michael Zeng:
Investigating Neural Audio Codecs For Speech Language Model-Based Speech Generation. SLT 2024: 554-561
[i30]Linzhan Mou, Jun-Kun Chen, Yu-Xiong Wang:
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion. CoRR abs/2406.09402 (2024)
[i29]Jun-Kun Chen, Samuel Rota Bulò, Norman Müller, Lorenzo Porzi, Peter Kontschieder, Yu-Xiong Wang:
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing. CoRR abs/2406.09404 (2024)
[i28]Peidong Wang, Jian Xue, Jinyu Li, Jun-Kun Chen, Aswin Shanmugam Subramanian:
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation. CoRR abs/2406.10276 (2024)
[i27]Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yanqing Liu, Jun-Kun Chen, Sheng Zhao, Jinyu Li, Zhizheng Wu, Michael Zeng:
Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation. CoRR abs/2409.04016 (2024)
[i26]Jun-Kun Chen, Jilin Mei, Liang Chen, Fangzhou Zhao, Yu Hu:
Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity. CoRR abs/2409.05466 (2024)
[i25]Xiuyu Yang, Yunze Man, Jun-Kun Chen, Yu-Xiong Wang:
SceneCraft: Layout-Guided 3D Scene Generation. CoRR abs/2410.09049 (2024)
[i24]Jun-Kun Chen, Yu-Xiong Wang:
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing. CoRR abs/2411.05006 (2024)
[i23]Midia Yousefi, Yao Qian, Jun-Kun Chen, Gang Wang, Yanqing Liu, Dongmei Wang, Xiaofei Wang, Jian Xue:
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages. CoRR abs/2411.07387 (2024)- 2023
[c14]Jun-Kun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li:
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach. ASRU 2023: 1-7
[c13]Sara Papi, Peidong Wang, Jun-Kun Chen, Jian Xue, Jinyu Li, Yashesh Gaur:
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments. ASRU 2023: 1-8
[c12]Jun-Kun Chen, Jipeng Lyu, Yu-Xiong Wang:
NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds. CVPR 2023: 12439-12448
[c11]Yuanyi Zhong, Haoran Tang, Jun-Kun Chen, Yu-Xiong Wang:
Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study. ICCV 2023: 16281-16290
[i22]Jun-Kun Chen, Jipeng Lyu, Yu-Xiong Wang:
NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds. CoRR abs/2305.03049 (2023)
[i21]Sara Papi, Peidong Wang, Jun-Kun Chen, Jian Xue, Jinyu Li, Yashesh Gaur:
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments. CoRR abs/2307.03354 (2023)
[i20]Mu Yang, Naoyuki Kanda, Xiaofei Wang, Jun-Kun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka:
DiariST: Streaming Speech Translation with Speaker Diarization. CoRR abs/2309.08007 (2023)
[i19]Jun-Kun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li:
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach. CoRR abs/2310.04399 (2023)
[i18]Sara Papi, Peidong Wang, Jun-Kun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur:
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation. CoRR abs/2310.14806 (2023)- 2022
[c10]Jun-Kun Chen, Yu-Xiong Wang:
PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees. ECCV (3) 2022: 105-120
[c9]He Bai
, Renjie Zheng, Jun-Kun Chen, Mingbo Ma, Xintong Li, Liang Huang:
A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing. ICML 2022: 1399-1411
[i17]Zhaocheng Zhu, Chence Shi, Zuobai Zhang, Shengchao Liu, Minghao Xu, Xinyu Yuan, Yangtian Zhang, Jun-Kun Chen, Huiyu Cai, Jiarui Lu, Chang Ma, Runcheng Liu, Louis-Pascal A. C. Xhonneux, Meng Qu, Jian Tang:
TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery. CoRR abs/2202.08320 (2022)
[i16]He Bai, Renjie Zheng, Jun-Kun Chen, Xintong Li, Mingbo Ma, Liang Huang:
A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing. CoRR abs/2203.09690 (2022)
[i15]Guangxu Xun, Mingbo Ma, Yuchen Bian, Xingyu Cai, Jiaji Huang, Renjie Zheng, Jun-Kun Chen, Jiahong Yuan, Kenneth Church, Liang Huang:
Data-Driven Adaptive Simultaneous Machine Translation. CoRR abs/2204.12672 (2022)
[i14]Hui Zhang, Tian Yuan, Jun-Kun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang:
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit. CoRR abs/2205.12007 (2022)
[i13]Yuanyi Zhong, Haoran Tang, Jun-Kun Chen, Jian Peng, Yu-Xiong Wang:
Is Self-Supervised Learning More Robust Than Supervised Learning? CoRR abs/2206.05259 (2022)
[i12]Jun-Kun Chen, Yu-Xiong Wang:
PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees. CoRR abs/2208.05962 (2022)
[i11]Xiaoran Fan, Chao Pang, Tian Yuan, He Bai, Renjie Zheng, Pengfei Zhu, Shuohuan Wang, Jun-Kun Chen, Zeyu Chen, Liang Huang, Yu Sun, Hua Wu:
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech. CoRR abs/2211.03545 (2022)- 2021
[c8]Jun-Kun Chen, Mingbo Ma, Renjie Zheng, Liang Huang:
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR. ACL/IJCNLP (Findings) 2021: 4618-4624
[c7]Jun-Kun Chen, Renjie Zheng, Atsuhito Kita, Mingbo Ma, Liang Huang:
Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings. EMNLP (1) 2021: 5857-5864
[c6]Meng Qu, Jun-Kun Chen, Louis-Pascal A. C. Xhonneux, Yoshua Bengio, Jian Tang:
RNNLogic: Learning Logic Rules for Reasoning on Knowledge Graphs. ICLR 2021
[c5]Renjie Zheng, Jun-Kun Chen, Mingbo Ma, Liang Huang:
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation. ICML 2021: 12736-12746
[c4]Jun-Kun Chen, Mingbo Ma, Renjie Zheng, Liang Huang:
SpecRec: An Alternative Solution for Improving End-to-End Speech-to-Text Translation via Spectrogram Reconstruction. Interspeech 2021: 2232-2236
[i10]Renjie Zheng, Jun-Kun Chen, Mingbo Ma, Liang Huang:
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation. CoRR abs/2102.05766 (2021)
[i9]Jun-Kun Chen, Mingbo Ma, Renjie Zheng, Liang Huang:
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR. CoRR abs/2106.06636 (2021)- 2020
[i8]Meng Qu, Jun-Kun Chen, Louis-Pascal A. C. Xhonneux, Yoshua Bengio, Jian Tang:
RNNLogic: Learning Logic Rules for Reasoning on Knowledge Graphs. CoRR abs/2010.04029 (2020)
[i7]Jun-Kun Chen, Renjie Zheng, Atsuhito Kita, Mingbo Ma, Liang Huang:
Improving Simultaneous Translation with Pseudo References. CoRR abs/2010.11247 (2020)
[i6]Jun-Kun Chen, Mingbo Ma, Renjie Zheng, Liang Huang:
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation. CoRR abs/2010.11445 (2020)
2010 – 2019
- 2019
[c3]Xin Wang, Jiawei Wu
, Jun-Kun Chen, Lei Li
, Yuan-Fang Wang, William Yang Wang:
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research. ICCV 2019: 4580-4590
[i5]Xin Wang, Jiawei Wu, Jun-Kun Chen, Lei Li, Yuan-Fang Wang, William Yang Wang:
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research. CoRR abs/1904.03493 (2019)
[i4]Zehui Lin, Pengfei Liu, Luyao Huang, Jun-Kun Chen, Xipeng Qiu, Xuanjing Huang:
DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks. CoRR abs/1907.11065 (2019)- 2018
[c2]Jun-Kun Chen, Xipeng Qiu, Pengfei Liu, Xuanjing Huang:
Meta Multi-Task Learning for Sequence Modeling. AAAI 2018: 5070-5077
[c1]Renjie Zheng, Jun-Kun Chen, Xipeng Qiu
:
Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks. IJCAI 2018: 4616-4622
[i3]Jun-Kun Chen, Xipeng Qiu, Pengfei Liu, Xuanjing Huang:
Meta Multi-Task Learning for Sequence Modeling. CoRR abs/1802.08969 (2018)
[i2]Renjie Zheng, Jun-Kun Chen, Xipeng Qiu:
Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks. CoRR abs/1804.08139 (2018)
[i1]Jun-Kun Chen, Kaiyu Chen, Xinchi Chen, Xipeng Qiu, Xuanjing Huang:
Exploring Shared Structures and Hierarchies for Multiple NLP Tasks. CoRR abs/1808.07658 (2018)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-29 01:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







