


Остановите войну!
for scientists:


default search action
Furu Wei
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j24]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Generative retrieval for conversational question answering. Inf. Process. Manag. 60(5): 103475 (2023) - [j23]Jian Yang
, Yuwei Yin
, Liqun Yang
, Shuming Ma, Haoyang Huang, Dongdong Zhang
, Furu Wei, Zhoujun Li
:
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1489-1498 (2023) - [j22]Zhiliang Peng, Li Dong, Hangbo Bao, Furu Wei, Qixiang Ye:
A Unified View of Masked Image Modeling. Trans. Mach. Learn. Res. 2023 (2023) - [c210]Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Zhoujun Li, Furu Wei:
TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models. AAAI 2023: 13094-13102 - [c209]Yuan Xie, Shaohan Huang, Tianyu Chen, Furu Wei:
MoEC: Mixture of Expert Clusters. AAAI 2023: 13807-13815 - [c208]Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. ACL (Findings) 2023: 114-128 - [c207]Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei:
SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval. ACL (1) 2023: 2244-2258 - [c206]Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang:
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding. ACL (1) 2023: 3466-3478 - [c205]Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Shuming Ma, Zhifang Sui, Furu Wei:
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers. ACL (Findings) 2023: 4005-4019 - [c204]Yuxian Gu, Li Dong, Furu Wei, Minlie Huang:
Pre-Training to Learn in Context. ACL (1) 2023: 4849-4870 - [c203]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Multiview Identifiers Enhanced Generative Retrieval. ACL (1) 2023: 6636-6648 - [c202]Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li:
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator. ACL (1) 2023: 9394-9412 - [c201]Liang Chen, Shuming Ma, Dongdong Zhang, Furu Wei, Baobao Chang:
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation. ACL (Findings) 2023: 9542-9558 - [c200]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. ACL (1) 2023: 14590-14604 - [c199]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. ACL (1) 2023: 15354-15373 - [c198]Jinghao Zhou, Li Dong, Zhe Gan, Lijuan Wang, Furu Wei:
Non-Contrastive Learning Meets Language-Image Pre-Training. CVPR 2023: 11028-11038 - [c197]Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye:
Generic-to-Specific Distillation of Masked Autoencoders. CVPR 2023: 15996-16005 - [c196]Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei:
Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks. CVPR 2023: 19175-19186 - [c195]Jian Yang, Yuwei Yin, Shuming Ma, Liqun Yang, Hongcheng Guo, Haoyang Huang, Dongdong Zhang, Yutao Zeng, Zhoujun Li, Furu Wei:
HanoiT: Enhancing Context-aware Translation via Selective Context. DASFAA (3) 2023: 471-486 - [c194]Yuxin Fang, Li Dong, Hangbo Bao, Xinggang Wang, Furu Wei:
Corrupted Image Modeling for Self-Supervised Visual Pre-Training. ICLR 2023 - [c193]Zhixiong Han, Yaru Hao, Li Dong, Yutao Sun, Furu Wei:
Prototypical Calibration for Few-shot Learning of Language Models. ICLR 2023 - [c192]Weizhi Wang, Li Dong, Hao Cheng, Haoyu Song, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei:
Visually-Augmented Language Modeling. ICLR 2023 - [c191]Haiteng Zhao, Shuming Ma, Dongdong Zhang, Zhi-Hong Deng, Furu Wei:
Are More Layers Beneficial to Graph Transformers? ICLR 2023 - [c190]Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Daniel Tompkins, Zhuo Chen, Wanxiang Che, Xiangzhan Yu, Furu Wei:
BEATs: Audio Pre-Training with Acoustic Tokenizers. ICML 2023: 5178-5193 - [c189]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Magneto: A Foundation Transformer. ICML 2023: 36077-36092 - [i199]Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei:
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers. CoRR abs/2301.02111 (2023) - [i198]Jian Yang, Yuwei Yin
, Shuming Ma, Liqun Yang, Hongcheng Guo, Haoyang Huang, Dongdong Zhang, Yutao Zeng, Zhoujun Li, Furu Wei:
HanoiT: Enhancing Context-aware Translation via Selective Context. CoRR abs/2301.06825 (2023) - [i197]Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei:
Language Is Not All You Need: Aligning Perception with Language Models. CoRR abs/2302.14045 (2023) - [i196]Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye:
Generic-to-Specific Distillation of Masked Autoencoders. CoRR abs/2302.14771 (2023) - [i195]Haiteng Zhao, Shuming Ma, Dongdong Zhang, Zhi-Hong Deng, Furu Wei:
Are More Layers Beneficial to Graph Transformers? CoRR abs/2303.00579 (2023) - [i194]Guangyue Peng, Tao Ge, Si-Qing Chen, Furu Wei, Houfeng Wang:
Semiparametric Language Models Are Scalable Continual Learners. CoRR abs/2303.01421 (2023) - [i193]Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei:
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling. CoRR abs/2303.03926 (2023) - [i192]Liang Wang, Nan Yang, Furu Wei:
Query2doc: Query Expansion with Large Language Models. CoRR abs/2303.07678 (2023) - [i191]Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Furu Wei, Denvy Deng, Qi Zhang:
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation. CoRR abs/2303.08518 (2023) - [i190]Liang Chen, Shuming Ma, Dongdong Zhang, Furu Wei, Baobao Chang:
On the Pareto Front of Multilingual Neural Machine Translation. CoRR abs/2304.03216 (2023) - [i189]Nan Yang, Tao Ge, Liang Wang, Binxing Jiao, Daxin Jiang, Linjun Yang, Rangan Majumder, Furu Wei:
Inference with Reference: Lossless Acceleration of Large Language Models. CoRR abs/2304.04487 (2023) - [i188]Beiduo Chen
, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. CoRR abs/2305.03981 (2023) - [i187]Hongyuan Lu, Haoyang Huang, Dongdong Zhang, Haoran Yang, Wai Lam, Furu Wei:
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models. CoRR abs/2305.06575 (2023) - [i186]Haoyang Huang, Tianyi Tang, Dongdong Zhang, Wayne Xin Zhao, Ting Song, Yan Xia, Furu Wei:
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting. CoRR abs/2305.07004 (2023) - [i185]Yuxian Gu, Li Dong, Furu Wei, Minlie Huang:
Pre-Training to Learn in Context. CoRR abs/2305.09137 (2023) - [i184]Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang:
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding. CoRR abs/2305.09148 (2023) - [i183]Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei:
TextDiffuser: Diffusion Models as Text Painters. CoRR abs/2305.10855 (2023) - [i182]Liang Chen, Shuming Ma, Dongdong Zhang, Furu Wei, Baobao Chang:
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation. CoRR abs/2305.10930 (2023) - [i181]Lan Jiang, Haoyang Huang, Dongdong Zhang, Rui Jiang, Furu Wei:
One-stop Training of Multiple Capacity Models. CoRR abs/2305.14066 (2023) - [i180]Tianyi Tang, Hongyuan Lu, Yuchen Eleanor Jiang, Haoyang Huang, Dongdong Zhang, Wayne Xin Zhao, Furu Wei:
Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing. CoRR abs/2305.15067 (2023) - [i179]Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei:
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation. CoRR abs/2305.16107 (2023) - [i178]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Multiview Identifiers Enhanced Generative Retrieval. CoRR abs/2305.16675 (2023) - [i177]Weizhi Wang, Li Dong, Hao Cheng, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei:
Augmenting Language Models with Long-Term Memory. CoRR abs/2306.07174 (2023) - [i176]Yuxian Gu, Li Dong, Furu Wei, Minlie Huang:
Knowledge Distillation of Large Language Models. CoRR abs/2306.08543 (2023) - [i175]Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei:
Kosmos-2: Grounding Multimodal Large Language Models to the World. CoRR abs/2306.14824 (2023) - [i174]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Learning to Rank in Generative Retrieval. CoRR abs/2306.15222 (2023) - [i173]Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Nanning Zheng, Furu Wei:
LongNet: Scaling Transformers to 1, 000, 000, 000 Tokens. CoRR abs/2307.02486 (2023) - [i172]Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji:
Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration. CoRR abs/2307.05300 (2023) - [i171]Tao Ge, Jing Hu, Xun Wang, Si-Qing Chen, Furu Wei:
In-context Autoencoder for Context Compression in a Large Language Model. CoRR abs/2307.06945 (2023) - [i170]Liang Wang, Nan Yang, Furu Wei:
Learning to Retrieve In-Context Examples for Large Language Models. CoRR abs/2307.07164 (2023) - [i169]Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei:
Retentive Network: A Successor to Transformer for Large Language Models. CoRR abs/2307.08621 (2023) - [i168]Guangyu Chen, Yu Wu, Shujie Liu, Tao Liu, Xiaoyong Du, Furu Wei:
WavMark: Watermarking for Audio Generation. CoRR abs/2308.12770 (2023) - [i167]Qingxiu Dong, Li Dong, Ke Xu, Guangyan Zhou, Yaru Hao, Zhifang Sui, Furu Wei:
Large Language Model for Science: A Study on P vs. NP. CoRR abs/2309.05689 (2023) - [i166]Daixuan Cheng, Shaohan Huang, Furu Wei:
Adapting Large Language Models via Reading Comprehension. CoRR abs/2309.09530 (2023) - 2022
- [j21]Sanyuan Chen
, Chengyi Wang, Zhengyang Chen, Yu Wu
, Shujie Liu, Zhuo Chen, Jinyu Li
, Naoyuki Kanda
, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian
, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022) - [j20]Haichao Zhu
, Li Dong, Furu Wei, Bing Qin
, Ting Liu:
Transforming Wikipedia Into Augmented Data for Query-Focused Summarization. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2357-2367 (2022) - [c188]Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei:
Sequence Level Contrastive Learning for Text Summarization. AAAI 2022: 11556-11565 - [c187]Shengqiang Zhang, Xingxing Zhang, Hangbo Bao, Furu Wei:
Attention Temperature Matters in Abstractive Summarization Distillation. ACL (1) 2022: 127-141 - [c186]Guanhua Chen, Shuming Ma, Yun Chen, Dongdong Zhang, Jia Pan, Wenping Wang, Furu Wei:
Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation. ACL (1) 2022: 142-157 - [c185]Ruipeng Jia, Xingxing Zhang, Yanan Cao, Zheng Lin, Shi Wang, Furu Wei:
Neural Label Search for Zero-Shot Multi-Lingual Extractive Summarization. ACL (1) 2022: 561-570 - [c184]Jing Qian, Li Dong, Yelong Shen, Furu Wei, Weizhu Chen:
Controllable Natural Language Generation with Contrastive Prefixes. ACL (Findings) 2022: 2912-2924 - [c183]Yiheng Xu
, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Furu Wei:
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding. ACL (Findings) 2022: 3214-3224 - [c182]Tianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei:
THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption. ACL (Findings) 2022: 3510-3520 - [c181]Junyi Ao, Rui Wang
, Long Zhou, Chengyi Wang, Shuo Ren, Yu Wu, Shujie Liu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei:
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing. ACL (1) 2022: 5723-5738 - [c180]Junlong Li, Yiheng Xu
, Lei Cui, Furu Wei:
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding. ACL (1) 2022: 6078-6087 - [c179]Haoyu Song, Li Dong, Weinan Zhang, Ting Liu, Furu Wei:
CLIP Models are Few-Shot Learners: Empirical Studies on VQA and Visual Entailment. ACL (1) 2022: 6088-6100 - [c178]Zewen Chi, Shaohan Huang, Li Dong, Shuming Ma, Bo Zheng, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei:
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA. ACL (1) 2022: 6170-6182 - [c177]Damai Dai, Li Dong, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei:
StableMoE: Stable Routing Strategy for Mixture of Experts. ACL (1) 2022: 7085-7095 - [c176]Damai Dai, Li Dong, Yaru Hao, Zhifang Sui, Baobao Chang, Furu Wei:
Knowledge Neurons in Pretrained Transformers. ACL (1) 2022: 8493-8502 - [c175]Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo:
Swin Transformer V2: Scaling Up Capacity and Resolution. CVPR 2022: 11999-12009 - [c174]Jian Yang, Shaohan Huang, Shuming Ma, Yuwei Yin, Li Dong, Dongdong Zhang, Hongcheng Guo, Zhoujun Li, Furu Wei:
CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation. EMNLP (Findings) 2022: 486-496 - [c173]Jingye Chen, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei:
XDoc: Unified Pre-training for Cross-Format Document Understanding. EMNLP (Findings) 2022: 1006-1016 - [c172]Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei:
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. EMNLP 2022: 1663-1676 - [c171]Daixuan Cheng, Shaohan Huang, Jianfeng Liu, Yuefeng Zhan, Hao Sun, Furu Wei, Denvy Deng, Qi Zhang:
Snapshot-Guided Domain Adaptation for ELECTRA. EMNLP (Findings) 2022: 2226-2232 - [c170]Ting Jiang, Jian Jiao, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Denvy Deng, Qi Zhang:
PromptBERT: Improving BERT Sentence Embeddings with Prompts. EMNLP 2022: 8826-8837 - [c169]Zekun Wang, Wenhui Wang, Haichao Zhu, Ming Liu, Bing Qin, Furu Wei:
Distilled Dual-Encoder Model for Vision-Language Understanding. EMNLP 2022: 8901-8913 - [c168]Tao Ge, Si-Qing Chen, Furu Wei:
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation. EMNLP 2022: 10786-10798 - [c167]Lianzhe Huang, Shuming Ma, Dongdong Zhang, Furu Wei, Houfeng Wang:
Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified Multilingual Prompt. EMNLP 2022: 11488-11497 - [c166]Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li
, Xiangzhan Yu:
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training. ICASSP 2022: 6152-6156 - [c165]Hangbo Bao, Li Dong, Songhao Piao, Furu Wei:
BEiT: BERT Pre-Training of Image Transformers. ICLR 2022 - [c164]Xin Sun, Tao Ge, Shuming Ma, Jingjing Li, Furu Wei, Houfeng Wang:
A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model. IJCAI 2022: 4367-4374 - [c163]Jian Yang, Yuwei Yin
, Shuming Ma, Dongdong Zhang, Shuangzhi Wu, Hongcheng Guo, Zhoujun Li
, Furu Wei:
UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation. IJCAI 2022: 4454-4460 - [c162]Jian Yang, Yuwei Yin
, Shuming Ma, Dongdong Zhang, Zhoujun Li
, Furu Wei:
High-resource Language-specific Training for Multilingual Neural Machine Translation. IJCAI 2022: 4461-4467 - [c161]Xuyang Jin, Tao Ge, Furu Wei:
Plug and Play Knowledge Distillation for kNN-LM with External Logits. AACL/IJCNLP (2) 2022: 463-469 - [c160]Chengyi Wang, Yiming Wang, Yu Wu, Sanyuan Chen, Jinyu Li
, Shujie Liu, Furu Wei:
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training. INTERSPEECH 2022: 2643-2647 - [c159]Shuo Ren, Shujie Liu, Yu Wu, Long Zhou, Furu Wei:
Speech Pre-training with Acoustic Piece. INTERSPEECH 2022: 2648-2652 - [c158]Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li
, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. INTERSPEECH 2022: 2658-2662 - [c157]Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Zhuo Chen, Peidong Wang, Gang Liu, Jinyu Li
, Jian Wu, Xiangzhan Yu, Furu Wei:
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? INTERSPEECH 2022: 3699-3703 - [c156]Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li
, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387 - [c155]Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei:
DiT: Self-supervised Pre-training for Document Image Transformer. ACM Multimedia 2022: 3530-3539 - [c154]Yupan Huang, Tengchao Lv, Lei Cui, Yutong Lu, Furu Wei:
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking. ACM Multimedia 2022: 4083-4091 - [c153]Hangbo Bao, Wenhui Wang, Li Dong, Qiang Liu, Owais Khan Mohammed, Kriti Aggarwal, Subhojit Som, Songhao Piao, Furu Wei:
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts. NeurIPS 2022 - [c152]Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei:
On the Representation Collapse of Sparse Mixture of Experts. NeurIPS 2022 - [c151]Yunzhi Yao, Shaohan Huang, Li Dong, Furu Wei, Huajun Chen, Ningyu Zhang:
Kformer: Knowledge Injection in Transformer Feed-Forward Layers. NLPCC (1) 2022: 131-143 - [i165]Xu Zhang, Jian Yang, Haoyang Huang, Shuming Ma, Dongdong Zhang, Jinlong Li, Furu Wei:
SMDT: Selective Memory-Augmented Neural Document Translation. CoRR abs/2201.01631 (2022) - [i164]Juncheng Wan, Jian Yang, Shuming Ma, Dongdong Zhang, Weinan Zhang, Yong Yu, Furu Wei:
Phrase-level Adversarial Example Generation for Neural Machine Translation. CoRR abs/2201.02009 (2022) - [i163]Ting Jiang, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Liangjie Zhang, Qi Zhang:
PromptBERT: Improving BERT Sentence Embeddings with Prompts. CoRR abs/2201.04337 (2022) - [i162]Yunzhi Yao, Shaohan Huang, Ningyu Zhang, Li Dong, Furu Wei, Huajun Chen:
Kformer: Knowledge Injection in Transformer Feed-Forward Layers. CoRR abs/2201.05742 (2022) - [i161]Xin Sun, Tao Ge, Shuming Ma, Jingjing Li, Furu Wei, Houfeng Wang:
A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model. CoRR abs/2201.10707 (2022) - [i160]Yuxin Fang, Li Dong, Hangbo Bao, Xinggang Wang, Furu Wei:
Corrupted Image Modeling for Self-Supervised Visual Pre-Training. CoRR abs/2202.03382 (2022) - [i159]Tao Ge, Furu Wei:
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation. CoRR abs/2202.07959 (2022) - [i158]Da Yin, Li Dong, Hao Cheng, Xiaodong Liu, Kai-Wei Chang, Furu Wei, Jianfeng Gao:
A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models. CoRR abs/2202.08772 (2022) - [i157]Lianzhe Huang, Shuming Ma, Dongdong Zhang, Furu Wei, Houfeng Wang:
Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified Multilingual Prompt. CoRR abs/2202.11451 (2022) - [i156]Jing Qian, Li Dong, Yelong Shen, Furu Wei, Weizhu Chen:
Controllable Natural Language Generation with Contrastive Prefixes. CoRR abs/2202.13257 (2022) - [i155]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Dongdong Zhang, Furu Wei:
DeepNet: Scaling Transformers to 1, 000 Layers. CoRR abs/2203.00555 (2022) - [i154]Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei:
DiT: Self-supervised Pre-training for Document Image Transformer. CoRR abs/2203.02378 (2022) - [i153]