default search action
Furu Wei
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [c257]Shaoguang Mao, Yuzhe Cai, Yan Xia, Wenshan Wu, Xun Wang, Fengyi Wang, Qiang Guan, Tao Ge, Furu Wei:
ALYMPICS: LLM Agents Meet Game Theory. COLING 2025: 2845-2866 - 2024
- [j31]Hangbo Bao, Li Dong, Wenhui Wang, Nan Yang, Songhao Piao, Furu Wei:
Fine-tuning pretrained transformer encoders for sequence-to-sequence learning. Int. J. Mach. Learn. Cybern. 15(5): 1711-1728 (2024) - [j30]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Dongdong Zhang, Furu Wei:
DeepNet: Scaling Transformers to 1,000 Layers. IEEE Trans. Pattern Anal. Mach. Intell. 46(10): 6761-6774 (2024) - [j29]Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Li-Rong Dai, Jinyu Li, Furu Wei:
SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2177-2187 (2024) - [j28]Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei:
VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3709-3716 (2024) - [j27]Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Qixiang Ye, Jianbin Jiao:
Generic-to-Specific Distillation of Masked Autoencoders. IEEE Trans. Circuits Syst. Video Technol. 34(9): 8779-8793 (2024) - [j26]Qiushi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Li-Rong Dai, Daxin Jiang, Jinyu Li, Furu Wei:
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning. IEEE Trans. Multim. 26: 1055-1064 (2024) - [c256]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Learning to Rank in Generative Retrieval. AAAI 2024: 8716-8723 - [c255]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Text Diffusion with Reinforced Conditioning. AAAI 2024: 14069-14077 - [c254]Liang Zhang, Qin Jin, Haoyang Huang, Dongdong Zhang, Furu Wei:
Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models. ACL (1) 2024: 4177-4192 - [c253]Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang:
Se²: Sequential Example Selection for In-Context Learning. ACL (Findings) 2024: 5262-5284 - [c252]Tianyi Tang, Wenyang Luo, Haoyang Huang, Dongdong Zhang, Xiaolei Wang, Xin Zhao, Furu Wei, Ji-Rong Wen:
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models. ACL (1) 2024: 5701-5715 - [c251]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition. ACL (1) 2024: 7641-7660 - [c250]Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
ResLoRA: Identity Residual Mapping in Low-Rank Adaption. ACL (Findings) 2024: 8870-8884 - [c249]Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei:
Improving Text Embeddings with Large Language Models. ACL (1) 2024: 11897-11916 - [c248]Xin Cheng, Xun Wang, Tao Ge, Si-Qing Chen, Furu Wei, Dongyan Zhao, Rui Yan:
SCALE: Synergized Collaboration of Asymmetric Language Translation Engines. ACL (Findings) 2024: 15903-15918 - [c247]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Calibrating LLM-Based Evaluator. LREC/COLING 2024: 2638-2656 - [c246]Zonglin Yang, Li Dong, Xinya Du, Hao Cheng, Erik Cambria, Xiaodong Liu, Jianfeng Gao, Furu Wei:
Language Models as Inductive Reasoners. EACL (1) 2024: 209-225 - [c245]Hongyuan Lu, Haoyang Huang, Dongdong Zhang, Furu Wei, Wai Lam:
Revamping Multilingual Agreement Bidirectionally via Switched Back-translation for Multilingual Neural Machine Translation. EACL (Findings) 2024: 264-275 - [c244]Liang Wang, Nan Yang, Furu Wei:
Learning to Retrieve In-Context Examples for Large Language Models. EACL (1) 2024: 1752-1767 - [c243]Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei:
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering. ECCV (5) 2024: 386-402 - [c242]Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li:
LongEmbed: Extending Embedding Models for Long Context Retrieval. EMNLP 2024: 802-816 - [c241]Hongyuan Lu, Haoran Yang, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei:
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models. EMNLP 2024: 958-976 - [c240]Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei:
Instruction Pre-Training: Language Models are Supervised Multitask Learners. EMNLP 2024: 2529-2550 - [c239]Shujie Hu, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei:
WavLLM: Towards Robust and Adaptive Speech Large Language Model. EMNLP (Findings) 2024: 4552-4572 - [c238]Tao Ge, Jing Hu, Lei Wang, Xun Wang, Si-Qing Chen, Furu Wei:
In-context Autoencoder for Context Compression in a Large Language Model. ICLR 2024 - [c237]Daixuan Cheng, Shaohan Huang, Furu Wei:
Adapting Large Language Models via Reading Comprehension. ICLR 2024 - [c236]Yuxian Gu, Li Dong, Furu Wei, Minlie Huang:
MiniLLM: Knowledge Distillation of Large Language Models. ICLR 2024 - [c235]Xichen Pan, Li Dong, Shaohan Huang, Zhiliang Peng, Wenhu Chen, Furu Wei:
Kosmos-G: Generating Images in Context with Multimodal Large Language Models. ICLR 2024 - [c234]Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Qixiang Ye, Furu Wei:
Grounding Multimodal Large Language Models to the World. ICLR 2024 - [c233]Xun Wu, Shaohan Huang, Furu Wei:
Mixture of LoRA Experts. ICLR 2024 - [c232]Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li:
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training. ICLR 2024 - [c231]Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei:
MathScale: Scaling Instruction Tuning for Mathematical Reasoning. ICML 2024 - [c230]Zhi Wang, Xun Wu, Shaohan Huang, Li Dong, Wenhui Wang, Shuming Ma, Furu Wei:
KOSMOS-E : Learning to Follow Instruction for Robotic Grasping. IROS 2024: 9510-9517 - [c229]Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Nan Duan, Furu Wei:
Low-code LLM: Graphical User Interface over Large Language Models. NAACL (Demonstrations) 2024: 12-25 - [c228]Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji:
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration. NAACL-HLT 2024: 257-279 - [c227]Tianyi Tang, Hongyuan Lu, Yuchen Jiang, Haoyang Huang, Dongdong Zhang, Wayne Xin Zhao, Tom Kocmi, Furu Wei:
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References. NAACL-HLT 2024: 6596-6610 - [c226]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. SIGIR 2024: 2421-2425 - [i263]Hongkun Hao, Long Zhou, Shujie Liu, Jinyu Li, Shujie Hu, Rui Wang, Furu Wei:
Boosting Large Language Model for Speech Synthesis: An Empirical Study. CoRR abs/2401.00246 (2024) - [i262]Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei:
Improving Text Embeddings with Large Language Models. CoRR abs/2401.00368 (2024) - [i261]Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang:
Improving Domain Adaptation through Extended-Text Reading Comprehension. CoRR abs/2401.07284 (2024) - [i260]Yadong Zhang, Shaoguang Mao, Tao Ge, Xun Wang, Yan Xia, Man Lan, Furu Wei:
K-Level Reasoning with Large Language Models. CoRR abs/2402.01521 (2024) - [i259]Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei:
Multilingual E5 Text Embeddings: A Technical Report. CoRR abs/2402.05672 (2024) - [i258]Niklas Muennighoff, Hongjin Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh, Douwe Kiela:
Generative Representational Instruction Tuning. CoRR abs/2402.09906 (2024) - [i257]Haoran Li, Qingxiu Dong, Zhengyang Tang, Chaojun Wang, Xingxing Zhang, Haoyang Huang, Shaohan Huang, Xiaolong Huang, Zeqiang Huang, Dongdong Zhang, Yuxian Gu, Xin Cheng, Xun Wang, Si-Qing Chen, Li Dong, Wei Lu, Zhifang Sui, Benyou Wang, Wai Lam, Furu Wei:
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models. CoRR abs/2402.13064 (2024) - [i256]Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang:
Se2: Sequential Example Selection for In-Context Learning. CoRR abs/2402.13874 (2024) - [i255]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Text Diffusion with Reinforced Conditioning. CoRR abs/2402.14843 (2024) - [i254]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition. CoRR abs/2402.15754 (2024) - [i253]Tianyi Tang, Wenyang Luo, Haoyang Huang, Dongdong Zhang, Xiaolei Wang, Xin Zhao, Furu Wei, Ji-Rong Wen:
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models. CoRR abs/2402.16438 (2024) - [i252]Yuxian Gu, Li Dong, Yaru Hao, Qingxiu Dong, Minlie Huang, Furu Wei:
Towards Optimal Learning of Language Models. CoRR abs/2402.17759 (2024) - [i251]Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong Xue, Furu Wei:
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits. CoRR abs/2402.17764 (2024) - [i250]Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
ResLoRA: Identity Residual Mapping in Low-Rank Adaption. CoRR abs/2402.18039 (2024) - [i249]Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei:
MathScale: Scaling Instruction Tuning for Mathematical Reasoning. CoRR abs/2403.02884 (2024) - [i248]Shujie Hu, Long Zhou, Shujie Liu, Sanyuan Chen, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei:
WavLLM: Towards Robust and Adaptive Speech Large Language Model. CoRR abs/2404.00656 (2024) - [i247]Yadong Zhang, Shaoguang Mao, Tao Ge, Xun Wang, Adrian de Wynter, Yan Xia, Wenshan Wu, Ting Song, Man Lan, Furu Wei:
LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models. CoRR abs/2404.01230 (2024) - [i246]Wenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia, Li Dong, Lei Cui, Furu Wei:
Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models. CoRR abs/2404.03622 (2024) - [i245]Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li:
LongEmbed: Extending Embedding Models for Long Context Retrieval. CoRR abs/2404.12096 (2024) - [i244]Xun Wu, Shaohan Huang, Furu Wei:
Mixture of LoRA Experts. CoRR abs/2404.13628 (2024) - [i243]Xun Wu, Shaohan Huang, Wenhui Wang, Furu Wei:
Multi-Head Mixture-of-Experts. CoRR abs/2404.15045 (2024) - [i242]Xun Wu, Shaohan Huang, Furu Wei:
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation. CoRR abs/2404.15100 (2024) - [i241]Jiawei Zhou, Li Dong, Furu Wei, Lei Chen:
Semi-Parametric Retrieval via Binary Token Index. CoRR abs/2405.01924 (2024) - [i240]Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei:
You Only Cache Once: Decoder-Decoder Architectures for Language Models. CoRR abs/2405.05254 (2024) - [i239]Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang:
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning. CoRR abs/2405.12130 (2024) - [i238]Xin Cheng, Xun Wang, Xingxing Zhang, Tao Ge, Si-Qing Chen, Furu Wei, Huishuai Zhang, Dongyan Zhao:
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token. CoRR abs/2405.13792 (2024) - [i237]Sanyuan Chen, Shujie Liu, Long Zhou, Yanqing Liu, Xu Tan, Jinyu Li, Sheng Zhao, Yao Qian, Furu Wei:
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers. CoRR abs/2406.05370 (2024) - [i236]Bing Han, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Yanming Qian, Yanqing Liu, Sheng Zhao, Jinyu Li, Furu Wei:
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment. CoRR abs/2406.07855 (2024) - [i235]Peizhong Gao, Ao Xie, Shaoguang Mao, Wenshan Wu, Yan Xia, Haipeng Mi, Furu Wei:
Meta Reasoning for Large Language Models. CoRR abs/2406.11698 (2024) - [i234]Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei:
Instruction Pre-Training: Language Models are Supervised Multitask Learners. CoRR abs/2406.14491 (2024) - [i233]Yixing Li, Yuxian Gu, Li Dong, Dequan Wang, Yu Cheng, Furu Wei:
Direct Preference Knowledge Distillation for Large Language Models. CoRR abs/2406.19774 (2024) - [i232]Yadong Zhang, Shaoguang Mao, Wenshan Wu, Yan Xia, Tao Ge, Man Lan, Furu Wei:
Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning. CoRR abs/2407.06112 (2024) - [i231]Lingwei Meng, Long Zhou, Shujie Liu, Sanyuan Chen, Bing Han, Shujie Hu, Yanqing Liu, Jinyu Li, Sheng Zhao, Xixin Wu, Helen Meng, Furu Wei:
Autoregressive Speech Synthesis without Vector Quantization. CoRR abs/2407.08551 (2024) - [i230]Hongyu Wang, Shuming Ma, Ruiping Wang, Furu Wei:
Q-Sparse: All Large Language Models can be Fully Sparsely-Activated. CoRR abs/2407.10969 (2024) - [i229]Johan Bjorck, Alon Benhaim, Vishrav Chaudhary, Furu Wei, Xia Song:
Scaling Optimal LR Across Token Horizon. CoRR abs/2409.19913 (2024) - [i228]Tianzhu Ye, Li Dong, Yuqing Xia, Yutao Sun, Yi Zhu, Gao Huang, Furu Wei:
Differential Transformer. CoRR abs/2410.05258 (2024) - [i227]Qingxiu Dong, Li Dong, Xingxing Zhang, Zhifang Sui, Furu Wei:
Self-Boosting Large Language Models with Synthetic Preference Data. CoRR abs/2410.06961 (2024) - [i226]Yuxian Gu, Li Dong, Hongning Wang, Yaru Hao, Qingxiu Dong, Furu Wei, Minlie Huang:
Data Selection via Optimal Control for Language Models. CoRR abs/2410.07064 (2024) - [i225]Fangru Lin, Shaoguang Mao, Emanuele La Malfa, Valentin Hofmann, Adrian de Wynter, Jing Yao, Si-Qing Chen, Michael J. Wooldridge, Furu Wei:
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks. CoRR abs/2410.11005 (2024) - [i224]Jinheng Wang, Hansong Zhou, Ting Song, Shaoguang Mao, Shuming Ma, Hongyu Wang, Yan Xia, Furu Wei:
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs. CoRR abs/2410.16144 (2024) - [i223]Haonan Chen, Liang Wang, Nan Yang, Yutao Zhu, Ziliang Zhao, Furu Wei, Zhicheng Dou:
Little Giants: Synthesizing High-Quality Embedding Data at Scale. CoRR abs/2410.18634 (2024) - [i222]Hengyuan Zhang, Chenming Shang, Sizhe Wang, Dongdong Zhang, Feng Yao, Renliang Sun, Yiyao Yu, Yujiu Yang, Furu Wei:
ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework. CoRR abs/2410.19453 (2024) - [i221]Zongyi Li, Shujie Hu, Shujie Liu, Long Zhou, Jeongsoo Choi, Lingwei Meng, Xun Guo, Jinyu Li, Hefei Ling, Furu Wei:
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation. CoRR abs/2410.20502 (2024) - [i220]Lingjie Jiang, Shaohan Huang, Xun Wu, Furu Wei:
Textual Aesthetics in Large Language Models. CoRR abs/2411.02930 (2024) - [i219]Hongyu Wang, Shuming Ma, Furu Wei:
BitNet a4.8: 4-bit Activations for 1-bit LLMs. CoRR abs/2411.04965 (2024) - [i218]Shaohan Huang, Xun Wu, Shuming Ma, Furu Wei:
MH-MoE: Multi-Head Mixture-of-Experts. CoRR abs/2411.16205 (2024) - [i217]Fangkai Jiao, Geyang Guo, Xingxing Zhang, Nancy F. Chen, Shafiq Joty, Furu Wei:
Preference Optimization for Reasoning with Pseudo Feedback. CoRR abs/2411.16345 (2024) - [i216]Yaoyao Chang, Lei Cui, Li Dong, Shaohan Huang, Yangyu Huang, Yupan Huang, Scarlett Li, Tengchao Lv, Shuming Ma, Qinzheng Sun, Wenhui Wang, Furu Wei, Ying Xin, Mao Yang, Qiufeng Yin, Xingxing Zhang:
RedStone: Curating General, Code, Math, and QA Data for Large Language Models. CoRR abs/2412.03398 (2024) - [i215]Yutao Sun, Hangbo Bao, Wenhui Wang, Zhiliang Peng, Li Dong, Shaohan Huang, Jianyong Wang, Furu Wei:
Multimodal Latent Language Modeling with Next-Token Diffusion. CoRR abs/2412.08635 (2024) - 2023
- [j25]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Generative retrieval for conversational question answering. Inf. Process. Manag. 60(5): 103475 (2023) - [j24]Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei:
Large Search Model: Redefining Search Stack in the Era of LLMs. SIGIR Forum 57(2): 23:1-23:16 (2023) - [j23]Jian Yang, Yuwei Yin, Liqun Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Furu Wei, Zhoujun Li:
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1489-1498 (2023) - [j22]Zhiliang Peng, Li Dong, Hangbo Bao, Furu Wei, Qixiang Ye:
A Unified View of Masked Image Modeling. Trans. Mach. Learn. Res. 2023 (2023) - [c225]Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Zhoujun Li, Furu Wei:
TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models. AAAI 2023: 13094-13102 - [c224]Yuan Xie, Shaohan Huang, Tianyu Chen, Furu Wei:
MoEC: Mixture of Expert Clusters. AAAI 2023: 13807-13815 - [c223]Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. ACL (Findings) 2023: 114-128 - [c222]Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, Furu Wei:
SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval. ACL (1) 2023: 2244-2258 - [c221]Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang:
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding. ACL (1) 2023: 3466-3478 - [c220]Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Shuming Ma, Zhifang Sui, Furu Wei:
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers. ACL (Findings) 2023: 4005-4019 - [c219]Yuxian Gu, Li Dong, Furu Wei, Minlie Huang:
Pre-Training to Learn in Context. ACL (1) 2023: 4849-4870 - [c218]Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li:
Multiview Identifiers Enhanced Generative Retrieval. ACL (1) 2023: 6636-6648 - [c217]Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li:
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator. ACL (1) 2023: 9394-9412 - [c216]Liang Chen, Shuming Ma, Dongdong Zhang, Furu Wei, Baobao Chang:
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation. ACL (Findings) 2023: 9542-9558 - [c215]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. ACL (1) 2023: 14590-14604 - [c214]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. ACL (1) 2023: 15354-15373 - [c213]Jinghao Zhou, Li Dong, Zhe Gan, Lijuan Wang, Furu Wei:
Non-Contrastive Learning Meets Language-Image Pre-Training. CVPR 2023: 11028-11038 - [c212]Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye:
Generic-to-Specific Distillation of Masked Autoencoders. CVPR 2023: 15996-16005 - [c211]Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei:
Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks. CVPR 2023: 19175-19186 - [c210]Jian Yang, Yuwei Yin, Shuming Ma, Liqun Yang, Hongcheng Guo, Haoyang Huang, Dongdong Zhang, Yutao Zeng, Zhoujun Li, Furu Wei:
HanoiT: Enhancing Context-aware Translation via Selective Context. DASFAA (3) 2023: 471-486 - [c209]Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Democratizing Reasoning Ability: Tailored Learning from Large Language Model. EMNLP 2023: 1948-1966 - [c208]Heming Xia, Tao Ge, Peiyi Wang, Si-Qing Chen, Furu Wei, Zhifang Sui:
Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation. EMNLP (Findings) 2023: 3909-3925 - [c207]