default search action

combined dblp search
author search
venue search
publication search

ask others

Yu Bai 0017

> Home > Persons

Person information

affiliation: Salesforce Research, Palo Alto, CA, USA
affiliation (PhD 2019): Stanford University, CA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tit/ChenMBYPW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tit/ChenMBYPW24
Minshuo Chen, Jie Meng, Yu Bai, Yinyu Ye, H. Vincent Poor, Mengdi Wang:
Efficient Reinforcement Learning With Impaired Observability: Learning to Act With Delayed and Missing State Observations. IEEE Trans. Inf. Theory 70(10): 7251-7272 (2024)
[c39]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/0004HMWXS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0004HMWXS024
Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai:
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations. ICLR 2024
[c38]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/GuoCWXW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GuoCWXW024
Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai:
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight. ICLR 2024
[c37]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Lin0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Lin0M24
Licong Lin, Yu Bai, Song Mei:
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining. ICLR 2024
[c36]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ZhaoW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhaoW024
Lei Zhao, Mengdi Wang, Yu Bai:
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective. ICML 2024
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10941
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10941
Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese:
Text2Data: Low-Resource Data Generation with Textual Control. CoRR abs/2402.10941 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-05868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-05868
Ruiqi Zhang, Licong Lin, Yu Bai, Song Mei:
Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning. CoRR abs/2404.05868 (2024)
2023
[c35]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/WangL0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/WangL0023
Yuanhao Wang, Qinghua Liu, Yu Bai, Chi Jin:
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation. COLT 2023: 2793-2848
[c34]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/0004K0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0004K0023
Yuanhao Wang, Dingwen Kong, Yu Bai, Chi Jin:
Learning Rationalizable Equilibria in Multiplayer Games. ICLR 2023
[c33]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Chen0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Chen0M23
Fan Chen, Yu Bai, Song Mei:
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms. ICLR 2023
[c32]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/XieF00K23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XieF00K23
Tengyang Xie, Dylan J. Foster, Yu Bai, Nan Jiang, Sham M. Kakade:
The Role of Coverage in Online Reinforcement Learning. ICLR 2023
[c31]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BhatnagarWX023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BhatnagarWX023
Aadyot Bhatnagar, Huan Wang, Caiming Xiong, Yu Bai:
Improved Online Conformal Prediction via Strongly Adaptive Online Learning. ICML 2023: 2337-2363
[c30]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChenWXMB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenWXMB23
Fan Chen, Huan Wang, Caiming Xiong, Song Mei, Yu Bai:
Lower Bounds for Learning in Revealing POMDPs. ICML 2023: 5104-5161
[c29]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BaiCWXM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaiCWXM23
Yu Bai, Fan Chen, Huan Wang, Caiming Xiong, Song Mei:
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection. NeurIPS 2023
[c28]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Chen0PW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Chen0PW23
Minshuo Chen, Yu Bai, H. Vincent Poor, Mengdi Wang:
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations. NeurIPS 2023
[c27]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Fu00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Fu00M23
Hengyu Fu, Tianyu Guo, Yu Bai, Song Mei:
What can a Single Attention Layer Learn? A Study Through the Random Features Lens. NeurIPS 2023
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01333
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01333
Fan Chen, Huan Wang, Caiming Xiong, Song Mei, Yu Bai:
Lower Bounds for Learning in Revealing POMDPs. CoRR abs/2302.01333 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-06606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-06606
Yuanhao Wang, Qinghua Liu, Yu Bai, Chi Jin:
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation. CoRR abs/2302.06606 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-07869
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-07869
Aadyot Bhatnagar, Huan Wang, Caiming Xiong, Yu Bai:
Improved Online Conformal Prediction via Strongly Adaptive Online Learning. CoRR abs/2302.07869 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01243
Minshuo Chen, Yu Bai, H. Vincent Poor, Mengdi Wang:
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations. CoRR abs/2306.01243 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04637
Yu Bai, Fan Chen, Huan Wang, Caiming Xiong, Song Mei:
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection. CoRR abs/2306.04637 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-02884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-02884
Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai:
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight. CoRR abs/2307.02884 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11353
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11353
Hengyu Fu, Tianyu Guo, Yu Bai, Song Mei:
What can a Single Attention Layer Learn? A Study Through the Random Features Lens. CoRR abs/2307.11353 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08566
Licong Lin, Yu Bai, Song Mei:
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining. CoRR abs/2310.08566 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10616
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10616
Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai:
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations. CoRR abs/2310.10616 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-00054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-00054
Lei Zhao, Mengdi Wang, Yu Bai:
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? CoRR abs/2312.00054 (2023)
2022
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChoubeyBWLR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChoubeyBWLR22
Prafulla Kumar Choubey, Yu Bai, Chien-Sheng Wu, Wenhao Liu, Nazneen Rajani:
Conformal Predictor for Improving Zero-Shot Text Classification Efficiency. EMNLP 2022: 3027-3034
[c25]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BaiMWZX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BaiMWZX22
Yu Bai, Song Mei, Huan Wang, Yingbo Zhou, Caiming Xiong:
Efficient and Differentiable Conformal Prediction with General Function Classes. ICLR 2022
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SongMB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SongMB22
Ziang Song, Song Mei, Yu Bai:
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? ICLR 2022
[c23]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BaiJMY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BaiJMY22
Yu Bai, Chi Jin, Song Mei, Tiancheng Yu:
Near-Optimal Learning of Extensive-Form Games with Imperfect Information. ICML 2022: 1337-1382
[c22]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/00170MSY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/00170MSY22
Yu Bai, Chi Jin, Song Mei, Ziang Song, Tiancheng Yu:
Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent. NeurIPS 2022
[c21]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Nichani0L22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Nichani0L22
Eshaan Nichani, Yu Bai, Jason D. Lee:
Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials. NeurIPS 2022
[c20]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/SongM022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SongM022
Ziang Song, Song Mei, Yu Bai:
Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games. NeurIPS 2022
[c19]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhangLWX0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangLWX0022
Runyu Zhang, Qinghua Liu, Huan Wang, Caiming Xiong, Na Li, Yu Bai:
Policy Optimization for Markov Games: Unified Framework and Faster Convergence. NeurIPS 2022
[c18]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/LuoBBZWXSESP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/LuoBBZWXSESP22
Rachel Luo, Aadyot Bhatnagar, Yu Bai, Shengjia Zhao, Huan Wang, Caiming Xiong, Silvio Savarese, Stefano Ermon, Edward Schmerling, Marco Pavone:
Local calibration: metrics and recalibration. UAI 2022: 1286-1295
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01752
Yu Bai, Chi Jin, Song Mei, Tiancheng Yu:
Near-Optimal Learning of Extensive-Form Games with Imperfect Information. CoRR abs/2202.01752 (2022)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11091
Yu Bai, Song Mei, Huan Wang, Yingbo Zhou, Caiming Xiong:
Efficient and Differentiable Conformal Prediction with General Function Classes. CoRR abs/2202.11091 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-07223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-07223
Ziang Song, Song Mei, Yu Bai:
Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games. CoRR abs/2205.07223 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15294
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15294
Yu Bai, Chi Jin, Song Mei, Ziang Song, Tiancheng Yu:
Efficient Φ-Regret Minimization in Extensive-Form Games via Online Mirror Descent. CoRR abs/2205.15294 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02640
Runyu Zhang, Qinghua Liu, Huan Wang, Caiming Xiong, Na Li, Yu Bai:
Policy Optimization for Markov Games: Unified Framework and Faster Convergence. CoRR abs/2206.02640 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-03688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-03688
Eshaan Nichani, Yu Bai, Jason D. Lee:
Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials. CoRR abs/2206.03688 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11745
Fan Chen, Song Mei, Yu Bai:
Unified Algorithms for RL with Decision-Estimation Coefficients: No-Regret, PAC, and Reward-Free Learning. CoRR abs/2209.11745 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-14990
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-14990
Fan Chen, Yu Bai, Song Mei:
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms. CoRR abs/2209.14990 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-04157
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-04157
Tengyang Xie, Dylan J. Foster, Yu Bai, Nan Jiang, Sham M. Kakade:
The Role of Coverage in Online Reinforcement Learning. CoRR abs/2210.04157 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11402
Yuanhao Wang, Dingwen Kong, Yu Bai, Chi Jin:
Learning Rationalizable Equilibria in Multiplayer Games. CoRR abs/2210.11402 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12619
Prafulla Kumar Choubey, Yu Bai, Chien-Sheng Wu, Wenhao Liu, Nazneen Rajani:
Conformal Predictor for Improving Zero-shot Text Classification Efficiency. CoRR abs/2210.12619 (2022)
2021
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/YinBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YinBW21
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning. AISTATS 2021: 1567-1575
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BaiCZZLKWX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BaiCZZLKWX21
Yu Bai, Minshuo Chen, Pan Zhou, Tuo Zhao, Jason D. Lee, Sham M. Kakade, Huan Wang, Caiming Xiong:
How Important is the Train-Validation Split in Meta-Learning? ICML 2021: 543-553
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BaiMWX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BaiMWX21
Yu Bai, Song Mei, Huan Wang, Caiming Xiong:
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification. ICML 2021: 566-576
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiuYBJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuYBJ21
Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin:
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play. ICML 2021: 7001-7010
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YangBM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangBM21
Zitong Yang, Yu Bai, Song Mei:
Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models. ICML 2021: 11704-11715
[c12]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YinBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YinBW21
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction. NeurIPS 2021: 7677-7688
[c11]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BaiMWX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaiMWX21
Yu Bai, Song Mei, Huan Wang, Caiming Xiong:
Understanding the Under-Coverage Bias in Uncertainty Estimation. NeurIPS 2021: 18307-18319
[c10]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BaiJWX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaiJWX21
Yu Bai, Chi Jin, Huan Wang, Caiming Xiong:
Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games. NeurIPS 2021: 25799-25811
[c9]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/XieJWXB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XieJWXB21
Tengyang Xie, Nan Jiang, Huan Wang, Caiming Xiong, Yu Bai:
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning. NeurIPS 2021: 27395-27407
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01748
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction. CoRR abs/2102.01748 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07856
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07856
Yu Bai, Song Mei, Huan Wang, Caiming Xiong:
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification. CoRR abs/2102.07856 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-10809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-10809
Rachel Luo, Aadyot Bhatnagar, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai, Shengjia Zhao, Stefano Ermon:
Localized Calibration: Metrics and Recalibration. CoRR abs/2102.10809 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11494
Yu Bai, Chi Jin, Huan Wang, Caiming Xiong:
Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games. CoRR abs/2102.11494 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04554
Zitong Yang, Yu Bai, Song Mei:
Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models. CoRR abs/2103.04554 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-04895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-04895
Tengyang Xie, Nan Jiang, Huan Wang, Caiming Xiong, Yu Bai:
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning. CoRR abs/2106.04895 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-05515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-05515
Yu Bai, Song Mei, Huan Wang, Caiming Xiong:
Understanding the Under-Coverage Bias in Uncertainty Estimation. CoRR abs/2106.05515 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04184
Ziang Song, Song Mei, Yu Bai:
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? CoRR abs/2110.04184 (2021)
2020
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BaiL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BaiL20
Yu Bai, Jason D. Lee:
Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks. ICLR 2020
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BaiJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BaiJ20
Yu Bai, Chi Jin:
Provable Self-Play Algorithms for Competitive Reinforcement Learning. ICML 2020: 551-560
[c6]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BaiJY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaiJY20
Yu Bai, Chi Jin, Tiancheng Yu:
Near-Optimal Reinforcement Learning with Self-Play. NeurIPS 2020
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ChenBLZWXS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenBLZWXS20
Minshuo Chen, Yu Bai, Jason D. Lee, Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher:
Towards Understanding Hierarchical Learning: Benefits of Neural Representations. NeurIPS 2020
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-04010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-04010
Yu Bai, Ben Krause, Huan Wang, Caiming Xiong, Richard Socher:
Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width. CoRR abs/2002.04010 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-04017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-04017
Yu Bai, Chi Jin:
Provable Self-Play Algorithms for Competitive Reinforcement Learning. CoRR abs/2002.04017 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-12007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-12007
Yu Bai, Chi Jin, Tiancheng Yu:
Near-Optimal Reinforcement Learning with Self-Play. CoRR abs/2006.12007 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13436
Minshuo Chen, Yu Bai, Jason D. Lee, Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher:
Towards Understanding Hierarchical Learning: Benefits of Neural Representations. CoRR abs/2006.13436 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03760
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near Optimal Provable Uniform Convergence in Off-Policy Evaluation for Reinforcement Learning. CoRR abs/2007.03760 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01604
Qinghua Liu, Tiancheng Yu, Yu Bai, Chi Jin:
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play. CoRR abs/2010.01604 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05843
Yu Bai, Minshuo Chen, Pan Zhou, Tuo Zhao, Jason D. Lee, Sham M. Kakade, Huan Wang, Caiming Xiong:
How Important is the Train-Validation Split in Meta-Learning? CoRR abs/2010.05843 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BaiJS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BaiJS19
Yu Bai, Qijia Jiang, Ju Sun:
Subgradient Descent Learns Orthogonal Dictionaries. ICLR (Poster) 2019
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BaiMR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BaiMR19
Yu Bai, Tengyu Ma, Andrej Risteski:
Approximability of Discriminators Implies Diversity in GANs. ICLR (Poster) 2019
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BaiWL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BaiWL19
Yu Bai, Yu-Xiang Wang, Edo Liberty:
ProxQuant: Quantized Neural Networks via Proximal Operators. ICLR (Poster) 2019
[c1]
- view
- export record
  dblp key:
  - conf/nips/BaiXJW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaiXJW19
Yu Bai, Tengyang Xie, Nan Jiang, Yu-Xiang Wang:
Provably Efficient Q-Learning with Low Switching Cost. NeurIPS 2019: 8002-8011
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00184
Yu Bai, John C. Duchi, Song Mei:
Proximal algorithms for constrained composite optimization, with applications to solving low-rank SDPs. CoRR abs/1903.00184 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12849
Yu Bai, Tengyang Xie, Nan Jiang, Yu-Xiang Wang:
Provably Efficient Q-Learning with Low Switching Cost. CoRR abs/1905.12849 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01619
Yu Bai, Jason D. Lee:
Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks. CoRR abs/1910.01619 (2019)
2018
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-10586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-10586
Yu Bai, Tengyu Ma, Andrej Risteski:
Approximability of Discriminators Implies Diversity in GANs. CoRR abs/1806.10586 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-00861
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-00861
Yu Bai, Yu-Xiang Wang, Edo Liberty:
ProxQuant: Quantized Neural Networks via Proximal Operators. CoRR abs/1810.00861 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-10702
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-10702
Yu Bai, Qijia Jiang, Ju Sun:
Subgradient Descent Learns Orthogonal Dictionaries. CoRR abs/1810.10702 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.