default search action
Sherry Yang 0001
Person information
- unicode name: 杨梦娇
- affiliation: Google DeepMind
- affiliation (PhD): University of California, Berkeley, CA, USA
Other persons with the same name
- Sherry Yang — disambiguation page
- Sherry Yang 0002 — Oregon Institute of Technology, Klamath Falls, OR, USA (and 1 more)
- Sherry Yang 0003 — Synopsys Inc., Mountain View, CA, USA
- Mengjiao Yang (aka: Meng-Jiao Yang) — disambiguation page
- Mengjiao Yang 0002 — Chinese Academy of Sciences, Institute of Mountain Hazards and Environment, Chengdu, China (and 1 more)
- Mengjiao Yang 0003 — Qufu Normal University, School of Computer Science, Rizhao, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Yilun Du, Sherry Yang, Pete Florence, Fei Xia, Ayzaan Wahid, Brian Ichter, Pierre Sermanet, Tianhe Yu, Pieter Abbeel, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Andy Zeng, Jonathan Tompson:
Video Language Planning. ICLR 2024 - [c24]Sherry Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk:
Scalable Diffusion for Materials Generation. ICLR 2024 - [c23]Sherry Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel:
Probabilistic Adaptation of Black-Box Text-to-Video Models. ICLR 2024 - [c22]Sherry Yang, Yilun Du, Seyed Kamyar Seyed Ghasemipour, Jonathan Tompson, Leslie Pack Kaelbling, Dale Schuurmans, Pieter Abbeel:
Learning Interactive Real-World Simulators. ICLR 2024 - [c21]David Venuto, Mohammad Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand:
Code as Reward: Empowering Reinforcement Learning with VLMs. ICML 2024 - [c20]Sherry Yang, Jacob C. Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, André Barreto, Pieter Abbeel, Dale Schuurmans:
Position: Video as the New Language for Real-World Decision Making. ICML 2024 - [i29]David Venuto, Mohammad Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand:
Code as Reward: Empowering Reinforcement Learning with VLMs. CoRR abs/2402.04764 (2024) - [i28]Sherry Yang, Jacob C. Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, André Barreto, Pieter Abbeel, Dale Schuurmans:
Video as the New Language for Real-World Decision Making. CoRR abs/2402.17139 (2024) - [i27]Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai:
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF. CoRR abs/2405.19320 (2024) - [i26]Hanjun Dai, Bethany Wang, Xingchen Wan, Bo Dai, Sherry Yang, Azade Nova, Pengcheng Yin, Phitchaya Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans:
UQE: A Query Engine for Unstructured Databases. CoRR abs/2407.09522 (2024) - [i25]Sherry Yang, Simon L. Batzner, Ruiqi Gao, Muratahan Aykol, Alexander L. Gaunt, Brendan McMorrow, Danilo J. Rezende, Dale Schuurmans, Igor Mordatch, Ekin D. Cubuk:
Generative Hierarchical Materials Search. CoRR abs/2409.06762 (2024) - 2023
- [c19]Charlie Snell, Ilya Kostrikov, Yi Su, Sherry Yang, Sergey Levine:
Offline RL for Natural Language Generation with Implicit Language Q Learning. ICLR 2023 - [c18]Sherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. ICLR 2023 - [c17]David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. ICML 2023: 35024-35036 - [c16]Yilun Du, Sherry Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Josh Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. NeurIPS 2023 - [i24]Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. CoRR abs/2302.00111 (2023) - [i23]Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale Schuurmans:
Foundation Models for Decision Making: Problems, Methods, and Opportunities. CoRR abs/2303.04129 (2023) - [i22]Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel:
Probabilistic Adaptation of Text-to-Video Models. CoRR abs/2306.01872 (2023) - [i21]Mengjiao Yang, Yilun Du, Kamyar Ghasemipour, Jonathan Tompson, Dale Schuurmans, Pieter Abbeel:
Learning Interactive Real-World Simulators. CoRR abs/2310.06114 (2023) - [i20]Yilun Du, Mengjiao Yang, Pete Florence, Fei Xia, Ayzaan Wahid, Brian Ichter, Pierre Sermanet, Tianhe Yu, Pieter Abbeel, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Andy Zeng, Jonathan Tompson:
Video Language Planning. CoRR abs/2310.10625 (2023) - [i19]Mengjiao Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk:
Scalable Diffusion for Materials Generation. CoRR abs/2311.09235 (2023) - 2022
- [c15]Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans:
Offline Policy Selection under Uncertainty. AISTATS 2022: 4376-4396 - [c14]Mengjiao Yang, Sergey Levine, Ofir Nachum:
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data. ICLR 2022 - [c13]Hanjun Dai, Mengjiao Yang, Yuan Xue, Dale Schuurmans, Bo Dai:
Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization. ICML 2022: 4605-4617 - [c12]Tianjun Zhang, Tongzheng Ren, Mengjiao Yang, Joseph Gonzalez, Dale Schuurmans, Bo Dai:
Making Linear MDPs Practical via Contrastive Representation Learning. ICML 2022: 26447-26466 - [c11]Charlie Snell, Sherry Yang, Justin Fu, Yi Su, Sergey Levine:
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems. NAACL-HLT (Findings) 2022: 2351-2366 - [c10]Siddharth Verma, Justin Fu, Sherry Yang, Sergey Levine:
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning. NAACL-HLT 2022: 4471-4491 - [c9]Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Sergio Guadarrama, Ian Fischer, Winnie Xu, Eric Jang, Henryk Michalewski, Igor Mordatch:
Multi-Game Decision Transformers. NeurIPS 2022 - [c8]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. NeurIPS 2022 - [i18]Siddharth Verma, Justin Fu, Mengjiao Yang, Sergey Levine:
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning. CoRR abs/2204.08426 (2022) - [i17]Charlie Snell, Mengjiao Yang, Justin Fu, Yi Su, Sergey Levine:
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems. CoRR abs/2204.10198 (2022) - [i16]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. CoRR abs/2205.10816 (2022) - [i15]Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch:
Multi-Game Decision Transformers. CoRR abs/2205.15241 (2022) - [i14]Charlie Snell, Ilya Kostrikov, Yi Su, Mengjiao Yang, Sergey Levine:
Offline RL for Natural Language Generation with Implicit Language Q Learning. CoRR abs/2206.11871 (2022) - [i13]Tianjun Zhang, Tongzheng Ren, Mengjiao Yang, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai:
Making Linear MDPs Practical via Contrastive Representation Learning. CoRR abs/2207.07150 (2022) - [i12]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. CoRR abs/2210.13435 (2022) - [i11]David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. CoRR abs/2211.13337 (2022) - 2021
- [c7]Haoming Jiang, Bo Dai, Mengjiao Yang, Tuo Zhao, Wei Wei:
Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach. EMNLP (1) 2021: 7419-7451 - [c6]Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine:
Benchmarks for Deep Off-Policy Evaluation. ICLR 2021 - [c5]Mengjiao Yang, Ofir Nachum:
Representation Matters: Offline Pretraining for Sequential Decision Making. ICML 2021: 11784-11794 - [c4]Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai:
Combiner: Full Attention Transformer with Sparse Computation Cost. NeurIPS 2021: 22470-22482 - [c3]Ofir Nachum, Mengjiao Yang:
Provable Representation Learning for Imitation with Contrastive Fourier Features. NeurIPS 2021: 30100-30112 - [i10]Mengjiao Yang, Ofir Nachum:
Representation Matters: Offline Pretraining for Sequential Decision Making. CoRR abs/2102.05815 (2021) - [i9]Haoming Jiang, Bo Dai, Mengjiao Yang, Wei Wei, Tuo Zhao:
Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach. CoRR abs/2102.10242 (2021) - [i8]Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine:
Benchmarks for Deep Off-Policy Evaluation. CoRR abs/2103.16596 (2021) - [i7]Ofir Nachum, Mengjiao Yang:
Provable Representation Learning for Imitation with Contrastive Fourier Features. CoRR abs/2105.12272 (2021) - [i6]Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai:
Combiner: Full Attention Transformer with Sparse Computation Cost. CoRR abs/2107.05768 (2021) - [i5]Mengjiao Yang, Sergey Levine, Ofir Nachum:
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data. CoRR abs/2110.14770 (2021) - 2020
- [c2]Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans:
Energy-Based Processes for Exchangeable Data. ICML 2020: 10681-10692 - [c1]Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans:
Off-Policy Evaluation via the Regularized Lagrangian. NeurIPS 2020 - [i4]Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans:
Energy-Based Processes for Exchangeable Data. CoRR abs/2003.07521 (2020) - [i3]Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans:
Off-Policy Evaluation via the Regularized Lagrangian. CoRR abs/2007.03438 (2020) - [i2]Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans:
Offline Policy Selection under Uncertainty. CoRR abs/2012.06919 (2020)
2010 – 2019
- 2018
- [j1]Yunming Zhang, Mengjiao Yang, Riyadh Baghdadi, Shoaib Kamil, Julian Shun, Saman P. Amarasinghe:
GraphIt: a high-performance graph DSL. Proc. ACM Program. Lang. 2(OOPSLA): 121:1-121:30 (2018) - [i1]Yunming Zhang, Mengjiao Yang, Riyadh Baghdadi, Shoaib Kamil, Julian Shun, Saman P. Amarasinghe:
GraphIt - A High-Performance DSL for Graph Analytics. CoRR abs/1805.00923 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-13 23:51 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint