Publications - Sherry Yang's Homepage

Publications

Please see my google scholar for an up-to-date list of publications. * denotes equal contribution

2025

Introducing Marin: An Open Lab for Building Foundation Models.

David Hall, Ahmed Ahmed, Christopher Chou, Abhinav Garg, Rohith Kuditipudi, Will Held, Nikil Ravi, Herumb Shandilya, Jason Wang Jason Bolton, Siddharth Karamcheti, Suhas Kotha, Tony Lee, Nelson Liu, Joel Niklaus, Ashwin Ramaswami, Kamyar Salahi, Kaiyue Wen, Chi Heem Wong, Sherry Yang, Ivan Zhou, Percy Liang.

Reinforcement Learning for Machine Learning Engineering Agents.

Sherry Yang, Joy He-Yueya, Percy Liang

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities . [website]

Gemini Team, 2025.

Evaluating Policies in a World Model. [website]

Julian Quevedo, Percy Liang, Sherry Yang.

arXiv preprint, 2025.

Object-centric 3D Motion Field for Robot Learning from Human Videos. [website]

Zhao-Heng Yin, Sherry Yang, Pieter Abbeel.

arXiv preprint, 2025.

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering [code] [website]

Rushi Qiang, Yuchen Zhuang, Yinghao Li, Rongzhi Zhang, Changhao Li, Ian Shu-Hei Wong, Sherry Yang, Percy Liang, Chao Zhang, Bo Dai.

arXiv preprint, 2025.

System of Agentic AI for the Discovery of Metal-Organic Frameworks

Theo Jaffrelot Inizan, Sherry Yang, Aaron Kaplan, Yen-hsu Lin, Jian Yin, Saber Mirzaei, Mona Abdelgaid, Ali H Alawadhi, KwangHwan Cho, Zhiling Zheng, Ekin Dogus Cubuk, Christian Borgs, Jennifer T Chayes, Kristin A Persson, Omar M Yaghi.

arXiv preprint, 2025.

Generative Model for Enhancing Reticular Material Discovery

Theo Jaffrelot Inizan, Aaron Kaplan, Sherry Yang, Yen-hsu Lin, Mona Abdelgaid, Jian Yin, Zhiling Zheng, Saber Mirzaei, Ali H Alawadhi, Ekin Dogus Cubuk, Christian Borgs, Jennifer T Chayes, Kristin A Persson, Omar M Yaghi.

AI4X 2025. Oral.

2024

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai.

International Conference on Learning Representations (ICLR), 2025.

Generative Hierarchical Materials Search [website]

Sherry Yang, Simon Batzner, Ruiqi Gao, Muratahan Aykol, Alexander L Gaunt, Brendan McMorrow, Danilo J Rezende, Dale Schuurmans, Igor Mordatch, Ekin D Cubuk.

Advances in Neural Information Processing Systems (NeurIPS), 2024.

VideoAgent: Self-improving Video Generation [website]

Achint Soni, Sreyas Venkataraman, Abhranil Chandra, Sebastian Fischmeister, Percy Liang, Bo Dai, Sherry Yang.

RLC Workshop on RL Beyond Reward, 2025. Spotlight.

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback [website]

Hiroki Furuta, Heiga Zen, Dale Schuurmans, Aleksandra Faust, Yutaka Matsuo, Percy Liang, Sherry Yang.

arXiv preprint, 2024.

UQE: A Query Engine for Unstructured Databases

Hanjun Dai, Bethany Wang, Xingchen Wan, Bo Dai, Sherry Yang, Azade Nova, Pengcheng Yin, Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans.

Advances in Neural Information Processing Systems (NeurIPS), 2024.

Video as the New Language for Real-World Decision Making

Sherry Yang, Jacob Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, Andre Barreto, Pieter Abbeel, Dale Schuurmans.

International Conference on Machine Learning (ICML), 2024.

Code as Reward: Empowering Reinforcement Learning with VLMs

David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand.

International Conference on Machine Learning (ICML), 2024.

Learning Interactive Real-World Simulators [website]

Sherry Yang, Yilun Du, Kamyar Ghasemipour, Jonathan Tompson, Leslie Kaelbling, Dale Schuurmans, Pieter Abbeel.

International Conference on Learning Representations (ICLR), 2024. Outstanding Paper Award.

NeurIPS workshop on Instruction Tuning and Instruction Following, 2023. Best Paper.

Scalable Diffusion for Materials Generation [website]

Sherry Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk.

International Conference on Learning Representations (ICLR), 2024.

NeurIPS workshop on AI4Mat, 2023. Spotlight.

Video Language Planning [website]

Yilun Du, Sherry Yang, Pete Florence, Fei Xia, Ayzaan Wahid, Brian Ichter, Pierre Sermanet, Tianhe Yu, Pieter Abbeel, Joshua B. Tenenbaum, Leslie Kaelbling, Andy Zeng, Jonathan Tompson.

International Conference on Learning Representations (ICLR), 2024.

Probabilistic Adaptation of Text-to-Video Models [website]

Sherry Yang*, Yilun Du*, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel.

International Conference on Learning Representations (ICLR), 2024.

2023

Foundation Models for Decision Making: Problems, Methods, and Opportunities [website]

Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale Schuurmans.

Learning Universal Policies via Text-Guided Video Generation [website] [slides] [recording]

Yilun Du*, Sherry Yang*, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel.

Advances in Neural Information Processing Systems (NeurIPS), 2023. Spotlight.

Dichotomy of control: Separating what you can control from what you cannot [code] [slides] [recording]

Sherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum.

International Conference on Learning Representations (ICLR), 2023. Oral.

Multi-Environment Pretraining Enables Transfer to Action Limited Datasets

David Venuto*, Sherry Yang*, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum.

International Conference on Machine Learning (ICML), 2023.

Offline RL for Natural Language Generation with Implicit Language Q Learning [code] [website]

Charlie Snell, Ilya Kostrikov, Yi Su, Sherry Yang, Sergey Levine.

International Conference on Learning Representations (ICLR), 2023.

2022

Chain of Thought Imitation Learning with Procedure Cloning [code] [slides] [poster] [website]

Sherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum.

Advances in Neural Information Processing Systems (NeurIPS), 2022.

Multi-Game Decision Transformers [code] [blog]

Kuang-Huei Lee, Ofir Nachum, Sherry Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch.

Advances in Neural Information Processing Systems (NeurIPS), 2022. Oral.

Making linear mdps practical via contrastive representation learning

Tianjun Zhang, Tongzheng Ren, Sherry Yang, Joseph Gonzalez, Dale Schuurmans, Bo Dai.

International Conference on Machine Learning (ICML), 2022.

Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization

Hanjun Dai, Sherry Yang, Yuan Xue, Dale Schuurmans, Bo Dai.

International Conference on Machine Learning (ICML), 2022.

Offline Policy Selection under Uncertainty [code] [slides] [recording]

Sherry Yang*, Bo Dai*, Ofir Nachum*, George Tucker, Dale Schuurmans.

Artificial Intelligence and Statistics Conference (AISTATS), 2022.

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning [website]

Siddharth Verma, Justin Fu, Sherry Yang, Sergey Levine.

Findings of the Association for Computational Linguistics: NAACL, 2022. Oral.

Context-Aware Language Modeling for Goal-Oriented Dialogue Systems [website]

Charlie Snell, Sherry Yang, Justin Fu, Yi Su, Sergey Levine.

Findings of the Association for Computational Linguistics: NAACL, 2022

2021

Combiner: Full Attention Transformer with Sparse Computation Cost [code]

Hongyu Ren, Hanjun Dai, Zihang Dai, Sherry Yang, Jure Leskovec, Dale Schuurmans, Bo Dai.

Advances in Neural Information Processing Systems (NeurIPS), 2021.

Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach [code]

Haoming Jiang, Bo Dai, Sherry Yang, Tuo Zhao, Wei Wei.

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021. Oral.

Benchmarks for Deep Off-Policy Evaluation [code]

Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Sherry Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Thomas Paine.

International Conference on Learning Representations (ICLR), 2021.

TRAIL: Near-Optimal Imitation Learning with Suboptimal Data [code] [slides] [recording] [website]

Sherry Yang, Sergey Levine, Ofir Nachum.

International Conference on Learning Representations (ICLR), 2021.

Provable Representation Learning for Imitation with Contrastive Fourier Features [code]

Ofir Nachum, Sherry Yang.

Advances in Neural Information Processing Systems (NeurIPS), 2021.

Representation Matters: Offline Pretraining for Sequential Decision Making [code] [slides]

Sherry Yang, Ofir Nachum.

International Conference on Machine Learning (ICML), 2021.

2020

Off-Policy Evaluation via the Regularized Lagrangian [code] [slides]