


default search action
Yuda Song 0001
Person information
- affiliation: Carnegie Mellon University, PA, USA
Other persons with the same name
- Yuda Song 0002
— Zhejiang University, Hangzhou, China
- Yuda Song 0003 — Xiaomi Inc.
SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i15]Zhaoyi Zhou, Yuda Song, Andrea Zanette:
Accelerating Unbiased LLM Evaluation via Synthetic Feedback. CoRR abs/2502.10563 (2025) - 2024
- [c13]Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun:
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees. ICLR 2024 - [c12]Yuda Song, Drew Bagnell, Aarti Singh:
Hybrid Reinforcement Learning from Offline Observation Alone. ICML 2024 - [c11]Yuda Song, Lili Wu, Dylan J. Foster, Akshay Krishnamurthy:
Rich-Observation Reinforcement Learning with Continuous Latent Dynamics. ICML 2024 - [c10]Yuda Song, Gokul Swamy, Aarti Singh, J. Andrew Bagnell, Wen Sun:
The Importance of Online Data: Understanding Preference Fine-tuning via Coverage. NeurIPS 2024 - [i14]Yuda Song, Lili Wu, Dylan J. Foster, Akshay Krishnamurthy:
Rich-Observation Reinforcement Learning with Continuous Latent Dynamics. CoRR abs/2405.19269 (2024) - [i13]Yuda Song, Gokul Swamy, Aarti Singh, J. Andrew Bagnell, Wen Sun:
Understanding Preference Fine-Tuning Through the Lens of Coverage. CoRR abs/2406.01462 (2024) - [i12]Yuda Song, J. Andrew Bagnell, Aarti Singh:
Hybrid Reinforcement Learning from Offline Observation Alone. CoRR abs/2406.07253 (2024) - [i11]Yuda Song, Hanlin Zhang, Carson Eisenach, Sham M. Kakade, Dean P. Foster, Udaya Ghai:
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models. CoRR abs/2412.02674 (2024) - 2023
- [c9]Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang:
Provable Benefits of Representational Transfer in Reinforcement Learning. COLT 2023: 2114-2187 - [c8]Yuda Song, Yifei Zhou, Ayush Sekhari, Drew Bagnell, Akshay Krishnamurthy, Wen Sun:
Hybrid RL: Using both offline and online data can make RL efficient. ICLR 2023 - [c7]Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Zihan Ding, Chi Jin, Mengdi Wang:
Representation Learning for Low-rank General-sum Markov Games. ICLR 2023 - [c6]Anirudh Vemula, Yuda Song, Aarti Singh, Drew Bagnell, Sanjiban Choudhury:
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms. ICML 2023: 34978-35005 - [i10]Anirudh Vemula, Yuda Song, Aarti Singh, J. Andrew Bagnell, Sanjiban Choudhury:
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms. CoRR abs/2303.00694 (2023) - [i9]Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun:
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees. CoRR abs/2311.08384 (2023) - 2022
- [c5]Ye Yuan, Yuda Song
, Zhengyi Luo, Wen Sun, Kris M. Kitani:
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design. ICLR 2022 - [c4]Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun:
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach. ICML 2022: 26517-26547 - [c3]Yuda Song
, Ye Yuan, Wen Sun, Kris Kitani:
Online No-regret Model-Based Meta RL for Personalized Navigation. L4DC 2022: 166-179 - [i8]Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun:
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach. CoRR abs/2202.00063 (2022) - [i7]Yuda Song, Ye Yuan, Wen Sun, Kris Kitani:
Online No-regret Model-Based Meta RL for Personalized Navigation. CoRR abs/2204.01925 (2022) - [i6]Alekh Agarwal, Yuda Song
, Wen Sun, Kaiwen Wang, Mengdi Wang
, Xuezhou Zhang:
Provable Benefits of Representational Transfer in Reinforcement Learning. CoRR abs/2205.14571 (2022) - [i5]Yuda Song, Yifei Zhou, Ayush Sekhari, J. Andrew Bagnell, Akshay Krishnamurthy, Wen Sun:
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient. CoRR abs/2210.06718 (2022) - [i4]Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Chi Jin, Mengdi Wang
:
Representation Learning for General-sum Low-rank Markov Games. CoRR abs/2210.16976 (2022) - 2021
- [c2]Yuda Song, Wen Sun:
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration. ICML 2021: 9801-9811 - [i3]Yuda Song, Wen Sun:
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration. CoRR abs/2107.07410 (2021) - [i2]Ye Yuan, Yuda Song, Zhengyi Luo, Wen Sun, Kris Kitani:
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design. CoRR abs/2110.03659 (2021) - 2020
- [c1]Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao:
Provably Efficient Model-based Policy Adaptation. ICML 2020: 9088-9098 - [i1]Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao:
Provably Efficient Model-based Policy Adaptation. CoRR abs/2006.08051 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-22 00:01 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint