default search action
Xianyuan Zhan
Person information
- affiliation: Tsinghua University, Beijing, China
Other persons with a similar name
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Yu Luo, Fuchun Sun, Tianying Ji, Xianyuan Zhan:
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies. RLJ 2: 733-762 (2024) - [j8]Xiaoyun Feng, Li Jiang, Xudong Yu, Haoran Xu, Xiaoyan Sun, Jie Wang, Xianyuan Zhan, Wai Kin Chan:
Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning. IEEE Trans. Games 16(1): 102-112 (2024) - [c29]Huiling Qin, Xianyuan Zhan, Yuanxun Li, Yu Zheng:
FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning. ECAI 2024: 1455-1462 - [c28]Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang:
Query-Policy Misalignment in Preference-Based Reinforcement Learning. ICLR 2024 - [c27]Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan:
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update. ICLR 2024 - [c26]Guan Wang, Sijie Cheng, Xianyuan Zhan, Xiangang Li, Sen Song, Yang Liu:
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data. ICLR 2024 - [c25]Yinan Zheng, Jianxiong Li, Dongjie Yu, Yujie Yang, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu:
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model. ICLR 2024 - [c24]Tianying Ji, Yu Luo, Fuchun Sun, Xianyuan Zhan, Jianwei Zhang, Huazhe Xu:
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic. ICML 2024 - [c23]Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan:
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning. ICML 2024 - [c22]Yu Luo, Tianying Ji, Fuchun Sun, Jianwei Zhang, Huazhe Xu, Xianyuan Zhan:
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts. ICML 2024 - [c21]Yu Luo, Tianying Ji, Fuchun Sun, Jianwei Zhang, Huazhe Xu, Xianyuan Zhan:
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL. ICML 2024 - [c20]Hanfei Geng, Yi Sun, Yuanzhe Li, Jichao Leng, Xiangyu Zhu, Xianyuan Zhan, Yuanchun Li, Feng Zhao, Yunxin Liu:
TESLA: Thermally Safe, Load-Aware, and Energy-Efficient Cooling Control System for Data Centers. ICPP 2024: 939-949 - [c19]Haoyi Niu, Jianming Hu, Guyue Zhou, Xianyuan Zhan:
A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied Agents. IJCAI 2024: 8197-8206 - [i38]Yinan Zheng, Jianxiong Li, Dongjie Yu, Yujie Yang, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu:
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model. CoRR abs/2401.10700 (2024) - [i37]Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan:
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update. CoRR abs/2402.00348 (2024) - [i36]Haoyi Niu, Jianming Hu, Guyue Zhou, Xianyuan Zhan:
A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied Agents. CoRR abs/2402.04580 (2024) - [i35]Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan:
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning. CoRR abs/2402.18137 (2024) - [i34]Wenjun Zou, Yao Lyu, Jie Li, Yujie Yang, Shengbo Eben Li, Jingliang Duan, Xianyuan Zhan, Jingjing Liu, Yaqin Zhang, Keqiang Li:
Policy Bifurcation in Safe Reinforcement Learning. CoRR abs/2403.12847 (2024) - [i33]Yu Luo, Tianying Ji, Fuchun Sun, Jianwei Zhang, Huazhe Xu, Xianyuan Zhan:
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL. CoRR abs/2405.18520 (2024) - [i32]Yu Luo, Tianying Ji, Fuchun Sun, Jianwei Zhang, Huazhe Xu, Xianyuan Zhan:
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts. CoRR abs/2405.19080 (2024) - [i31]Jinliang Zheng, Jianxiong Li, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan:
Instruction-Guided Visual Masking. CoRR abs/2405.19783 (2024) - [i30]Yu Luo, Fuchun Sun, Tianying Ji, Xianyuan Zhan:
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies. CoRR abs/2406.18053 (2024) - [i29]Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan, Amy Zhang:
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning. CoRR abs/2407.20109 (2024) - [i28]Haoyi Niu, Qimao Chen, Tenglong Liu, Jianxiong Li, Guyue Zhou, Yi Zhang, Jianming Hu, Xianyuan Zhan:
xTED: Cross-Domain Policy Adaptation via Diffusion-Based Trajectory Editing. CoRR abs/2409.08687 (2024) - [i27]Jianxiong Li, Zhihao Wang, Jinliang Zheng, Xiaoai Zhou, Guanming Wang, Guanglu Song, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Junzhi Yu, Xianyuan Zhan:
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning. CoRR abs/2410.01529 (2024) - 2023
- [j7]Huiling Qin, Xianyuan Zhan, Yu Zheng:
CSCAD: Correlation Structure-Based Collective Anomaly Detection in Complex System. IEEE Trans. Knowl. Data Eng. 35(5): 4634-4645 (2023) - [j6]Shuhan Liu, Di Weng, Yuan Tian, Zikun Deng, Haoran Xu, Xiangyu Zhu, Honglei Yin, Xianyuan Zhan, Yingcai Wu:
ECoalVis: Visual Analysis of Control Strategies in Coal-fired Power Plants. IEEE Trans. Vis. Comput. Graph. 29(1): 1091-1101 (2023) - [c18]Xiangsen Wang, Xianyuan Zhan:
Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization. AAMAS 2023: 2781-2783 - [c17]Li Jiang, Xiangsen Wang, Aidong Yang, Xidong Wang, Xiaojia Jin, Wei Wang, Xiaozhou Ye, Ye Ouyang, Xianyuan Zhan:
An Efficient Multi-Agent Optimization Approach for Coordinated Massive MIMO Beamforming. ICC 2023: 5632-5638 - [c16]Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang:
Mind the Gap: Offline Policy Optimization for Imperfect Rewards. ICLR 2023 - [c15]Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang:
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning. ICLR 2023 - [c14]Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Wai Kin Victor Chan, Xianyuan Zhan:
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization. ICLR 2023 - [c13]Peng Cheng, Xianyuan Zhan, Zhi-Hao Wu, Wenjia Zhang, Youfang Lin, Shoucheng Song, Han Wang, Li Jiang:
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL. NeurIPS 2023 - [c12]Xiangsen Wang, Haoran Xu, Yinan Zheng, Xianyuan Zhan:
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization. NeurIPS 2023 - [i26]Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang:
Mind the Gap: Offline Policy Optimization for Imperfect Rewards. CoRR abs/2302.01667 (2023) - [i25]Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Wai Kin Victor Chan, Xianyuan Zhan:
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization. CoRR abs/2303.15810 (2023) - [i24]Fang Wu, Huiling Qin, Siyuan Li, Stan Z. Li, Xianyuan Zhan, Jinbo Xu:
InstructBio: A Large-scale Semi-supervised Learning Paradigm for Biochemical Problems. CoRR abs/2304.03906 (2023) - [i23]Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Ya-Qin Zhang:
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning. CoRR abs/2305.15669 (2023) - [i22]Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang:
Query-Policy Misalignment in Preference-Based Reinforcement Learning. CoRR abs/2305.17400 (2023) - [i21]Tianying Ji, Yu Luo, Fuchun Sun, Xianyuan Zhan, Jianwei Zhang, Huazhe Xu:
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic. CoRR abs/2306.02865 (2023) - [i20]Peng Cheng, Xianyuan Zhan, Zhihao Wu, Wenjia Zhang, Shoucheng Song, Han Wang, Youfang Lin, Li Jiang:
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL. CoRR abs/2306.04220 (2023) - [i19]Xiangsen Wang, Xianyuan Zhan:
Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization. CoRR abs/2306.08900 (2023) - [i18]Xiangsen Wang, Haoran Xu, Yinan Zheng, Xianyuan Zhan:
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization. CoRR abs/2307.11620 (2023) - [i17]Guan Wang, Sijie Cheng, Xianyuan Zhan, Xiangang Li, Sen Song, Yang Liu:
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data. CoRR abs/2309.11235 (2023) - [i16]Haoyi Niu, Tianying Ji, Bingqi Liu, Haocheng Zhao, Xiangyu Zhu, Jianying Zheng, Pengfei Huang, Guyue Zhou, Jianming Hu, Xianyuan Zhan:
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps. CoRR abs/2309.12716 (2023) - [i15]Jianxiong Li, Shichao Lin, Tianyu Shi, Chujie Tian, Yu Mei, Jian Song, Xianyuan Zhan, Ruimin Li:
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning. CoRR abs/2311.15920 (2023) - [i14]Huiling Qin, Xianyuan Zhan, Yuanxun Li, Yu Zheng:
FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning. CoRR abs/2312.16892 (2023) - 2022
- [c11]Xianyuan Zhan, Haoran Xu, Yue Zhang, Xiangyu Zhu, Honglei Yin, Yu Zheng:
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning. AAAI 2022: 4680-4688 - [c10]Haoran Xu, Xianyuan Zhan, Xiangyu Zhu:
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning. AAAI 2022: 8753-8760 - [c9]Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li, Heming Zhang, Guyue Zhou, Xianyuan Zhan:
Discriminator-Guided Model-Based Offline Imitation Learning. CoRL 2022: 1266-1276 - [c8]Qiying Yu, Jieming Lou, Xianyuan Zhan, Qizhang Li, Wangmeng Zuo, Yang Liu, Jingjing Liu:
Adversarial Contrastive Learning via Asymmetric InfoNCE. ECCV (5) 2022: 53-69 - [c7]Haoran Xu, Xianyuan Zhan, Honglei Yin, Huiling Qin:
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations. ICML 2022: 24725-24742 - [c6]Xianyuan Zhan, Xiangyu Zhu, Haoran Xu:
Model-Based Offline Planning with Trajectory Pruning. IJCAI 2022: 3716-3722 - [c5]Haoyi Niu, Shubham Sharma, Yiwen Qiu, Ming Li, Guyue Zhou, Jianming Hu, Xianyuan Zhan:
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning. NeurIPS 2022 - [c4]Haoran Xu, Li Jiang, Jianxiong Li, Xianyuan Zhan:
A Policy-Guided Imitation Approach for Offline Reinforcement Learning. NeurIPS 2022 - [i13]Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang:
Distance-Sensitive Offline Reinforcement Learning. CoRR abs/2205.11027 (2022) - [i12]Haoyi Niu, Shubham Sharma, Yiwen Qiu, Ming Li, Guyue Zhou, Jianming Hu, Xianyuan Zhan:
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning. CoRR abs/2206.13464 (2022) - [i11]Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li, Heming Zhang, Guyue Zhou, Xianyuan Zhan:
Discriminator-Guided Model-Based Offline Imitation Learning. CoRR abs/2207.00244 (2022) - [i10]Qiying Yu, Jieming Lou, Xianyuan Zhan, Qizhang Li, Wangmeng Zuo, Yang Liu, Jingjing Liu:
Adversarial Contrastive Learning via Asymmetric InfoNCE. CoRR abs/2207.08374 (2022) - [i9]Haoran Xu, Xianyuan Zhan, Honglei Yin, Huiling Qin:
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations. CoRR abs/2207.10050 (2022) - [i8]Haoran Xu, Li Jiang, Jianxiong Li, Xianyuan Zhan:
A Policy-Guided Imitation Approach for Offline Reinforcement Learning. CoRR abs/2210.08323 (2022) - 2021
- [c3]Huiling Qin, Songyu Ke, Xiaodu Yang, Haoran Xu, Xianyuan Zhan, Yu Zheng:
Robust Spatio-Temporal Purchase Prediction via Deep Meta Learning. AAAI 2021: 4312-4319 - [c2]Huiling Qin, Xianyuan Zhan, Yuanxun Li, Xiaodu Yang, Yu Zheng:
Network-Wide Traffic States Imputation Using Self-interested Coalitional Learning. KDD 2021: 1370-1378 - [i7]Xianyuan Zhan, Haoran Xu, Yue Zhang, Yusen Huo, Xiangyu Zhu, Honglei Yin, Yu Zheng:
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning. CoRR abs/2102.11492 (2021) - [i6]Xianyuan Zhan, Xiangyu Zhu, Haoran Xu:
Model-Based Offline Planning with Trajectory Pruning. CoRR abs/2105.07351 (2021) - [i5]Huiling Qin, Xianyuan Zhan, Yu Zheng:
CSCAD: Correlation Structure-based Collective Anomaly Detection in Complex System. CoRR abs/2105.14476 (2021) - [i4]Haoran Xu, Xianyuan Zhan, Xiangyu Zhu:
Constraints Penalized Q-Learning for Safe Offline Reinforcement Learning. CoRR abs/2107.09003 (2021) - [i3]Haoran Xu, Xianyuan Zhan, Jianxiong Li, Honglei Yin:
Offline Reinforcement Learning with Soft Behavior Regularization. CoRR abs/2110.07395 (2021) - [i2]Jin Li, Xianyuan Zhan, Zixu Xiao, Guyue Zhou:
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information. CoRR abs/2110.10905 (2021) - [i1]Guan Wang, Haoyi Niu, Desheng Zhu, Jianming Hu, Xianyuan Zhan, Guyue Zhou:
ModEL: A Modularized End-to-end Reinforcement Learning Framework for Autonomous Driving. CoRR abs/2110.11573 (2021)
2010 – 2019
- 2019
- [j5]Jonatan Zischg, Christopher Klinkhamer, Xianyuan Zhan, P. Suresh C. Rao, Robert Sitzenfrei:
A Century of Topological Coevolution of Complex Infrastructure Networks in an Alpine City. Complex. 2019: 2096749:1-2096749:16 (2019) - [j4]Hemant Gehlot, Xianyuan Zhan, Xinwu Qian, Christopher Thompson, Milind Kulkarni, Satish V. Ukkusuri:
A-RESCUE 2.0: A High-Fidelity, Parallel, Agent-Based Evacuation Simulator. J. Comput. Civ. Eng. 33(2) (2019) - 2017
- [j3]Xianyuan Zhan, Yu Zheng, Xiuwen Yi, Satish V. Ukkusuri:
Citywide Traffic Volume Estimation Using Trajectory Data. IEEE Trans. Knowl. Data Eng. 29(2): 272-285 (2017) - 2016
- [j2]Samiul Hasan, Satish V. Ukkusuri, Xianyuan Zhan:
Understanding Social Influence in Activity Location Choice and Lifestyle Patterns Using Geolocation Data from Social Media. Frontiers ICT 3: 10 (2016) - [j1]Xianyuan Zhan, Xinwu Qian, Satish V. Ukkusuri:
A Graph-Based Approach to Measuring the Efficiency of an Urban Taxi Service System. IEEE Trans. Intell. Transp. Syst. 17(9): 2479-2489 (2016) - 2013
- [c1]Samiul Hasan, Xianyuan Zhan, Satish V. Ukkusuri:
Understanding urban human activity and mobility patterns using large-scale location-based data from online social media. UrbComp@KDD 2013: 6:1-6:8
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-15 21:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint