default search action
Yi Wu 0013
Person information
- affiliation: Tsinghua University, Institute of Interdisciplinary Information Sciences (IIIS), Beijing, China
- affiliation (PhD 2019): University of California, Berkeley, CA, USA
- affiliation: Microsoft Research Asia
Other persons with the same name
- Yi Wu — disambiguation page
- Yi Wu 0001 — Nanjing University of Science and Technology, China (and 2 more)
- Yi Wu 0002 — Google (and 3 more)
- Yi Wu 0003 — National University of Defense Technology, Changsha
- Yi Wu 0004 — Ericsson China, Beijing, China (and 2 more)
- Yi Wu 0005 — Intel Corporation (and 1 more)
- Yi Wu 0006 — Agder University College, Norway (and 1 more)
- Yi Wu 0007 — Tianjin University, Department of Information Management and Management Science, China (and 1 more)
- Yi Wu 0008 — Xi'an Jiaotong University, State Key Lab of Electrical Insulation and Power Equipment, China
- Yi Wu 0009 — Ericsson Sweden, Lund (and 1 more)
- Yi Wu 0010 — Fujian Normal University, College of Photonic and Electronic Engineering, Fuzhou, China (and 1 more)
- Yi Wu 0011 — State Grid Shanghai Electrical Power Research Institute, China (and 1 more)
- Yi Wu 0012 — Zhejiang University, China
- Yi Wu 0014 — Third Military Medical University, Chongqing, China
- Yi Wu 0015 — Beijing Institute of Technology
- Yi Wu 0016 (aka: Yi Alice Wu) — Oracle (and 1 more)
- Yi Wu 0017 — Harbin Engineering University, College of Automation, China
- Yi Wu 0018 — University of Science and Technology of China, Big Data and Decision Lab, Hefei, China
- Yi Wu 0019 — University of Science and Technology of China, Key Lab of Computing and Communication Software, Hefei, China
- Yi Wu 0020 — University of Tennessee, Department of Electrical Engineering and Computer Science, Knoxville, TN, USA
- Yi Wu 0021 — Heilongjiang University, School of Data Science and Technology, Harbin, China
Other persons with a similar name
- Soon-Yi Wu
- Tianyi Wu (aka: Tian-Yi Wu) — disambiguation page
- Ting-Yi Wu
- Yi-Chiao Wu
- Yi-Chieh Wu
- Yi-Leh Wu
- Yi-Ta Wu
- Yi-fang Brook Wu (aka: Brook Wu, Yi-Fang Wu) — New Jersey Institute of Technology, Newark, USA
- Yiming Wu (aka: Yi-ming Wu, Yi-Ming Wu) — disambiguation page
- Yi Yuan-Wu (aka: Yuan-Wu Yi)
- show all similar names
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Shuzhen Li, Yuxin Chen, Xuesong Chen, Ruiyang Gao, Yupeng Zhang, Chao Yu, Yunfei Li, Ziyi Ye, Weijun Huang, Hongliang Yi, Yue Leng, Yi Wu:
SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging with Neural Networks Based on Ballistocardiograms. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 8(4): 185:1-185:25 (2024) - [j5]Zikun Li, Jinjun Peng, Yixuan Mei, Sina Lin, Yi Wu, Oded Padon, Zhihao Jia:
Quarl: A Learning-Based Quantum Circuit Optimizer. Proc. ACM Program. Lang. 8(OOPSLA1): 555-582 (2024) - [j4]Botian Xu, Feng Gao, Chao Yu, Ruize Zhang, Yi Wu, Yu Wang:
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control. IEEE Robotics Autom. Lett. 9(3): 2838-2844 (2024) - [c59]Jiayu Chen, Zelai Xu, Yunfei Li, Chao Yu, Jiaming Song, Huazhong Yang, Fei Fang, Yu Wang, Yi Wu:
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning. AAAI 2024: 11320-11328 - [c58]Jijia Liu, Chao Yu, Jiaxuan Gao, Yuqing Xie, Qingmin Liao, Yi Wu, Yu Wang:
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination. AAMAS 2024: 1219-1228 - [c57]Zhicheng Zhang, Yancheng Liang, Yi Wu, Fei Fang:
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure. AAMAS 2024: 2085-2093 - [c56]Yihuan Mao, Chengjie Wu, Xi Chen, Hao Hu, Ji Jiang, Tianze Zhou, Tangjie Lv, Changjie Fan, Zhipeng Hu, Yi Wu, Yujing Hu, Chongjie Zhang:
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets. ICLR 2024 - [c55]Zhiyu Mei, Wei Fu, Jiaxuan Gao, Guangju Wang, Huanchen Zhang, Yi Wu:
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores. ICLR 2024 - [c54]Feng Gao, Liangzhi Shi, Shenao Zhang, Zhaoran Wang, Yi Wu:
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations. ICML 2024 - [c53]Zelai Xu, Chao Yu, Fei Fang, Yu Wang, Yi Wu:
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game. ICML 2024 - [c52]Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu:
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study. ICML 2024 - [c51]Ying Yuan, Haichuan Che, Yuzhe Qin, Binghao Huang, Zhao-Heng Yin, Kang-Won Lee, Yi Wu, Soo-Chul Lim, Xiaolong Wang:
Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing. ICRA 2024: 6558-6565 - [c50]Shusheng Xu, Huaijie Wang, Yutao Ouyang, Jiaxuan Gao, Zhiyu Mei, Chao Yu, Yi Wu:
LAGOON: Language-Guided Motion Control. ICRA 2024: 9743-9750 - [c49]Zhi Su, Xiaoyu Huang, Daniel Felipe Ordoñez Apraez, Yunfei Li, Zhongyu Li, Qiayuan Liao, Giulio Turrisi, Massimiliano Pontil, Claudio Semini, Yi Wu, Koushil Sreenath:
Leveraging Symmetry in RL-based Legged Locomotion Control. IROS 2024: 6899-6906 - [i68]Zhi Su, Xiaoyu Huang, Daniel Felipe Ordoñez Apraez, Yunfei Li, Zhongyu Li, Qiayuan Liao, Giulio Turrisi, Massimiliano Pontil, Claudio Semini, Yi Wu, Koushil Sreenath:
Leveraging Symmetry in RL-based Legged Locomotion Control. CoRR abs/2403.17320 (2024) - [i67]Yutao Ouyang, Jinhan Li, Yunfei Li, Zhongyu Li, Chao Yu, Koushil Sreenath, Yi Wu:
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models. CoRR abs/2404.05291 (2024) - [i66]Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu:
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study. CoRR abs/2404.10719 (2024) - [i65]Zhicheng Zhang, Yancheng Liang, Yi Wu, Fei Fang:
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure. CoRR abs/2405.00902 (2024) - [i64]Shu-Ang Yu, Chao Yu, Feng Gao, Yi Wu, Yu Wang:
FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors. CoRR abs/2406.05687 (2024) - [i63]Zhiyu Mei, Wei Fu, Kaiwei Li, Guangju Wang, Huanchen Zhang, Yi Wu:
ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation. CoRR abs/2406.14088 (2024) - [i62]Jiaxuan Gao, Shusheng Xu, Wenjie Ye, Weilin Liu, Chuyi He, Wei Fu, Zhiyu Mei, Guangju Wang, Yi Wu:
On Designing Effective RL Reward at Training Time for LLM Reasoning. CoRR abs/2410.15115 (2024) - [i61]Chao Yu, Hong Lu, Jiaxuan Gao, Qixin Tan, Xinting Yang, Yu Wang, Yi Wu, Eugene Vinitsky:
Few-shot In-Context Preference Learning Using Large Language Models. CoRR abs/2410.17233 (2024) - [i60]Yuqing Xie, Chao Yu, Hongzhi Zang, Feng Gao, Wenhao Tang, Jingyi Huang, Jiayu Chen, Botian Xu, Yi Wu, Yu Wang:
Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning. CoRR abs/2410.18495 (2024) - [i59]Shuzhen Li, Yuxin Chen, Xuesong Chen, Ruiyang Gao, Yupeng Zhang, Chao Yu, Yunfei Li, Ziyi Ye, Weijun Huang, Hongliang Yi, Yue Leng, Yi Wu:
SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging With Neural Networks Based on Ballistocardiograms. CoRR abs/2410.22646 (2024) - [i58]Jiayu Chen, Chao Yu, Yuqing Xie, Feng Gao, Yinuo Chen, Shu'ang Yu, Wenhao Tang, Shilong Ji, Mo Mu, Yi Wu, Huazhong Yang, Yu Wang:
What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study. CoRR abs/2412.11764 (2024) - 2023
- [j3]Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu:
Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning. Trans. Mach. Learn. Res. 2023 (2023) - [j2]Runlong Zhou, Zelin He, Yuandong Tian, Yi Wu, Simon Shaolei Du:
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization. Trans. Mach. Learn. Res. 2023 (2023) - [c48]Rui Zhao, Jinming Song, Yufeng Yuan, Haifeng Hu, Yang Gao, Yi Wu, Zhongqian Sun, Wei Yang:
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination. AAAI 2023: 6145-6153 - [c47]Kevin Du, Ian Gemp, Yi Wu, Yingying Wu:
AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard Markov Decision Process (Student Abstract). AAAI 2023: 16204-16205 - [c46]Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu:
Differentiable Arbitrating in Zero-sum Markov Games. AAMAS 2023: 1034-1043 - [c45]Zelai Xu, Yancheng Liang, Chao Yu, Yu Wang, Yi Wu:
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games. AAMAS 2023: 1053-1061 - [c44]Chao Yu, Xinyi Yang, Jiaxuan Gao, Jiayu Chen, Yunfei Li, Jijia Liu, Yunfei Xiang, Ruixin Huang, Huazhong Yang, Yi Wu, Yu Wang:
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration. AAMAS 2023: 1107-1115 - [c43]Chao Yu, Jiaxuan Gao, Weilin Liu, Botian Xu, Hao Tang, Jiaqi Yang, Yu Wang, Yi Wu:
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased. ICLR 2023 - [c42]Yixuan Mei, Jiaxuan Gao, Weirui Ye, Shaohuai Liu, Yang Gao, Yi Wu:
SpeedyZero: Mastering Atari with Limited Data and Time. ICLR 2023 - [c41]Yunfei Li, Chaoyi Pan, Huazhe Xu, Xiaolong Wang, Yi Wu:
Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning. ICRA 2023: 3867-3874 - [c40]Weihua Du, Jinglun Zhao, Chao Yu, Xingcheng Yao, Zimeng Song, Siyang Wu, Ruifeng Luo, Zhiyuan Liu, Xianzhong Zhao, Yi Wu:
Automatic Truss Design with Reinforcement Learning. IJCAI 2023: 3659-3667 - [c39]Yingying Wu, Shusheng Xu, Shing-Tung Yau, Yi Wu:
PhyloTransformer: A Self-supervised Discriminative Model for SARS-CoV-2 Viral Mutation Prediction Based on a Multi-head Self-attention Mechanism. KDH@IJCAI 2023 - [c38]Wei Fu, Weihua Du, Jingwei Li, Sunli Chen, Jingzhao Zhang, Yi Wu:
Iteratively Learn Diverse Strategies with State Distance Information. NeurIPS 2023 - [i57]Chao Yu, Xinyi Yang, Jiaxuan Gao, Jiayu Chen, Yunfei Li, Jijia Liu, Yunfei Xiang, Ruixin Huang, Huazhong Yang, Yi Wu, Yu Wang:
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration. CoRR abs/2301.03398 (2023) - [i56]Chao Yu, Jiaxuan Gao, Weilin Liu, Botian Xu, Hao Tang, Jiaqi Yang, Yu Wang, Yi Wu:
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased. CoRR abs/2302.01605 (2023) - [i55]Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu:
Differentiable Arbitrating in Zero-sum Markov Games. CoRR abs/2302.10058 (2023) - [i54]Qian Luo, Yunfei Li, Yi Wu:
Grounding Object Relations in Language-Conditioned Robotic Manipulation with Semantic-Spatial Reasoning. CoRR abs/2303.17919 (2023) - [i53]Shusheng Xu, Huaijie Wang, Jiaxuan Gao, Yutao Ouyang, Chao Yu, Yi Wu:
Language-Guided Generation of Physically Realistic Robot Motion and Control. CoRR abs/2306.10518 (2023) - [i52]Weihua Du, Jinglun Zhao, Chao Yu, Xingcheng Yao, Zimeng Song, Siyang Wu, Ruifeng Luo, Zhiyuan Liu, Xianzhong Zhao, Yi Wu:
Automatic Truss Design with Reinforcement Learning. CoRR abs/2306.15182 (2023) - [i51]Zhiyu Mei, Wei Fu, Guangju Wang, Huanchen Zhang, Yi Wu:
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores. CoRR abs/2306.16688 (2023) - [i50]Zikun Li, Jinjun Peng, Yixuan Mei, Sina Lin, Yi Wu, Oded Padon, Zhihao Jia:
Quarl: A Learning-Based Quantum Circuit Optimizer. CoRR abs/2307.10120 (2023) - [i49]Yancheng Liang, Jiajie Zhang, Hui Li, Xiaochen Liu, Yi Hu, Yong Wu, Jinyao Zhang, Yongyan Liu, Yi Wu:
DeRisk: An Effective Deep Learning Framework for Credit Risk Prediction over Real-World Financial Data. CoRR abs/2308.03704 (2023) - [i48]Botian Xu, Feng Gao, Chao Yu, Ruize Zhang, Yi Wu, Yu Wang:
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control. CoRR abs/2309.12825 (2023) - [i47]Zelai Xu, Yancheng Liang, Chao Yu, Yu Wang, Yi Wu:
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games. CoRR abs/2310.03354 (2023) - [i46]Jiayu Chen, Zelai Xu, Yunfei Li, Chao Yu, Jiaming Song, Huazhong Yang, Fei Fang, Yu Wang, Yi Wu:
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning. CoRR abs/2310.04796 (2023) - [i45]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Huaijie Wang, Lingxiao Ma, Fan Yang, Ruiping Wang, Yi Wu, Furu Wei:
BitNet: Scaling 1-bit Transformers for Large Language Models. CoRR abs/2310.11453 (2023) - [i44]Wei Fu, Weihua Du, Jingwei Li, Sunli Chen, Jingzhao Zhang, Yi Wu:
Iteratively Learn Diverse Strategies with State Distance Information. CoRR abs/2310.14509 (2023) - [i43]Zelai Xu, Chao Yu, Fei Fang, Yu Wang, Yi Wu:
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game. CoRR abs/2310.18940 (2023) - [i42]Yunfei Li, Jinhan Li, Wei Fu, Yi Wu:
Learning Agile Bipedal Motions on a Quadrupedal Robot. CoRR abs/2311.05818 (2023) - [i41]Ying Yuan, Haichuan Che, Yuzhe Qin, Binghao Huang, Zhao-Heng Yin, Kang-Won Lee, Yi Wu, Soo-Chul Lim, Xiaolong Wang:
Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing. CoRR abs/2312.01853 (2023) - [i40]Jijia Liu, Chao Yu, Jiaxuan Gao, Yuqing Xie, Qingmin Liao, Yi Wu, Yu Wang:
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination. CoRR abs/2312.15224 (2023) - 2022
- [c37]Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei:
Sequence Level Contrastive Learning for Text Summarization. AAAI 2022: 11556-11565 - [c36]Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu:
Learning Efficient Multi-agent Cooperative Visual Exploration. ECCV (39) 2022: 497-515 - [c35]Zihan Zhou, Wei Fu, Bingliang Zhang, Yi Wu:
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization. ICLR 2022 - [c34]Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu:
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning. ICML 2022: 6863-6877 - [c33]Yunfei Li, Tian Gao, Jiaqi Yang, Huazhe Xu, Yi Wu:
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning. ICML 2022: 12765-12781 - [c32]Yunfei Li, Tao Kong, Lei Li, Yi Wu:
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets. ICRA 2022: 7469-7476 - [c31]Shusheng Xu, Huaijie Wang, Yi Wu:
Grounded Reinforcement Learning: Learning to Win the Game under Human Commands. NeurIPS 2022 - [c30]Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre M. Bayen, Yi Wu:
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games. NeurIPS 2022 - [c29]Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu:
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning. NeurIPS 2022 - [i39]Runlong Zhou, Yuandong Tian, Yi Wu, Simon S. Du:
Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems. CoRR abs/2202.05423 (2022) - [i38]Zihan Zhou, Wei Fu, Bingliang Zhang, Yi Wu:
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization. CoRR abs/2204.02246 (2022) - [i37]Yunfei Li, Tao Kong, Lei Li, Yi Wu:
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets. CoRR abs/2204.05509 (2022) - [i36]Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu:
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2206.07505 (2022) - [i35]Yunfei Li, Tian Gao, Jiaqi Yang, Huazhe Xu, Yi Wu:
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning. CoRR abs/2206.12030 (2022) - [i34]Kevin Du, Ian Gemp, Yi Wu, Yingying Wu:
AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process. CoRR abs/2211.09622 (2022) - [i33]Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu:
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning. CoRR abs/2212.08860 (2022) - 2021
- [j1]Yining Wang, Yi Wu, Simon S. Du:
Near-Linear Time Local Polynomial Nonparametric Estimation with Box Kernels. INFORMS J. Comput. 33(4): 1339-1353 (2021) - [c28]Wei Fu, Chao Yu, Yunfei Li, Yi Wu:
Unlocking the Potential of MAPPO with Asynchronous Optimization. CICAI (2) 2021: 395-407 - [c27]Yunfei Li, Yilin Wu, Huazhe Xu, Xiaolong Wang, Yi Wu:
Solving Compositional Reinforcement Learning Problems via Task Reduction. ICLR 2021 - [c26]Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Shaolei Du, Yu Wang, Yi Wu:
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization. ICLR 2021 - [c25]Weizhe Chen, Zihan Zhou, Yi Wu, Fei Fang:
Temporal Induced Self-Play for Stochastic Bayesian Games. IJCAI 2021: 96-103 - [c24]Yunfei Li, Tao Kong, Lei Li, Yifeng Li, Yi Wu:
Learning to Design and Construct Bridge without Blueprint. IROS 2021: 2398-2405 - [c23]Jiayu Chen, Yuanxin Zhang, Yuanfan Xu, Huimin Ma, Huazhong Yang, Jiaming Song, Yu Wang, Yi Wu:
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems. NeurIPS 2021: 9681-9693 - [c22]Shusheng Xu, Yichen Liu, Xiaoyu Yi, Siyuan Zhou, Huizi Li, Yi Wu:
Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension. NeurIPS Datasets and Benchmarks 2021 - [c21]Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian:
NovelD: A Simple yet Effective Exploration Criterion. NeurIPS 2021: 25217-25230 - [i32]Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre M. Bayen, Yi Wu:
The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games. CoRR abs/2103.01955 (2021) - [i31]Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon S. Du, Yu Wang, Yi Wu:
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization. CoRR abs/2103.04564 (2021) - [i30]Yunfei Li, Yilin Wu, Huazhe Xu, Xiaolong Wang, Yi Wu:
Solving Compositional Reinforcement Learning Problems via Task Reduction. CoRR abs/2103.07607 (2021) - [i29]Minghao Zhang, Pingcheng Jian, Yi Wu, Huazhe Xu, Xiaolong Wang:
Disentangled Attention as Intrinsic Regularization for Bimanual Multi-Object Manipulation. CoRR abs/2106.05907 (2021) - [i28]Yunfei Li, Tao Kong, Lei Li, Yifeng Li, Yi Wu:
Learning to Design and Construct Bridge without Blueprint. CoRR abs/2108.02439 (2021) - [i27]Weizhe Chen, Zihan Zhou, Yi Wu, Fei Fang:
Temporal Induced Self-Play for Stochastic Bayesian Games. CoRR abs/2108.09444 (2021) - [i26]Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei:
Sequence Level Contrastive Learning for Text Summarization. CoRR abs/2109.03481 (2021) - [i25]Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu:
Learning Efficient Multi-Agent Cooperative Visual Exploration. CoRR abs/2110.05734 (2021) - [i24]Yingying Wu, Shusheng Xu, Shing-Tung Yau, Yi Wu:
PhyloTransformer: A Discriminative Model for Mutation Prediction Based on a Multi-head Self-attention Mechanism. CoRR abs/2111.01969 (2021) - [i23]Jiayu Chen, Yuanxin Zhang, Yuanfan Xu, Huimin Ma, Huazhong Yang, Jiaming Song, Yu Wang, Yi Wu:
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems. CoRR abs/2111.04613 (2021) - [i22]Weilin Liu, Ye Mu, Chao Yu, Xuefei Ning, Zhong Cao, Yi Wu, Shuang Liang, Huazhong Yang, Yu Wang:
Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward. CoRR abs/2112.06185 (2021) - [i21]Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu:
A Benchmark for Low-Switching-Cost Reinforcement Learning. CoRR abs/2112.06424 (2021) - [i20]Shusheng Xu, Yichen Liu, Xiaoyu Yi, Siyuan Zhou, Huizi Li, Yi Wu:
Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension. CoRR abs/2112.06494 (2021) - [i19]Rui Zhao, Jinming Song, Haifeng Hu, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei:
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination. CoRR abs/2112.11701 (2021) - 2020
- [c20]Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei, Ming Zhou:
Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers. EMNLP (Findings) 2020: 1784-1795 - [c19]Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang:
Influence-Based Multi-Agent Exploration. ICLR 2020 - [c18]Bowen Baker, Ingmar Kanitscheider, Todor M. Markov, Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch:
Emergent Tool Use From Multi-Agent Autocurricula. ICLR 2020 - [c17]Qian Long, Zihan Zhou, Abhinav Gupta, Fei Fang, Yi Wu, Xiaolong Wang:
Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning. ICLR 2020 - [c16]Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang:
Multi-Task Reinforcement Learning with Soft Modularization. NeurIPS 2020 - [i18]Qian Long, Zihan Zhou, Abhinav Gupta, Fei Fang, Yi Wu, Xiaolong Wang:
Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning. CoRR abs/2003.10423 (2020) - [i17]Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang:
Multi-Task Reinforcement Learning with Soft Modularization. CoRR abs/2003.13661 (2020) - [i16]Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei, Ming Zhou:
Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers. CoRR abs/2010.08242 (2020) - [i15]Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian:
Multi-Agent Collaboration via Reward Attribution Decomposition. CoRR abs/2010.08531 (2020) - [i14]Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian:
BeBold: Exploration Beyond the Boundary of Explored Regions. CoRR abs/2012.08621 (2020)
2010 – 2019
- 2019
- [c15]Yufei Wang, Zheyuan Ryan Shi, Lantao Yu, Yi Wu, Rohit Singh, Lucas Joppa, Fei Fang:
Deep Reinforcement Learning for Green Security Games with Real-Time Information. AAAI 2019: 1401-1408 - [c14]Shihui Li, Yi Wu, Xinyue Cui, Honghua Dong, Fei Fang, Stuart Russell:
Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient. AAAI 2019: 4213-4220 - [c13]Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian:
Bayesian Relational Memory for Semantic Visual Navigation. ICCV 2019: 2769-2779 - [i13]Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian:
Bayesian Relational Memory for Semantic Visual Navigation. CoRR abs/1909.04306 (2019) - [i12]Bowen Baker, Ingmar Kanitscheider, Todor M. Markov, Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch:
Emergent Tool Use From Multi-Agent Autocurricula. CoRR abs/1909.07528 (2019) - [i11]Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang:
Influence-Based Multi-Agent Exploration. CoRR abs/1910.05512 (2019) - 2018
- [c12]Lantao Yu, Yi Wu, Rohit Singh, Lucas Joppa, Fei Fang:
Deep Reinforcement Learning for Green Security Game with Online Information. AAAI Workshops 2018: 325-333 - [c11]Yi Wu, Yuxin Wu, Georgia Gkioxari, Yuandong Tian:
Building Generalizable Agents with a Realistic and Rich 3D Environment. ICLR (Workshop) 2018 - [c10]Yi Wu, Siddharth Srivastava, Nicholas Hay, Simon S. Du, Stuart Russell:
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms. ICML 2018: 5339-5348 - [c9]Tongzhou Wang, Yi Wu, Dave Moore, Stuart J. Russell:
Meta-Learning MCMC Proposals. NeurIPS 2018: 4150-4160 - [i10]Yi Wu, Yuxin Wu, Georgia Gkioxari, Yuandong Tian:
Building Generalizable Agents with a Realistic and Rich 3D Environment. CoRR abs/1801.02209 (2018) - [i9]Yining Wang, Yi Wu, Simon S. Du:
Near-Linear Time Local Polynomial Nonparametric Estimation. CoRR abs/1802.09578 (2018) - [i8]Yi Wu, Siddharth Srivastava, Nicholas Hay, Simon S. Du, Stuart Russell:
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms. CoRR abs/1806.02027 (2018) - [i7]Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian:
Learning and Planning with a Semantic Model. CoRR abs/1809.10842 (2018) - [i6]Yufei Wang, Zheyuan Ryan Shi, Lantao Yu, Yi Wu, Rohit Singh, Lucas Joppa, Fei Fang:
Deep Reinforcement Learning for Green Security Games with Real-Time Information. CoRR abs/1811.02483 (2018) - 2017
- [c8]Yusuf Bugra Erol, Yi Wu, Lei Li, Stuart Russell:
A Nearly-Black-Box Online Algorithm for Joint Parameter and State Estimation in Temporal Models. AAAI 2017: 1861-1869 - [c7]Yi Wu, David Bamman, Stuart Russell:
Adversarial Training for Relation Extraction. EMNLP 2017: 1778-1783 - [c6]Aviv Tamar, Yi Wu, Garrett Thomas, Sergey Levine, Pieter Abbeel:
Value Iteration Networks. IJCAI 2017: 4949-4953 - [c5]Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch:
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments. NIPS 2017: 6379-6390 - [i5]Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch:
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments. CoRR abs/1706.02275 (2017) - [i4]Tongzhou Wang, Yi Wu, David A. Moore, Stuart J. Russell:
Neural Block Sampling. CoRR abs/1708.06040 (2017) - 2016
- [c4]Yi Wu, Lei Li, Stuart Russell, Rastislav Bodík:
Swift: Compiled Inference for Probabilistic Programming Languages. IJCAI 2016: 3637-3645 - [c3]Aviv Tamar, Sergey Levine, Pieter Abbeel, Yi Wu, Garrett Thomas:
Value Iteration Networks. NIPS 2016: 2146-2154 - [i3]Yusuf Bugra Erol, Yi Wu, Lei Li, Stuart Russell:
Towards Practical Bayesian Parameter and State Estimation. CoRR abs/1603.08988 (2016) - [i2]Yi Wu, Lei Li, Stuart Russell, Rastislav Bodík:
Swift: Compiled Inference for Probabilistic Programming Languages. CoRR abs/1606.09242 (2016) - 2015
- [c2]Yi Wu, David P. Wipf, Jeong-Min Yun:
Understanding and Evaluating Sparse Linear Discriminant Analysis. AISTATS 2015 - 2012
- [c1]David P. Wipf, Yi Wu:
Dual-Space Analysis of the Sparse Linear Model. NIPS 2012: 1754-1762 - [i1]David P. Wipf, Yi Wu:
Dual-Space Analysis of the Sparse Linear Model. CoRR abs/1207.2422 (2012)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-27 20:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint