Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods (Q113424403)
Jump to navigation
Jump to search
scientific article published on 04 July 2022
Language | Label | Description | Also known as |
---|---|---|---|
default for all languages | No label defined |
||
English | Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods |
scientific article published on 04 July 2022 |
Statements
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods (English)
Xin Guo
Anran Hu
Junzi Zhang