Need help with rl_learn?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

applenob
220 Stars 87 Forks 68 Commits 0 Opened issues

Description

我的强化学习笔记和学习材料:book: still updating ... ...

Services available

!
?

Need anything else?

Contributors list

[WIP]强化学习的学习仓库

这是我个人学习强化学习的时候收集的比较经典的学习资料、笔记和代码,分享给所有人。

为了直接在GitHub上用markdown文件看公式,推荐安装chrome插件:MathJax Plugin for Github

入门指南

课程笔记

实验目录

所有的实验源代码都在

lib
目录下,来自dennybritz。在原先代码的基础上,增加了对实验背景的具体介绍、代码和公式的对照。
  • Gridworld:对应MDPDynamic Programming
  • Blackjack:对应Model FreeMonte Carlo的Planning和Controlling
  • Windy Gridworld:对应Model FreeTemporal DifferenceOn-Policy ControllingSARSA
  • Cliff Walking:对应Model FreeTemporal DifferenceOff-Policy ControllingQ-learning
  • Mountain Car:对应Q表格很大无法处理(state空间连续)的Q-Learning with Linear Function Approximation
  • Atari:对应Deep-Q Learning

其他重要学习资料:

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.