week4_approx_rl

Materials

lecture slides I, slides II
Our lecture, second lecture, seminar (russian)
David Silver lecture - video
More practical and less theoretical lecture from MIT 6.S191 - video
Understanding approximate q-learning - url
Karpathy's post on approximate RL - url

More materials

[recommended] How to actually do deep reinforcement learning by J. Schulman - pdf
[recommended] An overview of deep reinforcement learning - arxiv
DQN and modiffications - lecture by J. Schulman - video
- interactive demos in your browser: demo1(karpathy), demo2(Hünermann)
Reinforcement learning architectures list - repo
Article on dueling DQN - arxiv
Article on double DQN - arxiv
Article on prioritized experience replay - arxiv
Article on bootstrap DQN - pdf, summary
Article on asynchronuous methods in deep RL - arxiv
Successor representations for reinforcement learning - article, video
Video on asynchronuous methods (Mnih) - video

DQN tutorials

[in pytorch] A great series starting from simple DQN to all the cool new stuff - url
A guide to deep RL from ~scratch (nervana blog) - url
Building deep q-network from ~scratch (blog) - url
Another guide guide to DQN from ~scratch (blog) - url

Practice

From now on, we have two tracks, theano and tensorflow. We'll also add pytorch support soon.

You can choose whichever track you want, but unless you're expertly familiar with your framework, we recommend you to start by completing the task in lasagne and only then reproduce your solution in your chosen framework.

Begin with seminar_<framework>.ipynb and then proceed with homework_<framework>.ipynb.

__Note: you're not required to submit assignments in all three frameworks. Pick one and go with it. Maybe switch it occasionally if you want more challenge. __

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
framebuffer.py		framebuffer.py
homework_lasagne.ipynb		homework_lasagne.ipynb
homework_pytorch.ipynb		homework_pytorch.ipynb
homework_tf.ipynb		homework_tf.ipynb
replay_buffer.py		replay_buffer.py
seminar_lasagne.ipynb		seminar_lasagne.ipynb
seminar_pytorch.ipynb		seminar_pytorch.ipynb
seminar_tf.ipynb		seminar_tf.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

week4_approx_rl

week4_approx_rl

README.md

Materials

More materials

DQN tutorials

Practice

Files

week4_approx_rl

Directory actions

More options

Directory actions

More options

Latest commit

History

week4_approx_rl

Folders and files

parent directory

README.md

Materials

More materials

DQN tutorials

Practice