Links on all articles mentioned during the lecture could be found in "References" at the very end of the presentation slides. All other interesing links which contribute to the topic of POMDP are presented below
- Our lecture and seminar (russian)
- A lecture on basics by Andrew NG (english, LQ) - video
- A lecture on basics by 5vision (russian) - video
- [alternative] Chalkboard-style 2-part lecture by B. Ravindran. - part1, part2
- [alternative] Yet another mini-lecture touching on POMDP by S.S. Baveja - video
- DRQN lecture by Fritz448 (russian) - video
- Data efficient learning in continous POMDP
- Managing wind farms with bayesian POMDP
- Bayesian learning and decision-making in dynamic environments
Theano:
- Optionally go through
theano_recurrence_tutorial.ipynb
- Seminar task is in
practice_theano.ipynb
- Homework assignment is in
homework_common.ipynb
Tensorflow:
- Look through
atari_util.py
andenv_pool.py
- Seminar task is in
practice_tensorflow.ipynb
- Homework assignment is in
homework_common.ipynb
Homework is platform and framewerk independent, so choose the ones which suit you best, but pay attention on how many you will need to implement youself in case of nonstandart ones.