Skip to content

Commit

Permalink
add week5 hse deadline
Browse files Browse the repository at this point in the history
  • Loading branch information
justheuristic authored Mar 7, 2017
1 parent 74764b7 commit 0ce5402
Showing 1 changed file with 5 additions and 4 deletions.
9 changes: 5 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -82,13 +82,14 @@ Taught on-campus in [HSE](https://cs.hse.ru) and [Yandex SDA](https://yandexdata
* Homework - convnets on MNIST or simple deep q-learning
* HSE homework deadline _23.59 8.03.17_

* __week5__ Deep reinforcement learning
* Lecture: Deep Q-learning/sarsa/whatever. Heuristics & motivation behind them: experience replay, target networks, double/dueling/bootstrap DQN, etc.
* Seminar: DQN on atari
* Homework - convnets on MNIST or simple deep q-learning
* HSE homework deadline _23.59 15.03.17_

## Future lectures:

* __week5__ Deep reinforcement learning (coming 6.03.2017)
* Lecture: Deep Q-learning/sarsa/whatever. Heuristics & motivation behind them: experience replay, target networks, double/dueling/bootstrap DQN, etc.
* Seminar: Double DQN, Dueling DQN, experience replay on atari

* __week6__ Policy gradient methods (coming 13.03.2017)
* Lecture: Motivation for policy-based, policy gradient, logderivative trick, REINFORCE/crossentropy method, variance theorem(advantage), advantage actor-critic (incl.n-step advantage), off-policy actor-critic (off-PAC), natural gradients(briefly), continuous action space(teaser).
* Seminar: a2c Vs qlearning for MountainCar/Doom, entropy regularization & tricks, simple demo with continuous action spaces
Expand Down

0 comments on commit 0ce5402

Please sign in to comment.