- Slides
- Our lecture, seminar
- The only relevant video-lecture we could find - video
- Will hopefully record our lecture in english soon!
- Self-critical sequence traning original article
- An awesome post explaining attention and long-term memory models.
- BLEU and CIDEr articles.
- Image captioning
- Other articles on reinforcement learning for natural language:
- task-oriented conversation system
- generating dialogues
- sequential adversarial networks (a.k.a. SeqGAN)
- A large overview for machine translation (touching on RL, including RL failures) - arxiv
- How not to evaluate conversation models - arxiv
- Overview of other non-games applications ("that article again") - arxiv
As usual, go to practice_theano.ipynb or practice_tf.ipynb and follow instructions from there.