Practical_RL/week8_scst at 39472d1b98f889e6b48b6bbc51d89f7cf03e78fb · olgaiv39/Practical_RL · GitHub

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
basic_model_tf.py		basic_model_tf.py
basic_model_theano.py		basic_model_theano.py
bonus.ipynb		bonus.ipynb
he-pron-wiktionary.txt		he-pron-wiktionary.txt
main_dataset.txt		main_dataset.txt
practice_tf.ipynb		practice_tf.ipynb
practice_theano.ipynb		practice_theano.ipynb
voc.py		voc.py

README.md

Materials

Slides
Our lecture, seminar
The only relevant video-lecture we could find - video
Will hopefully record our lecture in english soon!
Self-critical sequence traning original article

More materials

An awesome post explaining attention and long-term memory models.
BLEU and CIDEr articles.
Image captioning
- MSCOCO captioning challenge
- Captioning baseline notebook
Other articles on reinforcement learning for natural language:
- task-oriented conversation system
- generating dialogues
- sequential adversarial networks (a.k.a. SeqGAN)
- A large overview for machine translation (touching on RL, including RL failures) - arxiv
- How not to evaluate conversation models - arxiv
Overview of other non-games applications ("that article again") - arxiv

Homework

As usual, go to practice_theano.ipynb or practice_tf.ipynb and follow instructions from there.