week8_scst

Materials

Slides
Our lecture, seminar
The only relevant video-lecture we could find - video
Will hopefully record our lecture in english soon!
Self-critical sequence traning original article

More materials

An awesome post explaining attention and long-term memory models.
BLEU and CIDEr articles.
Image captioning
- MSCOCO captioning challenge
- Captioning baseline notebook
Other articles on reinforcement learning for natural language:
- task-oriented conversation system
- generating dialogues
- sequential adversarial networks (a.k.a. SeqGAN)
- A large overview for machine translation (touching on RL, including RL failures) - article
- How not to evaluate conversation models - article
Overview of other non-games applications ("that article again") - https://arxiv.org/abs/1701.07274

Homework

As usual, go to practice_theano.ipynb or practice_tf.ipynb and follow instructions from there.

Other frameworks: as usual, your task remains the same as in the main track:

Implement or borrow seq2seq model for the same translation task
- Neat tenworflow repo
- Important - this repo uses simplified phoneme dict - make sure you change preprocessing phase to meaningfully compare results.
Implement self-critical sequence training ( = basic policy gradient with a special baseline, see notebook)
Beat the baseline (main notebook: step6)

Even if you decide to use custom frameworks, it is highly recommended that you reuse evaluation code (e.g. min Levenshtein) from the main notebook to avoid confusion.

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
basic_model_tf.py		basic_model_tf.py
basic_model_theano.py		basic_model_theano.py
bonus.ipynb		bonus.ipynb
he-pron-wiktionary.txt		he-pron-wiktionary.txt
main_dataset.txt		main_dataset.txt
practice_tf.ipynb		practice_tf.ipynb
practice_theano.ipynb		practice_theano.ipynb
voc.py		voc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

week8_scst

week8_scst

README.md

Materials

More materials

Homework

Files

week8_scst

Directory actions

More options

Directory actions

More options

Latest commit

History

week8_scst

Folders and files

parent directory

README.md

Materials

More materials

Homework