Skip to content

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.

License

Notifications You must be signed in to change notification settings

Tekrific/deeplearning-nlp-models

 
 

Repository files navigation

deeplearning-nlp-models

Coveralls github Travis (.com) CodeFactor Grade GitHub

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch.

This is presented as an (incomplete) starting point for those interested in getting into the weeds of DL architectures in NLP. Annotated models are presented along with some notes.

There are links to run these models on colab with GPUs 🌩️ via notebooks.

Current models: word2vec, CNNs, transformer, gpt. (Work in progress)

Meta

BERT: Reading. Comprehending.

Note: These are toy versions of each model.

Contents

Models

These NLP models are presented chronologically and, as you might expect, build off each other.

Model Class Model Year
Embeddings
1. Word2Vec Embeddings (Self-Supervised Learning) 2013
CNNs
2. CNN-based Text Classification (Binary Classification) 2014
Transformers
3. The O.G. Transformer (Machine Translation) 2017
4. OpenAI's GPT Model (Language Model) 2018, 2019, 2020

Features

This repository has the following features:

  • model overviews: A brief overview of each model's motivation and design are provided in separate README.md files.
  • Jupyter notebooks (easy to run on colab w/ GPUs): Jupyter notebooks showing how to run the models and some simple analyses of the model results.
  • self-contained: Tokenizers, dataset loaders, dictionaries, and all the custom utilities required for each problem.

Endgame

After reviewing these models, the world's your oyster in terms of other models to explore:

Char-RNN, BERT, ELMO, XLNET, all the other BERTs, BART, Performer, T5, etc....

Roadmap

Future models to implement:

  • Char-RNN (Kaparthy)
  • BERT

Future repo features:

  • Tensorboard plots
  • Val set demonstrations
  • Saving checkpoints/ loading models
  • BPE (from either openai/gpt-2 or facebook's fairseq library)

Setup

You can install the repo using pip:

pip install git+https://github.com/will-thompson-k/deeplearning-nlp-models 

Structure

Here is a breakdown of the repository:

  • nlpmodels/models: The model code for each paper.
  • nlpmodels/utils: Contains all the auxiliary classes related to building a model, including datasets, vocabulary, tokenizers, samplers and trainer classes. (Note: Most of the non-model files are thrown into utils. I would advise against that in a larger repo.)
  • tests: Light (and by no means comprehensive) coverage.
  • notebooks: Contains the notebooks and write-ups for each model implementation.

A few useful commands:

  • make test: Run the full suite of tests (you can also use setup.py test and run_tests.sh).
  • make test_light: Run all tests except the regression tests.
  • make lint: If you really like linting code (also can run run_pylint.sh).

Requirements

Python 3.6+

Here are the package requirements (found in requirements.txt)

  • numpy==1.19.1
  • tqdm==4.50.2
  • torch==1.7.0
  • datasets==1.0.2
  • torchtext==0.8.0

Citation

@misc{deeplearning-nlp-models,
  author = {Thompson, Will},
  url = {https://github.com/will-thompson-k/deeplearning-nlp-models},
  year = {2020}
}

License

MIT

About

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 66.8%
  • Python 33.1%
  • Other 0.1%