Machado de Assis - Transformer-based Language Model

Installation

Create a virtual environment and install the dependencies:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Dataset

The dataset used in this project is a collection of the books written by Machado de Assis, a Brazilian writer. The dataset was downloaded from Kaggle and can be found here

After downloading the dataset, unzip it in the data folder then run the following command to prepare the data into a single .txt file that will be used to train our transformer model.

python data.py

Training

To train the simple bigram model, run the following command:

python train_bigram.py

To train the transformer model, run the following command:

python train.py

The weights of the transformer model will be saved in the weights folder. In total, this particular model takes about 15 minutes to train on a 4090.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
data.py		data.py
generate.py		generate.py
requirements.txt		requirements.txt
train.py		train.py
train_bigram.py		train_bigram.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machado de Assis - Transformer-based Language Model

Installation

Dataset

Training

About

Releases

Packages

Languages

jmtzt/machadoLM

Folders and files

Latest commit

History

Repository files navigation

Machado de Assis - Transformer-based Language Model

Installation

Dataset

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages