Skip to content

A very simplified transformer language model in C++

License

Notifications You must be signed in to change notification settings

VincentSe/FemtoGPT

Repository files navigation

FemtoGPT

A very simplified transformer language model in C++.

This is inspired by Andrej Karpathy's work:

The training text is a subset of Shakespeare's plays, instead of the whole internet. Also this is a character based language model, instead of word based. PyTorch and Tensorflow are not used, to study what is costs to implement gradients manually.

Open FemtoGPT.sln in Visual Studio, then build and execute FemtoGPT.exe to start training this language model.

About

A very simplified transformer language model in C++

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages