Implementation of Universal Transformer in Pytorch
-
Updated
Nov 19, 2018 - Python
Implementation of Universal Transformer in Pytorch
The implementation of Adaptive Computation function used in RNN "https://arxiv.org/pdf/1603.08983.pdf" and Universal Transformer Network "https://arxiv.org/pdf/1807.03819.pdf"
Universal Transformers implementation in PyTorch
Unofficial Implementation of Universal Transformer https://arxiv.org/abs/1807.03819
GPTs trained with shakespeare dataset. Includes: small 10.8M GPT mimicking Andrej Karpathy's video lecture, Universal Transformer with Adaptive Computation Time
Simple from-scratch implementations of transformer-based models that match the state of the art.
Add a description, image, and links to the universal-transformer topic page so that developers can more easily learn about it.
To associate your repository with the universal-transformer topic, visit your repo's landing page and select "manage topics."