I have developed a compact GPT model capable of predicting subsequent words from input sequences, drawing inspiration from Andrej Karpathy
- Learn the encode-decode technique by character level
- Build a model that can predict the next word based on a given sentence
- Using the Transformer technique from Attention Is All You Need paper
- Train model on The Raven poem written by Edgar Allan Poe
- Test model