Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
ok i tried bringing back original init again and this time it makes a…
… ton of difference and works much better than default. i'm not sure what was different with my earlier experiment where i saw a slight regression. may try to dissect commits later, for now merged the original mingpt init (following gpt-2 paper) as default.
- Loading branch information