-
Notifications
You must be signed in to change notification settings - Fork 874
Issues: karpathy/minbpe
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
One problem in the annotations of
test_wikipedia_example
in the tests/test_tokenizer
file
#93
opened Nov 18, 2024 by
donglinkang2021
The regular expressions break all scripts with combining marks in the middle of the syllable
#73
opened May 12, 2024 by
ajaykg
Would using prompts that contain concatenated words to reduce token count negatively affect results
#61
opened Mar 28, 2024 by
hatgit
Huggingface already has an efficient implementation of this?
#58
opened Mar 19, 2024 by
laurislopata
Using minBPE token encoded sentence vectors need to be padded
#56
opened Mar 19, 2024 by
elevateclub
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.