⭐ Star AlbumentationsX on GitHub — 307+ stars and counting!

Star on GitHub
karpathy

minbpe

karpathy/minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

10,426stars
Forks
1,032
Open issues
58
Watchers
10,426
Size
0.3 MB
PythonMIT License
Created: Feb 16, 2024
Updated: Apr 14, 2026
Last push: Jul 1, 2024