⭐ Star AlbumentationsX on GitHub — 448+ stars and counting!

Star on GitHub
karpathy

minbpe

karpathy/minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

10,509stars
Forks
1,052
Open issues
56
Watchers
10,509
Size
0.3 MB
PythonMIT License
Created: Feb 16, 2024
Updated: May 28, 2026
Last push: Jul 1, 2024