Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
karpathy

minbpe

karpathy/minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

10,337stars
Forks
1,012
Open issues
59
Watchers
10,337
Size
0.3 MB
PythonMIT License
Created: Feb 16, 2024
Updated: Feb 28, 2026
Last push: Jul 1, 2024