minbpe
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
10,509stars
Forks
1,052
Open issues
56
Watchers
10,509
Size
0.3 MB
PythonMIT License
Created: Feb 16, 2024
Updated: May 28, 2026
Last push: Jul 1, 2024