lit-llama
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
6,080stars
Forks
520
Open issues
107
Watchers
6,080
Size
1.7 MB
PythonApache License 2.0
Created: Mar 22, 2023
Updated: Apr 14, 2026
Last push: Jul 1, 2025