lit-llama
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
6,083stars
Forks
521
Open issues
107
Watchers
6,083
Size
1.7 MB
PythonApache License 2.0
Created: Mar 22, 2023
Updated: Feb 26, 2026
Last push: Jul 1, 2025