llm-compressor
vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
3,287stars
Forks
522
Open issues
125
Watchers
3,287
Size
35.5 MB
PythonApache License 2.0
compressionquantization
Created: Jun 20, 2024
Updated: May 26, 2026
Last push: May 26, 2026