llm-compressor
vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
3,078stars
Forks
482
Open issues
130
Watchers
3,078
Size
34.4 MB
PythonApache License 2.0
compressionquantization
Created: Jun 20, 2024
Updated: Apr 14, 2026
Last push: Apr 14, 2026