production-stack
vllm-project/production-stack
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
2,356stars
Forks
410
Open issues
172
Watchers
2,356
Size
8.1 MB
PythonApache License 2.0
Created: Jan 21, 2025
Updated: May 26, 2026
Last push: May 25, 2026