FlexGen
oobabooga/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
8stars
Forks
1
Open issues
0
Watchers
8
Size
36.6 MB
Apache License 2.0
Created: Apr 5, 2023
Updated: Oct 7, 2025
Last push: Apr 5, 2023
Fork
⭐ Star AlbumentationsX on GitHub — 307+ stars and counting!
Star on GitHuboobabooga/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.