AutoAWQ
oobabooga/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
3stars
Forks
0
Open issues
0
Watchers
3
Size
7.5 MB
PythonMIT License
Created: Jul 24, 2024
Updated: Sep 28, 2024
Last push: Sep 28, 2024
Fork