SepLLM
HKUDS/SepLLM
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"
567stars
Forks
46
Open issues
2
Watchers
567
Size
514.2 MB
Python
inference-speedlarge-language-modelsllms
Created: Dec 11, 2024
Updated: Apr 11, 2026
Last push: Jul 29, 2025