PAIR
unclecode/PAIR
Beyond single-shot evaluation: Measuring LLM capabilities through collaborative iteration
5stars
Forks
0
Open issues
0
Watchers
5
Size
0.0 MB
Apache License 2.0
Created: Jan 23, 2025
Updated: Apr 29, 2025
Last push: Jan 23, 2025
Star AlbumentationsX on GitHub — it powers this leaderboard
Star on GitHubunclecode/PAIR
Beyond single-shot evaluation: Measuring LLM capabilities through collaborative iteration