critic-rubrics
OpenHands/critic-rubrics
Official repo for paper "A Rubric-Supervised Critic from Sparse Real-World Outcomes". Type-safe function-calling-based LLM-as-judge evaluation framework for agent behavior prediction and analysis.
10stars
Forks
5
Open issues
0
Watchers
10
Size
0.4 MB
Python
Created: Aug 18, 2025
Updated: Mar 20, 2026
Last push: Mar 5, 2026