critic-rubrics
OpenHands/critic-rubrics
Official repo for paper "A Rubric-Supervised Critic from Sparse Real-World Outcomes". Type-safe function-calling-based LLM-as-judge evaluation framework for agent behavior prediction and analysis.
15stars
Forks
6
Open issues
2
Watchers
15
Size
0.5 MB
Python
Created: Aug 18, 2025
Updated: May 26, 2026
Last push: May 26, 2026