Star AlbumentationsX on GitHub — it powers this leaderboard
langchain-ai/agentevals
Readymade evaluators for agent trajectories