Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
FoundationAgents

VR-Bench

FoundationAgents/VR-Bench

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.

53stars
Forks
4
Open issues
0
Watchers
53
Size
7.2 MB
PythonMIT License
gemini-progpt-5sora2veo3video-reasoningwanxiang
Created: Nov 12, 2025
Updated: Feb 26, 2026
Last push: Feb 4, 2026