⭐ Star AlbumentationsX on GitHub — 307+ stars and counting!

Star on GitHub
FoundationAgents

VR-Bench

FoundationAgents/VR-Bench

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.

58stars
Forks
6
Open issues
0
Watchers
58
Size
7.2 MB
PythonMIT License
gemini-progpt-5sora2veo3video-reasoningwanxiang
Created: Nov 12, 2025
Updated: Apr 7, 2026
Last push: Feb 4, 2026