⭐ Star AlbumentationsX on GitHub — 448+ stars and counting!

Star on GitHub
FoundationAgents

VR-Bench

FoundationAgents/VR-Bench

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.

61stars
Forks
6
Open issues
0
Watchers
61
Size
7.2 MB
PythonMIT License
gemini-progpt-5sora2veo3video-reasoningwanxiang
Created: Nov 12, 2025
Updated: May 13, 2026
Last push: Feb 4, 2026