GRPO-CARE
TencentARC/GRPO-CARE
[ACL2026 Findings] GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning
83stars
Forks
2
Open issues
5
Watchers
83
Size
3.9 MB
PythonApache License 2.0
Created: Jun 18, 2025
Updated: May 18, 2026
Last push: Jun 23, 2025