Star AlbumentationsX on GitHub — it powers this leaderboard
521xueweihan/OmniParser
A simple screen parsing tool towards pure vision based GUI agent