Star AlbumentationsX on GitHub — it powers this leaderboard
NanmiCoder/Video2TextGPT
convert the audio from the video into text.