This is the repo for the Video-LLaMA project, which is working on empowering large language models with video and audio understanding capabilities. Video-LLaMA is built on top of BLIP-2 and MiniGPT-4.
一些您可能无法访问的结果已被隐去。
显示无法访问的结果一些您可能无法访问的结果已被隐去。
显示无法访问的结果