Abstract: Video Moment Retrieval is a common task to evaluate the performance of visual-language models-it involves localising start and end times of moments in videos from query sentences. The ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果一些您可能无法访问的结果已被隐去。
显示无法访问的结果